Publication Details
The dawn of a text-dependent society: deepfakes as a threat to speech verification systems
deepfakes, speech verification, voice biometrics, machine learning, cybersecurity
We are already aware that deepfakes pose threats to humankind. Nowadays, mostly as fake news or disinformation; however, there are still unexplored areas such as using deepfakes to spoof voice verification. We present a real-world use case for spoofing voice authentication in a customer care call center. Based on this scenario, we evaluate the feasibility of attacking such a system and create an attacker profile. For this purpose, we examine three available speech synthesis tools and discuss their usability. We use these tools and acquired knowledge to generate a dataset including deepfake speech and assess the resilience of voice biometrics systems against deepfakes. We prove that voice biometrics systems are indeed vulnerable to deepfake powered attacks. The most significant outcome is the proposal of text-dependent verification as a novel countermeasure for presented attacks. Text-dependent verification provides higher security than text-independent verification and can be used today as the simplest protection method against deepfakes.
@INPROCEEDINGS{FITPUB12595, author = "Anton Firc and Kamil Malinka", title = "The dawn of a text-dependent society: deepfakes as a threat to speech verification systems", pages = "1646--1655", booktitle = "SAC '22: Proceedings of the 37th ACM/SIGAPP Symposium on Applied Computing", year = 2022, location = "New York, NY, US", publisher = "Association for Computing Machinery", ISBN = "978-1-4503-8713-2", doi = "10.1145/3477314.3507013", language = "english", url = "https://www.fit.vut.cz/research/publication/12595" }