Publication Details

Clustering Unsupervised Representations as Defense against Poisoning Attacks on Speech Commands Classification System

THEBAUD Thomas, JOSHI Sonal, LI Henry, ŠŮSTEK Martin, VILLALBA Lopez Jesus Antonio, KHUDANPUR Sanjeev and DEHAK Najim. Clustering Unsupervised Representations as Defense against Poisoning Attacks on Speech Commands Classification System. In: Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Taipei: IEEE Signal Processing Society, 2023, pp. 1-8. ISBN 979-8-3503-0689-7. Available from: https://ieeexplore.ieee.org/document/10389650
Czech title
Shlukování reprezentací získaní pomocí učení bez učitele za účelem ochrany klasifikátoru řeči proti poisoning útokům
Type
conference paper
Language
english
Authors
Thebaud Thomas ()
Joshi Sonal ()
Li Henry ()
Šůstek Martin, Ing. (DCGM FIT BUT)
Villalba Lopez Jesus Antonio (JHU)
Khudanpur Sanjeev (JHU)
Dehak Najim (JHU)
URL
Keywords

poisoning attack, unsupervised representa-
tions, clustering, Speech commands, defense against attacks
on speech systems

Abstract

Poisoning attacks entail attackers intentionally tampering with training data. In this paper, we consider a dirty-label poisoning attack scenario on a speech commands classification system. The threat model assumes that certain utterances from one of the classes (source class) are poisoned by superimposing a trigger on it, and its label is changed to another class selected by the attacker (target class). We propose a filtering defense against such an attack. First, we use DIstillation with NO labels (DINO) to learn unsupervised representations for all the training examples. Next, we use K-means and LDA to cluster these representations. Finally, we keep the utterances with the most repeated label in their cluster for training and discard the rest. For a 10% poisoned source class, we demonstrate a drop in attack success rate from 99.75% to 0.25%. We test our defense against a variety of threat models, including different target and source classes, as well as trigger variations.

Published
2023
Pages
1-8
Proceedings
Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
Conference
2023 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), Taipei, TW
ISBN
979-8-3503-0689-7
Publisher
IEEE Signal Processing Society
Place
Taipei, TW
DOI
BibTeX
@INPROCEEDINGS{FITPUB13088,
   author = "Thomas Thebaud and Sonal Joshi and Henry Li and Martin \v{S}\r{u}stek and Antonio Jesus Lopez Villalba and Sanjeev Khudanpur and Najim Dehak",
   title = "Clustering Unsupervised Representations as Defense against Poisoning Attacks on Speech Commands Classification System",
   pages = "1--8",
   booktitle = "Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)",
   year = 2023,
   location = "Taipei, TW",
   publisher = "IEEE Signal Processing Society",
   ISBN = "979-8-3503-0689-7",
   doi = "10.1109/ASRU57964.2023.10389650",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/13088"
}
Back to top