Publication Details
Gender Independent Discriminative Speaker Recognition in I-Vector Space
Glembek Ondřej, Ing., Ph.D. (DCGM FIT BUT)
Brummer Johan Nikolaas Langenhoven (Agnitio)
de Villiers Edward (Agnitio)
Laface Pietro, prof. (POLITO)
speaker recognition, gender recognition, PLDA models, GI Pairwise SVM
This paper describes speaker recognition systems that are trained with gender dependent features and tested with known gender trails.
Speaker recognition systems attain their best accuracy when trained with gender dependent features and tested with known gender trials. In real applications, howevcer, gender labels are often not given. In this work we illustrate the design of a system that does not make use of the gender labels both in training and in test, i.e. a completely Gender Independent (GI) system. It relies on discriminative training, where the trials are i-vector pairs, and the discrimination is between the hypothesis that the pair of feature vectors in the trial belong to the same speaker or to different speakers. We demonstrate that this pairwise discriminative training can be interpreted as a procedure that estimates the parameters of the best (second order) approximation of the log-likelihood ratio score function, and that a pairwise SVM can be used for training a gender independent system. Our results show that a pairwise GI SVM, saving memory and execution time, achieves on the last NIST evaluationscomplet state-of-the-art performance, comparable to a Gender Dependent(GD) system.
@INPROCEEDINGS{FITPUB9942, author = "Sandro Cumani and Ond\v{r}ej Glembek and Langenhoven Nikolaas Johan Brummer and Edward Villiers de and Pietro Laface", title = "Gender Independent Discriminative Speaker Recognition in I-Vector Space", pages = "4361--4364", booktitle = "Proc. International Conference on Acoustics, Speech, and Signal P", year = 2012, location = "Kyoto, JP", publisher = "IEEE Signal Processing Society", ISBN = "978-1-4673-0044-5", doi = "10.1109/ICASSP.2012.6288885", language = "english", url = "https://www.fit.vut.cz/research/publication/9942" }