Result Details

Gender Independent Discriminative Speaker Recognition in I-Vector Space

CUMANI, S.; GLEMBEK, O.; BRUMMER, J.; DE VILLIERS, E.; LAFACE, P. Gender Independent Discriminative Speaker Recognition in I-Vector Space. Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012. p. 4361-4364. ISBN: 978-1-4673-0044-5.

Type

conference paper

Language

English

Authors

Cumani Sandro, Ph.D.
Glembek Ondřej, Ing., Ph.D., FIT (FIT), DCGM (FIT)
Brummer Johan Nikolaas Langenhoven, Dr.
de Villiers Edward
Laface Pietro

Abstract

This paper describes speaker recognition systems that are trained with gender dependent features and tested with known gender trails.

Keywords

speaker recognition, gender recognition, PLDA models, GI Pairwise SVM

URL

https://www.fit.vut.cz/research/group/speech/public/publi/2012/cumani…

Annotation

Speaker recognition systems attain their best accuracy when trained with gender dependent features and tested with known gender trials. In real applications, howevcer, gender labels are often not given. In this work we illustrate the design of a system that does not make use of the gender labels both in training and in test, i.e. a completely Gender Independent (GI) system. It relies on discriminative training, where the trials are i-vector pairs, and the discrimination is between the hypothesis that the pair of feature vectors in the trial belong to the same speaker or to different speakers. We demonstrate that this pairwise discriminative training can be interpreted as a procedure that estimates the parameters of the best (second order) approximation of the log-likelihood ratio score function, and that a pairwise SVM can be used for training a gender independent system. Our results show that a pairwise GI SVM, saving memory and execution time, achieves on the last NIST evaluationscomplet state-of-the-art performance, comparable to a Gender Dependent(GD) system.

Published

2012

Pages

4361–4364

Proceedings

Proc. International Conference on Acoustics, Speech, and Signal P

Conference

The 37th International Conference on Acoustics, Speech, and Signal Processing

ISBN

978-1-4673-0044-5

Publisher

IEEE Signal Processing Society

Place

Kyoto

DOI

10.1109/ICASSP.2012.6288885

BibTeX

@inproceedings{BUT91484,
  author="Sandro {Cumani} and Ondřej {Glembek} and Johan Nikolaas Langenhoven {Brummer} and Edward {de Villiers} and Pietro {Laface}",
  title="Gender Independent Discriminative Speaker Recognition in I-Vector Space",
  booktitle="Proc. International Conference on Acoustics, Speech, and Signal P",
  year="2012",
  pages="4361--4364",
  publisher="IEEE Signal Processing Society",
  address="Kyoto",
  doi="10.1109/ICASSP.2012.6288885",
  isbn="978-1-4673-0044-5",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2012/cumani_icassp2012_0004361.pdf"
}

Projects

Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running

Research groups

Výzkumná skupina dolování dat z řeči BUT Speech@FIT (RG SPEECH)

Departments

Ústav počítačové grafiky a multimédií (DCGM)