Publication Details
Discriminative Training Techniques for Acoustic Language Identification
Matějka Pavel, Ing., Ph.D. (DCGM FIT BUT)
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT)
language identification, language recognition, acoustic modeling, disriminative training, maximum mutual information
This paper presents comparison of Maximum Likelihood (ML)
and discriminative Maximum Mutual Information (MMI) training
for acoustic modeling in language identification (LID). Clear advantage of MMI over ML training is shown. The final error rate compares favorably to other results published on NIST 2003 data.
This paper presents comparison of Maximum Likelihood (ML)
and discriminative Maximum Mutual Information (MMI) training
for acoustic modeling in language identification (LID). Both approaches are compared on state-of-the-art shifted delta-cepstra features, the results are reported on data from NIST 2003 evaluations. Clear advantage of MMI over ML training is shown. Further improvements of acoustic LID are discussed: Heteroscedastic Linear Discriminant Analysis (HLDA) for feature de-correlation and dimensionality reduction and Ergodic Hidden Markov models (EHMM) for better modeling of dynamics in the acoustic space. The final error rate compares favorably to other results published on NIST 2003 data.
@INPROCEEDINGS{FITPUB8132, author = "Luk\'{a}\v{s} Burget and Pavel Mat\v{e}jka and Jan \v{C}ernock\'{y}", title = "Discriminative Training Techniques for Acoustic Language Identification", pages = "209--212", booktitle = "Proceedings of ICASSP 2006", year = 2006, location = "Toulouse, FR", language = "english", url = "https://www.fit.vut.cz/research/publication/8132" }