Publication Details
Patrol Team Language Identification System for DARPA RATS P1 Evaluation
Plchot Oldřich, Ing., Ph.D. (DCGM FIT BUT)
Soufifar Mehdi Mohammad, Ing. (FIT BUT)
Glembek Ondřej, Ing., Ph.D. (DCGM FIT BUT)
D'Haro Luis Fernando (UPN)
Veselý Karel, Ing., Ph.D. (DCGM FIT BUT)
Grézl František, Ing., Ph.D. (DCGM FIT BUT)
Ma Jeff (Raytheon BBN)
Matsoukas Spyros (Raytheon BBN)
Dehak Najim (CRIM)
language identification, noisy speech
In this paper we present four systems that were part of the Patrol Team Language Identification system for the DARPA RATS project.
This paper describes the language identification (LID) system developed by the Patrol team for the first phase of the DARPA RATS (Robust Automatic Transcription of Speech) program, which seeks to advance state of the art detection capabilities on audio from highly degraded communication channels. We show that techniques originally developed for LID on telephone speech (e.g., for the NIST language recognition evaluations) remain effective on the noisy RATS data, provided that careful consideration is applied when designing the training and development sets. In addition, we show significant improvements from the use of Wiener filtering, neural network based and language dependent i-vector modeling, and fusion.
@INPROCEEDINGS{FITPUB10098, author = "Pavel Mat\v{e}jka and Old\v{r}ich Plchot and Mohammad Mehdi Soufifar and Ond\v{r}ej Glembek and Fernando Luis D'Haro and Karel Vesel\'{y} and Franti\v{s}ek Gr\'{e}zl and Jeff Ma and Spyros Matsoukas and Najim Dehak", title = "Patrol Team Language Identification System for DARPA RATS P1 Evaluation", pages = "1--4", booktitle = "Proceedings of Interspeech 2012", journal = "Proceedings of Interspeech - on-line", volume = 2012, number = 9, year = 2012, location = "Portland, Oregon, US", publisher = "International Speech Communication Association", ISBN = "978-1-62276-759-5", ISSN = "1990-9772", language = "english", url = "https://www.fit.vut.cz/research/publication/10098" }