Project Details
DARPA Robust Automatic Transcription of Speech (RATS) - RATS Patrol I
Project Period: 23. 9. 2010 - 30. 6. 2014
Project Type: contract
Code: D10PC20015
Partner: Raytheon BBN Technologies
speech recognition, speaker recognition, language recognition, keyword spotting, robustness, noise, transmission channels
Existing speech signal processing technologies are inadequate for most noisy or degraded speech signals that are important to military intelligence. The Robust Automatic Transcription of Speech (RATS) program is creating algorithms and software for performing the following tasks on potentially speech-containing signals received over communication channels that are extremely noisy and/or highly distorted: Speech Activity Detection, Language Identification, Speaker Identification and Key Word Spotting.
Andrla Petr, Ing. (UPGM FIT VUT) , team leader
Cipr Tomáš, Ing. (UPGM FIT VUT) , team leader
Černocký Jan, prof. Dr. Ing. (UPGM FIT VUT) , team leader
Grézl František, Ing., Ph.D. (UPGM FIT VUT) , team leader
Chalupníček Kamil, Ing. (UPGM FIT VUT) , team leader
Otáhalová Sylva (UPGM FIT VUT) , team leader
Szőke Igor, Ing., Ph.D. (UPGM FIT VUT) , team leader
2017
- PLCHOT Oldřich, MATĚJKA Pavel, SILNOVA Anna, NOVOTNÝ Ondřej, DIEZ Sánchez Mireia, ROHDIN Johan A., GLEMBEK Ondřej, BRÜMMER Niko, SWART Albert du Preez, PRIETO Jesús J., GARCIA Perera Leibny Paola, BUERA Luis, KENNY Patrick, ALAM Jahangir and BHATTACHARYA Gautam. Analysis and Description of ABC Submission to NIST SRE 2016. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1348-1352. ISSN 1990-9772. Detail
- MATĚJKA Pavel, NOVOTNÝ Ondřej, PLCHOT Oldřich, BURGET Lukáš, DIEZ Sánchez Mireia and ČERNOCKÝ Jan. Analysis of Score Normalization in Multilingual Speaker Recognition. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1567-1571. ISSN 1990-9772. Detail
2014
- GLEMBEK Ondřej, MA Jeff, MATĚJKA Pavel, ZHANG Bing, PLCHOT Oldřich, BURGET Lukáš and MATSOUKAS Spyros. Domain Adaptation Via Within-class Covariance Correction in I-Vector Based Speaker Recognition Systerms. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, pp. 4060-4064. ISBN 978-1-4799-2892-7. Detail
- MATĚJKA Pavel, ZHANG Le, NG Tim, MALLIDI Sri Harish, GLEMBEK Ondřej, MA Jeff and ZHANG Bing. Neural Network Bottleneck Features for Language Identification. In: Proceedings of Odyssey 2014. Joensuu: International Speech Communication Association, 2014, pp. 299-304. ISSN 2312-2846. Detail
- BAHARI Mohamad H., DEHAK Najim, VAN hamme Hugo, BURGET Lukáš, ALI Ahmed M. and GLASS Jim. Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 2014, no. 7, pp. 1117-1129. ISSN 2329-9290. Detail
- CUMANI Sandro, LAFACE Pietro and PLCHOT Oldřich. On the use of i-vector posterior distributions in Probabilistic Linear Discriminant Analysis. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 22, no. 4, 2014, pp. 846-857. ISSN 2329-9290. Detail
- PLCHOT Oldřich, DIEZ Sánchez Mireia, SOUFIFAR Mehdi and BURGET Lukáš. PLLR Features in Language Recognition System for RATS. In: Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014, pp. 3048-3051. ISBN 978-1-63439-435-2. Detail
- NG Tim, HSIAO Roger, ZHANG Le, KARAKOS Damianos, MALLIDI Sri Harish, KARAFIÁT Martin, VESELÝ Karel, SZŐKE Igor, ZHANG Bing, NGUYEN Long and SCHWARTZ Richard. Progress in the BBN Keyword Search System for the DARPA RATS Program. In: Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014, pp. 959-963. ISBN 978-1-63439-435-2. Detail
- MARTÍNEZ González David, BURGET Lukáš, STAFYLAKIS Themos, LEI Yun, KENNY Patrick and LLEIDA Eduardo. Unscented Transform For Ivector-based Noisy Speaker Recognition. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, pp. 4070-4074. ISBN 978-1-4799-2892-7. Detail
2013
- PLCHOT Oldřich, MATSOUKAS Spyros, MATĚJKA Pavel, DEHAK Najim, MA Jeff, CUMANI Sandro, GLEMBEK Ondřej, HEŘMANSKÝ Hynek, MESGARANI Nima, SOUFIFAR Mehdi Mohammad, THOMAS Samuel, ZHANG Bing and ZHOU Xinhui et al. Developing A Speaker Identification System For The DARPA RATS Project. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, pp. 6768-6772. ISBN 978-1-4799-0355-9. Detail
- CUMANI Sandro, BRUMMER Johan Nikolaas Langenhoven, BURGET Lukáš, LAFACE Pietro, PLCHOT Oldřich and VASILAKAKIS Vasileios. Pairwise Discriminative Speaker Verification in the I -Vector Space. IEEE Transactions on Audio, Speech, and Language Processing, vol. 2013, no. 6, pp. 1217-1227. ISSN 1558-7916. Detail
- CUMANI Sandro, PLCHOT Oldřich and LAFACE Pietro. Probabilistic Linear Discriminant Analysis Of I-Vector Posterior Distributions. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, pp. 7644-7648. ISBN 978-1-4799-0355-9. Detail
- SOUFIFAR Mehdi Mohammad, BURGET Lukáš, PLCHOT Oldřich, CUMANI Sandro and ČERNOCKÝ Jan. Regularized Subspace n-Gram Model for Phonotactic iVector Extraction. In: Proceedings of Interspeech 2013. Lyon: International Speech Communication Association, 2013, pp. 74-78. ISBN 978-1-62993-443-3. ISSN 2308-457X. Detail
2012
- LEI Yun, BURGET Lukáš and SCHEFFER Nicolas. Bilinear Factor Analysis for iVector Based Speaker Verification. In: Proceedings of Interspeech. Portland, Oregon: International Speech Communication Association, 2012, pp. 1-4. ISBN 978-1-62276-759-5. Detail
- BRUMMER Johan Nikolaas Langenhoven, CUMANI Sandro, GLEMBEK Ondřej, KARAFIÁT Martin, MATĚJKA Pavel, PEŠÁN Jan, PLCHOT Oldřich, SOUFIFAR Mehdi Mohammad, DE Villiers Edward and ČERNOCKÝ Jan. Description and analysis of the Brno276 system for LRE2011. In: Proceedings of Odyssey 2012: The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012, pp. 216-223. ISBN 978-981-07-3093-2. Detail
- NG Tim, ZHANG Bing, NGUYEN Long, MATSOUKAS Spyros, ZHOU Xinhui, MESGARANI Nima, VESELÝ Karel and MATĚJKA Pavel. Developing a Speech Activity Detection System for the DARPA RATS Program. In: Proceedings of Interspeech 2012. Portland, Oregon: International Speech Communication Association, 2012, pp. 1-4. ISBN 978-1-62276-759-5. ISSN 1990-9772. Detail
- MATĚJKA Pavel, PLCHOT Oldřich, SOUFIFAR Mehdi Mohammad, GLEMBEK Ondřej, D'HARO Luis Fernando, VESELÝ Karel, GRÉZL František, MA Jeff, MATSOUKAS Spyros and DEHAK Najim. Patrol Team Language Identification System for DARPA RATS P1 Evaluation. In: Proceedings of Interspeech 2012. Portland, Oregon: International Speech Communication Association, 2012, pp. 1-4. ISBN 978-1-62276-759-5. ISSN 1990-9772. Detail
- D'HARO Luis Fernando, GLEMBEK Ondřej, PLCHOT Oldřich, MATĚJKA Pavel, SOUFIFAR Mehdi Mohammad, CORDOBA Ricardo and ČERNOCKÝ Jan. Phonotactic Language Recognition using i-vectors and Phoneme Posteriogram Counts. In: Proceedings of Interspeech 2012. Portland, Oregon: International Speech Communication Association, 2012, pp. 1-4. ISBN 978-1-62276-759-5. ISSN 1990-9772. Detail
- PLCHOT Oldřich, KARAFIÁT Martin, BRUMMER Johan Nikolaas Langenhoven, GLEMBEK Ondřej, MATĚJKA Pavel, DE Villiers Edward and ČERNOCKÝ Jan. Speaker vectors from Subspace Gaussian Mixture Model as complementary features for Language Identification. In: Proceedings of Odyssey 2012, The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012, pp. 330-333. ISBN 978-981-07-3093-2. Detail
2011
- SOUFIFAR Mehdi, KOCKMANN Marcel, BURGET Lukáš, PLCHOT Oldřich, GLEMBEK Ondřej and SVENDSEN Torbjorn. iVector Approach to Phonotactic Language Recognition. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, pp. 2913-2916. ISBN 978-1-61839-270-1. ISSN 1990-9772. Detail
- MARTÍNEZ González David, PLCHOT Oldřich, BURGET Lukáš, GLEMBEK Ondřej and MATĚJKA Pavel. Language Recognition in iVectors Space. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, pp. 861-864. ISBN 978-1-61839-270-1. ISSN 1990-9772. Detail