Publication Details
Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition
Hubeika Valiantsina, Ing. (DCGM FIT BUT)
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT)
Schwarz Petr, Ing., Ph.D. (DCGM FIT BUT)
Matějka Pavel, Ing., Ph.D. (DCGM FIT BUT)
Language Identification (LID), Broadcast data, Phone call detection, Channel compensation
The work is on Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition.
This paper presents a procedure of acquiring linguistic data from the broadcast media and its use in language recognition. The goal of this work is to answer the question whether the automatically obtained data from broadcasts can replace or augment to the continuous telephone speech. The main challenges are channel compensation issues and great portion of unspontaneous speech in broadcasts. The experimental results are obtained on NIST LRE 2007 evaluation system, using both NIST provided training data and data, obtained from broadcasts.
@INPROCEEDINGS{FITPUB8723, author = "Old\v{r}ich Plchot and Valiantsina Hubeika and Luk\'{a}\v{s} Burget and Petr Schwarz and Pavel Mat\v{e}jka", title = "Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition", pages = "477--483", booktitle = "Proc. 11th International Conference on Text, Speech and Dialogue", year = 2008, location = "Berlin, DE", publisher = "Springer Verlag", ISBN = "978-3-540-87390-7", language = "english", url = "https://www.fit.vut.cz/research/publication/8723" }