Publication Details
Phonotactic Language Identification using High Quality Phoneme Recognition
Schwarz Petr, Ing., Ph.D. (DCGM FIT BUT)
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT)
Chytil Pavel, Ing., Ph.D. (FEEC BUT)
Language identificaion, phoneme recognition
Phoneme Recognizers followed by Language Modeling (PRLM) have
consistently yielded top performance in language identification
(LID) task. Several phone recognizers are compared and results are reported on NIST LRE 2003.
Phoneme Recognizers followed by Language Modeling (PRLM) have
consistently yielded top performance in language identification
(LID) task. Parallel ordering of PRLMs (PPRLM) improves
performance even more.
Since tokenizer is the most important part of
LID system the high quality phoneme recognizer is employed. Two
different multilingual databases for training phoneme recognizers are
compared and the amount of sufficient training data is studied.
Reported results are on data from NIST
2003 LID evaluation. Our four PRLM systems have Equal Error Rate
(EER) of 2.4\% on 12 languages task. This result compares
favorably to the best known result from this task.
@INPROCEEDINGS{FITPUB7762, author = "Pavel Mat\v{e}jka and Petr Schwarz and Jan \v{C}ernock\'{y} and Pavel Chytil", title = "Phonotactic Language Identification using High Quality Phoneme Recognition", pages = "2237--2240", booktitle = "Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology", journal = "European Speech Communication", volume = 2005, number = 9, year = 2005, location = "Lisbon, PT", publisher = "International Speech Communication Association", ISSN = "1018-4074", language = "english", url = "https://www.fit.vut.cz/research/publication/7762" }