Project Details
Jazykově nezávislá detekce klíčových slov
Project Period: 1. 1. 2012 - 31. 12. 2014
Project Type: grant
Code: GPP202/12/P567
Agency: Czech Science Foundation
Program: Postdoktorandské granty
keyword spotting, query-by-example, language independent, hidden Markov models, artificial neural network
This project aims at research and development of language-independent keyword spotter in spoken speech. The keywords will be entered as examples (Query-by-Example). The application of project results is in search in speech where current approaches fail: exotic languages (insufficient or no training data) and recordings where speakers change language within the conversation. The first goal is to define evaluation data for several languages and to evaluate the state-of-the-art Query-by-Example systems in cross-lingual environment. Main goals are: (1) to design and evaluate an approach to language-independent high-level feature extraction from speech. We will use combination of several language-dependent artificial neural network classifiers. (2) To design and evaluate a GMM/HMM approach to Query-by-Example. It will be important to correctly estimate the keyword model on several examples and to investigate training of the universal background model. We will also compare achieved results with standard language-dependent approaches.
Janda Miloš, Ing. (UPGM FIT VUT) , team leader
Veselý Karel, Ing., Ph.D. (UPGM FIT VUT) , team leader
2015
- SZŐKE Igor, SKÁCEL Miroslav, ČERNOCKÝ Jan and BURGET Lukáš. Coping with Channel Mismatch in Query-By-Example - BUT QUESST 2014. In: Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015, pp. 5838-5842. ISBN 978-1-4673-6997-8. Detail
- ANGUERA Xavier, RODRIGUEZ-FUENTES Luis J., BUZO Andi, METZE Florian, SZŐKE Igor and PENAGARIKANO Mikel. QUESST 2014: Evaluating Query-By-Example Speech Search in a Zero-Resource. In: Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015, pp. 5833-5837. ISBN 978-1-4673-6997-8. Detail
2014
- SZŐKE Igor, SKÁCEL Miroslav and BURGET Lukáš. BUT QUESST 2014 System Description. In: CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2014, pp. 1-2. ISSN 1613-0073. Detail
- SZŐKE Igor, BURGET Lukáš, GRÉZL František, ČERNOCKÝ Jan and ONDEL Yang Lucas Antoine Francois. Calibration and Fusion of Query-by-example Systems - BUT SWS 2013. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, pp. 7899-7903. ISBN 978-1-4799-2892-7. Detail
- ANGUERA Xavier, RODRIGUEZ-FUENTES Luis J., SZŐKE Igor, BUZO Andi and METZE Florian. Query by Example Search on Speech at Mediaeval 2014. In: CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2014, pp. 1-2. ISSN 1613-0073. Detail
- ANGUERA Xavier, RODRIGUEZ-FUENTES Luis J., SZŐKE Igor, BUZO Andi and METZE Florian et al. Query-by-example Spoken Term Detection Evaluation on Low-resource Languages. In: Proceedings of the 4th International Workshop on Spoken Language Technologies for Under- resourced Languages SLTU-2014. St. Petersburg, Russia. St. Petersburg: International Speech Communication Association, 2014, pp. 24-31. ISBN 978-5-8088-0908-6. Detail
2013
- SZŐKE Igor, BURGET Lukáš, GRÉZL František and ONDEL Yang Lucas Antoine Francois. BUT SWS 2013 - Massive Parallel Approach. In: Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop. Barcelona: CEUR-WS.org, 2013, pp. 1-2. ISSN 1613-0073. Detail
- ANGUERA Xavier, METZE Florian, BUZO Andi, SZŐKE Igor and RODRIGUEZ-FUENTES Luis J. The Spoken Web Search Task. In: CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2013, pp. 1-2. ISSN 1613-0073. Detail
2012
- SZŐKE Igor, FAPŠO Michal and VESELÝ Karel. BUT2012 Approaches for Spoken Web Search - MediaEval 2012. In: Working Notes Proceedings of the MediaEval 2012 Workshop. Pisa: CEUR-WS.org, 2012, pp. 1-2. ISSN 1613-0073. Detail
- TEJEDOR Javier, FAPŠO Michal, SZŐKE Igor, ČERNOCKÝ Jan and GRÉZL František. Comparison of methods for language-dependent and language-independent query-by-example spoken term detection. ACM Transactions on Information Systems (TOIS), vol. 2012, no. 30, pp. 1-34. ISSN 1046-8188. Detail
- SZŐKE Igor, FAPŠO Michal, ŽIŽKA Josef, BERAN Vítězslav and ČERNOCKÝ Jan. Efektivní přístup ke znalostem v audio-vizuálních záznamech. In: Proceedings of the Annual Database Conference. Praha: The University of Technology Košice, 2012, pp. 57-74. ISBN 978-80-553-1049-7. Detail