Project Details
Rozpoznávání klíčových slov a akcí v audiovizuálních datech
Project Period: 26. 10. 2004 - 26. 10. 2006
Project Type: grant
Code: 119/2004
Agency: CESNET National Research and Education Network
Program:
speech recognition, feature extraction, meeting data, audio-video processing, keyword spotting
The goal of the project is automatic processing of recorded meeting data, which would allow to user the direct browsing of required events in acoustic/visual databases according to selected criteria. Further, proposed algorithms will be also modified to be used in large vocabulary continuous speech recognition tasks. Derivation of robust, efficient and by any user utilized system depends on training and testing of particular algorithms. These algorithms are generally very computationally expensive, mainly in case of processing real meeting data and large vocabulary databases. A cluster proposed in this project is able to significantly increase a computational capacity at our faculty, thus, new modern algorithms and trends in area of automatic speech recognition can be applied. We also suppose the utilization of new cluster in other spheres of research activity at Faculty of Information Technology.
Karafiát Martin, Ing., Ph.D. (UPGM FIT VUT) , team leader
Kašpárek Tomáš, Ing., Ph.D. (CVT FIT VUT) , team leader
Sumec Stanislav, Ing., Ph.D. (UPGM FIT VUT) , team leader
2007
- GRÉZL František, KARAFIÁT Martin, KONTÁR Stanislav and ČERNOCKÝ Jan. Probabilistic and bottle-neck features for LVCSR of meetings. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Hononulu: IEEE Signal Processing Society, 2007, pp. 757-760. ISBN 1-4244-0728-1. Detail
- ČERNOCKÝ Jan, SZŐKE Igor, FAPŠO Michal, KARAFIÁT Martin, BURGET Lukáš, KOPECKÝ Jiří, GRÉZL František, SCHWARZ Petr, GLEMBEK Ondřej, OPARIN Ilya, SMRŽ Pavel and MATĚJKA Pavel. Search in speech for public security and defense. In: Proc. IEEE Workshop on Signal Processing Applications for Public Security and Forensics, 2007 (SAFE '07). Washington D.C.: IEEE Signal Processing Society, 2007, pp. 1-7. ISBN 1-4244-1226-9. Detail
- MATĚJKA Pavel, BURGET Lukáš, SCHWARZ Petr, GLEMBEK Ondřej, KARAFIÁT Martin, GRÉZL František, ČERNOCKÝ Jan, VAN Leeuwen David, BRÜMMER Niko and STRASHEIM Albert. STBU system for the NIST 2006 speaker recognition evaluation. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Honolulu: IEEE Signal Processing Society, 2007, pp. 221-224. ISBN 1-4244-0728-1. Detail
2006
- MATĚJKA Pavel, BURGET Lukáš, SCHWARZ Petr and ČERNOCKÝ Jan. Brno University of Technology System for NIST 2005 Language Recognition Evaluation. In: Proceedings of Odyssey 2006: The Speaker and Language Recognition Workshop. San Juan, 2006, pp. 57-64. ISBN 1-4244-0472-X. Detail
- BURGET Lukáš, MATĚJKA Pavel and ČERNOCKÝ Jan. Discriminative Training Techniques for Acoustic Language Identification. In: Proceedings of ICASSP 2006. Toulouse, 2006, pp. 209-212. Detail
- SCHWARZ Petr, MATĚJKA Pavel and ČERNOCKÝ Jan. Hierarchical structures of neural networks for phoneme recognition. In: Proceedings of ICASSP 2006. Toulouse, 2006, pp. 325-328. Detail
- FAPŠO Michal, SMRŽ Pavel, SCHWARZ Petr, SZŐKE Igor, SCHWARZ Milan, ČERNOCKÝ Jan, KARAFIÁT Martin and BURGET Lukáš. Information Retrieval from Spoken Documents. In: Proceedings of the Seventh International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2006). Mexico City: Springer Verlag, 2006, pp. 410-416. ISBN 3-540-32205-1. Detail
- MATĚJKA Pavel, SCHWARZ Petr, BURGET Lukáš and ČERNOCKÝ Jan. Use of anti-models to furher improve state-of-the-art PRLM language recognition system. In: Proceedings of ICASSP 2006. Toulouse, 2006, pp. 197-200. Detail