Project Details
Hlasové technologie v podpoře informační společnosti
Project Period: 1. 1. 2002 - 31. 12. 2004
Project Type: grant
Code: GA102/02/0124
Agency: Czech Science Foundation
Program:
English title
Voice technologies for support of information society
Type
grant
Keywords
speech processing, recognition, coding
Abstract
Voice technologies for support of information society
Team members
Černocký Jan, prof. Dr. Ing.
(UPGM FIT VUT)
, research leader
Burget Lukáš, doc. Ing., Ph.D. (UPGM FIT VUT) , team leader
Grézl František, Ing., Ph.D. (UPGM FIT VUT) , team leader
Karafiát Martin, Ing., Ph.D. (UPGM FIT VUT) , team leader
Motlíček Petr, doc. Ing., Ph.D. (UPGM FIT VUT) , team leader
Schwarz Petr, Ing., Ph.D. (UPGM FIT VUT) , team leader
Burget Lukáš, doc. Ing., Ph.D. (UPGM FIT VUT) , team leader
Grézl František, Ing., Ph.D. (UPGM FIT VUT) , team leader
Karafiát Martin, Ing., Ph.D. (UPGM FIT VUT) , team leader
Motlíček Petr, doc. Ing., Ph.D. (UPGM FIT VUT) , team leader
Schwarz Petr, Ing., Ph.D. (UPGM FIT VUT) , team leader
Publications
2006
- FAPŠO Michal, SCHWARZ Petr, SZŐKE Igor, SMRŽ Pavel, SCHWARZ Milan, ČERNOCKÝ Jan, KARAFIÁT Martin and BURGET Lukáš. Search Engine for Information Retrieval from Speech Records. In: Proceedings of the Third International Seminar on Computer Treatment of Slavic and East European Languages. Bratislava, 2006, pp. 100-101. Detail
2005
- ČERNOCKÝ Jan and LAMPA Petr. Teaching signals - making it automatic, making it fun. In: Proc. Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005, p. 4. ISBN 80-214-2904-6. Detail
2004
- MATĚJKA Pavel, SZŐKE Igor, SCHWARZ Petr and ČERNOCKÝ Jan. Automatic Language Identification using Phoneme and Automatically Derived Unit Strings. Lecture Notes in Computer Science, vol. 2004, no. 3206, p. 8. ISSN 0302-9743. Detail
- MATĚJKA Pavel, SZŐKE Igor, SCHWARZ Petr and ČERNOCKÝ Jan. Automatic Language Identification using Phoneme and Automatically Derived Unit Strings. In: Proceedings of 7th International Conference Text,Speech and Dialoque 2004. Brno: Springer Verlag, 2004, p. 8. ISBN 3-540-23049-1. Detail
- KARAFIÁT Martin, GRÉZL František and BURGET Lukáš. Combination of MFCC and TRAP features for LVCSR of meeting data. Martigny, 2004. Detail
- BURGET Lukáš. Combination of Speech Features Using Smoothed Heteroscedastic Linear Discriminant Analysis. In: Proc. 8th International Conference on Spoken Language Processing. Jeju island: Sunjin Printing Co,, 2004, pp. 2549-2552. Detail
- GRÉZL František. Combinations of TRAP-based systems. In: Proc. Seventh International conference on Text, Speech and Dialogue. Brno: Faculty of Informatics MU, 2004, pp. 323-330. ISBN 3-540-23049-1. Detail
- MATĚJKA Pavel, ČERNOCKÝ Jan and SIGMUND Milan. Introduction to Automatic Language Identification. In: Conference Proceedings of Radioelektronika 2004. Brno: Slovak University of Technology in Bratislava, 2004, p. 4. ISBN 80-227-2017-8. Detail
- SZŐKE Igor and MOTLÍČEK Petr. Kódování řeči na velmi nízkých bitových rychlostech. GACR 102/02/0124 "Hlasové technologie v podpoře informační společnosti", souhrnný přehled aktivit řešitelských klektivů. Praha: Faculty of Electrical Engineering, Czech Technical University, 2004. ISBN 80-01-02957-3. Detail
- BURGET Lukáš. Measurement of Complementarity of Recognition Systems. In: Proc. Seventh International conference on Text, Speech and Dialogue. Lecture Notes in Artificial Intelligence (LNAI) subseries of LNCS series as Volume 3206. Brno: Springer Verlag, 2004, pp. 283-290. ISBN 3-540-23049-1. Detail
- MOTLÍČEK Petr. Modelování spektra a časových trajektorií v rozpoznávání řeči. GACR 102/02/0124 "Hlasové technologie v podpoře informační společnosti", souhrnný přehled aktivit řešitelských kolektivů. Praha, 2004. ISBN 80-01-02957-3. Detail
- MOTLÍČEK Petr and ČERNOCKÝ Jan. Multimodal Phoneme Recognition of Meeting Data. In: 7th International Conference, TSD 2004 Brno, Czech Republic, September 2004 Proceedings. Brno: Springer Verlag, 2004, pp. 379-384. ISBN 3-540-23049-1. ISSN 0302-9743. Detail
- MOTLÍČEK Petr and ČERNOCKÝ Jan. Multimodal Phoneme Recognition of Meeting Data. Lecture Notes in Computer Science, vol. 2004, no. 3206, p. 6. ISSN 0302-9743. Detail
- FOUSEK Petr, SVOJANOVSKÝ Petr, GRÉZL František and HEŘMANSKÝ Hynek. New Nonsense Syllables Database - Analyses and Preliminary ASR Experiments. In: Proc. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co,, 2004, pp. 348-351. ISSN 1225-4111. Detail
- SCHWARZ Petr and MATĚJKA Pavel. Phoneme Recognition from a Long Temporal Context. Martigny, 2004. Detail
- SCHWARZ Petr, MATĚJKA Pavel and ČERNOCKÝ Jan. Phoneme Recognition from a Long Temporal Context. In: poster at JOINT AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms. Martigny: Institute for Perceptual Artificial Intelligence, 2004, pp. 1-1. Detail
- MOTLÍČEK Petr, BURGET Lukáš and ČERNOCKÝ Jan. PHONEME RECOGNITION OF MEETINGS USING AUDIO-VISUAL DATA. AMI Workshop. Martigny, 2004. Detail
- MATĚJKA Pavel. Review of Automatic Language Identification. In: Proceedings of 10th Conference and Competition STUDENT EEICTT 2004 Volume 2. Brno, 2004, p. 5. ISBN 80-214-2635-7. Detail
- MOTLÍČEK Petr. Segmentace nahrávek živých jednání podle mluvčího. In: Sborník příspěvků a prezentací akce Odborné semináře 2004. REL03V. Brno: Department of Radioelectronics FEEC BUT, 2004, p. 28. Detail
- SZŐKE Igor. Speech units automatically generated by ergodic hidden Markov model. In: Proceedings of 10th Conference and Competition STUDENT EEICT 2004. Brno: Faculty of Electrical Engineering and Communication BUT, 2004, p. 5. Detail
- SCHWARZ Petr, MATĚJKA Pavel and ČERNOCKÝ Jan. Towards Lower Error Rates in Phoneme Recognition. Lecture Notes in Computer Science, vol. 2004, no. 3206, p. 8. ISSN 0302-9743. Detail
- SCHWARZ Petr, MATĚJKA Pavel and ČERNOCKÝ Jan. Towards Lower Error Rates in Phoneme Recognition. In: Proceedings of 7th International Conference Text,Speech and Dialoque 2004. Brno: Springer Verlag, 2004, p. 8. ISBN 3-540-23049-1. Detail
- SCHWARZ Petr, MATĚJKA Pavel and ČERNOCKÝ Jan. Towards Lower Error Rates In Phoneme Recognition. Lecture Notes in Computer Science, vol. 2004, no. 3206, pp. 465-472. ISBN 3-540-23049-1. ISSN 0302-9743. Detail
- MOTLÍČEK Petr. Visual Feature Extreaction for Phoneme Recognition of Meetings. Brno: Department of Computer Graphics and Multimedia FIT BUT, 2004. Detail
2003
- MOTLÍČEK Petr and ČERNOCKÝ Jan. All-Pole Modeling for Definition of Speech Features in Aurora3 DSR Task. In: 6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings. České Budějovice: University of West Bohemia in Pilsen, 2003, pp. 295-300. ISBN 3-540-20024-X. ISSN 0302-9743. Detail
- MOTLÍČEK Petr and ČERNOCKÝ Jan. Autoregressive Modeling based Feature Extraction for Aurora3 DSR Task. In: Proc. EUROSPEECH 2003. Geneva: Institute for Perceptual Artificial Intelligence, 2003, pp. 1801-1804. ISSN 1018-4074. Detail
- MOTLÍČEK Petr. Derivation of TRAPs in Auditory Domain. In: Proceedings of 9th Conference and Competition STUDENT EEICT 2003. Brno: Dean Office of FEEC BUT, 2003, pp. 598-602. ISBN 80-214-2379-X. Detail
- MOTLÍČEK Petr. Derivation of TRAPs in Auditory Domain. In: Proceedings of the International Conference and Competition. Brno: Faculty of Electrical Engineering and Communication BUT, 2003, pp. 315-319. ISBN 80-214-2401-X. Detail
- GRÉZL František. Effect of normalization on TRAP based systems in ASR. In: Proc. 13th International scientific conference Radioelektronika 2003. Brno: Department of Radioelectronics FEEC BUT, 2003, pp. 128-131. ISBN 80-214-2383-8. Detail
- GRÉZL František. Local time-frequency operators in TRAPs for speech recognition. In: 6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings. České Budějovice: University of West Bohemia in Pilsen, 2003, pp. 269-274. ISBN 3-540-20024-X. ISSN 0302-9743. Detail
- MOTLÍČEK Petr. Modeling of Spectra and Temporal Trajectories in Speech Processing. In: Sborník příspěvků a prezentací akce Odborné semináře 2003 . REL02V. Brno: Department of Radioelectronics FEEC BUT, 2003, p. 28. Detail
- HEŘMANSKÝ Hynek, MATĚJKA Pavel and SCHWARZ Petr. On Use of Temporal Dynamics of Speech for Language Identification. In: Proceedings of Language Recognition Workshop 2003. NIST Gaithersburg, MD USA, 2003, pp. 56-62. Detail
- MATĚJKA Pavel, SCHWARZ Petr, GRÉZL František and ČERNOCKÝ Jan. Phoneme Classification using Temporal Patterns. In: Proc. 13th International scientific conference Radioelektronika 2003. Brno: Faculty of Electrical Engineering and Communication BUT, 2003, pp. 1-4. ISBN 80-214-2383-8. Detail
- MATĚJKA Pavel, SCHWARZ Petr, HEŘMANSKÝ Hynek and ČERNOCKÝ Jan. Phoneme Recognition using Temporal Patterns. In: Proc. 6th International Conference Text, Speech and Dialogue, TSD2003. Ceske Budejovice: Springer Verlag, 2003, pp. 465-472. ISBN 3-540-20024-X. Detail
- SCHWARZ Petr, MATĚJKA Pavel and ČERNOCKÝ Jan. Recognition of Phoneme Strings using TRAP Technique. In: Proceedings of 8th International Conference Eurospeech. Geneve: International Speech Communication Association, 2003, pp. 1-4. ISSN 1018-4074. Detail
- BURGET Lukáš and ČERNOCKÝ Jan. Recognition of Speech with Non-random Attributes. In: 6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings. České Budějovice: Springer Verlag, 2003, p. 6. ISBN 3-540-20024-X. ISSN 0302-9743. Detail
- ČERNOCKÝ Jan. Temporal processing for feature extraction in speech recognition. Vědecké spisy VUT. Edice Habilitační a inaugurační spisy, sv. 112. Brno: Publishing house of Brno University of Technology VUTIUM, 2003, pp. 1-30. ISBN 80-214-2395-1. Detail
- MOTLÍČEK Petr and ČERNOCKÝ Jan. Time-domain based Temporal Processing with Application of. In: Proc. EUROSPEECH 2003. Geneva: Institute for Perceptual Artificial Intelligence, 2003, pp. 821-824. ISSN 1018-4074. Detail
- KARAFIÁT Martin and GRÉZL František. Using MATLAB for Analysis of TRAP system. Radioengineering, vol. 2003, no. 4, pp. 38-41. ISSN 1210-2512. Detail
- JENDERKA Petr and VÍCHA Tomáš. Voice Activity Detection in Multimodal Meeting Manager. In: Proceedings of 9th Conference and Competition STUDENT EEICT 2003 Volume 3. Brno: Faculty of Electrical Engineering and Communication BUT, 2003, pp. 588-592. ISBN 80-214-2379-X. Detail
- SCHWARZ Petr. Would You Like To Make Your Programs Understand Human Voice?. In: Proceedings of 9th Conference STUDENT EEICT 2003. Brno: Faculty of Electrical Engineering and Communication BUT, 2003, pp. 231-235. ISBN 80-214-2379-X. Detail
2002
- BAUDOIN Genevieve, CAPMAN Francois, ČERNOCKÝ Jan, EL Chami Fadi, CHARBIT Maurice, CHOLLET Gerard and PETROVSKA-DELACRETAZ Dijana. Advances in very low bit-rate speech coding using recognition and synthesis techniques. Lecture Notes in Computer Science, vol. 2002, no. 2448, pp. 269-276. ISBN 3-540-44129-8. ISSN 0302-9743. Detail
- MOTLÍČEK Petr. Application of Mel-scale Filter bank for Noise Estimation in Speech Processing. In: 12th International Czech-Slovak Scientific conference Radioelektronika 2002. Bratislava: Slovak University of Technology in Bratislava, 2002, p. 4. ISBN 80-227-1700-2. Detail
- GRÉZL František. Classifiers in speech recognition systems based on TRAPS. In: Proceedings of 8th Conference STUDENT EEICT 2002. Brno: Faculty of Electrical Engineering and Communication BUT, 2002, pp. 74-77. ISBN 80-214-2116-9. Detail
- KARAFIÁT Martin and ČERNOCKÝ Jan. Context dependent Hidden Markov models in recognition of Czech. In: Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovak University of Technology in Bratislava, 2002, p. 4. ISBN 80-227-1700-2. Detail
- ČERNOCKÝ Jan and KARAFIÁT Martin. Differences between context dependent and context independent Hidden Markov Models for recognition of Czech. In: Proc. of 8th student conference STUDENT EEICT 2002. Brno: Faculty of Electrical Engineering TUB, 2002, p. 5. ISBN 80-214-2116-9. Detail
- BURGET Lukáš, MOTLÍČEK Petr, GRÉZL František and JAIN Pratibha. Distributed speech recognition. Radioengineering, vol. 2002, no. 4, pp. 12-16. ISSN 1210-2512. Detail
- GARUDADRI Harinath, HEŘMANSKÝ Hynek, MORGAN Nelson, BENITEZ Carmen, BURGET Lukáš, KAJAREKAR Sachin, GRÉZL František, JAIN Pratibha and MOTLÍČEK Petr. Distributed Voice Recognition System Utilizing Multistream Network Feature Processing. San Diego: Qualcomm, 2002. Detail
- MOTLÍČEK Petr and BURGET Lukáš. Efficient Noise Estimation and its Application for Robust Speech Recognition. In: 5th International Conference, TSD 2002 Brno, Czech Republic, September 2002 Proceedings. Berlin: Springer Verlag, 2002, pp. 229-236. ISBN 3-540-44129-8. Detail
- MOTLÍČEK Petr. Feature Extraction in Speech Coding and Recognition. Portland: Oregon Graduate Institute of Science and Technology, 2002. Detail
- MATĚJKA Pavel and ČERNOCKÝ Jan. Feature gaussianization in speech recognition. In: Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovak University of Technology in Bratislava, 2002, p. 4. ISBN 80-227-1700-2. Detail
- GRÉZL František, BURGET Lukáš, JAIN Pratibha and ČERNOCKÝ Jan. Improving TRAPS features using LDA. In: Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovak University of Technology in Bratislava, 2002, p. 4. ISBN 80-227-1700-2. Detail
- SCHWARZ Petr and ČERNOCKÝ Jan. Keyword detection in Czech fluent speech. In: Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovak University of Technology in Bratislava, 2002, p. 4. ISBN 80-227-1700-2. Detail
- SCHWARZ Petr. Modifications of Viterbi algorithms for keyword detection. In: Proceedings of 8th Conference STUDENT EEICT 2002. Brno: Faculty of Electrical Engineering and Communication BUT, 2002, p. 4. ISBN 80-214-2116-9. Detail
- MOTLÍČEK Petr and BURGET Lukáš. Noise estimation for efficient speech enhancement and robust speech recognition. In: Proc. 7th International Conference on Spoken Language Processing. Denver: International Speech Communication Association, 2002, pp. 1033-1036. ISBN 1-876346-42-6. Detail
- MOTLÍČEK Petr. Noise Estimation for Spectral Subtraction in Speech Processing. In: Proceedings of 8th Conference STUDENT EEICT 2002. Brno: Faculty of Electrical Engineering and Communication BUT, 2002, p. 4. ISBN 80-214-2116-9. Detail
- BURGET Lukáš, DUPONT Stephane, GARUDADRI Harinath, GRÉZL František, HEŘMANSKÝ Hynek, JAIN Pratibha, KAJAREKAR Sachin and MORGAN Nelson. QUALCOMM-ICSI-OGI Features for ASR. In: Proc. 7th International Conference on Spoken Language Processing. Denver: International Speech Communication Association, 2002, p. 4. ISBN 1-876346-42-6. Detail
- MATĚJKA Pavel, SCHWARZ Petr, KARAFIÁT Martin and ČERNOCKÝ Jan. Some like it Gaussian... In: Proc. 5th International Conference Text, Speech and Dialogue, TSD2002. Lecture notes in artificial intelligence 2448. Berlin: Springer Verlag, 2002, pp. 321-324. ISBN 3-540-44129-8. Detail
- ČERNOCKÝ Jan. Temporal processing for feature extraction in speech recognition, habilitation thesis. Brno, 2002. Detail
- ČERNOCKÝ Jan. Units for automatic language independent speech processing. In: Proc. LREC 2002 - workshop on Portability issues in human language technologies. Las Palmas: European Language Resources Association, 2002, pp. 7-13. Detail
Products
2008
- Phoneme recognizer based on long temporal context, software, 2008
Authors: Schwarz Petr, Matějka Pavel, Burget Lukáš, Glembek Ondřej Detail