Project Details
AMIDA - Augmented Multi-party Interaction with Distance Access
Project Period: 1. 10. 2006 - 31. 12. 2009
Project Type: grant
Code: IST-033812-AMIDA
Agency: The Information Society Technologies (IST) 6th Framework programme
Program:
speech recognition, video processing, teleconference
AMIDA will develop and expand the research vision that we initiated in the previous (still ongoing) EU-IST AMI Integrated Project, to understand better and build new support for human communication. The ground-breaking research that we shall undertake in AMIDA will span several traditionally separate disciplines, including:
- Qualitative human analysis and human factors;
- Audio-video processing, including unconstrained speech recognition and natural scene analysis;
- Multimodal structure and content analysis, including the modelling of individuals and groups, through the joint processing of multiple (multimodal) information channels (audio, visual, slides, handwriting, and white board activity);
- HCI, application prototyping, evaluation, and system integration.
The AMIDA research work will directly build upon the recognized achievements and large multimodal corpora (becoming a standard reference in the area of multimodal processing) resulting from AMI. However, there will also be a very challenging shift in emphasis to live meetings with remote participants, using affordable commodity sensors (such as webcams and cheaper microphones), and targeting the development of advanced videoconferencing systems featuring new functionalities such as (1) filtering, searching and browsing; (2) remote monitoring; (3) interactive accelerated playback; (4) meeting support; and (5) shared context and presence. While addressing additional scientific challenges (such as real-time processing and processing of lower quality audio and visual signals), AMIDA has also raised the exploitation transfer potential through genuine integration of the AMIDA industrial partners collaborating on common prototypes and applications. Finally, through its "Community of Interest" (CoI)1, AMIDA will also actively engage beyond the consortium to spread awareness and knowledge.
Burget Lukáš, doc. Ing., Ph.D. (UPGM FIT VUT) , team leader
Černocký Jan, prof. Dr. Ing. (UPGM FIT VUT) , team leader
2010
- BERAN Vítězslav, HEROUT Adam and ZEMČÍK Pavel. On-line Video Synchronization Based on Visual Vocabularies. In: Proceedings of WSCG'10. 18th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision, WSCG 2010. Plzeň: University of West Bohemia in Pilsen, 2010, pp. 31-34. ISBN 978-80-86943-88-6. Detail
- ROSE Richard, NOROUZIAN Atta, REDDY Aarthi, COY Andre, GUPTA Vishwa and KARAFIÁT Martin. Subword-based spoken term detection in audio course lectures. In: Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, pp. 5282-5285. ISBN 978-1-4244-4296-6. ISSN 1520-6149. Detail
- HAIN Thomas, BURGET Lukáš, DINES John, GARNER Phillip N., EL Hannani Asmaa, HUIJBREGTS Marijn, KARAFIÁT Martin, LINCOLN Mike and WAN Vincent. The AMIDA 2009 Meeting Transcription System. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, pp. 358-361. ISBN 978-1-61782-123-3. ISSN 1990-9772. Detail
- SANTHOSH Kumar Chellappan Pillai, LI Haizhou, TONG Rong, MATĚJKA Pavel, BURGET Lukáš and ČERNOCKÝ Jan. Tuning phone decoders for language identification. In: Proc. International Conference on Acoustics, Speech, and Signal Processing 2010. Dallas: IEEE Signal Processing Society, 2010, pp. 5010-5013. ISBN 978-1-4244-4296-6. ISSN 1520-6149. Detail
2009
- CHMELAŘ Petr, BERAN Vítězslav, HEROUT Adam, HRADIŠ Michal, ŘEZNÍČEK Ivo and ZEMČÍK Pavel. Brno University of Technology at TRECVid 2009. In: TRECVID 2009: Participant Notebook Papers and Slides. Gaithersburg, MD: National Institute of Standards and Technology, 2009, pp. 1-11. Detail
- KOCKMANN Marcel, BURGET Lukáš and ČERNOCKÝ Jan. Brno University of Technology System for Interspeech 2009 Emotion Challenge. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 348-351. ISSN 1990-9772. Detail
- BURGET Lukáš, FAPŠO Michal, HUBEIKA Valiantsina, GLEMBEK Ondřej, KARAFIÁT Martin, KOCKMANN Marcel, MATĚJKA Pavel, SCHWARZ Petr and ČERNOCKÝ Jan. BUT system for NIST 2008 speaker recognition evaluation. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 2335-2338. ISBN 978-1-61567-692-7. ISSN 1990-9772. Detail
- BRÜMMER Niko, BURGET Lukáš, GLEMBEK Ondřej, HUBEIKA Valiantsina, JANČÍK Zdeněk, KARAFIÁT Martin, MATĚJKA Pavel, MIKOLOV Tomáš, PLCHOT Oldřich and STRASHEIM Albert. BUT-AGNITIO System Description for NIST Language Recognition Evaluation 2009. In: Proceedings NIST 2009 Language Recognition Evaluation Workshop. Baltimore, Maryland, USA: National Institute of Standards and Technology, 2009, pp. 1-7. Detail
- GLEMBEK Ondřej, BURGET Lukáš, DEHAK Najim, BRÜMMER Niko and KENNY Patrick. Comparison of Scoring Methods used in Speaker Recognition with Joint Factor Analysis. In: Proc. ICASSP 2009. Taipei: IEEE Signal Processing Society, 2009, p. 4. ISBN 978-1-4244-2354-5. Detail
- GRÉZL František, KARAFIÁT Martin and BURGET Lukáš. Investigation into bottle-neck features for meeting speech recognition. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 2947-2950. ISBN 978-1-61567-692-7. ISSN 1990-9772. Detail
- BURGET Lukáš, MATĚJKA Pavel, HUBEIKA Valiantsina and ČERNOCKÝ Jan. Investigation into variants of Joint Factor Analysis for speaker recognition. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 1263-1266. ISBN 978-1-61567-692-7. ISSN 1990-9772. Detail
- NIJHOLT Anton, ZWIERS Job and PEČIVA Jan. Mixed reality participants in smart meeting rooms and smart home environments. Personal and Ubiquitous Computing, vol. 2009, no. 1, pp. 85-94. ISSN 1617-4909. Detail
- BERAN Vítězslav, JURÁNEK Roman, MLÍCH Jozef, ŽÁK Pavel, HEROUT Adam and ZEMČÍK Pavel. On-Line Object Behaviour Analysis for Surveillance Systems. In: 10th Annual ICT Conference. Nairobi, 2009, p. 5. Detail
- KOMBRINK Stefan, BURGET Lukáš, MATĚJKA Pavel, KARAFIÁT Martin and HEŘMANSKÝ Hynek. Posterior-based Out of Vocabulary Word Detection in Telephone Speech. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 80-83. ISSN 1990-9772. Detail
- GARNER Phillip N., DINES John, HAIN Thomas, EL Hannani Asmaa, KARAFIÁT Martin, KORCHAGIN Danil, LINCOLN Mike, WAN Vincent and ZHANG Le. Real-Time ASR from Meetings. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 2119-2122. ISSN 1990-9772. Detail
- MLÍCH Jozef, ZEMČÍK Pavel and JIŘÍK Leoš. Trajectory classification using HMMs. In: WSCG 2009 Communication Papers. Plzeň: University of West Bohemia in Pilsen, 2009, pp. 67-72. ISBN 978-80-86943-94-7. Detail
- MLÍCH Jozef. Wiimote Gesture Recognition. In: Proceedings of the 15th Conference and Competition STUDENT EEICT 2009 Volume 4. Brno: Faculty of Electrical Engineering and Communication BUT, 2009, pp. 344-349. ISBN 978-80-214-3870-5. Detail
2008
- PLCHOT Oldřich, HUBEIKA Valiantsina, BURGET Lukáš, SCHWARZ Petr and MATĚJKA Pavel. Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition. In: Proc. 11th International Conference on Text, Speech and Dialogue. Berlin: Springer Verlag, 2008, pp. 477-483. ISBN 978-3-540-87390-7. Detail
- GLEMBEK Ondřej, MATĚJKA Pavel, BURGET Lukáš and MIKOLOV Tomáš. Advances in Phonotactic Language Recognition. In: Proc. Interspeech 2008. Brisbane: International Speech Communication Association, 2008, p. 4. ISSN 1990-9772. Detail
- HEROUT Adam, KUBÍČEK Radek, ZEMČÍK Pavel and ŽÁK Pavel. Automatic Video Editing for Multimodal Meetings. In: Proceedings of International Conference on Computer Vision and Graphics 2008. Lecture Notes in Computer Science. Heidelberg: Springer Verlag, 2008, pp. 1-12. ISSN 0302-9743. Detail
- CHMELAŘ Petr, BERAN Vítězslav, HEROUT Adam, HRADIŠ Michal, JURÁNEK Roman, LÁNÍK Aleš, MLÍCH Jozef, NAVRÁTIL Jan, ŘEZNÍČEK Ivo, ŽÁK Pavel and ZEMČÍK Pavel. Brno University of Technology at TRECVid 2008. In: Proceedings of TRECVID 2008. Gaithersburg: National Institute of Standards and Technology, 2008, pp. 1-16. Detail
- MATĚJKA Pavel, BURGET Lukáš, GLEMBEK Ondřej, SCHWARZ Petr, HUBEIKA Valiantsina, FAPŠO Michal, MIKOLOV Tomáš, PLCHOT Oldřich and ČERNOCKÝ Jan. BUT language recognition system for NIST 2007 evaluations. In: Proc. Interspeech 2008. Brisbane, Australia: International Speech Communication Association, 2008, p. 4. ISSN 1990-9772. Detail
- BURGET Lukáš, FAPŠO Michal, HUBEIKA Valiantsina, GLEMBEK Ondřej, KARAFIÁT Martin, KOCKMANN Marcel, MATĚJKA Pavel, SCHWARZ Petr and ČERNOCKÝ Jan. BUT system description: NIST SRE 2008. In: Proc. 2008 NIST Speaker Recognition Evaluation Workshop. Montreal: National Institute of Standards and Technology, 2008, pp. 1-4. Detail
- BURGET Lukáš, SCHWARZ Petr, MATĚJKA Pavel, HANNEMANN Mirko, RASTROW Ariya, WHITE Christopher, KHUDANPUR Sanjeev, HEŘMANSKÝ Hynek and ČERNOCKÝ Jan. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs. In: Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas: IEEE Signal Processing Society, 2008, p. 4. ISBN 1-4244-1484-9. Detail
- KOCKMANN Marcel and BURGET Lukáš. Contour modeling of prosodic and acoustic features for speaker recognition. In: Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008, p. 4. ISBN 978-1-4244-3472-5. Detail
- HUBEIKA Valiantsina, BURGET Lukáš, MATĚJKA Pavel and SCHWARZ Petr. Discriminative Training and Channel Compensation for Acoustic Language Recognition. In: Proc. Interspeech 2008. Brisbane: International Speech Communication Association, 2008, p. 4. ISSN 1990-9772. Detail
- KARAFIÁT Martin, BURGET Lukáš, HAIN Thomas and ČERNOCKÝ Jan. Discrimininative training of narrow band - wide band adaptated systems for meeting recognition. In: Proc. Interspeech 2008. Brisbane: International Speech Communication Association, 2008, p. 4. ISSN 1990-9772. Detail
- SZŐKE Igor, FAPŠO Michal, BURGET Lukáš and ČERNOCKÝ Jan. Hybrid word-subword decoding for spoken term detection. In: Proc. SSCS 2008: Speech search workshop at SIGIR. Singapore: Association for Computing Machinery, 2008, p. 4. ISBN 978-90-365-2697-5. Detail
- SZŐKE Igor, BURGET Lukáš, ČERNOCKÝ Jan and FAPŠO Michal. Sub-word modeling of out of vocabulary words in spoken term detection. In: Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008, p. 4. ISBN 978-1-4244-3472-5. Detail
- KOCKMANN Marcel and BURGET Lukáš. Syllable based Feature-Contours for Speaker Recognition. In: Proc. 14th International Workshop on Advances in Speech Technology. Maribor, 2008, p. 4. Detail
- HEROUT Adam, BERAN Vítězslav, HRADIŠ Michal, POTÚČEK Igor, ZEMČÍK Pavel and CHMELAŘ Petr. TRECVID 2007 by the Brno Group. In: Proceedings of TRECVID 2007. Gaithersburg: National Institute of Standards and Technology, 2008, pp. 1-6. ISBN 978-1-59593-780-3. Detail
2007
- BURGET Lukáš, MATĚJKA Pavel, SCHWARZ Petr, GLEMBEK Ondřej and ČERNOCKÝ Jan. Analysis of feature extraction and channel compensation in GMM speaker recognition system. IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 7, 2007, pp. 1979-1986. ISSN 1558-7916. Detail
- KARAFIÁT Martin, BURGET Lukáš, ČERNOCKÝ Jan and HAIN Thomas. Real-Time ASR from Meetings. In: Proc. INTERSPEECH 2007. Antwerpen: International Speech Communication Association, 2007, p. 4. ISSN 1990-9772. Detail
- MATĚJKA Pavel, BURGET Lukáš, GLEMBEK Ondřej, SCHWARZ Petr, HUBEIKA Valiantsina, FAPŠO Michal, MIKOLOV Tomáš and PLCHOT Oldřich. BUT system description for NIST LRE 2007. In: Proc. 2007 NIST Language Recognition Evaluation Workshop. Orlando: National Institute of Standards and Technology, 2007, pp. 1-5. Detail
- SZŐKE Igor, BURGET Lukáš and KARAFIÁT Martin. Combination of Word and Phoneme Approach for Spoken Term Detection. Brno, 2007. Detail
- POTÚČEK Igor, BERAN Vítězslav, SUMEC Stanislav and ZEMČÍK Pavel. Evaluation and comparison of tracking methods using meeting omnidirectional images. In: Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI). Brno, 2007, p. 12. Detail
- BRÜMMER Niko, BURGET Lukáš, ČERNOCKÝ Jan, GLEMBEK Ondřej, GRÉZL František, KARAFIÁT Martin, VAN Leeuwen David, MATĚJKA Pavel, SCHWARZ Petr and STRASHEIM Albert. Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006. IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 7, 2007, pp. 2072-2084. ISSN 1558-7916. Detail
- GRANÁT Jiří, HEROUT Adam, HRADIŠ Michal and ZEMČÍK Pavel. Hardware Acceleration of AdaBoost Classifier. In: Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI). Brno, 2007, pp. 1-12. Detail
- HUBEIKA Valiantsina, BURGET Lukáš, MATĚJKA Pavel and ČERNOCKÝ Jan. Channel Compensation for Speaker Recognition. Brno, 2007. Detail
- HUBEIKA Valiantsina, SZŐKE Igor, BURGET Lukáš and ČERNOCKÝ Jan. Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System. In: Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007). Pilsen: Springer Verlag, 2007, pp. 1-6. ISBN 978-3-540-74627-0. Detail
- GRÉZL František, KARAFIÁT Martin and ČERNOCKÝ Jan. Neural network topologies and bottle neck features in speech recognition. Brno, 2007. Detail
- GRÉZL František, KARAFIÁT Martin, KONTÁR Stanislav and ČERNOCKÝ Jan. Probabilistic and bottle-neck features for LVCSR of meetings. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Hononulu: IEEE Signal Processing Society, 2007, pp. 757-760. ISBN 1-4244-0728-1. Detail
- ČERNOCKÝ Jan, SZŐKE Igor, FAPŠO Michal, KARAFIÁT Martin, BURGET Lukáš, KOPECKÝ Jiří, GRÉZL František, SCHWARZ Petr, GLEMBEK Ondřej, OPARIN Ilya, SMRŽ Pavel and MATĚJKA Pavel. Search in speech for public security and defense. In: Proc. IEEE Workshop on Signal Processing Applications for Public Security and Forensics, 2007 (SAFE '07). Washington D.C.: IEEE Signal Processing Society, 2007, pp. 1-7. ISBN 1-4244-1226-9. Detail
- FAPŠO Michal. Search in speech records. In: Proc. 13th Conference STUDENT EEICT 2007. Brno: Faculty of Electrical Engineering and Communication BUT, 2007, pp. 1-3. ISBN 978-80-214-3410-3. Detail
- ČERNOCKÝ Jan, BURGET Lukáš, SCHWARZ Petr, MATĚJKA Pavel, KARAFIÁT Martin, GLEMBEK Ondřej, KOPECKÝ Jiří, SZŐKE Igor, FAPŠO Michal, GRÉZL František, HUBEIKA Valiantsina and OPARIN Ilya. Search in speech, language identification and speaker recognition in Speech@FIT. In: Proc. 17th International Conference Radioelektronika, 2007. Brno: Department of Radioelectronics FEEC BUT, 2007, pp. 1-6. ISBN 978-80-214-3390-8. Detail
- SZŐKE Igor, FAPŠO Michal, KARAFIÁT Martin, BURGET Lukáš, GRÉZL František, SCHWARZ Petr, GLEMBEK Ondřej, MATĚJKA Pavel, KOPECKÝ Jiří and ČERNOCKÝ Jan. Spoken Term Detection System Based on a Combination of LVCSR and Phonetic Search. Brno, 2007. Detail
- MATĚJKA Pavel, BURGET Lukáš, SCHWARZ Petr, GLEMBEK Ondřej, KARAFIÁT Martin, GRÉZL František, ČERNOCKÝ Jan, VAN Leeuwen David, BRÜMMER Niko and STRASHEIM Albert. STBU system for the NIST 2006 speaker recognition evaluation. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Honolulu: IEEE Signal Processing Society, 2007, pp. 221-224. ISBN 1-4244-0728-1. Detail
- HAIN Thomas, WAN Vincent, BURGET Lukáš, KARAFIÁT Martin, DINES John, VEPA Jithendra, GARAU Giulia and LINCOLN Mike. The AMI System for the Transcription of Speech in Meetings. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Hononulu: IEEE Signal Processing Society, 2007, pp. 357-360. ISBN 1-4244-0728-1. Detail
2009
- A Compact Speech Recognition System for lectures in English, software, 2009
Authors: Karafiát Martin, Burget Lukáš, Glembek Ondřej Detail - Automatic SVM creator in SGE environment, software, 2009
Authors: Řezníček Ivo, Zemčík Pavel Detail - Automatic Video Editing Software, software, 2009
Authors: Sumec Stanislav, Zemčík Pavel, Kubíček Radek, Žák Pavel, Hradiš Michal, Navrátil Jan, Kajan Rudolf Detail - Camera Localization using RANSAC, software, 2009
Authors: Potúček Igor, Beran Vítězslav, Zemčík Pavel Detail - Lattice Spoken Term Detection toolkit (LatticeSTD), software, 2009
Authors: Szőke Igor, Fapšo Michal Detail - Object Detection Framework, software, 2009
Authors: Beran Vítězslav, Havel Jiří, Herout Adam, Hradiš Michal, Jošth Radovan, Juránek Roman, Polok Lukáš, Zemčík Pavel Detail - OmniView system, software, 2009
Authors: Potúček Igor, Sumec Stanislav, Polok Lukáš, Zemčík Pavel Detail - Video and Feature processing, software, 2009
Authors: Beran Vítězslav, Chmelař Petr, Řezníček Ivo, Herout Adam, Zemčík Pavel, Hradiš Michal, Juránek Roman, Bařina David Detail
2007
- CVE Library, software, 2007
Authors: Pečiva Jan Detail
2006
- System for Collaborative Data Sharing, software, 2006
Authors: Pečiva Jan Detail