Výzkumná skupina dolování dat z řeči BUT Speech@FIT

2024

BENEŠ Karel, KOCOUR Martin a BURGET Lukáš. Hystoc: Obtaining Word Confidences for Fusion of End-To-End ASR Systems. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, s. 11276-11280. ISBN 979-8-3503-4485-1.
Detail

HAN Jiangyu, LANDINI Federico Nicolás, ROHDIN Johan A., DIEZ Sánchez Mireia, BURGET Lukáš, CAO Yuhang, LU Heng a ČERNOCKÝ Jan. Diacorrect: Error Correction Back-End for Speaker Diarization. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024, s. 11181-11185. ISBN 979-8-3503-4485-1.
Detail

PENG Junyi, DELCROIX Marc, OCHIAI Tsubasa, PLCHOT Oldřich, ARAKI Shoko a ČERNOCKÝ Jan. Target Speech Extraction with Pre-Trained Self-Supervised Learning Models. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, s. 10421-10425. ISBN 979-8-3503-4485-1.
Detail

PENG Junyi, DELCROIX Marc, OCHIAI Tsubasa, ASHIHARA Takanori, PLCHOT Oldřich, ARAKI Shoko a ČERNOCKÝ Jan. Probing Self-Supervised Learning Models With Target Speech Extraction. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, s. 535-539. ISBN 979-8-3503-7451-3.
Detail

KLEMENT Dominik, DIEZ Sánchez Mireia, LANDINI Federico Nicolás, BURGET Lukáš, SILNOVA Anna, DELCROIX Marc a TAWARA Naohiro. Discriminative Training of VBx Diarization. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, s. 11871-11875. ISBN 979-8-3503-4485-1.
Detail

LANDINI Federico Nicolás, DIEZ Sánchez Mireia, STAFYLAKIS Themos a BURGET Lukáš. DiaPer: End-to-End Neural Diarization With Perceiver-Based Attractors. IEEE Transactions on Audio, Speech, and Language Processing, roč. 32, č. 7, 2024, s. 3450-3465. ISSN 1558-7916.
Detail

WANNER Leo, ČERNOCKÝ Jan, EGOROVA Ekaterina, KLUSCH Matthias a MAVROPOULOS Athanasios a kol. Support of Migrant Reception, Integration, and Social Inclusion by Intelligent Technologies. Information, roč. 15, č. 11, 2024, s. 1-33. ISSN 2078-2489.
Detail

2023

PENG Junyi, PLCHOT Oldřich, STAFYLAKIS Themos, MOŠNER Ladislav, BURGET Lukáš a ČERNOCKÝ Jan. An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification. In: 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023, s. 555-562. ISBN 978-1-6654-7189-3.
Detail

STAFYLAKIS Themos, MOŠNER Ladislav, KAKOUROS Sofoklis, PLCHOT Oldřich, BURGET Lukáš a ČERNOCKÝ Jan. Extracting speaker and emotion information from self-supervised speech models via channel-wise correlations. In: 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023, s. 1136-1143. ISBN 978-1-6654-7189-3.
Detail

SILNOVA Anna, SLAVÍČEK Josef, MOŠNER Ladislav, KLČO Michal, PLCHOT Oldřich, MATĚJKA Pavel, PENG Junyi, STAFYLAKIS Themos a BURGET Lukáš. ABC System Description for NIST LRE 2022. In: Proceedings of NIST LRE 2022 Workshop. Washington DC: National Institute of Standards and Technology, 2023, s. 1-5.
Detail

ZULUAGA-GOMEZ Juan, SARFJOO Seyyed Saeed, PRASAD Amrutha, NIGMATULINA Iuliia, MOTLÍČEK Petr, ONDŘEJ Karel, OHNEISER Oliver a HELMKE Hartmut. BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications. In: IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023, s. 633-640. ISBN 978-1-6654-7189-3.
Detail

ZULUAGA-GOMEZ Juan, PRASAD Amrutha, NIGMATULINA Iuliia, SARFJOO Seyyed Saeed, MOTLÍČEK Petr, KLEINERT Matthias, HELMKE Hartmut, OHNEISER Oliver a ZHAN Qingran. How Does Pre-Trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? an Extensive Benchmark on Air Traffic Control Communications. In: IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023, s. 205-212. ISBN 978-1-6654-7189-3.
Detail

YUSUF Bolaji, GOURAV Aditya, GANDHE Ankur a BULYKO Ivan. On-the-Fly Text Retrieval for end-to-end ASR Adaptation. In: Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023, s. 1-5. ISBN 978-1-7281-6327-7.
Detail

LANDINI Federico Nicolás, DIEZ Sánchez Mireia, LOZANO Díez Alicia a BURGET Lukáš. Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization. In: Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023, s. 1-5. ISBN 978-1-7281-6327-7.
Detail

SILNOVA Anna, BRUMMER Johan Nikolaas Langenhoven, SWART Albert du Preez a BURGET Lukáš. Toroidal Probabilistic Spherical Discriminant Analysis. In: Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023, s. 1-5. ISBN 978-1-7281-6327-7.
Detail

PENG Junyi, STAFYLAKIS Themos, GU Rongzhi, PLCHOT Oldřich, MOŠNER Ladislav, BURGET Lukáš a ČERNOCKÝ Jan. Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Rhodes Island: IEEE Signal Processing Society, 2023, s. 1-5. ISBN 978-1-7281-6327-7.
Detail

KAKOUROS Sofoklis, STAFYLAKIS Themos, MOŠNER Ladislav a BURGET Lukáš. Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label Smoothing. In: Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023, s. 1-5. ISBN 978-1-7281-6327-7.
Detail

KESIRAJU Santosh, BENEŠ Karel, TIKHONOV Maksim a ČERNOCKÝ Jan. BUT Systems for IWSLT 2023 Marathi - Hindi Low Resource Speech Translation Task. In: 20th International Conference on Spoken Language Translation, IWSLT 2023 - Proceedings of the Conference. Toronto (in-person and online): Association for Computational Linguistics, 2023, s. 227-234. ISBN 978-1-959429-84-5.
Detail

YUSUF Bolaji, ČERNOCKÝ Jan a SARAÇLAR Murat. End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, roč. 31, č. 08, 2023, s. 3070-3080. ISSN 2329-9290.
Detail

YU Dong, GONG Yifan, PICHENY Michael Alan, RAMABHADRAN Bhuvana, HAKKANI-TÜR Dilek, PRASAD Rohit, ZEN Heiga, SKOGLUND Jan, ČERNOCKÝ Jan, BURGET Lukáš a MOHAMED Abdelrahman. Twenty-Five Years of Evolution in Speech and Language Processing. IEEE Signal Processing Magazine, roč. 40, č. 5, 2023, s. 27-39. ISSN 1558-0792.
Detail

ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, OCHIAI Tsubasa, ČERNOCKÝ Jan, KINOSHITA Keisuke a YU Dong. Neural Target Speech Extraction: An overview. IEEE Signal Processing Magazine, roč. 40, č. 3, 2023, s. 8-29. ISSN 1558-0792.
Detail

SKOWRON Marcin, BACKFRIED Gerhard, NAVAS Eva, BERZINŠ Aivars, VAN Den Bogaert Joachim, DE Jong Franciska, DEMARCO Andrea, POLÁK Peter, KOVÁČ Marek, POLÁK Peter, ROHDIN Johan A., ROSNER Michael, SANCHEZ Jon, SARATXAGA Ibon a SCHWARZ Petr. Deep Dive Speech Technology. European Language Equality. Cham: Springer Nature Switzerland AG, 2023, s. 289-312. ISBN 978-3-031-28819-7.
Detail

MOŠNER Ladislav, PLCHOT Oldřich, PENG Junyi, BURGET Lukáš a ČERNOCKÝ Jan. Multi-Channel Speech Separation with Cross-Attention and Beamforming. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, s. 1693-1697. ISSN 1990-9772.
Detail

KESIRAJU Santosh, SARVAŠ Marek, PAVLÍČEK Tomáš, MACAIRE Cécile a CIUBA Alejandro. Strategies for Improving Low Resource Speech to Text Translation Relying on Pre-trained ASR Models. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, s. 2148-2152. ISSN 1990-9772.
Detail

DELCROIX Marc, TAWARA Naohiro, DIEZ Sánchez Mireia, LANDINI Federico Nicolás, SILNOVA Anna, OGAWA Atsunori, NAKATANI Tomohiro, BURGET Lukáš a ARAKI Shoko. Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, s. 3477-3481. ISSN 1990-9772.
Detail

MATĚJKA Pavel, SILNOVA Anna, SLAVÍČEK Josef, MOŠNER Ladislav, PLCHOT Oldřich, KLČO Michal, PENG Junyi, STAFYLAKIS Themos a BURGET Lukáš. Description and Analysis of ABC Submission to NIST LRE 2022. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, s. 511-515. ISSN 1990-9772.
Detail

PENG Junyi, PLCHOT Oldřich, STAFYLAKIS Themos, MOŠNER Ladislav, BURGET Lukáš a ČERNOCKÝ Jan. Improving Speaker Verification with Self-Pretrained Transformer Models. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, s. 5361-5365. ISSN 1990-9772.
Detail

ZULUAGA-GOMEZ Juan, NIGMATULINA Iuliia, PRASAD Amrutha, MOTLÍČEK Petr, KHALIL Driss, MADIKERI Srikanth, TART Allan, SZŐKE Igor, LENDERS Vincent, RIGAULT Mickael a CHOUKRI Khalid. Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding. Aerospace, roč. 2023, č. 10, s. 1-33. ISSN 2226-4310.
Detail

ŘIHÁČEK Tomáš, NEHYBA Jan, ČEVELÍČEK Michal, POLOK Alexander, MATĚJKA Pavel a DOLEŽAL Petr. DeePsy: Představení online nástroje pro zpětnou vazbu v psychoterapii. Psychoterapie. Masarykova univerzita AN FL, roč. 17, č. 1, 2023, s. 1-11. ISSN 1802-3983.
Detail

KHALIL Driss, PRASAD Amrutha, MOTLÍČEK Petr, ZULUAGA-GOMEZ Juan, NIGMATULINA Iuliia, MADIKERI Srikanth a SCHUEPBACH Christof. An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain. Aerospace, roč. 10, č. 10, 2023, s. 1-14. ISSN 2226-4310.
Detail

NIGMATULINA Iuliia, MADIKERI Srikanth, VILLATORO-TELLO Esaú, MOTLÍČEK Petr, ZULUAGA-GOMEZ Juan, PANDIA Karthick a GANAPATHIRAJU Aravind. Implementing contextual biasing in GPU decoder for online ASR. In: Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, s. 4494-4498. ISSN 1990-9772.
Detail

BURDISSO Sergio, VILLATORO-TELLO Esaú, MADIKERI Srikanth a MOTLÍČEK Petr. Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews. In: Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, s. 3617-3621. ISSN 1990-9772.
Detail

MAI Florian, ZULUAGA-GOMEZ Juan, PARCOLLET Titouan a MOTLÍČEK Petr. HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition. In: Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, s. 2213-2217. ISSN 1990-9772.
Detail

VILLATORO-TELLO Esaú, MADIKERI Srikanth, ZULUAGA-GOMEZ Juan, SHARMA Bidisha, SARFJOO Seyyed Saeed, NIGMATULINA Iuliia, MOTLÍČEK Petr, IVANOV Alexei V. a GANAPATHIRAJU Aravind. Effectiveness of Text, Acoustic, and Lattice-Based Representations in Spoken Language Understanding Tasks. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Rhodes Island: IEEE Signal Processing Society, 2023, s. 1-5. ISBN 978-1-7281-6327-7.
Detail

VANDERREYDT Geoffroy, PRASAD Amrutha, KHALIL Driss, MADIKERI Srikanth, DEMUYNCK Kris a MOTLÍČEK Petr. Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition. In: Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Taipei: IEEE Signal Processing Society, 2023, s. 1-7. ISBN 979-8-3503-0689-7.
Detail

MOTLÍČEK Petr, PRASAD Amrutha, NIGMATULINA Iuliia, HELMKE Hartmut, OHNEISER Oliver a KLEINERT Matthias. Automatic Speech Analysis Framework for ATC Communication in HAAWAII. In: Proceedings of the 13th SESAR Innovation Days. Seville: SESAR Joint Undertaking, 2023, s. 1-9.
Detail

HELMKE Hartmut, KLEINERT Matthias, AHRENHOLD Nils, EHR Heiko, MÜHLHAUSEN Thorsten, PINSKA Chauvin Ella, OHNEISER Oliver, KLAMERT Lucas, MOTLÍČEK Petr, PRASAD Amrutha, ZULUAGA-GOMEZ Juan a DOKIC Jelena. Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers' Workload. In: Proceedings of ATM Seminar. Savannah, Georgia: EUROPEAN ORGANISATION FOR THE SAFETY OF AIR NAVIGATION, 2023, s. 1-11.
Detail

BHATTACHARJEE Mrinmoy, MOTLÍČEK Petr, NIGMATULINA Iuliia, HELMKE Hartmut, OHNEISER Oliver, KLEINERT Matthias a EHR Heiko. Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training. In: Proceedings of the 13th SESAR Innovation Days. Seville: SESAR Joint Undertaking, 2023, s. 1-8.
Detail

ZULUAGA-GOMEZ Juan, PRASAD Amrutha, NIGMATULINA Iuliia, MOTLÍČEK Petr a KLEINERT Matthias. A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers. Aerospace, roč. 10, č. 5, 2023, s. 1-25. ISSN 2226-4310.
Detail

2022

LANDINI Federico Nicolás, PROFANT Ján, DIEZ Sánchez Mireia a BURGET Lukáš. Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks. Computer Speech and Language, roč. 71, č. 101254, 2022, s. 1-16. ISSN 0885-2308.
Detail

BURGET Lukáš a BOJAR Ondřej. NEUREM3 Interim Research Report. Brno: Ústav počítačové grafiky a multimédií FIT VUT v Brně, 2022.
Detail

KIŠŠ Martin, KOHÚT Jan, BENEŠ Karel a HRADIŠ Michal. Importance of Textlines in Historical Document Classification. In: Uchida, S., Barney, E., Eglin, V. (eds) Document Analysis Systems. Lecture Notes in Computer Science, roč. 13237. La Rochelle: Springer Nature Switzerland AG, 2022, s. 158-170. ISBN 978-3-031-06554-5.
Detail

YUSUF Bolaji, GANDHE Ankur a SOKOLOV Alex. Usted: Improving ASR with a Unified Speech and Text Encoder-Decoder. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, s. 8297-8301. ISBN 978-1-6654-0540-9.
Detail

MOŠNER Ladislav, PLCHOT Oldřich, BURGET Lukáš a ČERNOCKÝ Jan. Multisv: Dataset for Far-Field Multi-Channel Speaker Verification. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, s. 7977-7981. ISBN 978-1-6654-0540-9.
Detail

MOŠNER Ladislav, PLCHOT Oldřich, BURGET Lukáš a ČERNOCKÝ Jan. Multi-Channel Speaker Verification with Conv-Tasnet Based Beamformer. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, s. 7982-7986. ISBN 978-1-6654-0540-9.
Detail

HAN Jiangyu, LONG Yanhua, BURGET Lukáš a ČERNOCKÝ Jan. DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation and Extraction. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, s. 7292-7296. ISBN 978-1-6654-0540-9.
Detail

ONDEL Yang Lucas Antoine Francois, LAM-YEE-MUI L'ea-Marie, KOCOUR Martin, CORRO Caio Filippo a BURGET Lukáš. GPU-Accelerated Forward-Backward Algorithm with Application to Lattice-Free MMI. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, s. 8417-8421. ISBN 978-1-6654-0540-9.
Detail

BLATT Alexander, KOCOUR Martin, VESELÝ Karel, SZŐKE Igor a KLAKOW Dietrich. Call-Sign Recognition and Understanding for Noisy Air-Traffic Transcripts Using Surveillance Information. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, s. 8357-8361. ISBN 978-1-6654-0540-9.
Detail

NIGMATULINA Iuliia, ZULUAGA-GOMEZ Juan, PRASAD Amrutha, SARFJOO Saeed a MOTLÍČEK Petr. A Two-Step Approach to Leverage Contextual Data: Speech Recognition in Air-Traffic Communications. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, s. 6282-6286. ISBN 978-1-6654-0540-9.
Detail

ONDEL Yang Lucas Antoine Francois, YUSUF Bolaji, BURGET Lukáš a SARAÇLAR Murat. Non-Parametric Bayesian Subspace Models for Acoustic Unit Discovery. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, roč. 30, č. 5, 2022, s. 1902-1917. ISSN 2329-9290.
Detail

EGOROVA Ekaterina, VYDANA Hari K., BURGET Lukáš a ČERNOCKÝ Jan. Spelling-Aware Word-Based End-to-End ASR. IEEE Signal Processing Letters, roč. 29, č. 29, 2022, s. 1729-1733. ISSN 1558-2361.
Detail

SILNOVA Anna, STAFYLAKIS Themos, MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., MATĚJKA Pavel, BURGET Lukáš, GLEMBEK Ondřej a BRUMMER Johan Nikolaas Langenhoven. Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch. In: Proceedings of The Speaker and Language Recognition Workshop (Odyssey 2022). Beijing: International Speech Communication Association, 2022, s. 9-16.
Detail

PENG Junyi, ZHANG Chunlei, ČERNOCKÝ Jan a YU Dong. Progressive contrastive learning for self-supervised text-independent speaker verification. In: Proceedings of The Speaker and Language Recognition Workshop (Odyssey 2022). Beijing: International Speech Communication Association, 2022, s. 17-24.
Detail

ALAM Jahangir, BURGET Lukáš, GLEMBEK Ondřej, MATĚJKA Pavel, MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna a STAFYLAKIS Themos a kol. Development of ABC systems for the 2021 edition of NIST Speaker Recognition evaluation. In: Proceedings of The Speaker and Language Recognition Workshop (Odyssey 2022). Beijing: International Speech Communication Association, 2022, s. 346-353.
Detail

SOLEWICZ Yosef, COHEN Noa, ROHDIN Johan A., MADIKERI Srikanth a ČERNOCKÝ Jan. Speaker recognition on mono-channel telephony recordings. In: Proceedings of Odyssey 2022. Beijing: International Speech Communication Association, 2022, s. 193-199.
Detail

BRUMMER Johan Nikolaas Langenhoven, SWART Albert du Preez, MOŠNER Ladislav, SILNOVA Anna, PLCHOT Oldřich, STAFYLAKIS Themos a BURGET Lukáš. Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 1446-1450. ISSN 1990-9772.
Detail

LANDINI Federico Nicolás, LOZANO Díez Alicia, DIEZ Sánchez Mireia a BURGET Lukáš. From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 5095-5099. ISSN 1990-9772.
Detail

STAFYLAKIS Themos, MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, BURGET Lukáš a ČERNOCKÝ Jan. Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 605-609. ISSN 1990-9772.
Detail

PENG Junyi, GU Rongzhi, MOŠNER Ladislav, PLCHOT Oldřich, BURGET Lukáš a ČERNOCKÝ Jan. Learnable Sparse Filterbank for Speaker Verification. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 5110-5114. ISSN 1990-9772.
Detail

KOCOUR Martin, ŽMOLÍKOVÁ Kateřina, ONDEL Yang Lucas Antoine Francois, ŠVEC Ján, DELCROIX Marc, OCHIAI Tsubasa, BURGET Lukáš a ČERNOCKÝ Jan. Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 4955-4959. ISSN 1990-9772.
Detail

BASKAR Murali K., ROSENBERG Andrew, RAMABHADRAN Bhuvana a ZHANG Yu. Reducing Domain mismatch in Self-supervised speech pre-training. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 3028-3032. ISSN 1990-9772.
Detail

BASKAR Murali K., HERZIG Tim, NGUYEN Diana, DIEZ Sánchez Mireia, POLZEHL Tim, BURGET Lukáš a ČERNOCKÝ Jan. Speaker adaptation for Wav2vec2 based dysarthric ASR. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 3403-3407. ISSN 1990-9772.
Detail

DELCROIX Marc, KINOSHITA Keisuke, OCHIAI Tsubasa, ŽMOLÍKOVÁ Kateřina, SATO Hiroshi a NAKATANI Tomohiro. Listen only to me! How well can target speech extraction handle false alarms?. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, s. 216-220. ISSN 1990-9772.
Detail

ŠVEC Ján, ŽMOLÍKOVÁ Kateřina, KOCOUR Martin, DELCROIX Marc, OCHIAI Tsubasa, MOŠNER Ladislav a ČERNOCKÝ Jan. Analysis of impact of emotions on target speech extraction and speech separation. In: Proceedings of The 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022). Bamberg: IEEE Signal Processing Society, 2022, s. 1-5. ISBN 978-1-6654-6867-1.
Detail

DE Benito Gorron Diego, ŽMOLÍKOVÁ Kateřina a TORRE Toledano Doroteo. Source Separation for Sound Event Detection in domestic environments using jointly trained models. In: Proceedings of The 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022). Bamberg: IEEE Signal Processing Society, 2022, s. 1-5. ISBN 978-1-6654-6867-1.
Detail

KOCOUR Martin, UMESH Jahnavi, KARAFIÁT Martin, ŠVEC Ján, LOPEZ Fernando, BENEŠ Karel, DIEZ Sánchez Mireia, SZŐKE Igor, LUQUE Jordi, VESELÝ Karel, BURGET Lukáš a ČERNOCKÝ Jan. BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge. In: Proceedings of IberSpeech 2022. Granada: International Speech Communication Association, 2022, s. 276-280.
Detail

DVOŘÁKOVÁ Martina, HRADIŠ Michal, ŽABIČKA Petr, KOHÚT Jan, KIŠŠ Martin a BENEŠ Karel. Využití PERO OCR při přepisu rukopisů. Archivní časopis, roč. 72, č. 1, 2022, s. 14-27. ISSN 0004-0398.
Detail

NADIMPALLI Vijaya Lakshmi V., KESIRAJU Santosh, BANKA Rohith, KETHIREDDY Rashmi a GANGASHETTY Suryakanth V. Resources and Benchmarks for Keyword Search in Spoken Audio From Low-Resource Indian Languages. IEEE Access, roč. 10, č. 2022, 2022, s. 34789-34799. ISSN 2169-3536.
Detail

BASKAR Murali K., ROSENBERG Andrew, RAMABHADRAN Bhuvana, ZHANG Yu a MORENO Pedro. Ask2Mask: Guided Data Selection for Masked Speech Modeling. IEEE Journal of Selected Topics in Signal Processing, roč. 16, č. 6, 2022, s. 1357-1366. ISSN 1932-4553.
Detail

PRASAD Amrutha, ZULUAGA-GOMEZ Juan, MOTLÍČEK Petr, SARFJOO Seyyed Saeed, NIGMATULINA Iuliia a VESELÝ Karel. Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator. In: Proceedings of the 12th SESAR Innovation Days. Budapest, 2022, s. 1-9.
Detail

PRASAD Amrutha, ZULUAGA-GOMEZ Juan, MOTLÍČEK Petr, SARFJOO Seyyed Saeed, NIGMATULINA Iuliia, OHNEISER Oliver a HELMKE Hartmut. Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition. In: Proceedings of the 12th SESAR Innovation Days. Budapest, 2022, s. 1-9.
Detail

BOITO Marcely Z., YUSUF Bolaji, ONDEL Yang Lucas Antoine Francois, VILLAVICENCIO Aline a BESACIER Laurent. Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings. In: Proceedings of the the 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages. Marseile: European Language Resources Association, 2022, s. 1-9. ISBN 979-10-95546-91-7.
Detail

2021

KIŠŠ Martin, BENEŠ Karel a HRADIŠ Michal. AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions. In: Lladós J., Lopresti D., Uchida S. (eds) Document Analysis and Recognition - ICDAR 2021. Lecture Notes in Computer Science, roč. 12824. Lausanne: Springer Nature Switzerland AG, 2021, s. 463-477. ISBN 978-3-030-86336-4.
Detail

LANDINI Federico Nicolás, LOZANO Díez Alicia, BURGET Lukáš, DIEZ Sánchez Mireia, SILNOVA Anna, ŽMOLÍKOVÁ Kateřina, GLEMBEK Ondřej, MATĚJKA Pavel, STAFYLAKIS Themos a BRUMMER Johan Nikolaas Langenhoven. BUT System Description for The Third DIHARD Speech Diarization Challenge. In: Proceedings available at Dihard Challenge Github. on-line by LDC and University of Pennsylvania, 2021, s. 1-5.
Detail

DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, OCHIAI Tsubasa, KINOSHITA Keisuke a NAKATANI Tomohiro. Speaker activity driven neural speech extraction. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Toronto: IEEE Signal Processing Society, 2021, s. 6099-6103. ISBN 978-1-7281-7605-5.
Detail

LANDINI Federico Nicolás, GLEMBEK Ondřej, MATĚJKA Pavel, ROHDIN Johan A., BURGET Lukáš, DIEZ Sánchez Mireia a SILNOVA Anna. Analysis of the BUT Diarization System for Voxconverse Challenge. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, s. 5819-5823. ISBN 978-1-7281-7605-5.
Detail

VYDANA Hari K., KARAFIÁT Martin, ŽMOLÍKOVÁ Kateřina, BURGET Lukáš a ČERNOCKÝ Jan. Jointly Trained Transformers Models for Spoken Language Translation. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, s. 7513-7517. ISBN 978-1-7281-7605-5.
Detail

YUSUF Bolaji, ONDEL Yang Lucas Antoine Francois, BURGET Lukáš, ČERNOCKÝ Jan a SARAÇLAR Murat. A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, s. 3710-3714. ISBN 978-1-7281-7605-5.
Detail

BASKAR Murali K., BURGET Lukáš, WATANABE Shinji, ASTUDILLO Ramon a ČERNOCKÝ Jan. Eat: Enhanced ASR-TTS for Self-Supervised Speech Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, s. 6753-6757. ISBN 978-1-7281-7605-5.
Detail

KARAFIÁT Martin, VESELÝ Karel, ČERNOCKÝ Jan, PROFANT Ján, NYTRA Jiří, HLAVÁČEK Miroslav a PAVLÍČEK Tomáš. Analysis of X-Vectors for Low-Resource Speech Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, s. 6998-7002. ISBN 978-1-7281-7605-5.
Detail

ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, BURGET Lukáš, NAKATANI Tomohiro a ČERNOCKÝ Jan. Integration of Variational Autoencoder and Spatial Clustering for Adaptive Multi-Channel Neural Speech Separation. In: 2021 IEEE Spoken Language Technology Workshop, SLT 2021 - Proceedings. Shenzhen - virtual : IEEE Signal Processing Society, 2021, s. 889-896. ISBN 978-1-7281-7066-4.
Detail

KOCOUR Martin, CÁMBARA Guillermo, LUQUE Jordi, BONET David, FARRÚS Mireia, KARAFIÁT Martin, VESELÝ Karel a ČERNOCKÝ Jan. BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge. In: Proceedings of IberSPEECH 2021. Vallaloid: International Speech Communication Association, 2021, s. 113-117.
Detail

STAFYLAKIS Themos, ROHDIN Johan A. a BURGET Lukáš. Speaker embeddings by modeling channel-wise correlations. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Brno: International Speech Communication Association, 2021, s. 501-505. ISSN 1990-9772.
Detail

PENG Junyi, QU Xiaoyang, WANG Jianzong, GU Rongzhi, XIAO Jing, BURGET Lukáš a ČERNOCKÝ Jan. ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Brno: International Speech Communication Association, 2021, s. 511-515. ISSN 1990-9772.
Detail

ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, RAJ Desh, WATANABE Shinji a ČERNOCKÝ Jan. Auxiliary Loss Function for Target Speech Extraction and Recognition with Weak Supervision Based on Speaker Characteristics. In: Proceedings of 2021 Interspeech. Brno: International Speech Communication Association, 2021, s. 1464-1468. ISSN 1990-9772.
Detail

BENEŠ Karel a BURGET Lukáš. Text Augmentation for Language Models in High Error Recognition Scenario. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Brno: International Speech Communication Association, 2021, s. 1872-1876. ISSN 1990-9772.
Detail

EGOROVA Ekaterina, VYDANA Hari K., BURGET Lukáš a ČERNOCKÝ Jan. Out-of-Vocabulary Words Detection with Attention and CTC Alignments in an End-to-End ASR System. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, s. 2901-2905. ISSN 1990-9772.
Detail

SZŐKE Igor, KESIRAJU Santosh, NOVOTNÝ Ondřej, KOCOUR Martin, VESELÝ Karel a ČERNOCKÝ Jan. Detecting English Speech in the Air Traffic Control Voice Communication. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, s. 3286-3290. ISSN 1990-9772.
Detail

KOCOUR Martin, VESELÝ Karel, BLATT Alexander, ZULUAGA-GOMEZ Juan, SZŐKE Igor, ČERNOCKÝ Jan, KLAKOW Dietrich a MOTLÍČEK Petr. Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, s. 3301-3305. ISSN 1990-9772.
Detail

ZULUAGA-GOMEZ Juan, NIGMATULINA Iuliia, PRASAD Amrutha, MOTLÍČEK Petr, VESELÝ Karel, KOCOUR Martin a SZŐKE Igor. Contextual Semi-Supervised Learning: An Approach to Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, s. 3296-3300. ISSN 1990-9772.
Detail

YUSUF Bolaji, GOK Alican, GUNDOGDU Batuhan a SARAÇLAR Murat. End-to-End Open Vocabulary Keyword Search. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, s. 4388-4392. ISSN 1990-9772.
Detail

WANNER Leo, KLUSCH Matthias, MAVROPOULOS Athanasios, JAMIN Emmanuel, MARIN Puchades Victor, CASAMAYOR Gerard, ČERNOCKÝ Jan a EGOROVA Ekaterina a kol. Towards a Versatile Intelligent Conversational Agent as Personal Assistant for Migrants. In: The PAAMS Collection. PAAMS 2021: Advances in Practical Applications of Agents, Multi-Agent Systems, and Social Good. . Lecture Notes in Computer Science book series , roč. 12946. Salamanca: Springer International Publishing, 2021, s. 316-327. ISBN 978-3-030-85739-4. ISSN 0302-9743.
Detail

HELMKE Hartmut, KLEINERT Matthias, SHETTY Shruthi, OHNEISER Oliver, EHR Heiko, PRASAD Amrutha, MOTLÍČEK Petr, VESELÝ Karel, ONDŘEJ Karel, SMRŽ Pavel, HARFMANN Julia a WINDISCH Christian a kol. Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety. In: Proceedings of ATM Seminar. on-line: EUROPEAN ORGANISATION FOR THE SAFETY OF AIR NAVIGATION, 2021, s. 1-10.
Detail

KLEINERT Matthias, HELMKE Hartmut, SHETTY Shruthi, OHNEISER Oliver, EHR Heiko, PRASAD Amrutha, MOTLÍČEK Petr a HARFMANN Julia. Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning. In: Proceedings of DASC 2021. San Antonio, Texas: Institute of Electrical and Electronics Engineers, 2021, s. 1-9. ISBN 978-1-6654-3420-1.
Detail

HELMKE Hartmut, SHETTY Shruthi, KLEINERT Matthias, OHNEISER Oliver, EHR Heiko, MOTLÍČEK Petr, PRASAD Amrutha a WINDISCH Christian a kol. Measuring Speech Recognition And Understanding Performance in Air Traffic Control Domain Beyond Word Error Rates. In: Proceedings of 11th SESAR Innovation Days 2021. Belgie, 2021, s. 1-8.
Detail

KOCOUR Martin, VESELÝ Karel, SZŐKE Igor, KESIRAJU Santosh, ZULUAGA-GOMEZ Juan, BLATT Alexander, PRASAD Amrutha, NIGMATULINA Iuliia, MOTLÍČEK Petr, KLAKOW Dietrich, TART Allan, KOLČÁREK Pavel, ČERNOCKÝ Jan, CEVENINI Claudia, CHOUKRI Khalid, RIGAULT Mickael, LANDIS Fabian a SARFJOO Saeed a kol. Automatic Processing Pipeline for Collecting and Annotating Air-Traffic Voice Communication Data. In: Proceedings of 9th OpenSky Symposium 2021, OpenSky Network, Brussels, Belgium. Brussels: MDPI, 2021, s. 1-10. ISSN 2504-3900.
Detail

VYDANA Hari K., KARAFIÁT Martin, BURGET Lukáš a ČERNOCKÝ Jan. The IWSLT 2021 BUT Speech Translation Systems. In: Proceedings of 18th International Conference on Spoken Language Translation (IWSLT) . Bangkok, on-line: Association for Computational Linguistics, 2021, s. 75-83. ISBN 978-1-7138-3378-9.
Detail

ŘIHÁČEK Tomáš a MATĚJKA Pavel. Deep learning v psychoterapii: Strojová analýza nahrávek terapeutických sezení. E-psychologie, roč. 15, č. 3, 2021, s. 35-37. ISSN 1802-8853.
Detail

2020

ROHDIN Johan A., SILNOVA Anna, DIEZ Sánchez Mireia, PLCHOT Oldřich, MATĚJKA Pavel, BURGET Lukáš a GLEMBEK Ondřej. End-to-end DNN based text-independent speaker recognition for long and short utterances. Computer Speech and Language, roč. 2020, č. 59, s. 22-35. ISSN 0885-2308.
Detail

DIEZ Sánchez Mireia, BURGET Lukáš, LANDINI Federico Nicolás a ČERNOCKÝ Jan. Analysis of Speaker Diarization based on Bayesian HMM with Eigenvoice Priors. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, roč. 28, č. 1, 2020, s. 355-368. ISSN 2329-9290.
Detail

MATĚJKA Pavel, PLCHOT Oldřich, GLEMBEK Ondřej, BURGET Lukáš, ROHDIN Johan A., ZEINALI Hossein, MOŠNER Ladislav, SILNOVA Anna, NOVOTNÝ Ondřej, DIEZ Sánchez Mireia a ČERNOCKÝ Jan. 13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE. Computer Speech and Language, roč. 2020, č. 63, s. 1-15. ISSN 0885-2308.
Detail

WANG Shuai, ROHDIN Johan A., PLCHOT Oldřich, BURGET Lukáš, YU Kai a ČERNOCKÝ Jan. Investigation of Specaugment for Deep Speaker Embedding Learning. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, s. 7139-7143. ISBN 978-1-5090-6631-5.
Detail

DELCROIX Marc, OCHIAI Tsubasa, ŽMOLÍKOVÁ Kateřina, KINOSHITA Keisuke, TAWARA Naohiro, NAKATANI Tomohiro a ARAKI Shoko. Improving Speaker Discrimination of Target Speech Extraction With Time-Domain Speakerbeam. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, s. 691-695. ISBN 978-1-5090-6631-5.
Detail

LANDINI Federico Nicolás, WANG Shuai, DIEZ Sánchez Mireia, BURGET Lukáš, MATĚJKA Pavel, ŽMOLÍKOVÁ Kateřina, MOŠNER Ladislav, SILNOVA Anna, PLCHOT Oldřich, NOVOTNÝ Ondřej, ZEINALI Hossein a ROHDIN Johan A. But System for the Second Dihard Speech Diarization Challenge. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, s. 6529-6533. ISBN 978-1-5090-6631-5.
Detail

DIEZ Sánchez Mireia, BURGET Lukáš, LANDINI Federico Nicolás, WANG Shuai a ČERNOCKÝ Jan. Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, s. 6519-6523. ISBN 978-1-5090-6631-5.
Detail

ŽMOLÍKOVÁ Kateřina, KOCOUR Martin, LANDINI Federico Nicolás, BENEŠ Karel, KARAFIÁT Martin, VYDANA Hari K., LOZANO Díez Alicia, PLCHOT Oldřich, BASKAR Murali K., ŠVEC Ján, MOŠNER Ladislav, MALENOVSKÝ Vladimír, BURGET Lukáš, YUSUF Bolaji, NOVOTNÝ Ondřej, GRÉZL František, SZŐKE Igor a ČERNOCKÝ Jan. BUT System for CHiME-6 Challenge. In: Proceedings of CHiME 2020 Virtual Workshop. Barcelona: University of Sheffield, 2020, s. 1-3.
Detail

SILNOVA Anna, BRUMMER Johan Nikolaas Langenhoven, ROHDIN Johan A., STAFYLAKIS Themos a BURGET Lukáš. Probabilistic embeddings for speaker diarization. In: Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Tokyo: International Speech Communication Association, 2020, s. 24-31. ISSN 2312-2846.
Detail

MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A. a ČERNOCKÝ Jan. Utilizing VOiCES dataset for multichannel speaker verification with beamforming. In: Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Tokyo: International Speech Communication Association, 2020, s. 187-193. ISSN 2312-2846.
Detail

ALAM Jahangir, BOULIANNE Gilles, BURGET Lukáš, DAHMANE Mohamed, DIEZ Sánchez Mireia, GLEMBEK Ondřej, LALONDE Marc, LOZANO Díez Alicia, MATĚJKA Pavel, MIZERA Petr, MOŠNER Ladislav, NOISEUX Cédric, MONTEIRO Joao, NOVOTNÝ Ondřej, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, SLAVÍČEK Josef, STAFYLAKIS Themos, ST-CHARLES Pierre-Luc, WANG Shuai a ZEINALI Hossein. Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge. In: Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Tokyo: International Speech Communication Association, 2020, s. 289-295. ISSN 2312-2846.
Detail

KESIRAJU Santosh, PLCHOT Oldřich, BURGET Lukáš a GANGASHETTY Suryakanth V. Learning Document Embeddings Along With Their Uncertainties. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, roč. 2020, č. 28, s. 2319-2332. ISSN 2329-9290.
Detail

KOSIBA Matěj a BURGET Lukáš a kol. Multiwavelength classification of X-ray selected galaxy cluster candidates using convolutional neural networks. Monthly Notices of the Royal Astronomical Society, roč. 496, č. 4, 2020, s. 4141-4153. ISSN 1365-2966.
Detail

LOZANO Díez Alicia, SILNOVA Anna, PULUGUNDLA Bhargav, ROHDIN Johan A., VESELÝ Karel, BURGET Lukáš, PLCHOT Oldřich, GLEMBEK Ondřej, NOVOTNÝ Ondřej a MATĚJKA Pavel. BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Shanghai: International Speech Communication Association, 2020, s. 761-765. ISSN 1990-9772.
Detail

ZEINALI Hossein, LEE Kong Aik, ALAM Jahangir a BURGET Lukáš. SdSV Challenge 2020: Large-Scale Evaluation of Short-duration Speaker Verification. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Shanghai: International Speech Communication Association, 2020, s. 731-735. ISSN 1990-9772.
Detail

DUNBAR Ewan, KARADAYI Julien, BERNARD Mathieu, CAO Xuan-Nga, ALGAYRES Robin, ONDEL Lucas Antoine Francois, BESACIER Laurent, SAKTI Sakriani a DUPOUX Emmanuel. The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Shanghai: International Speech Communication Association, 2020, s. 4831-4835. ISSN 1990-9772.
Detail

ZULUAGA-GOMEZ Juan, VESELÝ Karel, BLATT Alexander, MOTLÍČEK Petr, KLAKOW Dietrich, TART Allan, SZŐKE Igor, PRASAD Amrutha, SARFJOO Saeed, KOLČÁREK Pavel, KOCOUR Martin, ČERNOCKÝ Jan, CEVENINI Claudia, CHOUKRI Khalid, RIGAULT Mickael a LANDIS Fabian. Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications. In: Proceedings of the 8th OpenSky Symposium 2020. Brusel: MDPI, 2020, s. 1-10. ISSN 2504-3900.
Detail

ZULUAGA-GOMEZ Juan, MOTLÍČEK Petr, ZHAN Qingran, VESELÝ Karel a BRAUN Rudolf. Automatic Speech Recognition Benchmark for Air-Traffic Communications. In: Proceedings of Interspeech 2020. Shanghai: International Speech Communication Association, 2020, s. 2297-2301. ISSN 1990-9772.
Detail

SCHARENBORG Odette, BESACIER Laurent, BLACK Alan, HASEGAWA-JOHNSON Mark, METZE Florian, NEUBIG Graham, STÜKER Sebastian, GODARD Pierre, MÜLLER Markus, ONDEL Yang Lucas Antoine Francois, PALASKAR Shruti, ARTHUR Philip, CIANNELLA Francesco, DU Mingxing, LARSEN Elin, MERKX Danny, RIAD Rachid, WANG Liming a DUPOUX Emmanuel. Speech Technology for Unwritten Languages. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, roč. 2020, č. 28, s. 964-975. ISSN 2329-9290.
Detail

BURGET Lukáš, GLEMBEK Ondřej, LOZANO Díez Alicia, MATĚJKA Pavel, NOVOTNÝ Ondřej, PLCHOT Oldřich, PULUGUNDLA Bhargav, ROHDIN Johan A., SILNOVA Anna a VESELÝ Karel. BUT System Description to SdSV Challenge 2020. In: Proceedings of Short-duration Speaker Verification Challenge 2020 Workshop. Shanghai, on-line event of Interspeech 2020 Conference, 2020, s. 1-5.
Detail

2019

CARTAS Alejandro, KOCOUR Martin, RAMAN Aravindh, LEONTIADIS Ilias, LUQUE Jordi, SASTRY Nishanth, NUNEZ-MARTINEZ Leon, PERINO Diego a PERALES Carlos Segura. A Reality Check on Inference at Mobile Networks Edge. In: Proceedings of the 2nd ACM International Workshop on Edge Systems, Analytics and Networking (EDGESYS '19). Dressden: Association for Computing Machinery, 2019, s. 54-59. ISBN 978-1-4503-6275-7.
Detail

SZŐKE Igor, SKÁCEL Miroslav, MOŠNER Ladislav, PALIESEK Jakub a ČERNOCKÝ Jan. Building and Evaluation of a Real Room Impulse Response Dataset. IEEE Journal of Selected Topics in Signal Processing, roč. 13, č. 4, 2019, s. 863-876. ISSN 1932-4553.
Detail

ROHDIN Johan A., STAFYLAKIS Themos, SILNOVA Anna, ZEINALI Hossein, BURGET Lukáš a PLCHOT Oldřich. Speaker Verification Using End-To-End Adversarial Language Adaptation. In: Proceedings of ICASSP 2019. Brighton: IEEE Signal Processing Society, 2019, s. 6006-6010. ISBN 978-1-5386-4658-8.
Detail

ZEINALI Hossein, BURGET Lukáš, ROHDIN Johan A., STAFYLAKIS Themos a ČERNOCKÝ Jan. How To Improve Your Speaker Embeddings Extractor in Generic Toolkits. In: Proceedings of 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP). Brighton: IEEE Signal Processing Society, 2019, s. 6141-6145. ISBN 978-1-5386-4658-8.
Detail

NOVOTNÝ Ondřej, PLCHOT Oldřich, GLEMBEK Ondřej, ČERNOCKÝ Jan a BURGET Lukáš. Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition. Computer Speech and Language, roč. 2019, č. 58, s. 403-421. ISSN 0885-2308.
Detail

MAGHSOODI Nooshin, SAMETI Hossein, ZEINALI Hossein a STAFYLAKIS Themos. Speaker Recognition With Random Digit Strings Using Uncertainty Normalized HMM-Based i-Vectors. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, roč. 2019, č. 11, s. 1815-1825. ISSN 2329-9290.
Detail

ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, KINOSHITA Keisuke, OCHIAI Tsubasa, NAKATANI Tomohiro, BURGET Lukáš a ČERNOCKÝ Jan. SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures. IEEE Journal of Selected Topics in Signal Processing, roč. 13, č. 4, 2019, s. 800-814. ISSN 1932-4553.
Detail

ONDEL Yang Lucas Antoine Francois, VYDANA Hari K., BURGET Lukáš a ČERNOCKÝ Jan. Bayesian Subspace Hidden Markov Model for Acoustic Unit Discovery. In: Proceedings of Interspeech 2019. Graz: International Speech Communication Association, 2019, s. 261-265. ISSN 1990-9772.
Detail

DIEZ Sánchez Mireia, BURGET Lukáš, WANG Shuai, ROHDIN Johan A. a ČERNOCKÝ Jan. Bayesian HMM based x-vector clustering for Speaker Diarization. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 346-350. ISSN 1990-9772.
Detail

ZEINALI Hossein, STAFYLAKIS Themos, ATHANASOPOULOU Georgia, ROHDIN Johan A., GKINIS Ioanis, BURGET Lukáš a ČERNOCKÝ Jan. Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 1073-1077. ISSN 1990-9772.
Detail

WANG Shuai, ROHDIN Johan A., BURGET Lukáš, PLCHOT Oldřich, QIAN Yanmin, YU Kai a ČERNOCKÝ Jan. On the Usage of Phonetic Information for Text-independent Speaker Embedding Extraction. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 1148-1152. ISSN 1990-9772.
Detail

KARAFIÁT Martin, BASKAR Murali K., WATANABE Shinji, HORI Takaaki, WIESNER Matthew a ČERNOCKÝ Jan. Analysis of Multilingual Sequence-to-Sequence Speech Recognition Systems. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 2220-2224. ISSN 1990-9772.
Detail

BASKAR Murali K., WATANABE Shinji, ASTUDILLO Ramon, HORI Takaaki, BURGET Lukáš a ČERNOCKÝ Jan. Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 3790-3794. ISSN 1990-9772.
Detail

MATĚJKA Pavel, PLCHOT Oldřich, ZEINALI Hossein, MOŠNER Ladislav, SILNOVA Anna, BURGET Lukáš, NOVOTNÝ Ondřej a GLEMBEK Ondřej. Analysis of BUT Submission in Far-Field Scenarios of VOiCES 2019 Challenge. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 2448-2452. ISSN 1990-9772.
Detail

NOVOTNÝ Ondřej, PLCHOT Oldřich, GLEMBEK Ondřej a BURGET Lukáš. Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 4330-4334. ISSN 1990-9772.
Detail

STAFYLAKIS Themos, ROHDIN Johan A., PLCHOT Oldřich, MIZERA Petr a BURGET Lukáš. Self-supervised speaker embeddings. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, s. 2863-2867. ISSN 1990-9772.
Detail

NOVOTNÝ Ondřej, PLCHOT Oldřich, GLEMBEK Ondřej, BURGET Lukáš a MATĚJKA Pavel. Discriminatively Re-trained i-Vector Extractor For Speaker Recognition. In: Proceedings of 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP). Brighton: IEEE Signal Processing Society, 2019, s. 6031-6035. ISBN 978-1-5386-4658-8.
Detail

BASKAR Murali K., BURGET Lukáš, WATANABE Shinji, KARAFIÁT Martin, HORI Takaaki a ČERNOCKÝ Jan. Promising Accurate Prefix Boosting For Sequence-to-sequence ASR. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, s. 5646-5650. ISBN 978-1-5386-4658-8.
Detail

INAGUMA Hirofumi, CHO Jaejin, BASKAR Murali K., KAWAHARA Tatsuya a WATANABE Shinji. Transfer Learning Of Language-independent End-to-end ASR With Language Model Fusion. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, s. 6096-6100. ISBN 978-1-5386-4658-8.
Detail

DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, OCHIAI Tsubasa, KINOSHITA Keisuke, ARAKI Shoko a NAKATANI Tomohiro. Compact Network for Speakerbeam Target Speaker Extraction. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, s. 6965-6969. ISBN 978-1-5386-4658-8.
Detail

ONDEL Yang Lucas Antoine Francois, LI Ruizhi, SELL Gregory a HEŘMANSKÝ Hynek. Deriving Spectro-temporal Properties of Hearing from Speech Data. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, s. 411-415. ISBN 978-1-5386-4658-8.
Detail

MOŠNER Ladislav, WU Minhua, RAJU Anirudh, PARTHASARATHI Sree Hari Krishnan, KUMATANI Kenichi, SUNDARAM Shiva, MAAS Roland a HOFFMEISTER Björn. Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, s. 6475-6479. ISBN 978-1-5386-4658-8.
Detail

YANG Jinyi, ONDEL Yang Lucas Antoine Francois, MANOHAR Vimal a HEŘMANSKÝ Hynek. Towards Automatic Methods to Detect Errors in Transcriptions of Speech Recordings. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, s. 3747-3751. ISBN 978-1-5386-4658-8.
Detail

BENEŠ Karel, IRIE Kazuki, BECK Eugen, SCHLÜTER Ralf a NEY Hermann. Unsupervised Language Model Adaptation for Speech Recognition with no Extra Resources. In: Proceedings of DAGA 2019. Rostock: Deutsche Gesellschaft für Akustik (DEGA), DEGA Head office, 2019, s. 954-957. ISBN 978-3-939296-14-0.
Detail

DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, OCHIAI Tsubasa, KINOSHITA Keisuke, ARAKI Shoko a NAKATANI Tomohiro. Evaluation of SpeakerBeam target speech extraction in real noisy and reverberant conditions. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN, roč. 2019, č. 2, s. 1-2. ISSN 0369-4232.
Detail

MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., BURGET Lukáš a ČERNOCKÝ Jan. Speaker Verification with Application-Aware Beamforming. In: IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU). Sentosa, Singapore: IEEE Signal Processing Society, 2019, s. 411-418. ISBN 978-1-7281-0306-8.
Detail

ZEINALI Hossein, ČERNOCKÝ Jan a BURGET Lukáš. A multi purpose and large scale speech corpus in Persian and English for speaker and speech Recognition: the DeepMine database. In: IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU). Sentosa, Singapore: IEEE Signal Processing Society, 2019, s. 397-402. ISBN 978-1-7281-0306-8.
Detail

ALAM Jahangir, BOULIANNE Gilles, GLEMBEK Ondřej, LOZANO Díez Alicia, MATĚJKA Pavel, MIZERA Petr, MONTEIRO Joao, MOŠNER Ladislav, NOVOTNÝ Ondřej, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, SLAVÍČEK Josef, STAFYLAKIS Themos, WANG Shuai a ZEINALI Hossein. ABC NIST SRE 2019 CTS System Description. In: Proceedings of NIST. Sentosa, Singapore: National Institute of Standards and Technology, 2019, s. 1-6.
Detail

ALAM Jahangir, BOULIANNE Gilles, BURGET Lukáš, GLEMBEK Ondřej, LOZANO Díez Alicia, MATĚJKA Pavel, MIZERA Petr, MOŠNER Ladislav, NOVOTNÝ Ondřej, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, SLAVÍČEK Josef, STAFYLAKIS Themos, WANG Shuai, ZEINALI Hossein, DAHMANE Mohamed, ST-CHARLES Pierre-Luc, LALONDE Marc, NOISEUX Cédric a MONTEIRO Joao. ABC System Description for NIST Multimedia Speaker Recognition Evaluation 2019. In: Proceedings of NIST 2019 SRE Workshop. Sentosa, Singapore: National Institute of Standards and Technology, 2019, s. 1-7.
Detail

ZEINALI Hossein, WANG Shuai, SILNOVA Anna, MATĚJKA Pavel a PLCHOT Oldřich. BUT System Description to VoxCeleb Speaker Recognition Challenge 2019. In: Proceedings of The VoxCeleb Challange Workshop 2019. Graz, 2019, s. 1-4.
Detail

CHO Jaejin, WATANABE Shinji, HORI Takaaki, BASKAR Murali K., INAGUMA Hirofumi, VILLALBA Lopez Jesus Antonio a DEHAK Najim. Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition. In: Proceedings of 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP). Brighton: IEEE Signal Processing Society, 2019, s. 6191-6195. ISBN 978-1-5386-4658-8.
Detail

SUBRAMANIAN Aswin S., WANG Xiaofei, BASKAR Murali K., WATANABE Shinji, TANIGUCHI Toru, TRAN Dung a FUJITA Yuya. Speech Enhancement Using End-to-End Speech Recognition Objectives. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, NY: IEEE Signal Processing Society, 2019, s. 234-238. ISBN 978-1-7281-1123-0.
Detail

2018

BARTOS Anthony L., CIPR Tomáš, NELSON Douglas J., SCHWARZ Petr, BANOWETZ John a JERABEK Ladislav. Noise-robust speech triage. Journal of the Acoustical Society of America, roč. 143, č. 4, 2018, s. 2313-2320. ISSN 1520-8524.
Detail

ONDEL Yang Lucas Antoine Francois, GODARD Pierre, BESACIER Laurent, LARSEN Elin, HASEGAWA-JOHNSON Mark, SCHARENBORG Odette, DUPOUX Emmanuel, BURGET Lukáš, YVON Francois a KHUDANPUR Sanjeev. Bayesian Models for Unit Discovery on a Very Low Resource Language. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, s. 5939-5943. ISBN 978-1-5386-4658-8.
Detail

KARAFIÁT Martin, BASKAR Murali K., VESELÝ Karel, GRÉZL František, BURGET Lukáš a ČERNOCKÝ Jan. Analysis of Multilingual BLSTM Acoustic Model on Low and High Resource Languages. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, s. 5789-5793. ISBN 978-1-5386-4658-8.
Detail

DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, KINOSHITA Keisuke, OGAWA Atsunori a NAKATANI Tomohiro. Single Channel Target Speaker Extraction and Recognition with Speaker Beam. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, s. 5554-5558. ISBN 978-1-5386-4658-8.
Detail

ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, KINOSHITA Keisuke, HIGUCHI Takuya, NAKATANI Tomohiro a ČERNOCKÝ Jan. Optimization of Speaker-aware Multichannel Speech Extraction with ASR Criterion. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, s. 6702-6706. ISBN 978-1-5386-4658-8.
Detail

LOZANO Díez Alicia, PLCHOT Oldřich, MATĚJKA Pavel a GONZALEZ-RODRIGUEZ Joaquin. DNN Based Embeddings for Language Recognition. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, s. 5184-5188. ISBN 978-1-5386-4658-8.
Detail

ROHDIN Johan A., SILNOVA Anna, DIEZ Sánchez Mireia, PLCHOT Oldřich, MATĚJKA Pavel a BURGET Lukáš. End-to-End DNN Based Speaker Recognition Inspired by i-Vector and PLDA. In: Proceedings of ICASSP. Calgary: IEEE Signal Processing Society, 2018, s. 4874-4878. ISBN 978-1-5386-4658-8.
Detail

EGOROVA Ekaterina a BURGET Lukáš. Out-of-Vocabulary Word Recovery Using FST-Based Subword Unit Clustering in a Hybrid ASR System. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, s. 5919-5923. ISBN 978-1-5386-4658-8.
Detail

RYANT Neville, BERGELSON Elika, CHURCH Kenneth, CRISTIA Alejandrina, DU Jun, GANAPATHY Sriram, KHUDANPUR Sanjeev, KOWALSKI Diana, KRISHNAMOORTHY Mahesh, KULSHRESHTA Rajat, LIBERMAN Mark, LU Yu-Ding, MACIEJEWSKI Matthew, METZE Florian, PROFANT Ján, SUN Lei, TSAO Yu a YU Zhou. Enhancement and Analysis of Conversational Speech: JSALT 2017. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, s. 5154-5158. ISBN 978-1-5386-4658-8.
Detail

LOZANO Díez Alicia, PLCHOT Oldřich, MATĚJKA Pavel, NOVOTNÝ Ondřej a GONZALEZ-RODRIGUEZ Joaquin. Analysis of DNN-based Embeddings for Language Recognition on the NIST LRE 2017. In: Proceedings of Odyssey 2018 The Speaker and Language Recognition Workshop. Les Sables d'Olonne: International Speech Communication Association, 2018, s. 39-46. ISSN 2312-2846.
Detail

PLCHOT Oldřich, MATĚJKA Pavel, NOVOTNÝ Ondřej, CUMANI Sandro, LOZANO Díez Alicia, SLAVÍČEK Josef, DIEZ Sánchez Mireia, GRÉZL František, GLEMBEK Ondřej, KAMSALI Veera Mounika, SILNOVA Anna, BURGET Lukáš, ONDEL Yang Lucas Antoine Francois, KESIRAJU Santosh a ROHDIN Johan A. Analysis of BUT-PT Submission for NIST LRE 2017. In: Proceedings of Odyssey 2018 The Speaker and Language Recognition Workshop. Les Sables d'Olonne: International Speech Communication Association, 2018, s. 47-53. ISSN 2312-2846.
Detail

DIEZ Sánchez Mireia, BURGET Lukáš a MATĚJKA Pavel. Speaker Diarization based on Bayesian HMM with Eigenvoice Priors. In: Proceedings of Odyssey 2018. Les Sables d´Olonne: International Speech Communication Association, 2018, s. 147-154. ISSN 2312-2846.
Detail

NOVOTNÝ Ondřej, PLCHOT Oldřich, MATĚJKA Pavel, MOŠNER Ladislav a GLEMBEK Ondřej. On the use of X-vectors for Robust Speaker Recognition. In: Proceedings of Odyssey 2018. Les Sables d´Olonne: International Speech Communication Association, 2018, s. 168-175. ISSN 2312-2846.
Detail

SILNOVA Anna, MATĚJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, NOVOTNÝ Ondřej, GRÉZL František, SCHWARZ Petr a ČERNOCKÝ Jan. BUT/Phonexia Bottleneck Feature Extractor. In: Proceedings of Odyssey 2018. Les Sables d´Olonne: International Speech Communication Association, 2018, s. 283-287. ISSN 2312-2846.
Detail

BRUMMER Johan Nikolaas Langenhoven, SILNOVA Anna, BURGET Lukáš a STAFYLAKIS Themos. Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model. In: Proceedings of Odyssey 2018. Les Sables d'Olonne: International Speech Communication Association, 2018, s. 349-356. ISSN 2312-2846.
Detail

ZEINALI Hossein, BURGET Lukáš, SAMETI Hossein a ČERNOCKÝ Jan. Spoken Pass-Phrase Verification in the i-vector Space. In: Proceedings of Odyssey 2018. Les Sables d´Olonne: International Speech Communication Association, 2018, s. 372-377. ISSN 2312-2846.
Detail

SILNOVA Anna, BRUMMER Johan Nikolaas Langenhoven, GARCÍA-ROMERO Daniel, SNYDER David a BURGET Lukáš. Fast variational Bayes for heavy-tailed PLDA applied to i-vectors and x-vectors. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, s. 72-76. ISSN 1990-9772.
Detail

KARAFIÁT Martin, BASKAR Murali K., SZŐKE Igor, MALENOVSKÝ Vladimír, VESELÝ Karel, GRÉZL František, BURGET Lukáš a ČERNOCKÝ Jan. BUT OpenSAT 2017 speech recognition system. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, s. 2638-2642. ISSN 1990-9772.
Detail

DIEZ Sánchez Mireia, LANDINI Federico Nicolás, BURGET Lukáš, ROHDIN Johan A., SILNOVA Anna, ŽMOLÍKOVÁ Kateřina, NOVOTNÝ Ondřej, VESELÝ Karel, GLEMBEK Ondřej, PLCHOT Oldřich, MOŠNER Ladislav a MATĚJKA Pavel. BUT system for DIHARD Speech Diarization Challenge 2018. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, s. 2798-2802. ISSN 1990-9772.
Detail

PULUGUNDLA Bhargav, BASKAR Murali K., KESIRAJU Santosh, EGOROVA Ekaterina, KARAFIÁT Martin, BURGET Lukáš a ČERNOCKÝ Jan. BUT system for low resource Indian language ASR. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, s. 3182-3186. ISSN 1990-9772.
Detail

BENEŠ Karel, KESIRAJU Santosh a BURGET Lukáš. i-vectors in language modeling: An efficient way of domain adaptation for feed-forward models. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, s. 3383-3387. ISSN 1990-9772.
Detail

MOŠNER Ladislav, PLCHOT Oldřich, MATĚJKA Pavel, NOVOTNÝ Ondřej a ČERNOCKÝ Jan. Dereverberation and Beamforming in Robust Far-Field Speaker Recognition. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, s. 1334-1338. ISSN 1990-9772.
Detail

VESELÝ Karel, PERALES Carlos Segura, SZŐKE Igor, LUQUE Jordi a ČERNOCKÝ Jan. Lightly supervised vs. semi-supervised training of acoustic model on Luxembourgish for low-resource automatic speech recognition. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, s. 2883-2887. ISSN 1990-9772.
Detail

NOVOTNÝ Ondřej, MATĚJKA Pavel, PLCHOT Oldřich a GLEMBEK Ondřej. On the use of DNN Autoencoder for Robust Speaker Recognition. Brno: Fakulta informačních technologií VUT v Brně, 2018.
Detail

ZEINALI Hossein, BURGET Lukáš a ČERNOCKÝ Jan. Convolutional Neural Networks and X-Vector Embedding for DCASE2018 Acoustic Scene Classification Challenge. In: Proceedings of DCASE 2018 Workshop. Surrey: Tampere University of Technology, 2018, s. 1-5. ISBN 978-952-15-4262-6.
Detail

ALAM Jahangir, BHATTACHARYA Gautam, BRUMMER Johan Nikolaas Langenhoven, BURGET Lukáš, DIEZ Sánchez Mireia, GLEMBEK Ondřej, KENNY Patrick, KLČO Michal, LANDINI Federico Nicolás, LOZANO Díez Alicia, MATĚJKA Pavel, MONTEIRO Joao, MOŠNER Ladislav, NOVOTNÝ Ondřej, PLCHOT Oldřich, PROFANT Ján, ROHDIN Johan A., SILNOVA Anna, SLAVÍČEK Josef, STAFYLAKIS Themos a ZEINALI Hossein. ABC NIST SRE 2018 SYSTEM DESCRIPTION. In: Proceedings of 2018 NIST SRE Workshop. Athens: National Institute of Standards and Technology, 2018, s. 1-10.
Detail

SZŐKE Igor. Souhrnná zpráva k výzkumnému projektu "Škoda auto - Digital Minutes". Brno: ŠKODA AUTO a.s., 2018.
Detail

WIESNER Matthew, LIU Chunxi, ONDEL Yang Lucas Antoine Francois, HARMAN Craig, MANOHAR Vimal, TRMAL Jan, HUANG Zhongqiang, DEHAK Najim a KHUDANPUR Sanjeev. Automatic Speech Recognition and Topic Identification for Almost-Zero-Resource Languages. In: Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018, s. 2052-2056. ISSN 1990-9772.
Detail

GODARD Pierre, BOITO Marcely Z., ONDEL Yang Lucas Antoine Francois, BERARD Alexandre, YVON Francois, VILLAVICENCIO Aline a BESACIER Laurent. Unsupervised Word Segmentation from Speech with Attention. In: Proceeding of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, s. 2678-2682. ISSN 1990-9772.
Detail

CHO Jaejin, BASKAR Murali K., LI Ruizhi, WIESNER Matthew, MALLIDI Sri Harish, YALTA Nelson, KARAFIÁT Martin, WATANABE Shinji a HORI Takaaki. Multilingual Sequence-to-Sequence Speech Recognition: Architecture, Transfer Learning, and Language Modeling. In: Proceedings of 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018). Athens: IEEE Signal Processing Society, 2018, s. 521-527. ISBN 978-1-5386-4334-1.
Detail

DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, KINOSHITA Keisuke, ARAKI Shoko, OGAWA Atsunori a NAKATANI Tomohiro. SpeakerBeam: A New Deep Learning Technology for Extracting Speech of a Target Speaker Based on the Speaker's Voice Characteristics. NTT Technical Review, roč. 16, č. 11, 2018, s. 19-24. ISSN 1348-3447.
Detail

2017

ZEINALI Hossein, SAMETI Hossein a BURGET Lukáš. HMM-Based Phrase-Independent i-Vector Extractor for Text-Dependent Speaker Verification. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, roč. 25, č. 7, 2017, s. 1421-1435. ISSN 2329-9290.
Detail

BASKAR Murali K., KARAFIÁT Martin, BURGET Lukáš, VESELÝ Karel, GRÉZL František a ČERNOCKÝ Jan. Residual Memory Networks: Feed-forward approach to learn long-term temporal dependencies. In: Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017, s. 4810-4814. ISBN 978-1-5090-4117-6.
Detail

HANNEMANN Mirko, TRMAL Jan, ONDEL Yang Lucas Antoine Francois, KESIRAJU Santosh a BURGET Lukáš. Bayesian joint-sequence models for grapheme-to-phoneme conversion. In: Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017, s. 2836-2840. ISBN 978-1-5090-4117-6.
Detail

KESIRAJU Santosh, PAPPAGARI Raghavendra, ONDEL Yang Lucas Antoine Francois, BURGET Lukáš, DEHAK Najim, KHUDANPUR Sanjeev, ČERNOCKÝ Jan a GANGASHETTY Suryakanth V. Topic identification of spoken documents using unsupervised acoustic unit discovery. In: Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017, s. 5745-5749. ISBN 978-1-5090-4117-6.
Detail

LIU Chunxi, YANG Jinyi, SUN Ming, KESIRAJU Santosh, ROTT Alena, ONDEL Yang Lucas Antoine Francois, GHAHREMANI Pegah, DEHAK Najim, BURGET Lukáš a KHUDANPUR Sanjeev. An Empirical evaluation of zero resource acoustic unit discovery. In: Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017, s. 5305-5309. ISBN 978-1-5090-4117-6.
Detail

ONDEL Yang Lucas Antoine Francois, BURGET Lukáš, ČERNOCKÝ Jan a KESIRAJU Santosh. Bayesian phonotactic language model for Acoustic Unit Discovery. In: Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017, s. 5750-5754. ISBN 978-1-5090-4117-6.
Detail

FÉR Radek, MATĚJKA Pavel, GRÉZL František, PLCHOT Oldřich, VESELÝ Karel a ČERNOCKÝ Jan. Multilingually Trained Bottleneck Features in Spoken Language Recognition. Computer Speech and Language, roč. 2017, č. 46, s. 252-267. ISSN 0885-2308.
Detail

ZEINALI Hossein, SAMETI Hossein, BURGET Lukáš a ČERNOCKÝ Jan. Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models. Computer Speech and Language, roč. 2017, č. 46, s. 53-71. ISSN 0885-2308.
Detail

BENEŠ Karel, BASKAR Murali K. a BURGET Lukáš. Residual Memory Networks in Language Modeling: Improving the Reputation of Feed-Forward Networks. In: Proceedings of Interspeeech 2017. Stockholm: International Speech Communication Association, 2017, s. 284-288. ISSN 1990-9772.
Detail

KARAFIÁT Martin, BASKAR Murali K., MATĚJKA Pavel, VESELÝ Karel, GRÉZL František, BURGET Lukáš a ČERNOCKÝ Jan. 2016 BUT Babel system: Multilingual BLSTM acoustic model with i-vector based adaptation. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, s. 719-723. ISSN 1990-9772.
Detail

MATĚJKA Pavel, NOVOTNÝ Ondřej, PLCHOT Oldřich, BURGET Lukáš, DIEZ Sánchez Mireia a ČERNOCKÝ Jan. Analysis of Score Normalization in Multilingual Speaker Recognition. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, s. 1567-1571. ISSN 1990-9772.
Detail

PLCHOT Oldřich, MATĚJKA Pavel, SILNOVA Anna, NOVOTNÝ Ondřej, DIEZ Sánchez Mireia, ROHDIN Johan A., GLEMBEK Ondřej, BRÜMMER Niko, SWART Albert du Preez, PRIETO Jesús J., GARCIA Perera Leibny Paola, BUERA Luis, KENNY Patrick, ALAM Jahangir a BHATTACHARYA Gautam. Analysis and Description of ABC Submission to NIST SRE 2016. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, s. 1348-1352. ISSN 1990-9772.
Detail

SILNOVA Anna, BURGET Lukáš a ČERNOCKÝ Jan. Alternative Approaches to Neural Network based Speaker Verification. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, s. 1572-1575. ISSN 1990-9772.
Detail

PAPADOPOULOS Pavlos, TRAVADI Ruchir, VAZ Colin, MALANDRAKIS Nikolaos, HERMJAKOB Ulf, POURDAMGHANI Nima, PUST Michael, ZHANG Boliang, PAN Xiaoman, LU Di, LIN Ying, GLEMBEK Ondřej, BASKAR Murali K., KARAFIÁT Martin, BURGET Lukáš, HASEGAWA-JOHNSON Mark, JI Heng, MAY Jonathan, KNIGHT Kevin a NARAYANAN Shrikanth. Team ELISA System for DARPA LORELEI Speech Evaluation 2016. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, s. 2053-2057. ISSN 1990-9772.
Detail

VESELÝ Karel, BURGET Lukáš a ČERNOCKÝ Jan. Semi-supervised DNN training with word selection for ASR. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, s. 3687-3691. ISSN 1990-9772.
Detail

DAS Amit, HASEGAWA-JOHNSON Mark a VESELÝ Karel. Deep Auto-encoder Based Multi-task Learning Using Probabilistic Transcriptions. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, s. 2073-2077. ISSN 1990-9772.
Detail

HIGUCHI Takuya, KINOSHITA Keisuke, DELCROIX Marc, ŽMOLÍKOVÁ Kateřina a NAKATANI Tomohiro. Deep clustering-based beamforming for separation with unknown number of sources. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, s. 1183-1187. ISSN 1990-9772.
Detail

ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, KINOSHITA Keisuke, HIGUCHI Takuya, OGAWA Atsunori a NAKATANI Tomohiro. Speaker-aware neural network based beamformer for speaker extraction in speech mixtures. In: Proceedings of Interspeech 2017. Stocholm: International Speech Communication Association, 2017, s. 2655-2659. ISSN 1990-9772.
Detail

VESELÝ Karel, BASKAR Murali K., DIEZ Sánchez Mireia a BENEŠ Karel. MGB-3 but system: Low-resource ASR on Egyptian YouTube data. In: Proceedings of ASRU 2017. Okinawa: IEEE Signal Processing Society, 2017, s. 368-373. ISBN 978-1-5090-4788-8.
Detail

ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, KINOSHITA Keisuke, HIGUCHI Takuya, OGAWA Atsunori a NAKATANI Tomohiro. Learning Speaker Representation for Neural Network Based Multichannel Speaker Extraction. In: Proceedings of ASRU 2017. Okinawa: IEEE Signal Processing Society, 2017, s. 8-15. ISBN 978-1-5090-4788-8.
Detail

ŽMOLÍKOVÁ Kateřina. Summary report of project "Speech enhancement front-end for robust automatic speech recognition with large amount of training data" for Year 2017. Brno: NTT Corporation, 2017.
Detail

MATĚJKA Pavel, PLCHOT Oldřich, NOVOTNÝ Ondřej, CUMANI Sandro, LOZANO Díez Alicia, SLAVÍČEK Josef, DIEZ Sánchez Mireia, GRÉZL František, GLEMBEK Ondřej, KAMSALI Veera Mounika, SILNOVA Anna, BURGET Lukáš, ONDEL Yang Lucas Antoine Francois, KESIRAJU Santosh a ROHDIN Johan A. BUT- PT System Description for NIST LRE 2017. In: Proceedings of NIST Language Recognition Workshop 2017. Orlando, Florida: National Institute of Standards and Technology, 2017, s. 1-6.
Detail

MATĚJKA Pavel. Souhrnná zpráva k projektu "Speaker REcognition" za rok 2017. Brno: Phonexia s.r.o., 2017.
Detail

GLEMBEK Ondřej. Summary report for project Exploiting Language Information for Situational Awareness (ELISA) For year 2017. Brno: University of Southern California, 2017.
Detail

MATĚJKA Pavel. Summary report for project "Robust Automatic Speech Transcription" in Year 2017. Brno: Raytheon BBN Technologies Corp., 2017.
Detail

MALANDRAKIS Nikolaos, GLEMBEK Ondřej a NARAYANAN Shrikanth. Extracting Situation Frames from non-English Speech: Evaluation Framework and Pilot Results. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, s. 2123-2127. ISSN 1990-9772.
Detail

2016

PLCHOT Oldřich, BURGET Lukáš, ARONOWITZ Hagai a MATĚJKA Pavel. Audio Enhancing With DNN Autoencoder For Speaker Recognition. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, s. 5090-5094. ISBN 978-1-4799-9988-0.
Detail

MATĚJKA Pavel, GLEMBEK Ondřej, NOVOTNÝ Ondřej, PLCHOT Oldřich, GRÉZL František, BURGET Lukáš a ČERNOCKÝ Jan. Analysis Of DNN Approaches To Speaker Identification. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, s. 5100-5104. ISBN 978-1-4799-9988-0.
Detail

VESELÝ Karel, WATANABE Shinji, ŽMOLÍKOVÁ Kateřina, KARAFIÁT Martin, BURGET Lukáš a ČERNOCKÝ Jan. Sequence Summarizing Neural Network for Speaker Adaptation. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, s. 5315-5319. ISBN 978-1-4799-9988-0.
Detail

KARAFIÁT Martin, BURGET Lukáš, GRÉZL František, VESELÝ Karel a ČERNOCKÝ Jan. Multilingual Region-Dependent Transforms. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, s. 5430-5434. ISBN 978-1-4799-9988-0.
Detail

LOPEZ-MORENO Ignacio, GONZALEZ-DOMINGUEZ Javier, MARTÍNEZ González David, PLCHOT Oldřich, GONZALEZ-RODRIGUEZ Joaquin a MORENO Pedro. On the use of deep feedforward neural networks for automatic language identification. Computer Speech and Language, roč. 2016, č. 40, s. 46-59. ISSN 0885-2308.
Detail

GRÉZL František a KARAFIÁT Martin. Bottle-Neck Feature Extraction Structures for Multilingual Training and Porting. In: Procedia Computer Science. Yogyakarta: Elsevier Science, 2016, s. 144-151. ISSN 1877-0509.
Detail

LOZANO Díez Alicia, SILNOVA Anna, MATĚJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, PEŠÁN Jan, BURGET Lukáš a GONZALEZ-RODRIGUEZ Joaquin. Analysis and Optimization of Bottleneck Features for Speaker Recognition. In: Proceedings of Odyssey 2016. Bilbao: International Speech Communication Association, 2016, s. 352-357. ISSN 2312-2846.
Detail

ZEINALI Hossein, BURGET Lukáš, SAMETI Hossein, GLEMBEK Ondřej a PLCHOT Oldřich. Deep Neural Networks and Hidden Markov Models in i-vector-based Text-Dependent Speaker Verification. In: Proceedings of Odyssey 2016, The Speaker and Language Recognition Workshop. Bilbao: International Speech Communication Association, 2016, s. 24-30. ISSN 2312-2846.
Detail

PLCHOT Oldřich, MATĚJKA Pavel, FÉR Radek, GLEMBEK Ondřej, NOVOTNÝ Ondřej, PEŠÁN Jan, VESELÝ Karel, ONDEL Yang Lucas Antoine Francois, KARAFIÁT Martin, GRÉZL František, KESIRAJU Santosh, BURGET Lukáš, BRUMMER Johan Nikolaas Langenhoven, SWART Albert du Preez, CUMANI Sandro, MALLIDI Sri Harish a LI Ruizhi. BAT System Description for NIST LRE 2015. In: Proceedings of Odyssey 2016, The Speaker and Language Recognition Workshop. Bilbao: International Speech Communication Association, 2016, s. 166-173. ISSN 2312-2846.
Detail

GRÉZL František, EGOROVA Ekaterina a KARAFIÁT Martin. Study of Large Data Resources for Multilingual Training and System Porting. In: Procedia Computer Science. Yogyakarta: Elsevier Science, 2016, s. 15-22. ISSN 1877-0509.
Detail

EGOROVA Ekaterina a SERRANO Jordi Lugue. Semi-Supervised Training of Language Model on Spanish Conversational Telephone Speech Data. In: Procedia Computer Science. Yogyakarta: Elsevier Science, 2016, s. 114-120. ISSN 1877-0509.
Detail

ONDEL Yang Lucas Antoine Francois, BURGET Lukáš a ČERNOCKÝ Jan. Variational Inference for Acoustic Unit Discovery. In: Procedia Computer Science. Yogyakarta: Elsevier Science, 2016, s. 80-86. ISSN 1877-0509.
Detail

ZEINALI Hossein, SAMETI Hossein, BURGET Lukáš, ČERNOCKÝ Jan, MAGHSOODI Nooshin a MATĚJKA Pavel. i-vector/HMM Based Text-dependent Speaker Verification System for RedDots Challenge. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, s. 440-444. ISBN 978-1-5108-3313-5.
Detail

KESIRAJU Santosh, BURGET Lukáš, SZŐKE Igor a ČERNOCKÝ Jan. Learning document representations using subspace multinomial model. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, s. 700-704. ISBN 978-1-5108-3313-5.
Detail

NOVOTNÝ Ondřej, MATĚJKA Pavel, PLCHOT Oldřich, GLEMBEK Ondřej, BURGET Lukáš a ČERNOCKÝ Jan. Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, s. 828-832. ISBN 978-1-5108-3313-5.
Detail

ŽMOLÍKOVÁ Kateřina, KARAFIÁT Martin, VESELÝ Karel, DELCROIX Marc, WATANABE Shinji, BURGET Lukáš a ČERNOCKÝ Jan. Data selection by sequence summarizing neural network in mismatch condition training. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, s. 2354-2358. ISBN 978-1-5108-3313-5.
Detail

LI Ruizhi, MALLIDI Sri Harish, PLCHOT Oldřich, BURGET Lukáš a DEHAK Najim. Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, s. 3265-3269. ISBN 978-1-5108-3313-5.
Detail

PEŠÁN Jan, BURGET Lukáš a ČERNOCKÝ Jan. Sequence Summarizing Neural Networks for Spoken Language Recognition. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, s. 3285-3289. ISBN 978-1-5108-3313-5.
Detail

NOVOTNÝ Ondřej, MATĚJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, GRÉZL František, BURGET Lukáš a ČERNOCKÝ Jan. Analysis of the DNN-Based SRE Systems in Multi-language Conditions. In: Proceedings of SLT 2016. San Diego: IEEE Signal Processing Society, 2016, s. 199-204. ISBN 978-1-5090-4903-5.
Detail

KARAFIÁT Martin, BASKAR Murali K., MATĚJKA Pavel, VESELÝ Karel, GRÉZL František a ČERNOCKÝ Jan. Multilingual BLSTM and Speaker-Specific Vector Adaptation in 2016 BUT BABEL SYSTEM. In: Proceedings of SLT 2016. San Diego: IEEE Signal Processing Society, 2016, s. 637-643. ISBN 978-1-5090-4903-5.
Detail

GRÉZL František a KARAFIÁT Martin. Boosting Performance on Low-resource Languages by Standard Corpora: AN ANALYSIS. In: Proceeding of SLT 2016. San Diego: IEEE Signal Processing Society, 2016, s. 629-636. ISBN 978-1-5090-4903-5.
Detail

POVOLNÝ Filip, MATĚJKA Pavel, HRADIŠ Michal, POPKOVÁ Anna, OTRUSINA Lubomír, SMRŽ Pavel, WOOD Ian, ROBIN Cécile a LAMEL Lori. Multimodal Emotion Recognition for AVEC 2016 Challenge. In: AVEC '16 Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge. Amsterdam: Association for Computing Machinery, 2016, s. 75-82. ISBN 978-1-4503-4516-3.
Detail

GLEMBEK Ondřej. Summary report for project Exploiting Language Information for Situational Awareness (ELISA) For year 2016. Brno: University of Southern California, 2016.
Detail

MATĚJKA Pavel. Summary report for project "Robust Automatic Speech Transcription" in Year 2016. Brno: Raytheon BBN Technologies Corp., 2016.
Detail

SKÁCEL Miroslav, KARAFIÁT Martin, ONDEL Yang Lucas Antoine Francois, UCHYTIL Albert a SZŐKE Igor. BUT Zero-Cost Speech Recognition 2016 System Description. In: CEUR Workshop Proceedings. Hilversum: CEUR-WS.org, 2016, s. 1-3. ISSN 1613-0073.
Detail

POPKOVÁ Anna, POVOLNÝ Filip, MATĚJKA Pavel, GLEMBEK Ondřej, GRÉZL František a ČERNOCKÝ Jan. Investigation of Bottle-Neck Features for Emotion Recognition. In: 19th International Conference, TSD 2016, Brno , Czech Republic, September 12-16, 2016, Proceedings. Lecture Notes in Computer Science, Lecture Notes in Artificial Intelligence, roč. 9924. Brno: International Speech Communication Association, 2016, s. 426-434. ISSN 0302-9743.
Detail

SAGHA Hesam, MATĚJKA Pavel, GAVRYUOKOVA Maryna, POVOLNÝ Filip, MARCHI Erik a SCHULLER Björn W. Enhancing multilingual recognition of emotion in speech by language identification. In: 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION - Proceedings (INTERSPEECH 2016). San Francisco: International Speech Communication Association, 2016, s. 2949-2953. ISSN 1990-9772.
Detail

SZŐKE Igor a ANGUERA Xavier. Zero-Cost Speech Recognition Task at Mediaeval 2016. In: CEUR Workshop Proceedings. Hilversum: CEUR-WS.org, 2016, s. 1-3. ISSN 1613-0073.
Detail

2015

MOTLÍČEK Petr, DEY Subhadeep, MADIKERI Srikanth a BURGET Lukáš. EMPLOYMENT OF SUBSPACE GAUSSIAN MIXTURE MODELS IN SPEAKER RECOGNITION. In: Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015, s. 4445-4449. ISBN 978-1-4673-6997-8.
Detail

HEŘMANSKÝ Hynek, BURGET Lukáš, COHEN Jordan, DUPOUX Emmanuel, FELDMAN Naomi, GODFREY John, KHUDANPUR Sanjeev, MACIEJEWSKI Matthew, MALLIDI Sri Harish, MENON Anjali, OGAWA Tetsuji, PEDDINTI Vijayaditya, ROSE Richard, STERN Richard, WIESNER Matthew a VESELÝ Karel. TOWARDS MACHINES THAT KNOW WHEN THEY DO NOT KNOW: SUMMARY OF WORK DONE AT 2014 FREDERICK JELINEK MEMORIAL WORKSHOP. In: Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015, s. 5009-5013. ISBN 978-1-4673-6997-8.
Detail

SZŐKE Igor, SKÁCEL Miroslav, ČERNOCKÝ Jan a BURGET Lukáš. Coping with Channel Mismatch in Query-By-Example - BUT QUESST 2014. In: Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015, s. 5838-5842. ISBN 978-1-4673-6997-8.
Detail

ANGUERA Xavier, RODRIGUEZ-FUENTES Luis J., BUZO Andi, METZE Florian, SZŐKE Igor a PENAGARIKANO Mikel. QUESST2014: Evaluating Query-By-Example Speech Search in a Zero-Resource Setting with Real-Life Queries. In: Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015, s. 5833-5837. ISBN 978-1-4673-6997-8.
Detail

ONDEL Yang Lucas Antoine Francois, ANGUERA Xavier a LUQUE Jordi. MASK+: Data-driven regions selection for acoustic fingerprinting. In: Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015, s. 335-339. ISBN 978-1-4673-6997-8.
Detail

FÉR Radek, MATĚJKA Pavel, GRÉZL František, PLCHOT Oldřich a ČERNOCKÝ Jan. Multilingual Bottleneck Features for Language Recognition. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, s. 389-393. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Detail

CUMANI Sandro, PLCHOT Oldřich a FÉR Radek. Exploiting i-vector posterior covariances for short-duration language recognition. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, s. 1002-1006. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Detail

GLEMBEK Ondřej, MATĚJKA Pavel, PLCHOT Oldřich, PEŠÁN Jan, BURGET Lukáš a SCHWARZ Petr. Migrating i-vectors Between Speaker Recognition Systems Using Regression Neural Networks. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, s. 2327-2331. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Detail

PEŠÁN Jan, BURGET Lukáš, HEŘMANSKÝ Hynek a VESELÝ Karel. DNN derived filters for processing of modulation spectrum of speech. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, s. 1908-1911. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Detail

MALLIDI Sri Harish, OGAWA Tetsuji, VESELÝ Karel, NIDADAVOLU Phani S. a HEŘMANSKÝ Hynek. Autoencoder based multi-stream combination for noise robust speech recognition. In: Proceeding of Interspeech 2015. Dresden: International Speech Communication Association, 2015, s. 3551-3555. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Detail

SILNOVA Anna, GLEMBEK Ondřej, KINNUNEN Tomi a MATĚJKA Pavel. Exploring ANN Back-Ends for i-Vector Based Speaker Age Estimation. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, s. 3036-3040. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Detail

KARAFIÁT Martin, GRÉZL František, BURGET Lukáš, SZŐKE Igor a ČERNOCKÝ Jan. Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, s. 2454-2458. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Detail

GLEMBEK Ondřej, MATĚJKA Pavel, BURGET Lukáš, SCHWARZ Petr, PEŠÁN Jan a PLCHOT Oldřich. Voice-print transformation for migration between automatic speaker identification systems. Abstract book of the 7th European Academy of Forensic Science Conference. Praha: Kriminalistický ústav Praha, 2015. ISBN 978-80-260-8659-8.
Detail

HSIAO Roger, MA Jeff, HARTMANN William, KARAFIÁT Martin, GRÉZL František, BURGET Lukáš, SZŐKE Igor, ČERNOCKÝ Jan, WATANABE Shinji, CHEN Zhuo, MALLIDI Sri Harish, HEŘMANSKÝ Hynek, TSAKALIDIS Stavros a SCHWARTZ Richard. Robust Speech Recognition in Unknown Reverberant and Noisy Conditions. In: Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop. Scottsdale, Arizona: IEEE Signal Processing Society, 2015, s. 533-538. ISBN 978-1-4799-7291-3.
Detail

SKÁCEL Miroslav a SZŐKE Igor. BUT QUESST 2015 System Description. In: CEUR Workshop Proceedings. Wurzen: CEUR-WS.org, 2015, s. 1-3. ISSN 1613-0073.
Detail

GRÉZL František, KARAFIÁT Martin, VESELÝ Karel a ŽIŽKA Josef. Souhrnná zpráva k projektu "Zpracování audiovizuálních dat pro Superlectures.com" za rok 2015. Brno: ReplayWell, s. r. o., 2015.
Detail

GLEMBEK Ondřej, KESIRAJU Santosh a ONDEL Yang Lucas Antoine Francois. Summary report for project "ELISA" in Year 2015. Brno: University of Southern California, 2015.
Detail

MATĚJKA Pavel, PLCHOT Oldřich, NOVOTNÝ Ondřej a FÉR Radek. Summary report for project "Robust Automatic Speech Transcription" in Year 2015. Brno: Raytheon BBN Technologies Corp., 2015.
Detail

KARAFIÁT Martin, GRÉZL František, HANNEMANN Mirko a VESELÝ Karel. Summary report for project "Multilingual speech recognition" in Year 2015. Brno: Raytheon BBN Technologies Corp., 2015.
Detail

KARAFIÁT Martin a GRÉZL František. Souhrnná zpráva k projektu "ASR-FR" za rok 2015. Brno: Phonexia s.r.o., 2015.
Detail

KARAFIÁT Martin a GRÉZL František. Souhrnná zpráva k projektu "Dodání anotací akustických dat, akustického modelu, jazykového modelu a výslovnostního slovníku pro francouzský jazyk" za rok 2015. Brno: Phonexia s.r.o., 2015.
Detail

SZŐKE Igor, METZE Florian, RODRIGUEZ-FUENTES Luis J., PROENCA Jorge, BUZO Andi, LOJKA Martin, ANGUERA Xavier a XIONG Xiao. Query by Example Search on Speech at Mediaeval 2015. In: CEUR Workshop Proceedings. Wurzen: CEUR-WS.org, 2015, s. 1-3. ISSN 1613-0073.
Detail

2014

KARAFIÁT Martin, GRÉZL František, HANNEMANN Mirko a ČERNOCKÝ Jan. BUT Neural Network Features for Spontaneous Vietnamese in BABEL. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, s. 5659-5663. ISBN 978-1-4799-2892-7.
Detail

GLEMBEK Ondřej, MA Jeff, MATĚJKA Pavel, ZHANG Bing, PLCHOT Oldřich, BURGET Lukáš a MATSOUKAS Spyros. Domain adaptation via within-class covariance correction in I-vector based speaker recognition systems. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, s. 4060-4064. ISBN 978-1-4799-2892-7.
Detail

GRÉZL František, KARAFIÁT Martin a VESELÝ Karel. Adaptation of Multilingual Stacked Bottle-neck Neural Network Structure for New Language. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, s. 7704-7708. ISBN 978-1-4799-2892-7.
Detail

SZŐKE Igor, BURGET Lukáš, GRÉZL František, ČERNOCKÝ Jan a ONDEL Yang Lucas Antoine Francois. Calibration and Fusion of Query-by-example Systems - BUT SWS 2013. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, s. 7899-7903. ISBN 978-1-4799-2892-7.
Detail

LOPEZ-MORENO Ignacio, GONZALEZ-DOMINGUEZ Javier, MARTÍNEZ González David, PLCHOT Oldřich, GONZALEZ-RODRIGUEZ Joaquin a MORENO Pedro. Automatic Language Identification Using Deep Neural Networks. In: Proceeding of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, s. 5374-5378. ISBN 978-1-4799-2892-7.
Detail

GRÉZL František a KARAFIÁT Martin. Adapting Multilingual Neural Network Hierarchy to a New Language. In: Proceedings of the 4th International Workshop on Spoken Language Technologies for Under- resourced Languages SLTU-2014. St. Petersburg, Russia, 2014. St. Petersburg: International Speech Communication Association, 2014, s. 39-45. ISBN 978-5-8088-0908-6.
Detail

MARTÍNEZ González David, BURGET Lukáš, STAFYLAKIS Themos, LEI Yun, KENNY Patrick a LLEIDA Eduardo. Unscented Transform For Ivector-based Noisy Speaker Recognition. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, s. 4070-4074. ISBN 978-1-4799-2892-7.
Detail

ANGUERA Xavier, RODRIGUEZ-FUENTES Luis J., SZŐKE Igor, BUZO Andi a METZE Florian a kol. Query-by-example Spoken Term Detection Evaluation on Low-resource Languages. In: Proceedings of the 4th International Workshop on Spoken Language Technologies for Under- resourced Languages SLTU-2014. St. Petersburg, Russia. St. Petersburg: International Speech Communication Association, 2014, s. 24-31. ISBN 978-5-8088-0908-6.
Detail

EGOROVA Ekaterina. Multi-task Neural Networks For Speech Recognition. In: Proceedings of the 20th Student Conference, EEICT 2014. Volume 2. Brno: Vysoké učení technické v Brně, 2014, s. 24-26. ISBN 978-80-214-4923-7.
Detail

CUMANI Sandro, LAFACE Pietro a PLCHOT Oldřich. On the use of i-vector posterior distributions in Probabilistic Linear Discriminant Analysis. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, roč. 22, č. 4, 2014, s. 846-857. ISSN 2329-9290.
Detail

MATĚJKA Pavel, ZHANG Le, NG Tim, MALLIDI Sri Harish, GLEMBEK Ondřej, MA Jeff a ZHANG Bing. Neural Network Bottleneck Features for Language Identification. In: Proceedings of Odyssey 2014. Joensuu: International Speech Communication Association, 2014, s. 299-304. ISSN 2312-2846.
Detail

GRÉZL František a KARAFIÁT Martin. Combination of Multilingual and Semi-Supervised Training for Under-Resourced Languages. In: Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014, s. 820-824. ISBN 978-1-63439-435-2.
Detail

BAHARI Mohamad H., DEHAK Najim, VAN hamme Hugo, BURGET Lukáš, ALI Ahmed M. a GLASS Jim. Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, roč. 2014, č. 7, s. 1117-1129. ISSN 2329-9290.
Detail

NG Tim, HSIAO Roger, ZHANG Le, KARAKOS Damianos, MALLIDI Sri Harish, KARAFIÁT Martin, VESELÝ Karel, SZŐKE Igor, ZHANG Bing, NGUYEN Long a SCHWARTZ Richard. Progress in the BBN Keyword Search System for the DARPA RATS Program. In: Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014, s. 959-963. ISBN 978-1-63439-435-2.
Detail

KARAFIÁT Martin, GRÉZL František, VESELÝ Karel, HANNEMANN Mirko, SZŐKE Igor a ČERNOCKÝ Jan. BUT 2014 Babel System: Analysis of adaptation in NN based systems. In: Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014, s. 3002-3006. ISBN 978-1-63439-435-2.
Detail

PLCHOT Oldřich, DIEZ Sánchez Mireia, SOUFIFAR Mehdi a BURGET Lukáš. PLLR Features in Language Recognition System for RATS. In: Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014, s. 3048-3051. ISBN 978-1-63439-435-2.
Detail

GRÉZL František, EGOROVA Ekaterina a KARAFIÁT Martin. Further Investigation into Multilingual Training and Adaptation of Stacked Bottle-neck Neural Network Structure. In: Proceedings of 2014 Spoken Language Technology Workshop. South Lake Tahoe, Nevada: IEEE Signal Processing Society, 2014, s. 48-53. ISBN 978-1-4799-7129-9.
Detail

KARAFIÁT Martin, VESELÝ Karel, SZŐKE Igor, BURGET Lukáš, GRÉZL František, HANNEMANN Mirko a ČERNOCKÝ Jan. BUT ASR System for BABEL Surprise Evaluation 2014. In: Proceedings of 2014 Spoken Language Technology Workshop. South Lake Tahoe, Nevada: IEEE Signal Processing Society, 2014, s. 501-506. ISBN 978-1-4799-7129-9.
Detail

SZŐKE Igor, SKÁCEL Miroslav a BURGET Lukáš. BUT QUESST 2014 System Description. In: CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2014, s. 1-2. ISSN 1613-0073.
Detail

ANGUERA Xavier, RODRIGUEZ-FUENTES Luis J., SZŐKE Igor, BUZO Andi a METZE Florian. Query by Example Search on Speech at Mediaeval 2014. In: CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2014, s. 1-2. ISSN 1613-0073.
Detail

KARAFIÁT Martin a GRÉZL František. Souhrnná zpráva k projektu "Dodání anotací akustických dat, akustického modelu, jazykového modelu a výslovnostního slovníku pro španělský jazyk" za rok 2014. Brno: Phonexia s.r.o., 2014.
Detail

2013

CUMANI Sandro, PLCHOT Oldřich a LAFACE Pietro. Probabilistic Linear Discriminant Analysis Of I-Vector Posterior Distributions. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, s. 7644-7648. ISBN 978-1-4799-0355-9.
Detail

PLCHOT Oldřich, MATSOUKAS Spyros, MATĚJKA Pavel, DEHAK Najim, MA Jeff, CUMANI Sandro, GLEMBEK Ondřej, HEŘMANSKÝ Hynek, MESGARANI Nima, SOUFIFAR Mehdi Mohammad, THOMAS Samuel, ZHANG Bing a ZHOU Xinhui a kol. Developing A Speaker Identification System For The DARPA RATS Project. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, s. 6768-6772. ISBN 978-1-4799-0355-9.
Detail

EGOROVA Ekaterina, VESELÝ Karel, KARAFIÁT Martin, JANDA Miloš a ČERNOCKÝ Jan. Manual and Semi-Automatic Approaches to Building a Multilingual Phoneme Set. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, s. 7324-7328. ISBN 978-1-4799-0355-9.
Detail

HANNEMANN Mirko, POVEY Daniel a ZWEIG Geoffrey. Combining Forward and Backward Search in Decoding. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, s. 6739-6743. ISBN 978-1-4799-0355-9.
Detail

LEI Yun, BURGET Lukáš a SCHEFFER Nicolas. A Noise Robust I-Vector Extractor Using Vector Taylor Series For Speaker Recognition. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, s. 6788-6791. ISBN 978-1-4799-0355-9.
Detail

AKBACAK Murat, BURGET Lukáš, WENG Wan a VAN Hout Julien. Rich System Combination For Keyword Spotting In Noisy and Acoustically Heterogenous Audio Streams. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, s. 8267-8271. ISBN 978-1-4799-0355-9.
Detail

JANDA Miloš. Automatic Generation Of Pronunciation Dictionaries Based On Diarization. In: Proceedings of the 19th Conference Student EEICT 2013. Brno: Vysoké učení technické v Brně, 2013, s. 228-232. ISBN 978-80-214-4695-3.
Detail

MOTLÍČEK Petr, POVEY Daniel a KARAFIÁT Martin. Feature And Score Level Combination Of Subspace Gaussians In LVCSR Task. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, s. 7604-7608. ISBN 978-1-4799-0355-9.
Detail

VESELÝ Karel, GHOSHAL Arnab, BURGET Lukáš a POVEY Daniel. Sequence-discriminative Training of Deep Neural Networks. In: Proceedings of Interspeech 2013. Lyon: International Speech Communication Association, 2013, s. 2345-2349. ISBN 978-1-62993-443-3. ISSN 2308-457X.
Detail

KARAFIÁT Martin, GRÉZL František, HANNEMANN Mirko, VESELÝ Karel a ČERNOCKÝ Jan. BUT BABEL System for Spontaneous Cantonese. In: Proceedings of Interspeech 2013. Lyon: International Speech Communication Association, 2013, s. 2589-2593. ISBN 978-1-62993-443-3. ISSN 2308-457X.
Detail

RATH Shakti P., BURGET Lukáš, KARAFIÁT Martin, GLEMBEK Ondřej a ČERNOCKÝ Jan. A Region-specific Feature-space Transformation for Speaker Adaptation and Singularity Analysis of Jacobian Matrix. In: Proceedings of Interspeeech 2013. Lyon: International Speech Communication Association, 2013, s. 1228-1232. ISBN 978-1-62993-443-3. ISSN 2308-457X.
Detail

RATH Shakti P., POVEY Daniel, VESELÝ Karel a ČERNOCKÝ Jan. Improved Feature Processing for Deep Neural Networks. In: Proceedings of Interspeech 2013. Lyon: International Speech Communication Association, 2013, s. 109-113. ISBN 978-1-62993-443-3. ISSN 2308-457X.
Detail

SOUFIFAR Mehdi Mohammad, BURGET Lukáš, PLCHOT Oldřich, CUMANI Sandro a ČERNOCKÝ Jan. Regularized Subspace n-Gram Model for Phonotactic iVector Extraction. In: Proceedings of Interspeech 2013. Lyon: International Speech Communication Association, 2013, s. 74-78. ISBN 978-1-62993-443-3. ISSN 2308-457X.
Detail

CUMANI Sandro, BRUMMER Johan Nikolaas Langenhoven, BURGET Lukáš, LAFACE Pietro, PLCHOT Oldřich a VASILAKAKIS Vasileios. Pairwise Discriminative Speaker Verification in the I -Vector Space. IEEE Transactions on Audio, Speech, and Language Processing, roč. 2013, č. 6, s. 1217-1227. ISSN 1558-7916.
Detail

TRESADERN Phil, COOTES Timothy F., POH Norman, MATĚJKA Pavel, HADID Abdenour, LÉVY Christophe, MCCOOL Christopher S. a MARCEL Sebastien. Mobile Biometrics: Combined Face and Voice Verification for a Mobile Platform. Pervasive Computing, roč. 12, č. 1, 2013, s. 79-87. ISSN 1536-1268.
Detail

GRÉZL František a KARAFIÁT Martin. Semi-Supervised Bootstrapping Approach For Neural Network Feature Extractor Training. In: Proceedings of ASRU 2013. Olomouc: IEEE Signal Processing Society, 2013, s. 470-475. ISBN 978-1-4799-2755-5.
Detail

SZŐKE Igor, BURGET Lukáš, GRÉZL František a ONDEL Yang Lucas Antoine Francois. BUT SWS 2013 - Massive Parallel Approach. In: Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop. Barcelona: CEUR-WS.org, 2013, s. 1-2. ISSN 1613-0073.
Detail

ANGUERA Xavier, METZE Florian, BUZO Andi, SZŐKE Igor a RODRIGUEZ-FUENTES Luis J. The Spoken Web Search Task. In: CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2013, s. 1-2. ISSN 1613-0073.
Detail

KARAKOS Damianos, SCHWARTZ Richard, TSAKALIDIS Stavros, ZHANG Le, RANJAN Shivesh, NG Tim, HSIAO Roger, NGUYEN Long, GRÉZL František, HANNEMANN Mirko, KARAFIÁT Martin, SZŐKE Igor a VESELÝ Karel a kol. Score Normalization and System Combination for Improved Keyword Spotting. In: Proceedings of ASRU 2013. Olomouc: IEEE Signal Processing Society, 2013, s. 210-215. ISBN 978-1-4799-2755-5.
Detail

HSIAO Roger, NG Tim, GRÉZL František, KARAKOS Damianos, TSAKALIDIS Stavros, NGUYEN Long a SCHWARTZ Richard. Discriminative Semi-supervised Training for Keyword Search in Low Resource Languages. In: Proceedings of ASRU 2013. Olomouc: IEEE Signal Processing Society, 2013, s. 440-445. ISBN 978-1-4799-2755-5.
Detail

VESELÝ Karel, HANNEMANN Mirko a BURGET Lukáš. Semi-supervised Training of Deep Neural Networks. In: Proceedings of ASRU 2013. Olomouc: IEEE Signal Processing Society, 2013, s. 267-272. ISBN 978-1-4799-2755-5.
Detail

ZHILA Alisa, YIH Wen-tau, MEEK Christopher, MIKOLOV Tomáš a ZWEIG Geoffrey. Combining Heterogeneous Models for Measuring Relational Similarity. In: Proceedings of NAACL-HLT 2013. Atlanata, Georgia: Association for Computational Linguistics, 2013, s. 1000-1009. ISBN 978-1-937284-47-3.
Detail

KHOURY Elie S., VESNICER Boštjan, FRANCO-PEDROSO Javier, DIEZ Sánchez Mireia, CIPR Tomáš, SCHWARZ Petr, VAN Leeuwen David, PETROVSKA-DELACRETAZ Dijana, MATĚJKA Pavel, RODRIGUEZ-FUENTES Luis J., CHOLLET Gerard a MARCEL Sebastien a kol. The 2013 Speaker Recognition Evaluation in Mobile Environment. In: Proceedings of Biometrics (ICB), 2013 International Conference on. Madrid: IEEE Biometric Council, 2013, s. 1-8. ISBN 978-1-4799-0310-8.
Detail

GRÉZL František, CHALUPNÍČEK Kamil, KARAFIÁT Martin a VESELÝ Karel. Souhrnná zpráva k projektu "Dodání anotací akustických dat, akustického modelu, jazykového modelu a výslovnostního slovníku pro arabský jazyk" za rok 2013. Brno: Phonexia s.r.o., 2013.
Detail

GRÉZL František, KARAFIÁT Martin, VESELÝ Karel a ŽIŽKA Josef. Souhrnná zpráva k projektu "Zpracování audiovizuálních dat pro Superlectures.com" za rok 2013. Brno: ReplayWell, s. r. o., 2013.
Detail

BURGET Lukáš, PLCHOT Oldřich a SZŐKE Igor. 2013 Summary report of project "Processing and analysis of speech, automatic speaker identification". Brno: Raytheon BBN Technologies Corp., 2013.
Detail

MCLAREN Mitchell, ABRASH Victor, GRACIARENA Martin, LEI Yun a PEŠÁN Jan. Improving Robustness to Compressed Speech in Speaker Recognition. In: Proceedings of Interspeech 2013. Lyon: International Speech Communication Association, 2013, s. 3698-3702. ISBN 978-1-62993-443-3.
Detail

MATĚJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, SCHWARZ Milan, CIPR Tomáš, CUMANI Sandro, KUDLA Radim, SZŐKE Igor, SVOBODOVÁ Marie, MALÝ Květoslav a ČERNOCKÝ Jan. BUT HASR'12 Experience: Are developers of SRE Systems naive listeners?. Brno: Fakulta informačních technologií VUT v Brně, 2013.
Detail

2012

SOUFIFAR Mehdi Mohammad, CUMANI Sandro, BURGET Lukáš a ČERNOCKÝ Jan. Discriminative Classifiers for Phonotactic Language Recognition with iVectors. In: Proc. International Conference on Acoustics, Speech, and Signal Processing 2012. Kyoto: IEEE Signal Processing Society, 2012, s. 4853-4856. ISBN 978-1-4673-0044-5.
Detail

POVEY Daniel, HANNEMANN Mirko, BOULIANNE Gilles, BURGET Lukáš, GHOSHAL Arnab, JANDA Miloš, KARAFIÁT Martin, KOMBRINK Stefan, MOTLÍČEK Petr, QIAN Yanmin, RIEDHAMMER Korbinian, VESELÝ Karel a VU Ngoc Thang. Generating Exact Lattices in The WFST Framework. In: Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing. Kyoto: IEEE Signal Processing Society, 2012, s. 4213-4216. ISBN 978-1-4673-0044-5.
Detail

KOMBRINK Stefan, MIKOLOV Tomáš, KARAFIÁT Martin a BURGET Lukáš. Improving Language Models for ASR Using Translated In-domain Data. In: Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing. Kyoto: IEEE Signal Processing Society, 2012, s. 4405-4408. ISBN 978-1-4673-0044-5.
Detail

KARAFIÁT Martin, JANDA Miloš, ČERNOCKÝ Jan a BURGET Lukáš. Region Dependent Linear Transforms in Multilingual Speech Recognition. In: Proc. International Conference on Acoustics, Speech, and Signal Processing 2012. Kyoto: IEEE Signal Processing Society, 2012, s. 4885-4888. ISBN 978-1-4673-0044-5.
Detail

CUMANI Sandro, PLCHOT Oldřich a KARAFIÁT Martin. Independent Component Analysis and MLLR Transforms for Speaker Identification. In: Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012, s. 4365-4368. ISBN 978-1-4673-0044-5.
Detail

CUMANI Sandro, GLEMBEK Ondřej, BRUMMER Johan Nikolaas Langenhoven, DE Villiers Edward a LAFACE Pietro. Gender Independent Discriminative Speaker Recognition in I-Vector Space. In: Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012, s. 4361-4364. ISBN 978-1-4673-0044-5.
Detail

KOMBRINK Stefan, HANNEMANN Mirko a BURGET Lukáš. Out-of-Vocabulary Word Detection and Beyond. Detection and Identification of Rare Audiovisual Cues. Studies in Computational Intelligence, 384. Springer-Verlag Berlin Heidelberg: Springer Verlag, 2012, s. 57-65. ISBN 978-3-642-24033-1.
Detail

HAIN Thomas, BURGET Lukáš, DINES John, GARNER Phillip N., GRÉZL František, EL Hannani Asmaa, HUIJBREGTS Marijn, KARAFIÁT Martin, LINCOLN Mike a WAN Vincent. Transcribing Meetings with the AMIDA System. IEEE Transactions on Audio, Speech, and Language Processing, roč. 20, č. 2, 2012, s. 486-498. ISSN 1558-7916.
Detail

MOTLÍČEK Petr, VALENTE Fabio a SZŐKE Igor. Improving Acoustic Based Keyword Spotting Using LVCSR Lattices. In: Proc. International Conference on Acoustics, Speech, and Signal Processing 2012. Kyoto: IEEE Signal Processing Society, 2012, s. 4413-4416. ISBN 978-1-4673-0044-5.
Detail

METZE Florian, RAJPUT Nitendra, ANGUERA Xavier, DAVEL Marelie H., GRAVIER Guillaume, HEERDEN Charl van, MANTENA Gautam V., MUSCARIELLO Armando, PRAHALLAD Kishore, SZŐKE Igor a TEJEDOR Javier. The Spoken WEB Search Task At Mediaeval 2011. In: Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012, s. 5165-5168. ISBN 978-1-4673-0044-5.
Detail

LEI Yun, BURGET Lukáš, FERRER Luciana, GRACIARENA Martin a SCHEFFER Nicolas. Towards Noise-Robust Speaker Recognition Using Probabilistic Linear Discriminant Analysis. In: Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012, s. 4253-4256. ISBN 978-1-4673-0044-5.
Detail

MARTÍNEZ González David, BURGET Lukáš, FERRER Luciana a SCHEFFER Nicolas. Ivector-Based Prosodic System For Language Identification. In: Proc. International Conference on Acoustics, Speec. Kyoto: IEEE Signal Processing Society, 2012, s. 4861-4864. ISBN 978-1-4673-0044-5.
Detail

JANDA Miloš. Grapheme Based Speech Recognition. In: Proceedings of the 18th Conference STUDENT EEICT 2012. Brno: Vysoké učení technické v Brně, 2012, s. 441-445. ISBN 978-80-214-4460-7.
Detail

FERRER Luciana, BURGET Lukáš, PLCHOT Oldřich a SCHEFFER Nicolas. A Unified Approach for Audio Characterization and its Application to Speaker Recognition. In: Proceedings of Odyssey 2012, The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012, s. 317-323. ISBN 978-981-07-3093-2.
Detail

BOUSQUET Pierre-Michel, LARCHER Anthony, MATROUF Driss, BONASTRE Jean-Francois a PLCHOT Oldřich. Variance-Spectra based Normalization for I-vector Standard and Probabilistic Linear Discriminant Analysis. In: Proceedings of Odyssey 2012: The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012, s. 157-164. ISBN 978-981-07-3093-2.
Detail

BRUMMER Johan Nikolaas Langenhoven, CUMANI Sandro, GLEMBEK Ondřej, KARAFIÁT Martin, MATĚJKA Pavel, PEŠÁN Jan, PLCHOT Oldřich, SOUFIFAR Mehdi Mohammad, DE Villiers Edward a ČERNOCKÝ Jan. Description and analysis of the Brno276 system for LRE2011. In: Proceedings of Odyssey 2012: The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012, s. 216-223. ISBN 978-981-07-3093-2.
Detail

PLCHOT Oldřich, KARAFIÁT Martin, BRUMMER Johan Nikolaas Langenhoven, GLEMBEK Ondřej, MATĚJKA Pavel, DE Villiers Edward a ČERNOCKÝ Jan. Speaker vectors from Subspace Gaussian Mixture Model as complementary features for Language Identification. In: Proceedings of Odyssey 2012, The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012, s. 330-333. ISBN 978-981-07-3093-2.
Detail

RATH Shakti P., KARAFIÁT Martin, GLEMBEK Ondřej a ČERNOCKÝ Jan. A factorized representation of FMLLR transform based on QR-decomposition. In: Proceedings of Interspeech 2012. Portland, Oregon: International Speech Communication Association, 2012, s. 1-4. ISBN 978-1-62276-759-5. ISSN 1990-9772.
Detail

D'HARO Luis Fernando, GLEMBEK Ondřej, PLCHOT Oldřich, MATĚJKA Pavel, SOUFIFAR Mehdi Mohammad, CORDOBA Ricardo a ČERNOCKÝ Jan. Phonotactic Language Recognition using i-vectors and Phoneme Posteriogram Counts. In: Proceedings of Interspeech 2012. Portland, Oregon: International Speech Communication Association, 2012, s. 1-4. ISBN 978-1-62276-759-5. ISSN 1990-9772.
Detail

MATĚJKA Pavel, PLCHOT Oldřich, SOUFIFAR Mehdi Mohammad, GLEMBEK Ondřej, D'HARO Luis Fernando, VESELÝ Karel, GRÉZL František, MA Jeff, MATSOUKAS Spyros a DEHAK Najim. Patrol Team Language Identification System for DARPA RATS P1 Evaluation. In: Proceedings of Interspeech 2012. Portland, Oregon: International Speech Communication Association, 2012, s. 1-4. ISBN 978-1-62276-759-5. ISSN 1990-9772.
Detail

NG Tim, ZHANG Bing, NGUYEN Long, MATSOUKAS Spyros, ZHOU Xinhui, MESGARANI Nima, VESELÝ Karel a MATĚJKA Pavel. Developing a Speech Activity Detection System for the DARPA RATS Program. In: Proceedings of Interspeech 2012. Portland, Oregon: International Speech Communication Association, 2012, s. 1-4. ISBN 978-1-62276-759-5. ISSN 1990-9772.
Detail

VESELÝ Karel, KARAFIÁT Martin, GRÉZL František, JANDA Miloš a EGOROVA Ekaterina. The Language-Independent Bottleneck Features. In: Proceedings of IEEE 2012 Workshop on Spoken Language Technology. Miami: IEEE Signal Processing Society, 2012, s. 336-341. ISBN 978-1-4673-5124-9.
Detail

JANDA Miloš, KARAFIÁT Martin a ČERNOCKÝ Jan. Dealing with Numbers in Grapheme-Based Speech Recognition. In: Proceedings of 15th International Conference on Text, Speech and Dialogue. Lecture Notes in Computer Science, 2012, Volume 7499, roč. 2012. Springer-Verlag Berlin Heidelberg 2012: Springer Verlag, 2012, s. 438-445. ISBN 978-3-642-32789-6. ISSN 0302-9743.
Detail

MCCOOL Christopher S., MARCEL Sebastien, MATĚJKA Pavel, ČERNOCKÝ Jan, KITTLER Joseph, LARCHER Anthony, LÉVY Christophe, MATROUF Driss a BONASTRE Jean-Francois a kol. Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data. In: 2012 IEEE International Conference on Multimedia and Expo Workshops. Melbourne, Victoria: IEEE Computer Society, 2012, s. 635-640. ISBN 978-1-4673-2027-6.
Detail

DEORAS Anoop, MIKOLOV Tomáš, KOMBRINK Stefan a CHURCH Kenneth. Approximate inference: A sampling based modeling technique to capture complex dependencies in a language model. Speech Communication, roč. 2012, č. 8, s. 1-16. ISSN 0167-6393.
Detail

SZŐKE Igor, FAPŠO Michal, ŽIŽKA Josef, BERAN Vítězslav a ČERNOCKÝ Jan. Efektivní přístup ke znalostem v audio-vizuálních záznamech. In: Proceedings of the Annual Database Conference. Praha: Technická univerzita v Košiciach, 2012, s. 57-74. ISBN 978-80-553-1049-7.
Detail

SZŐKE Igor, FAPŠO Michal a VESELÝ Karel. BUT2012 Approaches for Spoken Web Search - MediaEval 2012. In: Working Notes Proceedings of the MediaEval 2012 Workshop. Pisa: CEUR-WS.org, 2012, s. 1-2. ISSN 1613-0073.
Detail

TEJEDOR Javier, FAPŠO Michal, SZŐKE Igor, ČERNOCKÝ Jan a GRÉZL František. Comparison of methods for language-dependent and language-independent query-by-example spoken term detection. ACM Transactions on Information Systems (TOIS), roč. 2012, č. 30, s. 1-34. ISSN 1046-8188.
Detail

ČERNOCKÝ Jan. Dolování informací z mluvené řeči v BUT Speech@FIT. In: Hovory s informatiky 2012. Praha: Akademie věd ČR, 2012, s. 113-114. ISBN 978-80-87136-14-0.
Detail

LEI Yun, BURGET Lukáš a SCHEFFER Nicolas. Bilinear Factor Analysis for iVector Based Speaker Verification. In: Proceedings of Interspeech. Portland, Oregon: International Speech Communication Association, 2012, s. 1-4. ISBN 978-1-62276-759-5.
Detail

2011

ČERNOCKÝ Jan. MOBIO D1.3 - Annual Report. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2011.
Detail

POVEY Daniel, KARAFIÁT Martin, GHOSHAL Arnab a SCHWARZ Petr. A Symmetrization of the Subspace Gaussian Mixture Model. In: Proceedings of 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing. Praha: IEEE Signal Processing Society, 2011, s. 4504-4507. ISBN 978-1-4577-0537-3.
Detail

BURGET Lukáš, PLCHOT Oldřich, CUMANI Sandro, GLEMBEK Ondřej, MATĚJKA Pavel a BRÜMMER Niko. Discriminatively Trained Probabilistic Linear Discriminant Analysis for Speaker Verification. In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011, s. 4832-4835. ISBN 978-1-4577-0537-3.
Detail

CUMANI Sandro, BRÜMMER Niko, BURGET Lukáš a LAFACE Pietro. Fast Discriminative Speaker Verification in the I-Vector Space. In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011, s. 4852-4855. ISBN 978-1-4577-0537-3.
Detail

GLEMBEK Ondřej, BURGET Lukáš, KENNY Patrick, KARAFIÁT Martin a MATĚJKA Pavel. Simplification and optimization of I-Vector Extraction. In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011, s. 4516-4519. ISBN 978-1-4577-0537-3.
Detail

KOCKMANN Marcel, FERRER Luciana, BURGET Lukáš, SHRIBERG Elisabeth a ČERNOCKÝ Jan. Recent Progress in Prosodic Speaker Verification. In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011, s. 4556-4559. ISBN 978-1-4577-0537-3.
Detail

MATĚJKA Pavel, GLEMBEK Ondřej, CASTALDO Fabio, ALAM Jahangir, PLCHOT Oldřich, KENNY Patrick, BURGET Lukáš a ČERNOCKÝ Jan. Full-covariance UBM and Heavy-tailed PLDA in I-Vector Speaker Verification. In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011, s. 4828-4831. ISBN 978-1-4577-0537-3.
Detail

MIKOLOV Tomáš, KOMBRINK Stefan, BURGET Lukáš, ČERNOCKÝ Jan a KHUDANPUR Sanjeev. Extensions of Recurrent Neural Network Language Model. In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011, s. 5528-5531. ISBN 978-1-4577-0537-3.
Detail

DEORAS Anoop, MIKOLOV Tomáš, KOMBRINK Stefan, KARAFIÁT Martin a KHUDANPUR Sanjeev. Variational Approximation of Long-span Language Models for LVCSR. In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011, s. 5532-5535. ISBN 978-1-4577-0537-3.
Detail

PEŠÁN Jan. Rozpoznávání mluvčího na mobilním telefonu. In: Proceedings of the 17th Conference Student EEICT 2011. Volume 2. Brno: Vysoké učení technické v Brně, 2011, s. 341-343. ISBN 978-80-214-4272-6.
Detail

POVEY Daniel, BURGET Lukáš, AGARWAL Mohit, AKYAZI Pinar, GHOSHAL Arnab, GLEMBEK Ondřej, GOEL Nagendra K., KARAFIÁT Martin, RASTROW Ariya, ROSE Richard, SCHWARZ Petr a THOMAS Samuel a kol. The subspace Gaussian mixture model-A structured model for speech recognition. Computer Speech and Language, roč. 25, č. 2, 2011, s. 404-439. ISSN 0885-2308.
Detail

KOCKMANN Marcel, BURGET Lukáš a ČERNOCKÝ Jan. Application of speaker- and language identification state-of-the-art techniques for emotion recognition. Speech Communication, roč. 53, č. 9, 2011, s. 1172-1185. ISSN 0167-6393.
Detail

DEORAS Anoop, MIKOLOV Tomáš a CHURCH Kenneth. A Fast Re-scoring Strategy to Capture Long-Distance Dependencies. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing July 2011 Edinburgh, Scotland, UK. Edinburgh: Association for Computational Linguistics, 2011, s. 1116-1127. ISBN 978-1-937284-11-4.
Detail

KOMBRINK Stefan a MIKOLOV Tomáš. Recurrent Neural Network Language Modeling Applied to the Brno AMI/AMIDA 2009 Meeting Recognizer Setup. In: Proceedings of the 17th Conference STUDENT EEICT 2011. Volume 3. Brno: Vysoké učení technické v Brně, 2011, s. 527-531. ISBN 978-80-214-4273-3.
Detail

GRÉZL František. The Role of Neural Network Size in TRAP/HATS Feature Extraction. In: Proceedings Text, Speech and Dialogue 2011. LNAI 6836, roč. 2011. Plzeň: Springer Verlag, 2011, s. 315-322. ISBN 978-3-642-23537-5. ISSN 0302-9743.
Detail

GLEMBEK Ondřej, BURGET Lukáš, BRÜMMER Niko, PLCHOT Oldřich a MATĚJKA Pavel. Discriminatively Trained i-vector Extractor for Speaker Verification. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, s. 137-140. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

KOCKMANN Marcel, FERRER Luciana, BURGET Lukáš a ČERNOCKÝ Jan. iVector Fusion of Prosodic and Cepstral Features for Speaker Verification. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, s. 265-268. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

MARTÍNEZ González David, PLCHOT Oldřich, BURGET Lukáš, GLEMBEK Ondřej a MATĚJKA Pavel. Language Recognition in iVectors Space. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, s. 861-864. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

GRÉZL František a KARAFIÁT Martin. Integrating recent MLP feature extraction techniques into TRAP architecture. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, s. 1229-1232. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

BOŘIL Hynek, GRÉZL František a HANSEN John H. Front-End Compensation Methods for LVCSR Under Lombard Effect. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, s. 1257-1260. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

SOUFIFAR Mehdi, KOCKMANN Marcel, BURGET Lukáš, PLCHOT Oldřich, GLEMBEK Ondřej a SVENDSEN Torbjorn. iVector Approach to Phonotactic Language Recognition. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, s. 2913-2916. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

MIKOLOV Tomáš, DEORAS Anoop, KOMBRINK Stefan, BURGET Lukáš a ČERNOCKÝ Jan. Empirical Evaluation and Combination of Advanced Language Modeling Techniques. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, s. 605-608. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

KOMBRINK Stefan, MIKOLOV Tomáš, KARAFIÁT Martin a BURGET Lukáš. Recurrent Neural Network based Language Modeling in Meeting Recognition. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, s. 2877-2880. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

KARAFIÁT Martin, BURGET Lukáš, MATĚJKA Pavel, GLEMBEK Ondřej a ČERNOCKÝ Jan. iVector-Based Discriminative Adaptation for Automatic Speech Recognition. In: Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011, s. 152-157. ISBN 978-1-4673-0366-8.
Detail

VESELÝ Karel, KARAFIÁT Martin a GRÉZL František. Convolutive Bottleneck Network Features for LVCSR. In: Proceedings of ASRU 2011. Big Island, Hawaii: IEEE Signal Processing Society, 2011, s. 42-47. ISBN 978-1-4673-0366-8.
Detail

GRÉZL František, KARAFIÁT Martin a JANDA Miloš. Study of Probabilistic and Bottle-Neck Features in Multilingual Environment. In: Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011, s. 359-364. ISBN 978-1-4673-0366-8.
Detail

MIKOLOV Tomáš, DEORAS Anoop, POVEY Daniel, BURGET Lukáš a ČERNOCKÝ Jan. Strategies for Training Large Scale Neural Network Language Models. In: Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011, s. 196-201. ISBN 978-1-4673-0366-8.
Detail

ČERNOCKÝ Jan, SZŐKE Igor, HANNEMANN Mirko, KOMBRINK Stefan a FAPŠO Michal. Hybrid Word-Subword Speech Recognition - a Powerful Tool to Search in Speech. Proceedings of 21st International Conference Radioelektronika 2011. Brno: Ústav radioelektroniky FEKT VUT v Brně, 2011. ISBN 978-1-61284-322-3.
Detail

MIKOLOV Tomáš, KOMBRINK Stefan, DEORAS Anoop, BURGET Lukáš a ČERNOCKÝ Jan. RNNLM - Recurrent Neural Network Language Modeling Toolkit. In: Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011, s. 1-4. ISBN 978-1-4673-0366-8.
Detail

FERRER Luciana, BRATT Harry, BURGET Lukáš, ČERNOCKÝ Jan, GLEMBEK Ondřej, GRACIARENA Martin, LAWSON Aaron, LEI Yun, MATĚJKA Pavel, PLCHOT Oldřich a SCHEFFER Nicolas. Promoting robustness for speaker modeling in the community: the PRISM evaluation set. In: Proceedings of SRE11 Analysis Workshop in 2011. Atlanta, Georga, 2011, s. 1-7.
Detail

POVEY Daniel, GHOSHAL Arnab, BOULIANNE Gilles, BURGET Lukáš, GLEMBEK Ondřej, GOEL Nagendra K., HANNEMANN Mirko, MOTLÍČEK Petr, QIAN Yanmin, SCHWARZ Petr, SILOVSKÝ Jan, STEMMER Georg a VESELÝ Karel. The Kaldi Speech Recognition Toolkit. In: Proceedings of ASRU 2011. Hilton Waikoloa Village Resort, Hawaii: IEEE Signal Processing Society, 2011, s. 1-4. ISBN 978-1-4673-0366-8.
Detail

2010

ČERNOCKÝ Jan a ŠEVEČKOVÁ Michaela. Korpusové a hlasové technologie v nové generaci elektronických slovníků - závěrečná technická zpráva. Brno: Ministerstvo průmyslu a obchodu ČR, 2010.
Detail

ŽIŽKA Josef, ČERNOCKÝ Jan, FAPŠO Michal a SZŐKE Igor. Web-Based Lecture Browser with Speech Search. In: Znalosti 2010. Sborník příspěvků 9. ročníku konference. Jindřichův Hradec: Fakulta managementu a informací VŠE, 2010, s. 287-290. ISBN 978-80-245-1636-3.
Detail

SANTHOSH Kumar Chellappan Pillai, LI Haizhou, TONG Rong, MATĚJKA Pavel, BURGET Lukáš a ČERNOCKÝ Jan. Tuning phone decoders for language identification. In: Proc. International Conference on Acoustics, Speech, and Signal Processing 2010. Dallas: IEEE Signal Processing Society, 2010, s. 5010-5013. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Detail

BURGET Lukáš, SCHWARZ Petr, AGARWAL Mohit, AKYAZI Pinar, FENG Kai, GHOSHAL Arnab, GLEMBEK Ondřej, GOEL Nagendra K., KARAFIÁT Martin, POVEY Daniel, RASTROW Ariya, ROSE Richard a THOMAS Samuel. Multilingual acoustic modeling for speech recognition based on Subspace Gaussian Mixture Models. In: Proc. International Conference on Acoustictics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, s. 4334-4337. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Detail

GHOSHAL Arnab, POVEY Daniel, AGARWAL Mohit, AKYAZI Pinar, BURGET Lukáš, FENG Kai, GLEMBEK Ondřej, GOEL Nagendra K., KARAFIÁT Martin, RASTROW Ariya, ROSE Richard, SCHWARZ Petr a THOMAS Samuel. A novel estimation of feature-space MLLR for full-covariance models. In: Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, s. 4310-4313. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Detail

GOEL Nagendra K., THOMAS Samuel, AGARWAL Mohit, AKYAZI Pinar, BURGET Lukáš, FENG Kai, GHOSHAL Arnab, GLEMBEK Ondřej, KARAFIÁT Martin, POVEY Daniel, RASTROW Ariya, ROSE Richard a SCHWARZ Petr. Approaches to automatic LEXICON learning with limited training examples. In: Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, s. 5094-5097. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Detail

KOCKMANN Marcel, BURGET Lukáš a ČERNOCKÝ Jan. Investigations into prosodic syllable contour features for speaker recognition. In: Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, s. 4418-4421. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Detail

POVEY Daniel, BURGET Lukáš, AGARWAL Mohit, AKYAZI Pinar, FENG Kai, GHOSHAL Arnab, GLEMBEK Ondřej, GOEL Nagendra K., KARAFIÁT Martin, RASTROW Ariya, ROSE Richard, SCHWARZ Petr a THOMAS Samuel. Subspace Gaussian mixture models for speech recognition. In: Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, s. 4330-4333. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Detail

ROSE Richard, NOROUZIAN Atta, REDDY Aarthi, COY Andre, GUPTA Vishwa a KARAFIÁT Martin. Subword-based spoken term detection in audio course lectures. In: Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, s. 5282-5285. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Detail

MIKOLOV Tomáš, PLCHOT Oldřich, GLEMBEK Ondřej, MATĚJKA Pavel, BURGET Lukáš a ČERNOCKÝ Jan. PCA-based Feature Extraction for Phonotactic Language Recognition. In: Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop. Brno: International Speech Communication Association, 2010, s. 251-255. ISBN 978-80-214-4114-9.
Detail

JANČÍK Zdeněk, PLCHOT Oldřich, BRUMMER Johan Nikolaas Langenhoven, BURGET Lukáš, GLEMBEK Ondřej, HUBEIKA Valiantsina, KARAFIÁT Martin, MATĚJKA Pavel, MIKOLOV Tomáš, STRASHEIM Albert a ČERNOCKÝ Jan. Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system. In: Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop. Brno: International Speech Communication Association, 2010, s. 215-221. ISBN 978-80-214-4114-9.
Detail

VESELÝ Karel, BURGET Lukáš a GRÉZL František. Parallel Training of Neural Networks for Speech Recognition. In: Prof. Text, Speech and Dialogue 2010. LNAI 6231, roč. 2010. Brno: Springer Verlag, 2010, s. 439-446. ISBN 978-3-642-15759-2. ISSN 0302-9743.
Detail

KARAFIÁT Martin, SZŐKE Igor a ČERNOCKÝ Jan. Using Gradient Descent Optimization for Acoustic Training from Heterogeneous Data. In: Proc. Text, Speech and Dialog 2010. LNAI 6231, roč. 2010. Brno: Springer Verlag, 2010, s. 322-329. ISBN 978-3-642-15759-2. ISSN 0302-9743.
Detail

KOMBRINK Stefan, HANNEMANN Mirko, BURGET Lukáš a HEŘMANSKÝ Hynek. Recovery of Rare Words in Lecture Speech. In: Proc. Text, Speech and Dialogue 2010. Brno: Springer Verlag, 2010, s. 330-337. ISBN 978-3-642-15759-2. ISSN 0302-9743.
Detail

VESELÝ Karel. Parallel training of neural networks for speech recognition. In: Proceedings of the 16th Conference STUDENT EEICT 2010. Volume 3. Brno: Vysoké učení technické v Brně, 2010, s. 74-76. ISBN 978-80-214-4078-4.
Detail

BRUMMER Johan Nikolaas Langenhoven, BURGET Lukáš, KENNY Patrick, MATĚJKA Pavel, DE Villiers Edward, KARAFIÁT Martin, KOCKMANN Marcel, GLEMBEK Ondřej, PLCHOT Oldřich, BAUM Doris a SENOUSSAUOI Mohammed. ABC System description for NIST SRE 2010. In: Proc. NIST 2010 Speaker Recognition Evaluation. Brno: National Institute of Standards and Technology, 2010, s. 1-20.
Detail

HANNEMANN Mirko, KOMBRINK Stefan, KARAFIÁT Martin a BURGET Lukáš. Similarity Scoring for Recognizing Repeated Out-of-VocabularyWords. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, s. 897-900. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Detail

KOCKMANN Marcel, BURGET Lukáš, GLEMBEK Ondřej, FERRER Luciana a ČERNOCKÝ Jan. Prosodic Speaker Verification using Subspace Multinomial Models with Intersession Compensation. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba, Japan: International Speech Communication Association, 2010, s. 1061-1064. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Detail

KOCKMANN Marcel, BURGET Lukáš a ČERNOCKÝ Jan. Brno University of Technology System for Interspeech 2010 Paralinguistic Challenge. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, s. 2822-2825. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Detail

MIKOLOV Tomáš, KARAFIÁT Martin, BURGET Lukáš, ČERNOCKÝ Jan a KHUDANPUR Sanjeev. Recurrent neural network based language model. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, s. 1045-1048. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Detail

GRÉZL František a KARAFIÁT Martin. Hierarchical Neural Net Architectures for Feature Extraction in ASR. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, s. 1201-1204. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Detail

VESELÝ Karel, BURGET Lukáš a GRÉZL František. Parallel Training of Neural Networks for Speech Recognition. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, s. 2934-2937. ISSN 1990-9772.
Detail

HAIN Thomas, BURGET Lukáš, DINES John, GARNER Phillip N., EL Hannani Asmaa, HUIJBREGTS Marijn, KARAFIÁT Martin, LINCOLN Mike a WAN Vincent. The AMIDA 2009 Meeting Transcription System. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, s. 358-361. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Detail

KOMBRINK Stefan, HANNEMANN Mirko a BURGET Lukáš. Out-of-vocabulary word detection and beyond. In: ECML PKDD 2010 Proceedings and Journal Content. Barcelona, 2010, s. 1-8.
Detail

ČERNOCKÝ Jan, SZŐKE Igor, HANNEMANN Mirko a KOMBRINK Stefan. Word-subword based keyword spotting with implications in OOV detection. Pacific Grove: Institute of Electrical and Electronics Engineers, 2010.
Detail

MARCEL Sebastien, MCCOOL Christopher S., MATĚJKA Pavel, ČERNOCKÝ Jan, KITTLER Joseph, GLEMBEK Ondřej, PLCHOT Oldřich, JANČÍK Zdeněk, LARCHER Anthony a LÉVY Christophe a kol. On the Results of the First Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation. In: Recognizing Patterns in Signals, Speech, Images, and Videos. Lecture Notes in Computer Science, roč. 6388. Istanbul: Springer Verlag, 2010, s. 210-225. ISBN 978-3-642-17710-1. ISSN 0302-9743.
Detail

SZŐKE Igor, GRÉZL František, ČERNOCKÝ Jan a FAPŠO Michal. Acoustic keyword spotter - optimization from end-user perspective. In: Proceedings of the 2010 IEEE Spoken Language Technology Workshop. IEEE Catalog Number: CFP 10SLT-USB. Berkeley, California: IEEE Signal Processing Society, 2010, s. 177-181. ISBN 978-1-4244-7902-3.
Detail

SZŐKE Igor, ČERNOCKÝ Jan, FAPŠO Michal a ŽIŽKA Josef. SPEECH@FIT LECTURE BROWSER. In: Proceedings of the 2010 IEEE Spoken Language Technology Workshop. IEEE Catalog Number: CFP 10SLT-USB. Berkeley, California: IEEE Signal Processing Society, 2010, s. 157-158. ISBN 978-1-4244-7902-3.
Detail

TEJEDOR Javier, SZŐKE Igor a FAPŠO Michal. Novel Methods for Query Selection and Query Combination in Query-By-Example Spoken Term Detection. In: Proceedings of the ACM Multimedia 2010 International Conference. Copyright 2010 ACM 978-1-4503-0162-6/10/10. Florencie: Association for Computing Machinery, 2010, s. 15-20. ISBN 978-1-60558-933-6.
Detail

KOMBRINK Stefan a HANNEMANN Mirko. DIRAC D2.16 - Final system for identifying unexpected acoustic inputs (BUT). Brno: The Information Society Technologies (IST) 6th Framework programme, 2010.
Detail

MARCEL Sebastien a MATĚJKA Pavel. MOBIO D6.6: Report on the MOBIO Final Prototypes. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2010.
Detail

ČERNOCKÝ Jan. MOBIO D7.4: Second report on dissemination activities. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2010.
Detail

MARCEL Sebastien, MCCOOL Christopher S., ČERNOCKÝ Jan, LÉVY Christophe a LARCHER Anthony a kol. MOBIO D1.2: Annual Report. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2010.
Detail

2009

DEHAK Najim, KENNY Patrick, DEHAK Reda, GLEMBEK Ondřej, DUMOUCHEL Pierre, BURGET Lukáš, HUBEIKA Valiantsina a CASTALDO Fabio. Support vector machines and joint factor analysis for speaker verification. In: Proc. ICASSP 2009. Taiwan: IEEE Signal Processing Society, 2009, s. 1-4. ISBN 978-1-4244-2354-5.
Detail

MIKOLOV Tomáš, KOPECKÝ Jiří, BURGET Lukáš, GLEMBEK Ondřej a ČERNOCKÝ Jan. Neural network based language models for highly inflective languages. In: Proc. ICASSP 2009. Taipei: IEEE Signal Processing Society, 2009, s. 4. ISBN 978-1-4244-2354-5.
Detail

GLEMBEK Ondřej, BURGET Lukáš, DEHAK Najim, BRÜMMER Niko a KENNY Patrick. Comparison of Scoring Methods used in Speaker Recognition with Joint Factor Analysis. In: Proc. ICASSP 2009. Taipei: IEEE Signal Processing Society, 2009, s. 4. ISBN 978-1-4244-2354-5.
Detail

HUBEIKA Valiantsina. Speaker verification as a target-nontarget trial task. In: Proceedings of the 15th Conference and Competition STUDENT EEICT 2009. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2009, s. 5. ISBN 978-80-214-3870-5.
Detail

KOMBRINK Stefan, BURGET Lukáš, MATĚJKA Pavel, KARAFIÁT Martin a HEŘMANSKÝ Hynek. Posterior-based Out of Vocabulary Word Detection in Telephone Speech. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, s. 80-83. ISSN 1990-9772.
Detail

GRÉZL František, KARAFIÁT Martin a BURGET Lukáš. Investigation into bottle-neck features for meeting speech recognition. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, s. 2947-2950. ISBN 978-1-61567-692-7. ISSN 1990-9772.
Detail

GARNER Phillip N., DINES John, HAIN Thomas, EL Hannani Asmaa, KARAFIÁT Martin, KORCHAGIN Danil, LINCOLN Mike, WAN Vincent a ZHANG Le. Real-Time ASR from Meetings. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, s. 2119-2122. ISSN 1990-9772.
Detail

BURGET Lukáš, MATĚJKA Pavel, HUBEIKA Valiantsina a ČERNOCKÝ Jan. Investigation into variants of Joint Factor Analysis for speaker recognition. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, s. 1263-1266. ISBN 978-1-61567-692-7. ISSN 1990-9772.
Detail

BURGET Lukáš, FAPŠO Michal, HUBEIKA Valiantsina, GLEMBEK Ondřej, KARAFIÁT Martin, KOCKMANN Marcel, MATĚJKA Pavel, SCHWARZ Petr a ČERNOCKÝ Jan. BUT system for NIST 2008 speaker recognition evaluation. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, s. 2335-2338. ISBN 978-1-61567-692-7. ISSN 1990-9772.
Detail

BRÜMMER Niko, STRASHEIM Albert, HUBEIKA Valiantsina, MATĚJKA Pavel, BURGET Lukáš a GLEMBEK Ondřej. Discriminative Acoustic Language Recognition via Channel-Compensated GMM Statistics. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, s. 2187-2190. ISBN 978-1-61567-692-7. ISSN 1990-9772.
Detail

KOCKMANN Marcel, BURGET Lukáš a ČERNOCKÝ Jan. Brno University of Technology System for Interspeech 2009 Emotion Challenge. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, s. 348-351. ISSN 1990-9772.
Detail

VILLALBA Lopez Jesus Antonio. Segmentation Experiments for NIST SRE. Brno: Fakulta informačních technologií VUT v Brně, 2009.
Detail

GRÉZL František a ČERNOCKÝ Jan. Audio Surveillance through Known Event Classification. Radioengineering, roč. 18, č. 4, 2009, s. 671-675. ISSN 1210-2512.
Detail

KAŠPAR Michal, ŠEVEČKOVÁ Michaela, CHALUPNÍČEK Kamil a ČERNOCKÝ Jan. Textové a řečové korpusy. Brno, 2009.
Detail

FAPŠO Michal, SZŐKE Igor a ČERNOCKÝ Jan. Hlasový přístup ke korpusům - experimenty. Brno: Ministerstvo průmyslu a obchodu ČR, 2009.
Detail

KAŠPAR Michal, PEŠÁN Jan, SZŐKE Igor, CHALUPNÍČEK Kamil a ČERNOCKÝ Jan. Technická zpráva k MPO projektu FT-TA3/006: Práce na Etapě 6: "Integrace". Brno: Ministerstvo průmyslu a obchodu ČR, 2009.
Detail

ČERNOCKÝ Jan, MATĚJKA Pavel a GLEMBEK Ondřej. MOBIO D3.4: Description and evaluation of advanced algorithms for uni-modal authentication. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2009.
Detail

ČERNOCKÝ Jan. MOBIO D7.3: First report on dissemination activities. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2009.
Detail

BRÜMMER Niko, BURGET Lukáš, GLEMBEK Ondřej, HUBEIKA Valiantsina, JANČÍK Zdeněk, KARAFIÁT Martin, MATĚJKA Pavel, MIKOLOV Tomáš, PLCHOT Oldřich a STRASHEIM Albert. BUT-AGNITIO System Description for NIST Language Recognition Evaluation 2009. In: Proceedings NIST 2009 Language Recognition Evaluation Workshop. Baltimore, Maryland, USA: National Institute of Standards and Technology, 2009, s. 1-7.
Detail

2008

BURGET Lukáš, SCHWARZ Petr, MATĚJKA Pavel, HANNEMANN Mirko, RASTROW Ariya, WHITE Christopher, KHUDANPUR Sanjeev, HEŘMANSKÝ Hynek a ČERNOCKÝ Jan. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs. In: Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas: IEEE Signal Processing Society, 2008, s. 4. ISBN 1-4244-1484-9.
Detail

GRÉZL František a FOUSEK Petr. Optimizing bottle-neck features for LVCSR. In: 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas, Nevada: IEEE Signal Processing Society, 2008, s. 4729-4732. ISBN 1-4244-1484-9.
Detail

PINTO Joel, SZŐKE Igor, PRASANNA S.R.M. a HEŘMANSKÝ Hynek. Fast Approximate Spoken Term Detection from Sequence of Phonemes. In: The 31st Annual International ACM SIGIR Conference 20-24 July 2008, Singapore. Singapore: Association for Computing Machinery, 2008, s. 28-33. ISBN 978-90-365-2697-5.
Detail

JANČÍK Zdeněk. Modelování dynamiky prosodie pro rozpoznání řečníka. In: Proceedings of the 14th Conference STUDENT EEICT 2008. Volume 2. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2008, s. 67-69. ISBN 978-80-214-3615-2.
Detail

WHITE Christopher, ZWEIG Geoffrey, BURGET Lukáš, SCHWARZ Petr a HEŘMANSKÝ Hynek. Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments. In: Proc. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas: IEEE Signal Processing Society, 2008, s. 4. ISBN 1-4244-1484-9.
Detail

PLCHOT Oldřich, HUBEIKA Valiantsina, BURGET Lukáš, SCHWARZ Petr a MATĚJKA Pavel. Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition. In: Proc. 11th International Conference on Text, Speech and Dialogue. Berlin: Springer Verlag, 2008, s. 477-483. ISBN 978-3-540-87390-7.
Detail

KOPECKÝ Jiří, GLEMBEK Ondřej a KARAFIÁT Martin. Advances in Acoustic Modeling for the Recognition of Czech. In: Proc. 11th International Conference on Text, Speech and Dialogue. Lecture Notes in Computer Science, roč. 5246. Berlin: Springer Verlag, 2008, s. 357-363. ISBN 978-3-540-87390-7.
Detail

MATĚJKA Pavel, BURGET Lukáš, GLEMBEK Ondřej, SCHWARZ Petr, HUBEIKA Valiantsina, FAPŠO Michal, MIKOLOV Tomáš, PLCHOT Oldřich a ČERNOCKÝ Jan. BUT language recognition system for NIST 2007 evaluations. In: Proc. Interspeech 2008. Brisbane, Australia: International Speech Communication Association, 2008, s. 4. ISSN 1990-9772.
Detail

HUBEIKA Valiantsina, BURGET Lukáš, MATĚJKA Pavel a SCHWARZ Petr. Discriminative Training and Channel Compensation for Acoustic Language Recognition. In: Proc. Interspeech 2008. Brisbane: International Speech Communication Association, 2008, s. 4. ISSN 1990-9772.
Detail

GLEMBEK Ondřej, MATĚJKA Pavel, BURGET Lukáš a MIKOLOV Tomáš. Advances in Phonotactic Language Recognition. In: Proc. Interspeech 2008. Brisbane: International Speech Communication Association, 2008, s. 4. ISSN 1990-9772.
Detail

KARAFIÁT Martin, BURGET Lukáš, HAIN Thomas a ČERNOCKÝ Jan. Discrimininative training of narrow band - wide band adaptated systems for meeting recognition. In: Proc. Interspeech 2008. Brisbane: International Speech Communication Association, 2008, s. 4. ISSN 1990-9772.
Detail

SZŐKE Igor, FAPŠO Michal, BURGET Lukáš a ČERNOCKÝ Jan. Hybrid word-subword decoding for spoken term detection. In: Proc. SSCS 2008: Speech search workshop at SIGIR. Singapore: Association for Computing Machinery, 2008, s. 4. ISBN 978-90-365-2697-5.
Detail

KOCKMANN Marcel a BURGET Lukáš. Syllable based Feature-Contours for Speaker Recognition. In: Proc. 14th International Workshop on Advances in Speech Technology. Maribor, 2008, s. 4.
Detail

BURGET Lukáš, FAPŠO Michal, HUBEIKA Valiantsina, GLEMBEK Ondřej, KARAFIÁT Martin, KOCKMANN Marcel, MATĚJKA Pavel, SCHWARZ Petr a ČERNOCKÝ Jan. BUT system description: NIST SRE 2008. In: Proc. 2008 NIST Speaker Recognition Evaluation Workshop. Montreal: National Institute of Standards and Technology, 2008, s. 1-4.
Detail

BURGET Lukáš, FAPŠO Michal, HUBEIKA Valiantsina, GLEMBEK Ondřej, KARAFIÁT Martin, KOCKMANN Marcel, MATĚJKA Pavel, SCHWARZ Petr a ČERNOCKÝ Jan. Brno University Of Technology - NIST 2008 SRE. Montreal, 2008.
Detail

MIKOLOV Tomáš. Language models for automatic speech recognition of Czech lectures. In: Proc. STUDENT EEICT 2008. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2008, s. 1-5. ISBN 978-80-214-3617-6.
Detail

SZŐKE Igor, BURGET Lukáš, ČERNOCKÝ Jan a FAPŠO Michal. Sub-word modeling of out of vocabulary words in spoken term detection. In: Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008, s. 4. ISBN 978-1-4244-3472-5.
Detail

KOCKMANN Marcel a BURGET Lukáš. Contour modeling of prosodic and acoustic features for speaker recognition. In: Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008, s. 4. ISBN 978-1-4244-3472-5.
Detail

OPARIN Ilya, GLEMBEK Ondřej, BURGET Lukáš a ČERNOCKÝ Jan. Morphological random forests for language modeling of inflectional languages. In: Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008, s. 4. ISBN 978-1-4244-3472-5.
Detail

BURGET Lukáš, BRÜMMER Niko, REYNOLDS Douglas, KENNY Patrick, PELECANOS Jason, VOGT Robbie, CASTALDO Fabio, DEHAK Najim, DEHAK Reda, GLEMBEK Ondřej, KARAM Zahi, NOECKER John Jr., NA Hye Young, COSTIN Ciprian C., HUBEIKA Valiantsina, KAJAREKAR Sachin, SCHEFFER Nicolas a ČERNOCKÝ Jan. Robust Speaker Recognition Over Varying Channels. Baltimore: Johns Hopkins University, 2008.
Detail

KOMBRINK Stefan. OOV detection in LVCSR using neural networks. In: Proc. STUDENT EEICT 2008. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2008, s. 3. ISBN 978-80-214-3617-6.
Detail

SZŐKE Igor, FAPŠO Michal a ČERNOCKÝ Jan. Hlasový přístup ke korpusům - studie. Brno: Ministerstvo průmyslu a obchodu ČR, 2008.
Detail

ČERNOCKÝ Jan a MATĚJKA Pavel. MOBIO D3.2: Report on the description and evaluation of baseline algorithms for unimodal authentication. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2008.
Detail

ČERNOCKÝ Jan a MATĚJKA Pavel. MOBIO D7.1: Planning of evaluation campaigns. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2008.
Detail

2007

GRÉZL František, KARAFIÁT Martin, KONTÁR Stanislav a ČERNOCKÝ Jan. Probabilistic and bottle-neck features for LVCSR of meetings. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Hononulu: IEEE Signal Processing Society, 2007, s. 757-760. ISBN 1-4244-0728-1.
Detail

MATĚJKA Pavel, BURGET Lukáš, SCHWARZ Petr, GLEMBEK Ondřej, KARAFIÁT Martin, GRÉZL František, ČERNOCKÝ Jan, VAN Leeuwen David, BRÜMMER Niko a STRASHEIM Albert. STBU system for the NIST 2006 speaker recognition evaluation. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Honolulu: IEEE Signal Processing Society, 2007, s. 221-224. ISBN 1-4244-0728-1.
Detail

BURGET Lukáš, MATĚJKA Pavel, SCHWARZ Petr, GLEMBEK Ondřej a ČERNOCKÝ Jan. Analysis of feature extraction and channel compensation in GMM speaker recognition system. IEEE Transactions on Audio, Speech, and Language Processing, roč. 15, č. 7, 2007, s. 1979-1986. ISSN 1558-7916.
Detail

GRÉZL František, KARAFIÁT Martin a ČERNOCKÝ Jan. Neural network topologies and bottle neck features in speech recognition. Brno, 2007.
Detail

GRÉZL František a ČERNOCKÝ Jan. TRAP-based Techniques for Recognition of Noisy Speech. In: Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007). LNCS. Berlin: Springer Verlag, 2007, s. 270-277. ISBN 978-3-540-74627-0.
Detail

KARAFIÁT Martin, BURGET Lukáš, ČERNOCKÝ Jan a HAIN Thomas. Application of CMLLR in narrow band wide band adapted systems. In: Proc. INTERSPEECH 2007. Antwerpen: International Speech Communication Association, 2007, s. 4. ISSN 1990-9772.
Detail

SINISCALCHI Sabato M., SCHWARZ Petr a LEE Chin-Hui. High-accuracy phone recognition by combining high performance lattice generation and knowledge based rescoring. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Hononulu: IEEE Signal Processing Society, 2007, s. 869-872. ISBN 1-4244-0728-1.
Detail

HAIN Thomas, WAN Vincent, BURGET Lukáš, KARAFIÁT Martin, DINES John, VEPA Jithendra, GARAU Giulia a LINCOLN Mike. The AMI System for the Transcription of Speech in Meetings. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Hononulu: IEEE Signal Processing Society, 2007, s. 357-360. ISBN 1-4244-0728-1.
Detail

ČERNOCKÝ Jan, SZŐKE Igor, FAPŠO Michal, KARAFIÁT Martin, BURGET Lukáš, KOPECKÝ Jiří, GRÉZL František, SCHWARZ Petr, GLEMBEK Ondřej, OPARIN Ilya, SMRŽ Pavel a MATĚJKA Pavel. Search in speech for public security and defense. In: Proc. IEEE Workshop on Signal Processing Applications for Public Security and Forensics, 2007 (SAFE '07). Washington D.C.: IEEE Signal Processing Society, 2007, s. 1-7. ISBN 1-4244-1226-9.
Detail

ČERNOCKÝ Jan, BURGET Lukáš, SCHWARZ Petr, MATĚJKA Pavel, KARAFIÁT Martin, GLEMBEK Ondřej, KOPECKÝ Jiří, SZŐKE Igor, FAPŠO Michal, GRÉZL František, HUBEIKA Valiantsina a OPARIN Ilya. Search in speech, language identification and speaker recognition in Speech@FIT. In: Proc. 17th International Conference Radioelektronika, 2007. Brno: Ústav radioelektroniky FEKT VUT v Brně, 2007, s. 1-6. ISBN 978-80-214-3390-8.
Detail

SZŐKE Igor, FAPŠO Michal, KARAFIÁT Martin, BURGET Lukáš, GRÉZL František, SCHWARZ Petr, GLEMBEK Ondřej, MATĚJKA Pavel, KOPECKÝ Jiří a ČERNOCKÝ Jan. Spoken Term Detection System Based on a Combination of LVCSR and Phonetic Search. Brno, 2007.
Detail

SZŐKE Igor, BURGET Lukáš a KARAFIÁT Martin. Combination of Word and Phoneme Approach for Spoken Term Detection. Brno, 2007.
Detail

HUBEIKA Valiantsina, BURGET Lukáš, MATĚJKA Pavel a ČERNOCKÝ Jan. Channel Compensation for Speaker Recognition. Brno, 2007.
Detail

HUBEIKA Valiantsina, SZŐKE Igor, BURGET Lukáš a ČERNOCKÝ Jan. Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System. In: Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007). Pilsen: Springer Verlag, 2007, s. 1-6. ISBN 978-3-540-74627-0.
Detail

BRÜMMER Niko, BURGET Lukáš, ČERNOCKÝ Jan, GLEMBEK Ondřej, GRÉZL František, KARAFIÁT Martin, VAN Leeuwen David, MATĚJKA Pavel, SCHWARZ Petr a STRASHEIM Albert. Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006. IEEE Transactions on Audio, Speech, and Language Processing, roč. 15, č. 7, 2007, s. 2072-2084. ISSN 1558-7916.
Detail

HUBEIKA Valiantsina. Estimation of gender and age. In: Proc. 13th Conference STUDENT EEICT 2007. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2007, s. 1-3. ISBN 9788021434103.
Detail

FAPŠO Michal. Search in speech records. In: Proc. 13th Conference STUDENT EEICT 2007. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2007, s. 1-3. ISBN 978-80-214-3410-3.
Detail

VESELÝ Karel. Hybrid recognizer of isolated words. In: Proc. 13th Conference STUDENT EEICT 2007. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2007, s. 1-3. ISBN 9788021434103.
Detail

HRDLIČKA Pavel. Rozpoznávání izolovaných slov. In: Proc. 13th Conference STUDENT EEICT 2007. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2007, s. 1-3. ISBN 9788021434103.
Detail

MIKOLOV Tomáš. Language modeling of Czech using neural networks. In: Proc. 13th Conference STUDENT EEICT 2007. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2007, s. 1-3. ISBN 9788021434103.
Detail

MIKOLOV Tomáš, OPARIN Ilya, GLEMBEK Ondřej, BURGET Lukáš, KARAFIÁT Martin a ČERNOCKÝ Jan. Použití mluvených korpusů ve vývoji systému pro rozpoznávání českých přednášek. Praha: Univerzita Karlova, 2007.
Detail

MATĚJKA Pavel, BURGET Lukáš, GLEMBEK Ondřej, SCHWARZ Petr, HUBEIKA Valiantsina, FAPŠO Michal, MIKOLOV Tomáš a PLCHOT Oldřich. BUT system description for NIST LRE 2007. In: Proc. 2007 NIST Language Recognition Evaluation Workshop. Orlando: National Institute of Standards and Technology, 2007, s. 1-5.
Detail

HEŘMANSKÝ Hynek, BURGET Lukáš, SCHWARZ Petr, MATĚJKA Pavel, HANNEMANN Mirko, RASTROW Ariya, WHITE Christopher, KHUDANPUR Sanjeev a ČERNOCKÝ Jan. Recovery from Model Inconsistency in Multilingual Speech Recognition. Baltimore: Johns Hopkins University, 2007.
Detail

CHALUPNÍČEK Kamil, ČERNOCKÝ Jan, KOSTKA Martin, PAVELEK Tomáš a VŠIANSKÝ Jan. Automatické hodnocení výslovnosti. Brno: Ministerstvo průmyslu a obchodu ČR, 2007.
Detail

GRÉZL František, HRDLIČKA Pavel, VESELÝ Karel, CHALUPNÍČEK Kamil, ČERNOCKÝ Jan, KOSTKA Martin, PAVELEK Tomáš a VŠIANSKÝ Jan. Vyhledávání slovníkových hesel hlasem. Brno: Ministerstvo průmyslu a obchodu ČR, 2007.
Detail

2006

FAPŠO Michal, SMRŽ Pavel, SCHWARZ Petr, SZŐKE Igor, SCHWARZ Milan, ČERNOCKÝ Jan, KARAFIÁT Martin a BURGET Lukáš. Information Retrieval from Spoken Documents. In: Proceedings of the Seventh International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2006). Mexico City: Springer Verlag, 2006, s. 410-416. ISBN 3-540-32205-1.
Detail

FAPŠO Michal, SCHWARZ Petr, SZŐKE Igor, SMRŽ Pavel, SCHWARZ Milan, ČERNOCKÝ Jan, KARAFIÁT Martin a BURGET Lukáš. Search Engine for Information Retrieval from Speech Records. In: Proceedings of the Third International Seminar on Computer Treatment of Slavic and East European Languages. Bratislava, 2006, s. 100-101.
Detail

SZŐKE Igor. Keyword Spotting in Meeting Data. In: Proceedings of the 12th Conference Student EEICT 2006 Volume 4. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2006, s. 440-444. ISBN 80-214-3163-6.
Detail

BURGET Lukáš, ČERNOCKÝ Jan, FAPŠO Michal, KARAFIÁT Martin, MATĚJKA Pavel, SCHWARZ Petr, SMRŽ Pavel a SZŐKE Igor. Indexing and search methods for spoken documents. In: Proceedings of the Ninth International Conference on Text, Speech and Dialogue, TSD 2006. LNCS. Berlin: Springer Verlag, 2006, s. 351-358. ISSN 0302-9743.
Detail

MATĚJKA Pavel, SCHWARZ Petr, BURGET Lukáš a ČERNOCKÝ Jan. Use of anti-models to furher improve state-of-the-art PRLM language recognition system. In: Proceedings of ICASSP 2006. Toulouse, 2006, s. 197-200.
Detail

BURGET Lukáš, MATĚJKA Pavel a ČERNOCKÝ Jan. Discriminative Training Techniques for Acoustic Language Identification. In: Proceedings of ICASSP 2006. Toulouse, 2006, s. 209-212.
Detail

SCHWARZ Petr, MATĚJKA Pavel a ČERNOCKÝ Jan. Hierarchical structures of neural networks for phoneme recognition. In: Proceedings of ICASSP 2006. Toulouse, 2006, s. 325-328.
Detail

MATĚJKA Pavel, BURGET Lukáš, SCHWARZ Petr a ČERNOCKÝ Jan. Brno University of Technology System for NIST 2005 Language Recognition Evaluation. In: Proceedings of Odyssey 2006: The Speaker and Language Recognition Workshop. San Juan, 2006, s. 57-64. ISBN 1-4244-0472-X.
Detail

MATĚJKA Pavel, BURGET Lukáš, SCHWARZ Petr a ČERNOCKÝ Jan. NIST Language Recognition Evaluation 2005. In: Proceedings of NIST LRE 2005. Washington DC: National Institute of Standards and Technology, 2006, s. 1-37.
Detail

MATĚJKA Pavel, BURGET Lukáš, SCHWARZ Petr a ČERNOCKÝ Jan. NIST Speaker Recognition Evaluation 2006. In: Proceedings of NIST Speaker Recognition Evaluation 2006. San Juan: National Institute of Standards and Technology, 2006, s. 1-40.
Detail

KONTÁR Stanislav. Parallel training of neural networks for speech recognition. In: Proc. 12th International Conference on Soft Computing MENDEL'06. Brno: Vysoké učení technické v Brně, 2006, s. 6. ISBN 80-214-3195-4.
Detail

GLEMBEK Ondřej, KARAFIÁT Martin, BURGET Lukáš a ČERNOCKÝ Jan. Czech Speech Recognizer for Multiple Environments. In: Radioeletronika 2006. Bratislava, 2006, s. 1-4.
Detail

ČERNOCKÝ Jan, MATĚJKA Pavel, BURGET Lukáš a SCHWARZ Petr. Automatic Language Identification System. In: Sborník příspěvků z odborného semináře "Nové technologie v radiokomunikacích". Brno: Univerzita Obrany, 2006, s. 1-6.
Detail

HUBEIKA Valiantsina. Estimation of Gender and Age from Recorded Speech. In: Proc. ACM Student Research competition 2006. Prague: České vysoké učení technické, 2006, s. 25-32. ISBN 80-01-03595-6.
Detail

KARAFIÁT Martin, GRÉZL František, SCHWARZ Petr, BURGET Lukáš a ČERNOCKÝ Jan. Robust heteroscedastic linear discriminant analysis and LCRC posterior features in large vocabulary continuous speech recognition. In: Proc. Fifth Slovenian and First International Language Technologies Conference. Ljubljana, 2006, s. 1-4.
Detail

SMRŽ Pavel. Uncertainty Extensions to Ontologies as a Tool for Semantic Interpretation in Audiovisual Systems. In: Proceedings of the 1st International Conference on Semantic and Digital Media Technologies, Poster and Demo. Athens, 2006, s. 27-28.
Detail

KARAFIÁT Martin, GRÉZL František, SCHWARZ Petr, BURGET Lukáš a ČERNOCKÝ Jan. Robust heteroscedastic linear discriminant analysis and LCRC posterior features in meeting data recognition. In: Proc. 3nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006). Lecture Notes in Computer Science, roč. 4299. Berlin: Springer Verlag, 2006, s. 275-284. ISBN 3-540-69267-3.
Detail

AL-HAMES Marc, HAIN Thomas, ČERNOCKÝ Jan, SCHREIBER Sascha, POEL Mannes, MÜLLER Ronald, MARCEL Sebastien, VAN Leeuwen David, ODOBEZ Jean-Marc, BA Sileye, BOURLARD Herve, CARDINAUX Fabien, GATICA-PEREZ Daniel, JANIN Adam, MOTLÍČEK Petr, REITER Stephan, RENALS Steve, VAN Rest Jeroen, RIENKS Rutger, RIGOLL Gerhard, SMITH Kevin, THEAN Andrew a ZEMČÍK Pavel. Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers. In: Proc. 3nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006). Washington D.C., 2006, s. 12.
Detail

HAIN Thomas, BURGET Lukáš, DINES John, GARAU Giulia, KARAFIÁT Martin, LINCOLN Mike a WAN Vincent. The AMI Meeting Transcription System. In: Proc. NIST Rich Transcription 2006 Spring Meeting Recognition Evaluation Worskhop. Washington D.C.: National Institute of Standards and Technology, 2006, s. 12.
Detail

SZŐKE Igor, FAPŠO Michal, KARAFIÁT Martin, BURGET Lukáš, GRÉZL František, SCHWARZ Petr, GLEMBEK Ondřej, MATĚJKA Pavel, KONTÁR Stanislav a ČERNOCKÝ Jan. BUT System for NIST STD 2006 - English. In: Proc. NIST SPoken Term Detection Evaluation workshop (STD 2006). Washington D.C.: National Institute of Standards and Technology, 2006, s. 26.
Detail

KOPECKÝ Jiří, SZŐKE Igor, FAPŠO Michal, KARAFIÁT Martin, BURGET Lukáš, OPARIN Ilya, SCHWARZ Petr, MATĚJKA Pavel, ČERNOCKÝ Jan a GLEMBEK Ondřej. BUT System for NIST STD 2006 - Arabic. In: Proc. NIST SPoken Term Detection Evaluation workshop (STD 2006). Washington D.C.: National Institute of Standards and Technology, 2006, s. 15.
Detail

ČERNOCKÝ Jan, POTÚČEK Igor, SUMEC Stanislav a ZEMČÍK Pavel a kol. AMI Mobile Meeting Capture and Analysis System. Washington, 2006.
Detail

STOLCKE Andreas, GRÉZL František, HWANG Mei-Yuh, LEI Xin, MORGAN Nelson a VERGYRI Dimitra. Cross-Domain and Cross-Language Portability of Acoustic Features Estimated by Multilayer Perceptrons. In: 2006 IEEE International Conference on Acoustic, Speech, and Signal Processing. Toulouse: IEEE Signal Processing Society, 2006, s. 321-324. ISBN 978-3-540-74627-0.
Detail

2005

MATĚJKA Pavel, SCHWARZ Petr, ČERNOCKÝ Jan a CHYTIL Pavel. Tuning Phonotactic Language Identificaion System. Brno: Fakulta informačních technologií VUT v Brně, 2005.
Detail

MATĚJKA Pavel. Phoneme Recognition Tuning for Language Identification System. In: Proceedings of the 11th conference STUDENT EEICT 2005. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2005, s. 658-653. ISBN 80-214-2890-2.
Detail

MATĚJKA Pavel, SCHWARZ Petr, ČERNOCKÝ Jan a CHYTIL Pavel. Phonotactic Language Identification. In: Proceedings of Radioelektronika 2005. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2005, s. 140-143. ISBN 80-214-2904-6.
Detail

MATĚJKA Pavel, SCHWARZ Petr, ČERNOCKÝ Jan a CHYTIL Pavel. Phonotactic Language Identification using High Quality Phoneme Recognition. In: Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology. Lisbon: International Speech Communication Association, 2005, s. 2237-2240. ISSN 1018-4074.
Detail

SZŐKE Igor. Smooth Pitch Tracker Based on Harmonic and Noise Model. In: STUDENT EEICT 2005. Brno: Fakulta informačních technologií VUT v Brně, 2005, s. 673-677. ISBN 80-214-2890-2.
Detail

SZŐKE Igor, SCHWARZ Petr, BURGET Lukáš, KARAFIÁT Martin a ČERNOCKÝ Jan. Phoneme based acoustics keyword spotting in informal continuous speech. In: Radioelektronika 2005. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2005, s. 195-198. ISBN 80-214-2904-6.
Detail

MOTLÍČEK Petr, BURGET Lukáš a ČERNOCKÝ Jan. VISUAL FEATURES FOR MULTIMODAL SPEECH RECOGNITION. In: Radioelektronika 2005. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2005, s. 187-190. ISBN 80-214-2904-6.
Detail

SMRŽ Pavel a FAPŠO Michal. Vyhledávání v záznamech přednášek. In: Sborník semináře Technologie pro e-vzdělávání. Praha: České vysoké učení technické, 2005, s. 21-26. ISBN 80-01-03274-4.
Detail

SZŐKE Igor, SCHWARZ Petr, BURGET Lukáš, KARAFIÁT Martin, MATĚJKA Pavel a ČERNOCKÝ Jan. Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech. Lecture Notes in Computer Science, roč. 2005, č. 3658, s. 8. ISSN 0302-9743.
Detail

SZŐKE Igor, SCHWARZ Petr, BURGET Lukáš, FAPŠO Michal, KARAFIÁT Martin, ČERNOCKÝ Jan a MATĚJKA Pavel. Comparison of Keyword Spotting Approaches for Informal Continuous Speech. In: Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology. Lisabon, 2005, s. 633-636. ISSN 1018-4074.
Detail

SZŐKE Igor, SCHWARZ Petr, MATĚJKA Pavel, BURGET Lukáš, FAPŠO Michal, KARAFIÁT Martin a ČERNOCKÝ Jan. Comparison of Keyword Spotting Approaches for Informal Continuous Speech. In: 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms. Edinburgh, 2005, s. 12.
Detail

ZHU Qifeng, CHEN Barry, GRÉZL František a MORGAN Nelson. Improved MLP Structures for Data-Driven Feature Extraction for ASR. In: Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology. Lisabon, 2005, s. 4. ISSN 1018-4074.
Detail

STOLCKE Andreas, ANGUERA Xavier, BOAKYE Kofi, CETIN Özgür, GRÉZL František, JANIN Adam, MANDAL Arindam, PESKIN Barbara, WOOTERS Chuck a ZHENG Jing. Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System. In: Machine Learning for Multimodal Interaction, Second International Workshop, MLMI 2005, Edinburgh, UK, July 11-13, 2005, Revised Selected Papers. Lecture Notes in Computer Science 3869, Springer 2006. Edinburgh, Scotland: University of Edinburgh, 2005, s. 463-475. ISBN 978-3-540-32549-9.
Detail

GRÉZL František. Spectral plane investigation for probabilistic features for ASR. Edinburgh, 2005.
Detail

FAPŠO Michal, SCHWARZ Petr, SZŐKE Igor, ČERNOCKÝ Jan, SMRŽ Pavel, BURGET Lukáš a KARAFIÁT Martin. Search Engine for Information Retrieval from Multi-modal Records. Edinburgh, 2005.
Detail

KARAFIÁT Martin, BURGET Lukáš a ČERNOCKÝ Jan. Using Smoothed Heteroscedastic Linear Discriminant Analysis in Large Vocabulary Continuous Speech Recognition System. In: 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms. tento článek nebyl zařazen mezi Revised Selected Papers, nevyšel v LNCS 3869. Edinbourgh, Scotland: University of Edinburgh, 2005, s. 8.
Detail

HAIN Thomas, KARAFIÁT Martin, DINES John, MCCOWAN Iain, LINCOLN Mike, GARAU Giulia, WAN Vincent, ORDELMAN Roeland a RENALS Steve. The Development of the AMI System for the Transcription of Speech in Meetings. In: Machine Learning for Multimodal Interaction, Second International Workshop, MLMI 2005, Edinburgh, UK, July 11-13, 2005, Revised Selected Papers. Lecture Notes in Computer Science Volume 3869, Springer 2006. Edinburgh: University of Edinburgh, 2005, s. 344-356. ISBN 978-3-540-32549-9.
Detail

HAIN Thomas, KARAFIÁT Martin, GARAU Giulia, MOORE Darren, WAN Vincent, ORDELMAN Roeland a RENALS Steve. Transcription of Conference Room Meetings: an Investigation. In: Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology. Lisabon: International Speech Communication Association, 2005, s. 4. ISSN 1018-4074.
Detail

HAIN Thomas, BURGET Lukáš, DINES John, GARAU Giulia, KARAFIÁT Martin, LINCOLN Mike, MCCOWAN Iain, MOORE Darren, WAN Vincent, ORDELMAN Roeland a RENALS Steve. The 2005 AMI System for the Transcription of Speech in Meetings. In: Machine Learning for Multimodal Interaction, Second International Workshop, MLMI 2005, Edinburgh, UK, July 11-13, 2005, Revised Selected Papers. Lecture Notes in Computer Science Volume 3869, Springer 2006. Edinburgh: University of Edinburgh, 2005, s. 450-462. ISBN 978-3-540-32549-9.
Detail

ASHBY Simone, BOURBAN Sebastien, CARLETTA Jean, FLYNN Mike, GUILLEMOT Mael, HAIN Thomas, KADLEC Jaroslav, KARAISKOS Vasilis, KRAAIJ Wessel, KRONENTHAL Melissa, LATHOUD Guillaume, LINCOLN Mike, LISOWSKA Agnes, MCCOWAN Iain, POST Wilfried, REIDSMA Dennis a WELLNER Pierre. The AMI Meeting Corpus: A Pre-Announcement. In: Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI). Edinburgh, 2005, s. 4.
Detail

MOTLÍČEK Petr, BURGET Lukáš a ČERNOCKÝ Jan. Non-parametric Speaker Turn Segmentation of Meeting Data. In: Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology. Lisabon: International Speech Communication Association, 2005, s. 657-660. ISSN 1018-4074.
Detail

ASHBY Simone, BOURBAN Sebastien, CARLETTA Jean, FLYNN Mike, GUILLEMOT Mael, HAIN Thomas, KADLEC Jaroslav, KARAISKOS Vasilis, KRAAIJ Wessel, KRONENTHAL Melissa, LATHOUD Guillaume, LINCOLN Mike, LISOWSKA Agnes, MCCOWAN Iain, POST Wilfried, REIDSMA Dennis a WELLNER Pierre. The AMI Meeting Corpus. In: Measuring Behavior 2005 Proceedings Book. Wageningen, 2005, s. 4.
Detail

ČERNOCKÝ Jan a LAMPA Petr. Teaching signals - making it automatic, making it fun. In: Proc. Radioelektronika 2005. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2005, s. 4. ISBN 80-214-2904-6.
Detail

GRÉZL František. Adaptation of Unknown Data to Already Trained Speech Recognition System. In: Sborník prací konference a souteze Student EEICT 2005. Brno: Fakulta informačních technologií VUT v Brně, 2005, s. 4. ISBN 80-214-2890-2.
Detail

ČERNOCKÝ Jan a CHALUPNÍČEK Kamil. Checks of speech transcriptions for AMI meeting database. In: 10th International Conference SPEECH and COMPUTER. Moscow, 2005, s. 587-590. ISBN 5-7452-0110-1.
Detail

CHALUPNÍČEK Kamil. Checks of Speech Annotation of AMI Meetings. In: Sborník prací konference a souteze Student EEICT 2005. Brno: Fakulta informačních technologií VUT v Brně, 2005, s. 612-616. ISBN 80-214-2890-2.
Detail

2004

GRÉZL František. Combinations of TRAP based systems. In: Proc. Seventh International conference on Text, Speech and Dialogue. Brno: Fakulta informatiky MU, 2004, s. 323-330. ISBN 3-540-23049-1.
Detail

MOTLÍČEK Petr. Modelování spektra a časových trajektorií v rozpoznávání řeči. GACR 102/02/0124 "Hlasové technologie v podpoře informační společnosti", souhrnný přehled aktivit řešitelských kolektivů. Praha, 2004. ISBN 80-01-02957-3.
Detail

SZŐKE Igor a MOTLÍČEK Petr. Kódování řeči na velmi nízkých bitových rychlostech. GACR 102/02/0124 "Hlasové technologie v podpoře informační společnosti", souhrnný přehled aktivit řešitelských klektivů. Praha: Fakulta elektrotechniky ČVUT, 2004. ISBN 80-01-02957-3.
Detail

SZŐKE Igor. Speech units automatically generated by ergodic hidden Markov model. In: Proceedings of 10th Conference and Competition STUDENT EEICT 2004. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2004, s. 5.
Detail

MATĚJKA Pavel, SZŐKE Igor, SCHWARZ Petr a ČERNOCKÝ Jan. Automatic Language Identification using Phoneme and Automatically Derived Unit Strings. In: Proceedings of 7th International Conference Text,Speech and Dialoque 2004. Brno: Springer Verlag, 2004, s. 8. ISBN 3-540-23049-1.
Detail

SCHWARZ Petr, MATĚJKA Pavel a ČERNOCKÝ Jan. Towards Lower Error Rates In Phoneme Recognition. Lecture Notes in Computer Science, roč. 2004, č. 3206, s. 465-472. ISBN 3-540-23049-1. ISSN 0302-9743.
Detail

MOTLÍČEK Petr, BURGET Lukáš a ČERNOCKÝ Jan. Phoneme Recognition of Meetings using Audio-Visual Data. AMI Workshop. Martigny, 2004.
Detail

KARAFIÁT Martin, GRÉZL František a ČERNOCKÝ Jan. TRAP based features for LVCSR of meeting data. In: Proc. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co,, 2004, s. 437-440. ISSN 1225-4111.
Detail

BURGET Lukáš. Combination of Speech Features Using Smoothed Heteroscedastic Linear Discriminant Analysis. In: Proc. 8th International Conference on Spoken Language Processing. Jeju island: Sunjin Printing Co,, 2004, s. 2549-2552.
Detail

MOTLÍČEK Petr a ČERNOCKÝ Jan. Multimodal Phoneme Recognition of Meeting Data. In: 7th International Conference, TSD 2004 Brno, Czech Republic, September 2004 Proceedings. Brno: Springer Verlag, 2004, s. 379-384. ISBN 3-540-23049-1. ISSN 0302-9743.
Detail

BURGET Lukáš. Measurement of Complementarity of Recognition Systems. In: Proc. Seventh International conference on Text, Speech and Dialogue. Lecture Notes in Artificial Intelligence (LNAI) subseries of LNCS series as Volume 3206. Brno: Springer Verlag, 2004, s. 283-290. ISBN 3-540-23049-1.
Detail

FOUSEK Petr, SVOJANOVSKÝ Petr, GRÉZL František a HEŘMANSKÝ Hynek. New Nonsense Syllables Database - Analyses and Preliminary ASR Experiments. In: Proc. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co,, 2004, s. 348-351. ISSN 1225-4111.
Detail

MOTLÍČEK Petr. Visual Feature Extreaction for Phoneme Recognition of Meetings. Brno: Ústav počítačové grafiky a multimédií FIT VUT v Brně, 2004.
Detail

MATĚJKA Pavel, SZŐKE Igor, SCHWARZ Petr a ČERNOCKÝ Jan. Automatic Language Identification using Phoneme and Automatically Derived Unit Strings. Lecture Notes in Computer Science, roč. 2004, č. 3206, s. 8. ISSN 0302-9743.
Detail

MATĚJKA Pavel, ČERNOCKÝ Jan a SIGMUND Milan. Introduction to Automatic Language Identification. In: Conference Proceedings of Radioelektronika 2004. Brno: Slovenská technická univerzita v Bratislavě, 2004, s. 4. ISBN 80-227-2017-8.
Detail

MATĚJKA Pavel. Review of Automatic Language Identification. In: Proceedings of 10th Conference and Competition STUDENT EEICTT 2004 Volume 2. Brno, 2004, s. 5. ISBN 80-214-2635-7.
Detail

MOTLÍČEK Petr. Segmentace nahrávek živých jednání podle mluvčího. In: Sborník příspěvků a prezentací akce Odborné semináře 2004. REL03V. Brno: Ústav radioelektroniky FEKT VUT v Brně, 2004, s. 28.
Detail

SCHWARZ Petr, MATĚJKA Pavel a ČERNOCKÝ Jan. Towards Lower Error Rates in Phoneme Recognition. In: Proceedings of 7th International Conference Text,Speech and Dialoque 2004. Brno: Springer Verlag, 2004, s. 8. ISBN 3-540-23049-1.
Detail

SCHWARZ Petr, MATĚJKA Pavel a ČERNOCKÝ Jan. Towards Lower Error Rates in Phoneme Recognition. Lecture Notes in Computer Science, roč. 2004, č. 3206, s. 8. ISSN 0302-9743.
Detail

SCHWARZ Petr, MATĚJKA Pavel a ČERNOCKÝ Jan. Phoneme Recognition from a Long Temporal Context. In: poster at JOINT AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms. Martigny: Insititut Dalle Molle d'Intelligence Artificielle Perceptive, 2004, s. 1-1.
Detail

MOTLÍČEK Petr a ČERNOCKÝ Jan. Multimodal Phoneme Recognition of Meeting Data. Lecture Notes in Computer Science, roč. 2004, č. 3206, s. 6. ISSN 0302-9743.
Detail

2003

MOTLÍČEK Petr. Derivation of TRAPs in Auditory Domain. In: Proceedings of 9th Conference and Competition STUDENT EEICT 2003. Brno: Děkanát FEKT VUT, 2003, s. 598-602. ISBN 80-214-2379-X.
Detail

JENDERKA Petr a VÍCHA Tomáš. Voice Activity Detection in Multimodal Meeting Manager. In: Proceedings of 9th Conference and Competition STUDENT EEICT 2003 Volume 3. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2003, s. 588-592. ISBN 80-214-2379-X.
Detail

SCHWARZ Petr, MATĚJKA Pavel a ČERNOCKÝ Jan. Recognition of Phoneme Strings using TRAP Technique. In: Proceedings of 8th International Conference Eurospeech. Geneve: International Speech Communication Association, 2003, s. 1-4. ISSN 1018-4074.
Detail

MOTLÍČEK Petr. Derivation of TRAPs in Auditory Domain. In: Proceedings of the International Conference and Competition. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2003, s. 315-319. ISBN 80-214-2401-X.
Detail

MOTLÍČEK Petr a ČERNOCKÝ Jan. Time-domain based Temporal Processing with Application of. In: Proc. EUROSPEECH 2003. Geneva: Insititut Dalle Molle d'Intelligence Artificielle Perceptive, 2003, s. 821-824. ISSN 1018-4074.
Detail

MOTLÍČEK Petr a ČERNOCKÝ Jan. Autoregressive Modeling based Feature Extraction for Aurora3 DSR Task. In: Proc. EUROSPEECH 2003. Geneva: Insititut Dalle Molle d'Intelligence Artificielle Perceptive, 2003, s. 1801-1804. ISSN 1018-4074.
Detail

MOTLÍČEK Petr a ČERNOCKÝ Jan. All-Pole Modeling for Definition of Speech Features in Aurora3 DSR Task. In: 6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings. České Budějovice: Západočeská univerzita v Plzni, 2003, s. 295-300. ISBN 3-540-20024-X. ISSN 0302-9743.
Detail

SCHWARZ Petr. Would You Like To Make Your Programs Understand Human Voice?. In: Proceedings of 9th Conference STUDENT EEICT 2003. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2003, s. 231-235. ISBN 80-214-2379-X.
Detail

ČERNOCKÝ Jan. Temporal processing for feature extraction in speech recognition, shortened version of habilitation thesis. Vědecké spisy VUT. Edice Habilitační a inaugurační spisy, sv. 112. Brno: Nakladatelství Vysokého učení technického v Brně VUTIUM, 2003, s. 1-30. ISBN 80-214-2395-1.
Detail

MATĚJKA Pavel, SCHWARZ Petr, HEŘMANSKÝ Hynek a ČERNOCKÝ Jan. Phoneme Recognition using Temporal Patterns. In: Proc. 6th International Conference Text, Speech and Dialogue, TSD2003. Ceske Budejovice: Springer Verlag, 2003, s. 465-472. ISBN 3-540-20024-X.
Detail

MATĚJKA Pavel, SCHWARZ Petr, GRÉZL František a ČERNOCKÝ Jan. Phoneme Classification using Temporal Patterns. In: Proc. 13th International scientific conference Radioelektronika 2003. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2003, s. 1-4. ISBN 80-214-2383-8.
Detail

GRÉZL František. Local Time-Frequency Operators in TRAPs For Speech Recognition. In: 6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings. České Budějovice: Západočeská univerzita v Plzni, 2003, s. 269-274. ISBN 3-540-20024-X. ISSN 0302-9743.
Detail

GRÉZL František a HEŘMANSKÝ Hynek. Local averaging and differentiating of spectral plane for TRAP-based ASR. In: Proc. EUROSPEECH 2003. Geneva: Insititut Dalle Molle d'Intelligence Artificielle Perceptive, 2003, s. 4. ISSN 1018-4074.
Detail

GRÉZL František. Effect of normalization on TRAP based systems in ASR. In: Proc. 13th International scientific conference Radioelektronika 2003. Brno: Ústav radioelektroniky FEKT VUT v Brně, 2003, s. 128-131. ISBN 80-214-2383-8.
Detail

KARAFIÁT Martin a GRÉZL František. Using MATLAB for Analysis of TRAP system. Radioengineering, roč. 2003, č. 4, s. 38-41. ISSN 1210-2512.
Detail

MOTLÍČEK Petr. Modeling of Spectra and Temporal Trajectories in Speech Processing. In: Sborník příspěvků a prezentací akce Odborné semináře 2003 . REL02V. Brno: Ústav radioelektroniky FEKT VUT v Brně, 2003, s. 28.
Detail

BURGET Lukáš a ČERNOCKÝ Jan. Recognition of Speech with Non-random Attributes. In: 6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings. České Budějovice: Springer Verlag, 2003, s. 6. ISBN 3-540-20024-X. ISSN 0302-9743.
Detail

2002

BAUDOIN Genevieve, CAPMAN Francois, ČERNOCKÝ Jan, EL Chami Fadi, CHARBIT Maurice, CHOLLET Gerard a PETROVSKA-DELACRETAZ Dijana. Advances in very low bit-rate speech coding using recognition and synthesis techniques. Lecture Notes in Computer Science, roč. 2002, č. 2448, s. 269-276. ISBN 3-540-44129-8. ISSN 0302-9743.
Detail

MATĚJKA Pavel, SCHWARZ Petr, KARAFIÁT Martin a ČERNOCKÝ Jan. Some like it Gaussian... In: Proc. 5th International Conference Text, Speech and Dialogue, TSD2002. Lecture notes in artificial intelligence 2448. Berlin: Springer Verlag, 2002, s. 321-324. ISBN 3-540-44129-8.
Detail

SCHWARZ Petr a ČERNOCKÝ Jan. Keyword detection in Czech fluent speech. In: Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovenská technická univerzita v Bratislavě, 2002, s. 4. ISBN 80-227-1700-2.
Detail

KARAFIÁT Martin a ČERNOCKÝ Jan. Context dependent Hidden Markov models in recognition of Czech. In: Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovenská technická univerzita v Bratislavě, 2002, s. 4. ISBN 80-227-1700-2.
Detail

GRÉZL František, BURGET Lukáš, JAIN Pratibha a ČERNOCKÝ Jan. Improving TRAPS features using LDA. In: Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovenská technická univerzita v Bratislavě, 2002, s. 4. ISBN 80-227-1700-2.
Detail

ČERNOCKÝ Jan. Units for automatic language independent speech processing. In: Proc. LREC 2002 - workshop on Portability issues in human language technologies. Las Palmas: European Language Resources Association, 2002, s. 7-13.
Detail

SCHWARZ Petr. Modifications of Viterbi algorithms for keyword detection. In: Proceedings of 8th Conference STUDENT EEICT 2002. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2002, s. 4. ISBN 80-214-2116-9.
Detail

MOTLÍČEK Petr a BURGET Lukáš. Noise estimation for efficient speech enhancement and robust speech recognition. In: Proc. 7th International Conference on Spoken Language Processing. Denver: International Speech Communication Association, 2002, s. 1033-1036. ISBN 1-876346-42-6.
Detail

MOTLÍČEK Petr. Application of Mel-scale Filter bank for Noise Estimation in Speech Processing. In: 12th International Czech-Slovak Scientific conference Radioelektronika 2002. Bratislava: Slovenská technická univerzita v Bratislavě, 2002, s. 4. ISBN 80-227-1700-2.
Detail

MOTLÍČEK Petr a BURGET Lukáš. Efficient Noise Estimation and its Application for Robust Speech Recognition. In: 5th International Conference, TSD 2002 Brno, Czech Republic, September 2002 Proceedings. Berlin: Springer Verlag, 2002, s. 229-236. ISBN 3-540-44129-8.
Detail

MOTLÍČEK Petr. Noise Estimation for Spectral Subtraction in Speech Processing. In: Proceedings of 8th Conference STUDENT EEICT 2002. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2002, s. 4. ISBN 80-214-2116-9.
Detail

BURGET Lukáš, MOTLÍČEK Petr, GRÉZL František a JAIN Pratibha. Distributed speech recognition. Radioengineering, roč. 2002, č. 4, s. 12-16. ISSN 1210-2512.
Detail

GARUDADRI Harinath, HEŘMANSKÝ Hynek, MORGAN Nelson, BENITEZ Carmen, BURGET Lukáš, KAJAREKAR Sachin, GRÉZL František, JAIN Pratibha a MOTLÍČEK Petr. Distributed Voice Recognition System Utilizing Multistream Network Feature Processing. San Diego: Qualcomm, 2002.
Detail

ČERNOCKÝ Jan a KARAFIÁT Martin. Differences between context dependent and context independent Hidden Markov Models for recognition of Czech. In: Proc. of 8th student conference STUDENT EEICT 2002. Brno: Fakulta elektrotechniky VUT, 2002, s. 5. ISBN 80-214-2116-9.
Detail

GRÉZL František. Classifiers in speech recognition systems based on TRAPS. In: Proceedings of 8th Conference STUDENT EEICT 2002. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2002, s. 74-77. ISBN 80-214-2116-9.
Detail

MOTLÍČEK Petr. Feature Extraction in Speech Coding and Recognition. Portland: Oregon Graduate Institute of Science and Technology, 2002.
Detail

BURGET Lukáš, DUPONT Stephane, GARUDADRI Harinath, GRÉZL František, HEŘMANSKÝ Hynek, JAIN Pratibha, KAJAREKAR Sachin a MORGAN Nelson. QUALCOMM-ICSI-OGI Features for ASR. In: Proc. 7th International Conference on Spoken Language Processing. Denver: International Speech Communication Association, 2002, s. 4. ISBN 1-876346-42-6.
Detail

MATĚJKA Pavel a ČERNOCKÝ Jan. Feature gaussianization in speech recognition. In: Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovenská technická univerzita v Bratislavě, 2002, s. 4. ISBN 80-227-1700-2.
Detail

ČERNOCKÝ Jan. Temporal processing for feature extraction in speech recognition, habilitation thesis. Brno, 2002.
Detail

2001

ČERNOCKÝ Jan, BAUDOIN Genevieve, PETROVSKA-DELACRETAZ Dijana a CHOLLET Gerard. Vers une analyse acoustico-phonetique de la parole independante de la langue, basee sur ALISP. Revue Parole, roč. 2001, č. 17, s. 191-226. ISSN 1373-1955.
Detail

HEUVEL Henk, BOUDY Jerome, BAKCSI Zoltan, ČERNOCKÝ Jan, GALUNOV Valerij, KOCHANINA Julia, MAJEWSKI Wojciech, POLLÁK Petr, RUSKO Milan, SADOWSKI Jerzy, STARONIEWICZ Piotr a TROPF Herbert. SpeechDat-East: Five multilingual speech databases for voice-operated teleservices completed. In: Proc. EUROSPEECH 2001. Aalborg: International Speech Communication Association, 2001, s. 4. ISBN 87-90834-09-7.
Detail

MOTLÍČEK Petr, BAUDOIN Genevieve, ČERNOCKÝ Jan a CHOLLET Gerard. Minimization of transition noise and HNM synthesis in very low bit rate speech coding. In: 4th International Conference, TSD 2001 Železná Ruda, Czech Republic, September 2001 Proceedings. Berlin: Springer Verlag, 2001, s. 305-312. ISBN 3-540-42557-8.
Detail

MOTLÍČEK Petr. Application of Re-segmentation in Very Low Bit Rate Speech Coding. In: Proceedings of 7th Conference STUDENT EEICT 2001. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2001, s. 269-274. ISBN 80-214-1860-5.
Detail

MOTLÍČEK Petr, GOURNAY Philipe, CHOLLET Gerard a BAUDOIN Genevieve. Codeur tres bas debit par indexation d'unites de parole de taille variable. In: GRETSI'01 on signal and image processing. Toulouse, 2001, s. 4.
Detail

2000

PETROVSKA-DELACRETAZ Dijana, ČERNOCKÝ Jan, HENNEBERT Jean a CHOLLET Gerard. Segmental Approaches for Automatic Speaker Verification. Digital signal processing, roč. 2000, č. 1, s. 198-212. ISSN 1052-2004.
Detail

BAUDOIN Genevieve, ČERNOCKÝ Jan, GOURNAY Philipe a CHOLLET Gerard. Codage de la parole a bas et tres bas debits. Annales des Telecommunications, roč. 2000, č. 9, s. 1-19. ISSN 0003-4347.
Detail

MOTLÍČEK Petr a ČERNOCKÝ Jan. Optimal Pitch Path Tracking for more reliable Pitch Detection. In: 3th International Conference, TSD 2000 Brno, Czech Republic, September 2000 Proceedings. Berlin: Springer Verlag, 2000, s. 183-188. ISBN 3-540-41042-2.
Detail

MOTLÍČEK Petr a BURGET Lukáš. RELIABILITY IMPROVEMENT OF SPEECH PITCH DETECTION USING PATHS. In: Volume of the Works written by Students and Postgraduate Students. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2000, s. 348-351. ISBN 80-7204-155-X.
Detail

1999

ČERNOCKÝ Jan, POLLÁK Petr, HANŽL Václav, RUSKO Milan a TRNKA Marián. Recording of Czech and Slovak telephone databases within SpeechDat-E. Proc. Workshop on TEXT, SPEECH and DIALOG (TSD'99). Lecture Notes in Artificial Intelligence No. 1692. Berlin: Springer Verlag, 1999, s. 388-391. ISBN 3-540-66494-7.
Detail

1996

ČERNOCKÝ Jan. Multigram-based speech coding - concepts of the dissertation. Brno: Fakulta elektrotechniky a informatiky VUT, 1996.
Detail

Studijní oddělení

Výzkumná skupina dolování dat z řeči BUT Speech@FIT

https://speech.fit.vutbr.cz/

Publikace