Project Details
VVI e-INFRA CZ (ORJ 90254, kód MŠMT MSMT-54/2023)
Project Period: 1. 1. 2023 - 31. 12. 2026
Project Type: grant
Code: e-INFRA CZ
Agency: Ministry of Education, Youth and Sports Czech Republic
Program:
Type
grant
Team members
Publications
2024
- KUNEŠOVÁ Marie, ZAJÍC Zbyněk, ŠMÍDL Luboš and KARAFIÁT Martin. Comparison of wav2vec 2.0 models on three speech processing tasks. International Journal of Speech Technology, vol. 27, no. 4, 2024, pp. 1-13. ISSN 1572-8110. Detail
- HAN Jiangyu, LANDINI Federico Nicolás, ROHDIN Johan A., DIEZ Sánchez Mireia, BURGET Lukáš, CAO Yuhang, LU Heng and ČERNOCKÝ Jan. Diacorrect: Error Correction Back-End for Speaker Diarization. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024, pp. 11181-11185. ISBN 979-8-3503-4485-1. Detail
- LANDINI Federico Nicolás, DIEZ Sánchez Mireia, STAFYLAKIS Themos and BURGET Lukáš. DiaPer: End-to-End Neural Diarization With Perceiver-Based Attractors. IEEE Transactions on Audio, Speech, and Language Processing, vol. 32, no. 7, 2024, pp. 3450-3465. ISSN 1558-7916. Detail
- KLEMENT Dominik, DIEZ Sánchez Mireia, LANDINI Federico Nicolás, BURGET Lukáš, SILNOVA Anna, DELCROIX Marc and TAWARA Naohiro. Discriminative Training of VBx Diarization. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11871-11875. ISBN 979-8-3503-4485-1. Detail
- ZHANG Lin, STAFYLAKIS Themos, LANDINI Federico Nicolás, DIEZ Sánchez Mireia, SILNOVA Anna and BURGET Lukáš. Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Québec City: International Speech Communication Association, 2024, pp. 123-130. Detail
- ČEGIŇ Ján, PECHER Branislav, ŠIMKO Jakub, SRBA Ivan and BIELIKOVÁ Mária. Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation. In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Bangkok: Association for Computational Linguistics, 2024, pp. 13148-13171. ISBN 979-8-8917-6094-3. Detail
- PECHER Branislav, ČEGIŇ Ján, BELANEC Róbert, SRBA Ivan, ŠIMKO Jakub and BIELIKOVÁ Mária. Fighting Randomness With Randomness: Mitigating Optimisation Instability of Fine-Tuning Using Ensemble and Noise Regularisation. In: Findings of the Association for Computational Linguistics: EMNLP 2024. Miami: Association for Computational Linguistics, 2024, pp. 11005-11044. ISBN 979-8-8917-6168-1. Detail
- BENEŠ Karel, KOCOUR Martin and BURGET Lukáš. Hystoc: Obtaining Word Confidences for Fusion of End-To-End ASR Systems. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11276-11280. ISBN 979-8-3503-4485-1. Detail
- STAFYLAKIS Themos, SILNOVA Anna, ROHDIN Johan A., PLCHOT Oldřich and BURGET Lukáš. Challenging margin-based speaker embedding extractors by using the variational information bottleneck. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 3220-3224. ISSN 1990-9772. Detail
- MOŠNER Ladislav, SERIZEL Romain, BURGET Lukáš, PLCHOT Oldřich, VINCENT Emmanuel, PENG Junyi and ČERNOCKÝ Jan. Multi-Channel Extension of Pre-trained Models for Speaker Verification. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 2135-2139. ISSN 1990-9772. Detail
- YUSUF Bolaji, ČERNOCKÝ Jan and SARAÇLAR Murat. Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic Units. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 5068-5072. ISSN 1990-9772. Detail
- PENG Junyi, DELCROIX Marc, OCHIAI Tsubasa, ASHIHARA Takanori, PLCHOT Oldřich, ARAKI Shoko and ČERNOCKÝ Jan. Probing Self-Supervised Learning Models With Target Speech Extraction. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 535-539. ISBN 979-8-3503-7451-3. Detail
- PEŠÁN Jan, JUŘÍK Vojtěch, RŮŽIČKOVÁ Alexandra, SVOBODA Vojtěch, JANOUŠEK Oto, NĚMCOVÁ Andrea, BOJANOVSKÁ Hana, ALDABAGHOVÁ Jasmína, KYSLÍK Filip, VODIČKOVÁ Kateřina, SODOMOVÁ Adéla, BARTYS Patrik, CHUDÝ Peter and ČERNOCKÝ Jan. Speech production under stress for machine learning: multimodal dataset of 79 cases and 8 signals. Nature Scientific Data, vol. 11, no. 1, 2024, pp. 1-9. ISSN 2052-4463. Detail
- PENG Junyi, DELCROIX Marc, OCHIAI Tsubasa, PLCHOT Oldřich, ARAKI Shoko and ČERNOCKÝ Jan. Target Speech Extraction with Pre-Trained Self-Supervised Learning Models. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 10421-10425. ISBN 979-8-3503-4485-1. Detail
- YUSUF Bolaji and SARAÇLAR Murat. Written Term Detection Improves Spoken Term Detection. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 32, no. 06, 2024, pp. 3213-3223. ISSN 2329-9290. Detail