Department of Computer Graphics and Multimedia
2025
- LOJDA Jakub, STRNADEL Josef, SMRŽ Pavel and ŠIMEK Václav. Multi-Partner Project: LoLiPoP-IoT - Design and Simulation of Energy-Efficient Devices for the Internet of Things. In: Lyon: Institute of Electrical and Electronics Engineers, 2025, p. 7. Detail
2024
- PECHER Branislav, SRBA Ivan and BIELIKOVÁ Mária. A Survey on Stability of Learning with Limited Labelled Data and its Sensitivity to the Effects of Randomness. ACM Computing Surveys, vol. 57, no. 1, 2024, pp. 1-40. ISSN 0360-0300. Detail
- ALAM Jahangir, BARAHONA Quirós Sara, BOBOŠ Dominik, BURGET Lukáš, CUMANI Sandro, DAHMANE Mohamed, HAN Jiangyu, HLAVÁČEK Miroslav, KODOVSKÝ Martin, LANDINI Federico Nicolás, MOŠNER Ladislav, PÁLKA Petr, PAVLÍČEK Tomáš, PENG Junyi, PLCHOT Oldřich, RAJASEKHAR Gnana Praveen, ROHDIN Johan A., SILNOVA Anna, STAFYLAKIS Themos and ZHANG Lin. ABC SYSTEM DESCRIPTION FOR NIST SRE 2024. In: Proceedings of NIST SRE 2024. San Juan: National Institute of Standards and Technology, 2024, pp. 1-9. Detail
- WANG Shuai, CHEN Zhengyang, HAN Bing, WANG Hongji, XIANG Xu, ROHDIN Johan A., SILNOVA Anna, QIAN Yanmin and LI Haizhou et al. Advancing speaker embedding learning: Wespeaker toolkit for research and production. Speech Communication, vol. 162, no. 103104, 2024, pp. 1-12. ISSN 0167-6393. Detail
- KAPINUS Michal, BERAN Vítězslav, MATERNA Zdeněk and BAMBUŠEK Daniel. Augmented Reality Spatial Programming Paradigm Applied to End-User Robot Programming. Robotics and Computer-Integrated Manufacturing, vol. 89, no. 89, 2024, pp. 1-13. ISSN 0736-5845. Detail
- CHLUBNA Tomáš, MILET Tomáš and ZEMČÍK Pavel. Automatic 3D-Display-Friendly Scene Extraction from Video Sequences and Optimal Focusing Distance Identification. Multimedia Tools and Applications, vol. 83, no. 7, 2024, pp. 1-29. ISSN 1573-7721. Detail
- PEŠÁN Jan, JUŘÍK Vojtěch, KARAFIÁT Martin and ČERNOCKÝ Jan. BESST Dataset: A Multimodal Resource for Speech-based Stress Detection and Analysis. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 1355-1359. ISSN 1990-9772. Detail
- ROHDIN Johan A., ZHANG Lin, PLCHOT Oldřich, STANĚK Vojtěch, MIHOLA David, PENG Junyi, STAFYLAKIS Themos, BEVERAKI Dmitriy, SILNOVA Anna, BRUKNER Jan and BURGET Lukáš. BUT systems and analyses for the ASVspoof 5 Challenge. In: Proceedings of ASV spoof 2024 Workshop. Kos Island: International Speech Communication Association, 2024, pp. 24-31. Detail
- POLOK Alexander, KLEMENT Dominik, HAN Jiangyu, SEDLÁČEK Šimon, YUSUF Bolaji, MACIEJEWSKI Matthew, WIESNER Matthew and BURGET Lukáš. BUT/JHU System Description for CHiME-8 NOTSOFAR-1 Challenge. In: Proceedings of CHiME 2024 Workshop. Kos Island: International Speech Communication Association, 2024, pp. 18-22. Detail
- HANÁK Jiří, NOVÁK Jiří and CHUDÝ Peter. Cognitive Modeling Approach for Generating Authentic Tactical Agent Behavior. In: AIAA/IEEE Digital Avionics Systems Conference - Proceedings. San Diego: Institute of Electrical and Electronics Engineers, 2024, pp. 1-15. ISBN 979-8-3503-4961-0. Detail
- KUNEŠOVÁ Marie, ZAJÍC Zbyněk, ŠMÍDL Luboš and KARAFIÁT Martin. Comparison of wav2vec 2.0 models on three speech processing tasks. International Journal of Speech Technology, vol. 27, no. 4, 2024, pp. 847-859. ISSN 1572-8110. Detail
- BHATTACHARJEE Mrinmoy, NIGMATULINA Iuliia, PRASAD Amrutha, RANGAPPA Pradeep, MADIKERI Srikanth, MOTLÍČEK Petr, HELMKE Hartmut and KLEINERT Matthias. Contextual Biasing Methods for Improving Rare Word Detection in Automatic Speech Recognition. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 12652-12656. ISBN 979-8-3503-4485-1. Detail
- HAN Jiangyu, LANDINI Federico Nicolás, ROHDIN Johan A., DIEZ Sánchez Mireia, BURGET Lukáš, CAO Yuhang, LU Heng and ČERNOCKÝ Jan. Diacorrect: Error Correction Back-End for Speaker Diarization. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024, pp. 11181-11185. ISBN 979-8-3503-4485-1. Detail
- LANDINI Federico Nicolás, DIEZ Sánchez Mireia, STAFYLAKIS Themos and BURGET Lukáš. DiaPer: End-to-End Neural Diarization With Perceiver-Based Attractors. IEEE Transactions on Audio, Speech, and Language Processing, vol. 32, no. 7, 2024, pp. 3450-3465. ISSN 1558-7916. Detail
- KLEMENT Dominik, DIEZ Sánchez Mireia, LANDINI Federico Nicolás, BURGET Lukáš, SILNOVA Anna, DELCROIX Marc and TAWARA Naohiro. Discriminative Training of VBx Diarization. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11871-11875. ISBN 979-8-3503-4485-1. Detail
- ZHANG Lin, STAFYLAKIS Themos, LANDINI Federico Nicolás, DIEZ Sánchez Mireia, SILNOVA Anna and BURGET Lukáš. Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Québec City: International Speech Communication Association, 2024, pp. 123-130. Detail
- NOVÁK Jiří and CHUDÝ Peter. Dynamic Soaring in Uncertain Wind Conditions: Polynomial Chaos Expansion Approach. In: Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Grasmere: Springer Nature Switzerland AG, 2024, pp. 104-115. ISBN 978-3-031-53968-8. ISSN 0302-9743. Detail
- ČEGIŇ Ján, PECHER Branislav, ŠIMKO Jakub, SRBA Ivan and BIELIKOVÁ Mária. Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation. In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Bangkok: Association for Computational Linguistics, 2024, pp. 13148-13171. ISBN 979-8-8917-6094-3. Detail
- CHLUBNA Tomáš, ZEMČÍK Pavel and MILET Tomáš. Efficient Random-Access GPU Video Decoding for Light-Field Rendering. Journal of Visual Communication and Image Representation, vol. 2024, no. 102, pp. 1-14. ISSN 1047-3203. Detail
- DEKEL Shay, KELLER Yosi and ČADÍK Martin. Estimating Extreme 3D Image Rotations using Cascaded Attention. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle: IEEE Computer Society, 2024, pp. 2588-2598. ISBN 979-8-3503-5301-3. Detail
- PECHER Branislav, ČEGIŇ Ján, BELANEC Róbert, SRBA Ivan, ŠIMKO Jakub and BIELIKOVÁ Mária. Fighting Randomness With Randomness: Mitigating Optimisation Instability of Fine-Tuning Using Ensemble and Noise Regularisation. In: Findings of the Association for Computational Linguistics: EMNLP 2024. Miami: Association for Computational Linguistics, 2024, pp. 11005-11044. ISBN 979-8-8917-6168-1. Detail
- PRASAD Amrutha, CAROFILIS Andrés, VANDERREYDT Geoffroy, KHALIL Driss, MADIKERI Srikanth, MOTLÍČEK Petr and SCHUEPBACH Christof. Fine-Tuning Self-Supervised Models for Language Identification Using Orthonormal Constraint. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11921-11925. ISBN 979-8-3503-4485-1. Detail
- LOJDA Jakub, STRNADEL Josef, SMRŽ Pavel and ŠIMEK Václav. First Steps Towards Unified Low-Power IoT Design: The "DYNAMIC" Framework. In: 2024 IEEE East-West Design and Test Symposium, EWDTS 2024 - Proceedings. Yerevan: Institute of Electrical and Electronics Engineers, 2024, pp. 1-6. ISBN 979-8-3315-1576-8. Detail
- CHLUBNA Tomáš, MILET Tomáš and ZEMČÍK Pavel. How Capturing Camera Trajectory Distortion Affects User Experience on Looking Glass 3D Display. Multimedia Tools and Applications, vol. 2024, no. 83, pp. 20265-20287. ISSN 1573-7721. Detail
- NOVÁK Jiří, HANÁK Jiří and CHUDÝ Peter. Hybrid Modeling Approach for Optimization Based Control of Multirotor Unmanned Aerial Vehicles. In: ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024, pp. 1-10. ISSN 2958-4647. Detail
- BENEŠ Karel, KOCOUR Martin and BURGET Lukáš. Hystoc: Obtaining Word Confidences for Fusion of End-To-End ASR Systems. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11276-11280. ISBN 979-8-3503-4485-1. Detail
- STAFYLAKIS Themos, SILNOVA Anna, ROHDIN Johan A., PLCHOT Oldřich and BURGET Lukáš. Challenging margin-based speaker embedding extractors by using the variational information bottleneck. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 3220-3224. ISSN 1990-9772. Detail
- ČIEF Matej. Learning Action Embeddings for Off-Policy Evaluation. In: ECIR 2024: Advances in Information Retrieval. Advances in Information Retrieval. Glasgow: Springer Nature Switzerland AG, 2024, pp. 108-122. Detail
- CHLUBNA Tomáš, MILET Tomáš and ZEMČÍK Pavel. Lightweight All-Focused Light Field Rendering. Computer Vision and Image Understanding, vol. 244, no. 7, 2024, pp. 7-8. ISSN 1077-3142. Detail
- KUBÍK Tibor and ŠPANĚL Michal. LMVSegRNN and Poseidon3D: Addressing Challenging Teeth Segmentation Cases in 3D Dental Surface Orthodontic Scans. Bioengineering, vol. 11, no. 10, 2024, pp. 1-18. ISSN 2306-5354. Detail
- STRNADEL Josef, LOJDA Jakub, SMRŽ Pavel and ŠIMEK Václav. Machine Learning in Context of IoT/Edge Devices and LoLiPoP-IoT Project. In: Proceedings of 32nd Austrian Workshop on Microelectronics (Austrochip 2024). Vienna: Institute of Electrical and Electronics Engineers, 2024, pp. 1-4. ISBN 979-8-3315-1617-8. Detail
- NOVÁK Jiří, CHUDÝ Peter and HANÁK Jiří. Model Predictive Control Driven Aerial Grasping with Soft Operational Constraints. In: ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024, pp. 1-15. ISSN 2958-4647. Detail
- MOŠNER Ladislav, SERIZEL Romain, BURGET Lukáš, PLCHOT Oldřich, VINCENT Emmanuel, PENG Junyi and ČERNOCKÝ Jan. Multi-Channel Extension of Pre-trained Models for Speaker Verification. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 2135-2139. ISSN 1990-9772. Detail
- ESPUNA Fontcuberta Aleix, PRASAD Amrutha, MOTLÍČEK Petr, MADIKERI Srikanth and SCHUEPBACH Christof. Normalising Flows for Speaker and Language Recognition Backend. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Quebec: International Speech Communication Association, 2024, pp. 74-80. Detail
- PECHER Branislav, SRBA Ivan and BIELIKOVÁ Mária. On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices. In: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Miami: Association for Computational Linguistics, 2024, pp. 522-556. ISBN 979-8-8917-6164-3. Detail
- STRNADEL Josef, LOJDA Jakub, SMRŽ Pavel and ŠIMEK Václav. On SMC-Based Dependability Analysis in LoLiPoP-IoT Project. In: Steffen, B. (eds) Bridging the Gap Between AI and Reality (AISolA 2024). Lecture Notes in Computer Science, vol. 15217. Limenas Hersonissou: Springer Nature Switzerland AG, 2024, pp. 420-445. ISBN 978-3-031-75434-0. ISSN 0302-9743. Detail
- ČIEF Matej and KOMPAN Michal. Pessimistic Off-Policy Optimization for Learning to Rank. In: 27TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE. Frontiers in Artificial Intelligence and Applications. Santiago de Compostela, 2024, pp. 1896-1903. ISBN 978-1-64368-548-9. Detail
- NOVÁK Jiří, HANÁK Jiří and CHUDÝ Peter. Predictive Control Driven Tactical Maneuvering. In: ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024, pp. 1-12. ISSN 2958-4647. Detail
- YUSUF Bolaji, ČERNOCKÝ Jan and SARAÇLAR Murat. Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic Units. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 5068-5072. ISSN 1990-9772. Detail
- PENG Junyi, DELCROIX Marc, OCHIAI Tsubasa, ASHIHARA Takanori, PLCHOT Oldřich, ARAKI Shoko and ČERNOCKÝ Jan. Probing Self-Supervised Learning Models With Target Speech Extraction. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 535-539. ISBN 979-8-3503-7451-3. Detail
- KAŠPÁREK Tomáš and CHUDÝ Peter. Pulsar Signal Adaptive Surrogate Modeling. Aerospace, vol. 11, no. 10, 2024, pp. 1-22. ISSN 2226-4310. Detail
- BOBÁK Petr, ČMOLÍK Ladislav and ČADÍK Martin. Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement. IEEE Transactions on Visualization and Computer Graphics, vol. 30, no. 9, 2024, pp. 5908-5922. ISSN 1077-2626. Detail
- NOVÁK Jiří, HANÁK Jiří and CHUDÝ Peter. Reliability-Based Control System Optimization in Uncertain Conditions. In: AIAA Aviation Forum and ASCEND, 2024. Las Vegas: American Institute of Aeronautics and Astronautics, 2024, pp. 1-15. ISBN 978-1-62410-716-0. Detail
- MOTLÍČEK Petr, DIKICI Erinç, MADIKERI Srikanth, RANGAPPA Pradeep, BACKFRIED Gerhard, ROHDIN Johan A., SCHWARZ Petr, KOVÁČ Marek, MALÝ Květoslav, BOBOŠ Dominik, KLAKOW Dietrich and SERGIDOU Eleni Konstantina et al. ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Québec City: International Speech Communication Association, 2024, pp. 17-24. Detail
- KIŠŠ Martin and HRADIŠ Michal. Self-supervised Pre-training of Text Recognizers. In: Barney Smith, E.H., Liwicki, M., Peng, L. (eds) Document Analysis and Recognition - ICDAR 2024. Lecture Notes in Computer Science, vol. 14807. Atény: Springer Nature Switzerland AG, 2024, pp. 218-235. ISBN 978-3-031-70545-8. Detail
- YUSUF Bolaji, BASKAR Karthick Murali, ROSENBERG Andrew and RAMABHADRAN Bhuvana. Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 792-796. ISSN 1990-9772. Detail
- PRASAD Amrutha, MADIKERI Srikanth, KHALIL Driss, MOTLÍČEK Petr and SCHUEPBACH Christof. Speech and Language Recognition with Low-rank Adaptation of Pretrained Models. In: Proceedings of Interspeech. Kos Island: International Speech Communication Association, 2024, pp. 2825-2829. ISSN 1990-9772. Detail
- PEŠÁN Jan, JUŘÍK Vojtěch, RŮŽIČKOVÁ Alexandra, SVOBODA Vojtěch, JANOUŠEK Oto, NĚMCOVÁ Andrea, BOJANOVSKÁ Hana, ALDABAGHOVÁ Jasmína, KYSLÍK Filip, VODIČKOVÁ Kateřina, SODOMOVÁ Adéla, BARTYS Patrik, CHUDÝ Peter and ČERNOCKÝ Jan. Speech production under stress for machine learning: multimodal dataset of 79 cases and 8 signals. Nature Scientific Data, vol. 11, no. 1, 2024, pp. 1-9. ISSN 2052-4463. Detail
- ZHANG Lin, WANG Xin, COOPER Erica, DIEZ Sánchez Mireia, LANDINI Federico Nicolás, EVANS Nicholas and YAMAGISHI Junichi. Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 502-506. ISSN 1990-9772. Detail
- WANNER Leo, ČERNOCKÝ Jan, EGOROVA Ekaterina, KLUSCH Matthias and MAVROPOULOS Athanasios et al. Support of Migrant Reception, Integration, and Social Inclusion by Intelligent Technologies. Information, vol. 15, no. 11, 2024, pp. 1-33. ISSN 2078-2489. Detail
- HANÁK Jiří, NOVÁK Jiří and CHUDÝ Peter. Tactical Scenario Adaptation for Pilot Training. In: AIAA/IEEE Digital Avionics Systems Conference - Proceedings. San Diego: Institute of Electrical and Electronics Engineers, 2024, pp. 1-7. ISBN 979-8-3503-4961-0. Detail
- PENG Junyi, DELCROIX Marc, OCHIAI Tsubasa, PLCHOT Oldřich, ARAKI Shoko and ČERNOCKÝ Jan. Target Speech Extraction with Pre-Trained Self-Supervised Learning Models. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 10421-10425. ISBN 979-8-3503-4485-1. Detail
- HANÁK Jiří, NOVÁK Jiří, CHUDÝ Peter and BEN-ASHER Joseph Z. The Cross-Entropy Method for Laser Defense Applications. Journal of Aerospace Information Systems, vol. 25, no. 10, 2024, pp. 1-14. ISSN 2327-3097. Detail
- LOJDA Jakub, STRNADEL Josef, ŠIMEK Václav, SMRŽ Pavel, HAYES Michael and POPP Ralf. The LoLiPoP-IoT Project: Long Life Power Platforms for Internet of Things. In: Proceedings - 2024 27th Euromicro Conference on Digital System Design, DSD 2024. Paris: Institute of Electrical and Electronics Engineers, 2024, pp. 604-611. ISBN 979-8-3503-8038-5. Detail
- NOVÁK Jiří, CHUDÝ Peter and HANÁK Jiří. Weight-varying Model Predictive Control for Coupled Cyber-Physical Systems: Aerial Grasping Study. In: Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Castiglione della Pescaia: Springer Nature Switzerland AG, 2024, pp. 1-15. ISSN 0302-9743. Detail
- YUSUF Bolaji and SARAÇLAR Murat. Written Term Detection Improves Spoken Term Detection. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 32, no. 06, 2024, pp. 3213-3223. ISSN 2329-9290. Detail