Project Details
Soudobé metody zpracování, analýzy a zobrazování multimediálních a 3D dat
Project Period: 1. 3. 2023 - 31. 12. 2025
Project Type: grant
Code: FIT-S-23-8278
Agency: Brno University of Technology
Program: Vnitřní projekty VUT
multimedia data, 3D data, data processing, data analysis, data display
Multimedia and 3D data are important and necessary data for an increasing number of applications of modern computer systems, in which their use is irreplaceable. At the same time, it is known that the processing of such data is difficult and computationally demanding, and this also applies to their display and analysis. Therefore, research in this area is one of the more difficult and important. The project continues the earlier project "Modern methods of processing, analysis and display of multimedia and 3D data".
Bambušek Daniel, Ing. (DCGM FIT BUT)
Bartl Vojtěch, Ing., Ph.D. (DCGM FIT BUT)
Bažout David, Ing. (DCGM FIT BUT)
Beneš Karel, Ing. (DCGM FIT BUT)
Beran Vítězslav, doc. Ing., Ph.D. (DCGM FIT BUT)
Bobák Petr, Ing. (DCGM FIT BUT)
Brukner Jan, Ing. (DCGM FIT BUT)
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT)
Čadík Martin, doc. Ing., Ph.D. (DCGM FIT BUT)
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT)
Dobeš Petr, Ing. (DCGM FIT BUT)
Dočekal Martin, Ing. (DCGM FIT BUT)
Dubovec Pavol, Ing. (FIT BUT)
Fajčík Martin, Ing., Ph.D. (DCGM FIT BUT)
Hanák Jiří, Ing. (DCGM FIT BUT)
Herout Adam, prof. Ing., Ph.D. (DCGM FIT BUT)
Hříbek David, Ing. (DCGM FIT BUT)
Chlubna Tomáš, Ing., Ph.D. (DCGM FIT BUT)
Chudý Peter, doc. Ing., Ph.D. MBA (DCGM FIT BUT)
Kapinus Michal, Ing. (DCGM FIT BUT)
Karas Matej, Ing. (DCGM FIT BUT)
Kišš Martin, Ing. (DCGM FIT BUT)
Klem Richard, Ing. (FIT BUT)
Klepárník Petr, Ing., Ph.D. (DCGM FIT BUT)
Kocour Martin, Ing. (DCGM FIT BUT)
Kohút Jan, Ing. (DCGM FIT BUT)
Landini Federico Nicolás (DCGM FIT BUT)
Liška Jakub, Ing. (FIT BUT)
Maršík Lukáš, Ing. (DCGM FIT BUT)
Mošner Ladislav, Ing. (DCGM FIT BUT)
Munzar Milan, Ing. (DCGM FIT BUT)
Nguyen Son Hai, Ing. (DCGM FIT BUT)
Nosko Svetozár, Ing., Ph.D. (DCGM FIT BUT)
Novák Jiří, Ing., Ph.D. (DCGM FIT BUT)
Omachtová Alena, Ing. (DCGM FIT BUT)
Ondřej Karel, Ing. (DCGM FIT BUT)
Pavlus Ján, Ing. (DCGM FIT BUT)
Peng Junyi, Msc. Eng. (DCGM FIT BUT)
Polášek Tomáš, Ing. (DCGM FIT BUT)
Reich Bořek, Ing. (DCGM FIT BUT)
Sedláček Šimon, Ing. (FIT BUT)
Smrž Pavel, doc. RNDr., Ph.D. (DCGM FIT BUT)
Strýček Šimon, Ing. (FIT BUT)
Španěl Michal, doc. Ing., Ph.D. (DCGM FIT BUT)
Špaňhel Jakub, Ing., Ph.D. (DCGM FIT BUT)
Šůstek Martin, Ing. (FIT BUT)
Švec Ján, Ing. (DCGM FIT BUT)
Švec Tomáš, Ing. (DCGM FIT BUT)
Vendrame Katia, Ing. (FIT BUT)
Vlnas Michal, Ing. (DCGM FIT BUT)
- HANÁK Jiří, NOVÁK Jiří, CHUDÝ Peter and BEN-ASHER Joseph Z. Cross-Entropy Method for Laser Defense Applications. Journal of Aerospace Information Systems, vol. 22, no. 1, 2025, pp. 53-58. ISSN 2327-3097. Detail
- PEŠÁN Jan, JUŘÍK Vojtěch, KARAFIÁT Martin and ČERNOCKÝ Jan. BESST Dataset: A Multimodal Resource for Speech-based Stress Detection and Analysis. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 1355-1359. ISSN 1990-9772. Detail
- BHATTACHARJEE Mrinmoy, NIGMATULINA Iuliia, PRASAD Amrutha, RANGAPPA Pradeep, MADIKERI Srikanth, MOTLÍČEK Petr, HELMKE Hartmut and KLEINERT Matthias. Contextual Biasing Methods for Improving Rare Word Detection in Automatic Speech Recognition. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 12652-12656. ISBN 979-8-3503-4485-1. Detail
- BURDISSO Sergio, RAMIREZ Reyes Ernesto Antonio, VILLATORO-TELLO Esaú, SÁNCHEZ-VEGA Fernando, LÓPEZ-MONROY A. Pastor and MOTLÍČEK Petr. DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews. In: Proceedings of the 6th Clinical Natural Language Processing Workshop. Association for Computational Linguistics. Mexico City: Association for Computational Linguistics, 2024, pp. 82-90. Detail
- RANGAPPA Pradeep, MUSCAT Amanda, SANCHEZ-LARA Alejandra, MOTLÍČEK Petr, ANTONOPOULOU Michaela, FOURFOURIS Ioannis, SKARLATOS Antonios, AVGERINOS Nikos, TSANGARIS Manolis and KOSTKA Kasia. Detecting Criminal Networks via Non-Content Communication Data Analysis Techniques from the TRACY Project. In: Proceedings of the15th EAI International Conference on Digital Forensics & Cyber Crime (EAI-ICDF2C24). Dubrovnik, 2024, pp. 1-15. Detail
- NOVÁK Jiří and CHUDÝ Peter. Dynamic Soaring in Uncertain Wind Conditions: Polynomial Chaos Expansion Approach. In: Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Grasmere: Springer Nature Switzerland AG, 2024, pp. 104-115. ISBN 978-3-031-53968-8. ISSN 0302-9743. Detail
- CHLUBNA Tomáš, ZEMČÍK Pavel and MILET Tomáš. Efficient Random-Access GPU Video Decoding for Light-Field Rendering. Journal of Visual Communication and Image Representation, vol. 2024, no. 102, pp. 1-14. ISSN 1047-3203. Detail
- MACIEJEWSKI Matthew, KLEMENT Dominik, HUANG Ruizhe, WIESNER Matthew and KHUDANPUR Sanjeev. Evaluating the Santa Barbara Corpus: Challenges of the Breadth of Conversational Spoken Language. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 2155-2160. ISSN 1990-9772. Detail
- PRASAD Amrutha, CAROFILIS Andrés, VANDERREYDT Geoffroy, KHALIL Driss, MADIKERI Srikanth, MOTLÍČEK Petr and SCHUEPBACH Christof. Fine-Tuning Self-Supervised Models for Language Identification Using Orthonormal Constraint. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11921-11925. ISBN 979-8-3503-4485-1. Detail
- CHLUBNA Tomáš, MILET Tomáš and ZEMČÍK Pavel. How Capturing Camera Trajectory Distortion Affects User Experience on Looking Glass 3D Display. Multimedia Tools and Applications, vol. 2024, no. 83, pp. 20265-20287. ISSN 1573-7721. Detail
- NOVÁK Jiří, HANÁK Jiří and CHUDÝ Peter. Hybrid Modeling Approach for Optimization Based Control of Multirotor Unmanned Aerial Vehicles. In: ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024, pp. 1-10. ISSN 2958-4647. Detail
- BENEŠ Karel, KOCOUR Martin and BURGET Lukáš. Hystoc: Obtaining Word Confidences for Fusion of End-To-End ASR Systems. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11276-11280. ISBN 979-8-3503-4485-1. Detail
- CHLUBNA Tomáš, MILET Tomáš and ZEMČÍK Pavel. Lightweight All-Focused Light Field Rendering. Computer Vision and Image Understanding, vol. 244, no. 7, 2024, pp. 7-8. ISSN 1077-3142. Detail
- KUBÍK Tibor and ŠPANĚL Michal. LMVSegRNN and Poseidon3D: Addressing Challenging Teeth Segmentation Cases in 3D Dental Surface Orthodontic Scans. Bioengineering, vol. 11, no. 10, 2024, pp. 1-18. ISSN 2306-5354. Detail
- NOVÁK Jiří, CHUDÝ Peter and HANÁK Jiří. Model Predictive Control Driven Aerial Grasping with Soft Operational Constraints. In: ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024, pp. 1-15. ISSN 2958-4647. Detail
- KUMAR Sashi, MADIKERI Srikanth, NIGMATULINA Iuliia, VILLATORO-TELLO Esaú, MOTLÍČEK Petr, PANDIA Karthick, DUBAGUNTA S. Pavankumar and GANAPATHIRAJU Aravind. Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024, pp. 12592-12596. ISBN 979-8-3503-4485-1. Detail
- ESPUNA Fontcuberta Aleix, PRASAD Amrutha, MOTLÍČEK Petr, MADIKERI Srikanth and SCHUEPBACH Christof. Normalising Flows for Speaker and Language Recognition Backend. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Quebec: International Speech Communication Association, 2024, pp. 74-80. Detail
- NOVÁK Jiří, HANÁK Jiří and CHUDÝ Peter. Predictive Control Driven Tactical Maneuvering. In: ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024, pp. 1-12. ISSN 2958-4647. Detail
- BOBÁK Petr, ČMOLÍK Ladislav and ČADÍK Martin. Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement. IEEE Transactions on Visualization and Computer Graphics, vol. 30, no. 9, 2024, pp. 5908-5922. ISSN 1077-2626. Detail
- NOVÁK Jiří, HANÁK Jiří and CHUDÝ Peter. Reliability-Based Control System Optimization in Uncertain Conditions. In: AIAA Aviation Forum and ASCEND, 2024. Las Vegas: American Institute of Aeronautics and Astronautics, 2024, pp. 1-15. ISBN 978-1-62410-716-0. Detail
- KIŠŠ Martin and HRADIŠ Michal. Self-supervised Pre-training of Text Recognizers. In: Barney Smith, E.H., Liwicki, M., Peng, L. (eds) Document Analysis and Recognition - ICDAR 2024. Lecture Notes in Computer Science, vol. 14807. Atény: Springer Nature Switzerland AG, 2024, pp. 218-235. ISBN 978-3-031-70545-8. Detail
- YUSUF Bolaji, BASKAR Karthick Murali, ROSENBERG Andrew and RAMABHADRAN Bhuvana. Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 792-796. ISSN 1990-9772. Detail
- PRASAD Amrutha, MADIKERI Srikanth, KHALIL Driss, MOTLÍČEK Petr and SCHUEPBACH Christof. Speech and Language Recognition with Low-rank Adaptation of Pretrained Models. In: Proceedings of Interspeech. Kos Island: International Speech Communication Association, 2024, pp. 2825-2829. ISSN 1990-9772. Detail
- HANÁK Jiří, NOVÁK Jiří and CHUDÝ Peter. Tactical Scenario Adaptation for Pilot Training. In: AIAA/IEEE Digital Avionics Systems Conference - Proceedings. San Diego: Institute of Electrical and Electronics Engineers, 2024, pp. 1-7. ISBN 979-8-3503-4961-0. ISSN 2155-7195. Detail
- ZULUAGA-GOMEZ Juan, PRASAD Amrutha, NIGMATULINA Iuliia, MOTLÍČEK Petr and KLEINERT Matthias. A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers. Aerospace, vol. 10, no. 5, 2023, pp. 1-25. ISSN 2226-4310. Detail
- KHALIL Driss, PRASAD Amrutha, MOTLÍČEK Petr, ZULUAGA-GOMEZ Juan, NIGMATULINA Iuliia, MADIKERI Srikanth and SCHUEPBACH Christof. An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain. Aerospace, vol. 10, no. 10, 2023, pp. 1-14. ISSN 2226-4310. Detail
- MOTLÍČEK Petr, PRASAD Amrutha, NIGMATULINA Iuliia, HELMKE Hartmut, OHNEISER Oliver and KLEINERT Matthias. Automatic Speech Analysis Framework for ATC Communication in HAAWAII. In: SESAR Innovation Days. Seville: SESAR Joint Undertaking, 2023, pp. 1-9. ISSN 0770-1268. Detail
- HELMKE Hartmut, KLEINERT Matthias, AHRENHOLD Nils, EHR Heiko, MÜHLHAUSEN Thorsten, PINSKA Chauvin Ella, OHNEISER Oliver, KLAMERT Lucas, MOTLÍČEK Petr, PRASAD Amrutha, ZULUAGA-GOMEZ Juan and DOKIC Jelena. Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers' Workload. In: Proceedings of ATM Seminar. Savannah, Georgia: EUROPEAN ORGANISATION FOR THE SAFETY OF AIR NAVIGATION, 2023, pp. 1-11. Detail
- HANÁK Jiří, CHUDÝ Peter and VLK Jan. Collaborative Agents for Synthetic Tactical Training. In: AIAA/IEEE Digital Avionics Systems Conference - Proceedings. Barcelona: Institute of Electrical and Electronics Engineers, 2023, pp. 1-9. ISBN 979-8-3503-3357-2. ISSN 2155-7195. Detail
- BHATTACHARJEE Mrinmoy, MOTLÍČEK Petr, NIGMATULINA Iuliia, HELMKE Hartmut, OHNEISER Oliver, KLEINERT Matthias and EHR Heiko. Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training. In: 13th SESAR Innovation Days 2023, SIDS 2023. Seville: SESAR Joint Undertaking, 2023, pp. 1-8. ISSN 0770-1268. Detail
- SKOWRON Marcin, BACKFRIED Gerhard, NAVAS Eva, BERZINŠ Aivars, VAN Den Bogaert Joachim, DE Jong Franciska, DEMARCO Andrea, POLÁK Peter, KOVÁČ Marek, POLÁK Peter, ROHDIN Johan A., ROSNER Michael, SANCHEZ Jon, SARATXAGA Ibon and SCHWARZ Petr. Deep Dive Speech Technology. European Language Equality. Cham: Springer Nature Switzerland AG, 2023, pp. 289-312. ISBN 978-3-031-28819-7. Detail
- VILLATORO-TELLO Esaú, MADIKERI Srikanth, ZULUAGA-GOMEZ Juan, SHARMA Bidisha, SARFJOO Seyyed Saeed, NIGMATULINA Iuliia, MOTLÍČEK Petr, IVANOV Alexei V. and GANAPATHIRAJU Aravind. Effectiveness of Text, Acoustic, and Lattice-Based Representations in Spoken Language Understanding Tasks. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Rhodes Island: IEEE Signal Processing Society, 2023, pp. 1-5. ISBN 978-1-7281-6327-7. Detail
- BAŘINA David. Experimental lossless data compressor. Microprocessors and Microsystems, vol. 98, no. 4, 2023, pp. 104803-104803. ISSN 0141-9331. Detail
- APAROVICH Maksim, KESIRAJU Santosh, DUFKOVÁ Aneta and SMRŽ Pavel. FIT BUT at SemEval-2023 Task 12: Sentiment Without Borders - Multilingual Domain Adaptation for Low-Resource Sentiment Classification. In: Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023). Toronto (online): Association for Computational Linguistics, 2023, pp. 1518-1524. ISBN 978-1-959429-99-9. Detail
- BAMBUŠEK Daniel, MATERNA Zdeněk, KAPINUS Michal, BERAN Vítězslav and SMRŽ Pavel. How Do I Get There? Overcoming Reachability Limitations of Constrained Industrial Environments in Augmented Reality Applications. In: 2023 IEEE Conference on Virtual Reality and 3D User Interfaces (VR). Shanghai: Institute of Electrical and Electronics Engineers, 2023, pp. 115-122. ISBN 979-8-3503-4815-6. Detail
- OMACHTOVÁ Alena, HEROUT Adam, BAMBUŠEK Daniel and JUŘÍK Vojtěch. How to shoot yourself right with a smartphone?. Virtual Reality, vol. 2023, no. 1, pp. 1-13. ISSN 1434-9957. Detail
- MAI Florian, ZULUAGA-GOMEZ Juan, PARCOLLET Titouan and MOTLÍČEK Petr. HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition. In: Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 2213-2217. ISSN 1990-9772. Detail
- NIGMATULINA Iuliia, MADIKERI Srikanth, VILLATORO-TELLO Esaú, MOTLÍČEK Petr, ZULUAGA-GOMEZ Juan, PANDIA Karthick and GANAPATHIRAJU Aravind. Implementing contextual biasing in GPU decoder for online ASR. In: Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 4494-4498. ISSN 1990-9772. Detail
- GAVRIELIDES Andreas, SOPHOCLEOUS Marios, AGAPIOU George, LESSI Christina, ŠPAŇHEL Jakub, LENDINEZ Adrian, QIU Renxi and LI Dayou. Implementing Network Applications for 5G-Enabled Robots Through the 5G-ERA Platform. In: IFIP Advances in Information and Communication Technology. Artificial Intelligence Applications and Innovations, vol. 677. Cham: Springer Nature Switzerland AG, 2023, pp. 55-65. ISBN 978-3-031-34170-0. ISSN 1868-422X. Detail
- BURDISSO Sergio, VILLATORO-TELLO Esaú, MADIKERI Srikanth and MOTLÍČEK Petr. Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews. In: Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 3617-3621. ISSN 1990-9772. Detail
- YUSUF Bolaji, GOURAV Aditya, GANDHE Ankur and BULYKO Ivan. On-the-Fly Text Retrieval for end-to-end ASR Adaptation. In: Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023, pp. 1-5. ISBN 978-1-7281-6327-7. Detail
- VANDERREYDT Geoffroy, PRASAD Amrutha, KHALIL Driss, MADIKERI Srikanth, DEMUYNCK Kris and MOTLÍČEK Petr. Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition. In: Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Taipei: IEEE Signal Processing Society, 2023, pp. 1-7. ISBN 979-8-3503-0689-7. Detail
- POLÁŠEK Tomáš and ČADÍK Martin. Predicting Photovoltaic Power Production using High-Uncertainty Weather Forecasts. Applied Energy, vol. 2023, no. 339, pp. 120989-121004. ISSN 0306-2619. Detail
- CHLUBNA Tomáš, MILET Tomáš, ZEMČÍK Pavel and KULA Michal. Real-Time Light Field Video Focusing and GPU Accelerated Streaming. Journal of Signal Processing Systems, vol. 95, no. 6, 2023, pp. 703-719. ISSN 1939-8115. Detail
- KIŠŠ Martin, HRADIŠ Michal, BENEŠ Karel, BUCHAL Petr and KULA Michal. SoftCTC-semi-supervised learning for text recognition using soft pseudo-labels. International Journal on Document Analysis and Recognition (IJDAR), vol. 2024, no. 27, 2023, pp. 177-193. ISSN 1433-2825. Detail
- NOVÁK Jiří and CHUDÝ Peter. Surrogate Modeling of Optimal Control Based Collision Avoidance System for Multirotor Unmanned Aerial Vehicles. In: AIAA/IEEE Digital Avionics Systems Conference - Proceedings. Barcelona: Institute of Electrical and Electronics Engineers, 2023, pp. 1-7. ISBN 979-8-3503-3357-2. ISSN 2155-7195. Detail
- POLÁŠEK Tomáš, ČADÍK Martin, KELLER Yosi and BENEŠ Bedřich. Vision UFormer: Long-Range Monocular Absolute Depth Estimation. Computers and Graphics, vol. 111, no. 4, 2023, pp. 180-189. ISSN 0097-8493. Detail
- BOITO Marcely Z., YUSUF Bolaji, ONDEL Yang Lucas Antoine Francois, VILLAVICENCIO Aline and BESACIER Laurent. Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings. In: Proceedings of the the 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages. Marseile: European Language Resources Association, 2022, pp. 1-9. ISBN 979-10-95546-91-7. Detail
- Convergence verification of the Collatz problem, software, 2024
Authors: Bařina David Detail - Data processing methods for transport purposes, software, 2024
Authors: Špaňhel Jakub, Beran Vítězslav, Herout Adam, Zemčík Pavel Detail - Minimalist JPEG decoder & encoder, software, 2024
Authors: Bařina David Detail - x3: Experimental Data Compressor, software, 2024
Authors: Bařina David Detail
- KSPredict: Software for predicting the development of emergency events and crisis situations, software, 2023
Authors: Klíma Ondřej, Neubauer Jiří, Polcerová Lenka, Králík Miroslav, Zeman Tomáš Detail