Project Details
ATCO2 - Automatic collection and processing of voice data from air-traffic communications
Project Period: 1. 11. 2019 - 28. 2. 2022
Project Type: grant
Code: 864702
Agency: European Comission EU
Program: Horizon 2020
air-traffic management, automatic speech recognition, signal processing, legal and ethical framework
Developing machine learning solutions for air-traffic control applications is a challenging task. Besides an expert knowledge, large amount of data for robust performance as well as for validation and verification is typically required. If funded, ATCO2 will deliver a unique platform enabling to collect, store, pre-process and share voice communications data recorded from real world air-traffic control data. The project aims at accessing data from two sources: (a) from certified ADS-B datalinks aligned with a surveillance technology, and (b) directly from air-traffic controllers offered to the project by several air navigation service providers. The technical development will be centred around the ATCO2 platform, built on an existing and extensively used solution of opensky-network partner, ensuring sustainability of the platform after the end of the project. Current platform collects periodically broadcasted aircraft information through a network of ADS-B receivers operated around the globe, further stored at a server. In ATCO2, existing platform will be extended to allow collection, storage and pre-processing of voice communications, and time/position aligned with other aircraft information. Unlike previous works, we will target both channels, i.e. spoken commands issued by air-traffic controllers, and confirmation provided by pilots. In addition to broadcasted data, ATCO2 will have an access to voice recordings from air navigation service providers, namely Austrocontrol. This data will simulate other source of speech recordings (specifically archives), complementing real-time voice communication. The ATCO2 platform will be enhanced by the latest speech pre-processing and machine learning technologies, mostly based on deep learning. Besides automatic segmentation (e.g. er speaker, accent, specific command), robust automatic speech recognition system will be implemented and integrated through RESTful API allowing to automatically transcribe voice communications.
Veselý Karel, Ing., Ph.D. (UPGM FIT VUT) , team leader
Kocour Martin, Ing. (UPGM FIT VUT)
Pulugundla Bhargav, M.Sc. (UPGM FIT VUT)
Žižka Josef, Ing. (UPGM FIT VUT)
This project has received funding from the European Unions Horizon 2020 research and innovation programme under grant agreement No 864702.
2023
- ZULUAGA-GOMEZ Juan, PRASAD Amrutha, NIGMATULINA Iuliia, SARFJOO Seyyed Saeed, MOTLÍČEK Petr, KLEINERT Matthias, HELMKE Hartmut, OHNEISER Oliver and ZHAN Qingran. How Does Pre-Trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? an Extensive Benchmark on Air Traffic Control Communications. In: IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023, pp. 205-212. ISBN 978-1-6654-7189-3. Detail
- ZULUAGA-GOMEZ Juan, NIGMATULINA Iuliia, PRASAD Amrutha, MOTLÍČEK Petr, KHALIL Driss, MADIKERI Srikanth, TART Allan, SZŐKE Igor, LENDERS Vincent, RIGAULT Mickael and CHOUKRI Khalid. Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding. Aerospace, vol. 2023, no. 10, pp. 1-33. ISSN 2226-4310. Detail
2022
- NIGMATULINA Iuliia, ZULUAGA-GOMEZ Juan, PRASAD Amrutha, SARFJOO Saeed and MOTLÍČEK Petr. A Two-Step Approach to Leverage Contextual Data: Speech Recognition in Air-Traffic Communications. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, pp. 6282-6286. ISBN 978-1-6654-0540-9. Detail
- BLATT Alexander, KOCOUR Martin, VESELÝ Karel, SZŐKE Igor and KLAKOW Dietrich. Call-Sign Recognition and Understanding for Noisy Air-Traffic Transcripts Using Surveillance Information. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, pp. 8357-8361. ISBN 978-1-6654-0540-9. Detail
- PRASAD Amrutha, ZULUAGA-GOMEZ Juan, MOTLÍČEK Petr, SARFJOO Seyyed Saeed, NIGMATULINA Iuliia, OHNEISER Oliver and HELMKE Hartmut. Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition. In: Proceedings of the 12th SESAR Innovation Days. Budapest, 2022, pp. 1-9. Detail
- KOCOUR Martin, ŽMOLÍKOVÁ Kateřina, ONDEL Yang Lucas Antoine Francois, ŠVEC Ján, DELCROIX Marc, OCHIAI Tsubasa, BURGET Lukáš and ČERNOCKÝ Jan. Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, pp. 4955-4959. ISSN 1990-9772. Detail
- PRASAD Amrutha, ZULUAGA-GOMEZ Juan, MOTLÍČEK Petr, SARFJOO Seyyed Saeed, NIGMATULINA Iuliia and VESELÝ Karel. Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator. In: Proceedings of the 12th SESAR Innovation Days. Budapest, 2022, pp. 1-9. Detail
2021
- KOCOUR Martin, VESELÝ Karel, SZŐKE Igor, KESIRAJU Santosh, ZULUAGA-GOMEZ Juan, BLATT Alexander, PRASAD Amrutha, NIGMATULINA Iuliia, MOTLÍČEK Petr, KLAKOW Dietrich, TART Allan, KOLČÁREK Pavel, ČERNOCKÝ Jan, CEVENINI Claudia, CHOUKRI Khalid, RIGAULT Mickael, LANDIS Fabian and SARFJOO Saeed et al. Automatic Processing Pipeline for Collecting and Annotating Air-Traffic Voice Communication Data. In: Proceedings of 9th OpenSky Symposium 2021, OpenSky Network, Brussels, Belgium. Brussels: MDPI, 2021, pp. 1-10. ISSN 2504-3900. Detail
- KOCOUR Martin, CÁMBARA Guillermo, LUQUE Jordi, BONET David, FARRÚS Mireia, KARAFIÁT Martin, VESELÝ Karel and ČERNOCKÝ Jan. BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge. In: Proceedings of IberSPEECH 2021. Vallaloid: International Speech Communication Association, 2021, pp. 113-117. Detail
- KOCOUR Martin, VESELÝ Karel, BLATT Alexander, ZULUAGA-GOMEZ Juan, SZŐKE Igor, ČERNOCKÝ Jan, KLAKOW Dietrich and MOTLÍČEK Petr. Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, pp. 3301-3305. ISSN 1990-9772. Detail
- ZULUAGA-GOMEZ Juan, NIGMATULINA Iuliia, PRASAD Amrutha, MOTLÍČEK Petr, VESELÝ Karel, KOCOUR Martin and SZŐKE Igor. Contextual Semi-Supervised Learning: An Approach to Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, pp. 3296-3300. ISSN 1990-9772. Detail
- SZŐKE Igor, KESIRAJU Santosh, NOVOTNÝ Ondřej, KOCOUR Martin, VESELÝ Karel and ČERNOCKÝ Jan. Detecting English Speech in the Air Traffic Control Voice Communication. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, pp. 3286-3290. ISSN 1990-9772. Detail
- VYDANA Hari K., KARAFIÁT Martin, ŽMOLÍKOVÁ Kateřina, BURGET Lukáš and ČERNOCKÝ Jan. Jointly Trained Transformers Models for Spoken Language Translation. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, pp. 7513-7517. ISBN 978-1-7281-7605-5. Detail
2020
- ZULUAGA-GOMEZ Juan, VESELÝ Karel, BLATT Alexander, MOTLÍČEK Petr, KLAKOW Dietrich, TART Allan, SZŐKE Igor, PRASAD Amrutha, SARFJOO Saeed, KOLČÁREK Pavel, KOCOUR Martin, ČERNOCKÝ Jan, CEVENINI Claudia, CHOUKRI Khalid, RIGAULT Mickael and LANDIS Fabian. Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications. In: Proceedings of the 8th OpenSky Symposium 2020. Brusel: MDPI, 2020, pp. 1-10. ISSN 2504-3900. Detail
- ZULUAGA-GOMEZ Juan, MOTLÍČEK Petr, ZHAN Qingran, VESELÝ Karel and BRAUN Rudolf. Automatic Speech Recognition Benchmark for Air-Traffic Communications. In: Proceedings of Interspeech 2020. Shanghai: International Speech Communication Association, 2020, pp. 2297-2301. ISSN 1990-9772. Detail