Project Details
ROXANNE - Real time network, text, and speaker analytics for combating organized crime
Project Period: 1. 9. 2019 - 31. 12. 2022
Project Type: grant
Code: 833635
Agency: European Comission EU
Program: Horizon 2020
Fight against criminality, Fight against trafficking, Speech analytics, Criminal network analysis, Organised crime, Counter-terrorism, Analysis platform, Legal and ethical framework
Discovering criminal networks and identifying their members is one of the primary aspects of LEAs' mission. ROXANNE will contribute towards this goal by bridging the strengths of speech and language technologies (SLTs), visual analysis (VA) and network analysis (NA). If funded, ROXANNE will achieve a significant increase in the speed of investigation processes and an improvement in identification of individuals by means of speech, in the scope of criminal cases where large amounts of lawfully intercepted communications (with multilingual attributes) are analysed. The technical development will be centred around the ROXANNE platform, which will enhance criminal network analysis capabilities by providing a framework for extracting evidence and actionable intelligence based on speech, language and video technologies. The intention is not to replace humans but automate time-consuming tasks, and support LEA decisionmaking. Its early version will offer preliminary SLT, VA and NA capabilities to collect end-user feedback. The final version will provide multilingual, probabilistic tools interfacing SLT and NA technologies, boosted by natural language processing (NLP) and relation analysis in the synoptic criminal activity graph. ROXANNE will achieve full compliance with relevant INTERPOL and EU legal and ethical frameworks, including innovative approaches to data protection management such as privacy by design. Special efforts will be expended to ensure ROXANNE outcomes achieve widespread adoption by law enforcement. The effort will be enhanced through a series of education and awareness campaigns and the direct involvement of LEAs from nine European countries, that will test our solutions on real case data. In addition, ROXANNE partner INTERPOL and EUROPOL (member of the External Advisory Board) will provide advice and guidance. The consortium has 24 partners with complementary skills, including leaders in key technology areas impacting criminal investigations.
Szőke Igor, Ing., Ph.D. (UPGM FIT VUT) , team leader
Černá Aneta, DiS
Karafiát Martin, Ing., Ph.D. (UPGM FIT VUT)
Veselý Karel, Ing., Ph.D. (UPGM FIT VUT)
Žižka Josef, Ing. (UPGM FIT VUT)
Žmolíková Kateřina, Ing., Ph.D. (UPGM FIT VUT)
This project has received funding from the European Unions Horizon 2020 research and innovation programme under grant agreement No 833635.
2023
- SKOWRON Marcin, BACKFRIED Gerhard, NAVAS Eva, BERZINŠ Aivars, VAN Den Bogaert Joachim, DE Jong Franciska, DEMARCO Andrea, POLÁK Peter, KOVÁČ Marek, POLÁK Peter, ROHDIN Johan A., ROSNER Michael, SANCHEZ Jon, SARATXAGA Ibon and SCHWARZ Petr. Deep Dive Speech Technology. European Language Equality. Cham: Springer Nature Switzerland AG, 2023, pp. 289-312. ISBN 978-3-031-28819-7. Detail
- NIGMATULINA Iuliia, MADIKERI Srikanth, VILLATORO-TELLO Esaú, MOTLÍČEK Petr, ZULUAGA-GOMEZ Juan, PANDIA Karthick and GANAPATHIRAJU Aravind. Implementing contextual biasing in GPU decoder for online ASR. In: Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 4494-4498. ISSN 1990-9772. Detail
2022
- SILNOVA Anna, STAFYLAKIS Themos, MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., MATĚJKA Pavel, BURGET Lukáš, GLEMBEK Ondřej and BRUMMER Johan Nikolaas Langenhoven. Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch. In: Proceedings of The Speaker and Language Recognition Workshop (Odyssey 2022). Beijing: International Speech Communication Association, 2022, pp. 9-16. Detail
- LANDINI Federico Nicolás, PROFANT Ján, DIEZ Sánchez Mireia and BURGET Lukáš. Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks. Computer Speech and Language, vol. 71, no. 101254, 2022, pp. 1-16. ISSN 0885-2308. Detail
- ALAM Jahangir, BURGET Lukáš, GLEMBEK Ondřej, MATĚJKA Pavel, MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna and STAFYLAKIS Themos et al. Development of ABC systems for the 2021 edition of NIST Speaker Recognition evaluation. In: Proceedings of The Speaker and Language Recognition Workshop (Odyssey 2022). Beijing: International Speech Communication Association, 2022, pp. 346-353. Detail
- SOLEWICZ Yosef, COHEN Noa, ROHDIN Johan A., MADIKERI Srikanth and ČERNOCKÝ Jan. Speaker recognition on mono-channel telephony recordings. In: Proceedings of Odyssey 2022. Beijing: International Speech Communication Association, 2022, pp. 193-199. Detail
- STAFYLAKIS Themos, MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, BURGET Lukáš and ČERNOCKÝ Jan. Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, pp. 605-609. ISSN 1990-9772. Detail
2021
- LANDINI Federico Nicolás, GLEMBEK Ondřej, MATĚJKA Pavel, ROHDIN Johan A., BURGET Lukáš, DIEZ Sánchez Mireia and SILNOVA Anna. Analysis of the BUT Diarization System for Voxconverse Challenge. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, pp. 5819-5823. ISBN 978-1-7281-7605-5. Detail
- KARAFIÁT Martin, VESELÝ Karel, ČERNOCKÝ Jan, PROFANT Ján, NYTRA Jiří, HLAVÁČEK Miroslav and PAVLÍČEK Tomáš. Analysis of X-Vectors for Low-Resource Speech Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, pp. 6998-7002. ISBN 978-1-7281-7605-5. Detail
- LANDINI Federico Nicolás, LOZANO Díez Alicia, BURGET Lukáš, DIEZ Sánchez Mireia, SILNOVA Anna, ŽMOLÍKOVÁ Kateřina, GLEMBEK Ondřej, MATĚJKA Pavel, STAFYLAKIS Themos and BRUMMER Johan Nikolaas Langenhoven. BUT System Description for The Third DIHARD Speech Diarization Challenge. In: Proceedings available at Dihard Challenge Github. on-line by LDC and University of Pennsylvania, 2021, pp. 1-5. Detail
- STAFYLAKIS Themos, ROHDIN Johan A. and BURGET Lukáš. Speaker embeddings by modeling channel-wise correlations. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Brno: International Speech Communication Association, 2021, pp. 501-505. ISSN 1990-9772. Detail
2020
- ALAM Jahangir, BOULIANNE Gilles, BURGET Lukáš, DAHMANE Mohamed, DIEZ Sánchez Mireia, GLEMBEK Ondřej, LALONDE Marc, LOZANO Díez Alicia, MATĚJKA Pavel, MIZERA Petr, MOŠNER Ladislav, NOISEUX Cédric, MONTEIRO Joao, NOVOTNÝ Ondřej, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, SLAVÍČEK Josef, STAFYLAKIS Themos, ST-CHARLES Pierre-Luc, WANG Shuai and ZEINALI Hossein. Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge. In: Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Tokyo: International Speech Communication Association, 2020, pp. 289-295. ISSN 2312-2846. Detail
- LOZANO Díez Alicia, SILNOVA Anna, PULUGUNDLA Bhargav, ROHDIN Johan A., VESELÝ Karel, BURGET Lukáš, PLCHOT Oldřich, GLEMBEK Ondřej, NOVOTNÝ Ondřej and MATĚJKA Pavel. BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Shanghai: International Speech Communication Association, 2020, pp. 761-765. ISSN 1990-9772. Detail
- SILNOVA Anna, BRUMMER Johan Nikolaas Langenhoven, ROHDIN Johan A., STAFYLAKIS Themos and BURGET Lukáš. Probabilistic embeddings for speaker diarization. In: Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Tokyo: International Speech Communication Association, 2020, pp. 24-31. ISSN 2312-2846. Detail
- MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A. and ČERNOCKÝ Jan. Utilizing VOiCES dataset for multichannel speaker verification with beamforming. In: Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Tokyo: International Speech Communication Association, 2020, pp. 187-193. ISSN 2312-2846. Detail