Project Details
NTT - Speech enhancement front-end for robust automatic speech recognition with large amount of training data
Project Period: 1. 1. 2021 - 31. 12. 2021
Project Type: contract
Partner: NTT Corporation
Czech title
Parametrizace s obohacováním řeči pro robustní automatické rozpoznávání řeči s velkým objemem trénovacích dat
Type
contract
Keywords
speech recognition, robustness, large data, DNN embeddings
Abstract
The purpose of the Joint Research is to develop Speech enhancement front-end for robust automatic speech recognition with large amount of training data through the cooperation of NTT and BUT. The work is relying on embeddings produced by neural networks in various places of the processing chain.
Team members
Žmolíková Kateřina, Ing., Ph.D.
(UPGM FIT VUT)
, research leader
Černocký Jan, prof. Dr. Ing. (UPGM FIT VUT) , team leader
Kocour Martin, Ing. (UPGM FIT VUT)
Švec Ján, Ing. (UPGM FIT VUT)
Černocký Jan, prof. Dr. Ing. (UPGM FIT VUT) , team leader
Kocour Martin, Ing. (UPGM FIT VUT)
Švec Ján, Ing. (UPGM FIT VUT)
Publications
2021
- DELCROIX Marc, ŽMOLÍKOVÁ Kateřina, OCHIAI Tsubasa, KINOSHITA Keisuke and NAKATANI Tomohiro. Speaker activity driven neural speech extraction. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Toronto: IEEE Signal Processing Society, 2021, pp. 6099-6103. ISBN 978-1-7281-7605-5. Detail