Τρίτη 27 Νοεμβρίου 2018

An Iterative Mask Estimation Approach to Deep Learning Based Multi-Channel Speech Recognition

Publication date: Available online 26 November 2018Source: Speech CommunicationAuthor(s): Yan-Hui Tu, Jun Du, Lei Sun, Feng Ma, Hai-Kun Wang, Jing-Dong Chen, Chin-Hui LeeAbstractWe propose a novel iterative mask estimation (IME) framework to improve the state-of-the-art complex Gaussian mixture model (CGMM)-based beamforming approach in an iterative manner by leveraging upon the complementary information obtained from different deep models. Although CGMM has been recently demonstrated to be quite effective for multi-channel, automation speech recognition (ASR) in operational scenarios, the corresponding mask estimation, however, is not always accurate in adverse environments due to the lack of prior or context information. To address this problem, in this study, a neural-network-based idea...

from #Head and Neck by Sfakianakis via simeraentaxei on Inoreader https://ift.tt/2RhNGKR

Δεν υπάρχουν σχόλια:

Δημοσίευση σχολίου

Σημείωση: Μόνο ένα μέλος αυτού του ιστολογίου μπορεί να αναρτήσει σχόλιο.