Facebook taught to recognize voices and translate them into texts

Facebook taught to recognize voices and translate them into texts

11 July 2020, 21:53
A source: © hightech.fm
234
Facebook engineers introduced a new model that can identify up to five different voices, then translate them into text or split them into different tracks.

Facebook’s Artificial Intelligence (AI) taught you how to identify up to five different voices in one conversation, translate them into text, or split them into five different tracks. The team claims that the new method is superior to all analogues in the quality and speed of separation of speech sources, noise reduction and reverb.

Facebook used a new recurrent neural network to create a new class of algorithms using an internal state similar to memory to process sequences of variable inputs. In this case, the model can automatically identify speakers and select a speech model.

Speech separation is a critical step towards improving communication in a variety of applications — using voice messaging or streaming audio. In addition, the methods of speech separation proposed by the researchers can be used to suppress background noise, for example, when recording musical instruments.
Search for lots
* Select a section
Search section
Search:
Search results in:
Cookies
We use essential cookies for the proper functioning of the website and additional ones to make interaction with the site as convenient as possible. It helps us personalize your user experience as well as obtain analytical information to improve the service. If you agree to accept all cookies, click "Accept all"; if not, click "Only essential". To learn more, view the Cookie Policy.