Login

ankush gawande · 08-16-2017, 08:32 PM

Voice recognition (SR) is the interdisciplinary sub-field of computational linguistics that develops methodologies and technologies that allow the recognition and translation of spoken language in the text by computers. It is also known as "automatic speech recognition" (ASR), "computer voice recognition", or simply "speech to text" (STT). It incorporates knowledge and research in the fields of linguistics, computer science and electrical engineering.

Some SR systems use "training" (also called "inscription") where an individual speaker reads isolated text or vocabulary in the system. The system analyzes the person's specific voice and uses it to fine-tune that person's speech recognition, resulting in greater accuracy. Systems that do not use training are called "speaker independent" systems. Systems that use training are called "speaker dependent".

Voice recognition applications include voice user interfaces such as voice dialing (eg, "Call home"), call routing (eg, "I would like to make a collective call"), domotic device control, paging (For example, entering a credit card number), preparation of structured documents (for example, a radiology report), voice-to-text processing (eg word processors or e-mails).

The term speech recognition or speaker identification refers to the speaker's identification, rather than what they are saying. Recognizing the speaker can simplify the task of translating speech into systems that have been trained in the voice of a specific person or can be used to authenticate or verify the identity of a speaker as part of a security process.

From the technological point of view, speech recognition has a long history with several waves of important innovations. More recently, the field has benefited from advances in deep learning and great data. Advances are evidenced not only by the wave of scholarly articles published in the field, but more importantly by the adoption in the global industry of a variety of deep learning methods in the design and deployment of speech recognition systems. These players in the speech industry include Google, Microsoft, IBM, Baidu, Apple, Amazon, Nuance, SoundHound, IflyTek, CDAC many of which have unveiled the core technology in their voice recognition systems as based on deep learning .

anjith · 08-16-2017, 08:32 PM

sir, I need speech to text conversion code(program) in matlab .. i request you to send the code.. thank you sir..