Microsoft Speech To Text Service

For customers using Learning Tools for OneNote, Word, or Read Aloud in the Microsoft Edge browser, this article documents ways to download new languages for the Text-to-Speech feature in different versions of Windows. Install a new Text-to-Speech language in Windows 10. On any edition of Windows 10, click Start Settings, and then follow these. As an alternative to the Speech SDK, Speech Services allow you to convert speech-to-text using a REST API. Each accessible endpoint is associated with a region. Each accessible endpoint is associated with a region.

Microsoft Speech To Text Service Online
Microsoft Edge Text To Speech
Microsoft Speech To Text Cognitive Services
Microsoft Speech To Text Download
Microsoft Speech To Text Service For Children
Microsoft Text To Speech Voice Downloads

Aug 12, 2013 In my windows service.NET 4.0 application I use SpeechSynthesizer to speech some text. In Windows XP works great, but on Windows 7 it doesn't play anything. (Microsoft Server Speech. Oct 22, 2017 Speech to Text Cortana on Windows 10 can utilize the onboard microphone to recognize speech and convert to text. It seems that I must have an external microphone to set-up and utilize the speech recognition (Control Panel - Ease of Access- Speech Recognition).

This library is now deprecated due to Microsoft's release of an official websocket supported NodeJS/JavaScript SDK for Microsoft Speech Service. Please use that instead of this. Thanks! 🙇🏼‍♀️

(Unofficial) JavaScript service wrapper for Microsoft Speech API. It is an implementation of the Speech Websocket API specifically, which supports long speech recognition up to 10 minutes in length. Are you looking for Microsoft Speech HTTP API (short speech) support instead? This SDK can help you out :)

npm install ms-bing-speech-service

Installation

Install NodeJS on your computer
Create a new directory for your code project if you haven't already
Open a terminal and run npm install ms-bing-speech-service from your project directory

Usage

✨ This library works in both browsers and NodeJS runtime environments ✨ Please see the examples directory in this repo for more in depth examples than those below.

Microsoft Speech API

You'll first need to create a Microsoft Speech API key. You can do this while logged in to the Azure Portal.

The following code will get you up and running with the essentials in Node:

You can also use this library with the async/await pattern!

And in the browser (a global window distribution is also available in dist directory). Use an ArrayBuffer instance in place of a file path:

Live TV may vary by subscription and location.

The above examples will use your subscription key to create an access token with Microsoft's service.

In some instances you may not want to share your subscription key directly with your application. If you're creating an app with multiple users, you may want to issue access tokens from an external API so each user can connect to the speech service without exposing your subscription key.

To do this, replace 'subscriptionKey' in the above code example with 'accessToken' and pass in the provided token.

Custom Speech Service

Yes! You can totally use this with Custom Speech Service. You'll need a few more details in your options object, though.

Your subscriptionKey will be the key displayed on your custom endpoint deployment page in the Custom Speech Management Portal. There, you can also find your websocket endpoint of choice to use.

The following code will get you up and running with the Custom Speech Service:

See the API section of these docs for details on configuration and methods.

API Reference

Methods

SpeechService(options)

optionsObject
ReturnsSpeechService

Creates a new instance of SpeechService.

Available options are below:

name	type	description	default	required
`subscriptionKey`	`String`	your Speech API key	n/a	yes
`accessToken`	`String`	your Speech access token. Only required if subscriptionKey option not supplied.	n/a	no
`language`	`String`	the language you want to translate from. See supported languages in the official Microsoft Speech API docs.	`'en-US'`	no
`mode`	`String`	which recognition mode you'd like to use. Choose from `interactive`, `conversation`, or `dictation`	`'conversation'`	no
`format`	`String`	file format you'd like the text to speech to be returned as. Choose from `simple` or `detailed`	`'simple'`	no

recognizer.start()

Connects to the Speech API websocket on your behalf. Returns a promise.

recognizer.stop()

Disconnects from the established websocket connection to the Speech API. Returns a promise.

recognizer.sendStream(stream)

streamReadable Stream

Sends an audio payload stream to the Speech API websocket connection. Audio payload is a native NodeJS Buffer stream (eg. a readable stream) or an ArrayBuffer in the browser. Returns a promise.

See the 'Sending Audio' section of the official Speech API docs for details on the data format needed.

NodeJS example:

recognizer.sendFile(filepath)

filepathString

Streams an audio file from disk to the Speech API websocket connection. Also accepts a NodeJS Buffer or browser ArrayBuffer. Returns a promise.

See the 'Sending Audio' section of the official Speech API docs for details on the data format needed for the audio file.

Events

You can listen to the following events on the recognizer instance:

recognizer.on('recognition', callback)

callbackFunction

Event listener for incoming recognition message payloads from the Speech API. Message payload is a JSON object.

recognizer.on('close', callback)

callbackFunction

Event listener for Speech API websocket connection closures.

recognizer.on('error', callback)

callbackFunction

Event listener for incoming Speech API websocket connection errors.

recognizer.on('turn.start', callback)

callbackFunction

Event listener for Speech API websocket 'turn.start' event. Fires when service detects an audio stream.

recognizer.on('turn.end', callback)

callbackFunction

Event listener for Speech API websocket 'turn.end' event. Fires after 'speech.endDetected' event and the turn has ended. This event is an ideal one to listen to in order to be notified when an entire stream of audio has been processed and all results have been received.

recognizer.on('speech.startDetected', callback)

callbackFunction

Event listener for Speech API websocket 'speech.startDetected' event. Fires when the service has first detected speech in the audio stream.

recognizer.on('speech.endDetected', callback)

callbackFunction

Event listener for Speech API websocket 'speech.endDetected' event. Fires when the service has stopped being able to detect speech in the audio stream.

recognizer.on('speech.phrase', callback)

callbackFunction

Identical to the recognition event. Event listener for incoming recognition message payloads from the Speech API. Message payload is a JSON object.

recognizer.on('speech.hypothesis', callback)

callbackFunction

Event listener for Speech API websocket 'speech.hypothesis' event. Only fires when using interactive mode. Contains incomplete recognition results. This event will fire often - beware!

recognizer.on('speech.fragment', callback)

callbackFunction

Event listener for Speech API websocket 'speech.fragment' event. Only fires when using dictation mode. Contains incomplete recognition results. This event will fire often - beware!

License

MIT.

Credits

Big thanks to @michael-chi. Their bing speech example was a great foundation to build upon, particularly the response parser and header helper.

Active6 years, 1 month ago

In my windows service .NET 4.0 application I use SpeechSynthesizer to speech some text.In Windows XP works great, but on Windows 7 it doesn't play anything. Why?

Microsoft Speech To Text Service Online

(Microsoft Server Speech Platform Runtime 11.0 + Speech Text To Speech Voice Lucia)

many thanks,Riccardo

RickRick

5511 gold badge8 silver badges25 bronze badges

Microsoft Edge Text To Speech

2 Answers

Please try using System.Speech and System.Speech.Synthesize instead of Microsoft.Speech. I have developed a tool to recoganize the voice using System.Speech which is working good for me in Windows Vista, Windows 7. Please do a check whether your windows speech system is working or not.

To use 'it-IT' culture

Vimal CK

Microsoft Speech To Text Cognitive Services