Posted  by  admin

Microsoft Speech Api Download

For customers using Learning Tools for OneNote, Word, or Read Aloud in the Microsoft Edge browser, this article documents ways to download new languages for the Text-to-Speech feature in different versions of Windows.

  1. Get Familiar With Batch Transcription
  2. Microsoft Speech Api 4.0 Download
  3. Microsoft Speech Api Python

Install a new Text-to-Speech language in Windows 10

On any edition of Windows 10, click Start > Settings, and then follow these steps:

  1. Select the Start button, then select Settings > Time & Language > Region & Language.

  2. Select Add a language and choose the language you want from the list. Only languages listed in the table below will have Text-to-Speech capabilities.

  3. After the new language has been installed (this may take a few moments), select it in the Region & Language list, and then select Options.

  4. Under Language options > Speech, select Download.

  5. Restart your computer. The Text-to-Speech voices will be installed when your machine turns back on.

Microsoft Speech API (SAPI) 5.4. This is the documentation for Microsoft Speech API (SAPI) 5.4, the native API for Windows. These are interfaces and enumerations that have been added for the SAPI 5.4 release: New SAPI 5.4 Interfaces. New SAPI 5.4 Enumerations. The Speech service, part of Azure Cognitive Services, is certified by SOC, FedRAMP, PCI DSS, HIPAA, HITECH, and ISO. Your data remains yours. Your text data isn’t stored during data processing or audio generation. View and delete your custom voice data and synthesized speech models at any time. Your data is encrypted while it’s in storage.

Text-to-Speech languages and voices available in Windows

Language (Region)

Male voice name

Female voice name

Arabic

Not applicable

Hoda

Arabic (Saudi Arabia)

Naayf

Not applicable

Brazil (pt-BR)

Daniel

Maria

Catalan Spanish (Spain)

Not applicable

Herena

China (zh-CN)

Kangkang

Huihui, yaoyao

Czech (Czech Republic)

Vit

Not applicable

Danish

Not applicable

Helle

Dutch

Frank

Not applicable

English (Australia)

James

Catherine

English (Canada)

Richard

Linda

English (Great Britain)

George

Hazel, Susan

English (India)

Ravi

Heera

English (Ireland)

Shaun

Not applicable

English (United States)

David, Mark

Zira

Finnish

Not applicable

Heidi

Flemish (Belgian Dutch)

Bart

Not applicable

French (Canada)

Claude

Caroline

French (France)

Paul

Hortence, Julie

German (Germany)

Stefan

Hedda, Katja

German (Switzerland)

Karsten

Not applicable

Greek

Stefanos

Not applicable

Hindi (India)

Hermant

Kalpana

Hong Kong (zh-HK)

Danny

Tracy

Hungarian (Hungary)

Szabolcs

Not applicable

Indonesian (Indonesia)

Andika

Not applicable

Italian

Cosimo

Elsa

Japanese

Ichiro

Ayumi, Haruka

Norwegian

Jon

Not applicable

Polish (Poland)

Adam

Paulina

Portuguese (Portugal)

Not applicable

Helia

Romanian (Romania)

Andrei

Not applicable

Russian (Russia)

Pavel

Irina

Slovak (Slovakia)

Filip

Not applicable

Korean

Not applicable

Heami

Spanish (Spain)

Pablo

Helena, Laura

Spanish (Mexico)

Raul

Sabina

Swedish

Bengt

Not applicable

Taiwan (zh-TW)

Zhiwei

Yating, Hanhan

Thai (Thailand)

Pettara

Not applicable

Turkish

Tolga

Not applicable


Third-party Text-to-Speech languages:

Additional Text-to-Speech languages can be purchased from the following third-party providers:

Note: These options are provided for informational purposes only. Microsoft does not endorse any particular third-party software, nor can it offer any support for their installation and use. For help with these products, please contact their original manufacturer.

Open source Text-to-Speech languages

Free Text-to-Speech languages are available for download from Open source provider eSpeak. These languages work on Windows 7, but some may not yet work on Windows 8, Windows 8.1, or Windows 10. View a list of available eSpeak languages and codes for more information.

Get Familiar With Batch Transcription

Download eSpeak languages

  1. In your browser, visit http://espeak.sourceforge.net/download.html.

  2. Download the 'Compiled for Windows' version, titled setup_espeak-1.48.04.exe.

  3. When the download is finished, click Next at the first Setup screen to begin installation.

  4. Confirm the installation path, and then click Next.

  5. Enter the two-letter code(s) for the language(s) and flag(s) that you want to install. For example, if you planned to add Text-to-Speech for English, Spanish, Polish, Swedish, and Czech, your screen would look like this:

    To use alternate voices for a language, you can select additional commands to change various voice and pronunciation attributes. For more information, visit http://espeak.sourceforge.net/voices.html. Pre-set voice variants can be applied to any of the language voices by appending a plus sign (+) and a variant name. The variants for male voices are +m1, +m2, +m3, +m4, +m5, +m6, and +m7. The variants for female voices are +f1, +f2, +f3, +f4, and +f5. You can also choose optional voice effects such as +croak or +whisper.

  6. Select Next twice, then select Install to complete installation.

Microsoft
page_typelanguagesnamedescriptionproducts
csharp
javascript
objc
swift
Learn how to use the Microsoft Cognitive Services Speech SDK to add speech-enabled features to your apps.

This project hosts the samples for the Microsoft Cognitive Services Speech SDK. To find out more about the Microsoft Cognitive Services Speech SDK itself, please visit the SDK documentation site.

News

Please check here for release notes and older releases.

Features

This repository hosts samples that help you to get started with several features of the SDK.In addition more complex scenarios are included to give you a head-start on using speech technology in your application.

We tested the samples with the latest released version of the SDK on Windows 10, Linux (on supported Linux distributions and target architectures), Android devices (API 23: Android 6.0 Marshmallow or higher), Mac x64 (OS version 10.14 or higher) and Mac M1 arm64 (OS version 11.0 or higher) and iOS 11.4 devices.

Getting Started

The SDK documentation has extensive sections about getting started, setting up the SDK, as well as the process to acquire the required subscription keys. You will need subscription keys to run the samples on your machines, you therefore should follow the instructions on these pages before continuing.

Get the samples

  • The easiest way to use these samples without using Git is to download the current version as a ZIP file.

    • On Windows, before you unzip the archive, right-click it, select Properties, and then select Unblock.
    • Be sure to unzip the entire archive, and not just individual samples.
  • Clone this sample repository using a Git client.

Build and run the samples

Note: the samples make use of the Microsoft Cognitive Services Speech SDK.By downloading the Microsoft Cognitive Services Speech SDK, you acknowledge its license, see Speech SDK license agreement.

Please see the description of each individual sample for instructions on how to build and run it.

Create a free Azure account

Speech recognition quickstarts

The following quickstarts demonstrate how to perform one-shot speech recognition using a microphone.If you want to build them from scratch, please follow the quickstart or basics articles on our documentation page.

QuickstartPlatformDescription
Quickstart C++ for LinuxLinuxDemonstrates one-shot speech recognition from a microphone.
Quickstart C++ for WindowsWindowsDemonstrates one-shot speech recognition from a microphone.
Quickstart C++ for macOSmacOS
Quickstart C# .NET for WindowsWindowsDemonstrates one-shot speech recognition from a microphone.
Quickstart C# .NET CoreWindows, Linux, macOSDemonstrates one-shot speech recognition from a microphone.
Quickstart C# UWP for WindowsWindowsDemonstrates one-shot speech recognition from a microphone.
Quickstart C# Unity (Windows or Android)Windows, AndroidDemonstrates one-shot speech recognition from a microphone.
Quickstart for AndroidAndroidDemonstrates one-shot speech recognition from a microphone.
Quickstart Java JREWindows, Linux, macOSDemonstrates one-shot speech recognition from a microphone.
Quickstart JavaScriptWebDemonstrates one-shot speech recognition from a microphone.
Quickstart Node.jsNode.jsDemonstrates one-shot speech recognition from a file.
Quickstart PythonWindows, Linux, macOSDemonstrates one-shot speech recognition from a microphone.
Quickstart Objective-C iOSiOSDemonstrates one-shot speech recognition from a file with recorded speech.
Quickstart Swift iOSiOSDemonstrates one-shot speech recognition from a microphone.
Quickstart Objective-C macOSmacOSDemonstrates one-shot speech recognition from a microphone.
Quickstart Swift macOSmacOSDemonstrates one-shot speech recognition from a microphone.

Speech translation quickstarts

The following quickstarts demonstrate how to perform one-shot speech translation using a microphone.If you want to build them from scratch, please follow the quickstart or basics articles on our documentation page.

QuickstartPlatformDescription
Quickstart C++ for WindowsWindowsDemonstrates one-shot speech translation/transcription from a microphone.
Quickstart C# .NET Framework for WindowsWindowsDemonstrates one-shot speech translation/transcription from a microphone.
Quickstart C# .NET CoreWindows, Linux, macOSDemonstrates one-shot speech translation/transcription from a microphone.
Quickstart C# UWP for WindowsWindowsDemonstrates one-shot speech translation/transcription from a microphone.
Quickstart Java JREWindows, Linux, macOSDemonstrates one-shot speech translation/transcription from a microphone.

Speech synthesis quickstarts

The following quickstarts demonstrate how to perform one-shot speech synthesis to a speaker.If you want to build them from scratch, please follow the quickstart or basics articles on our documentation page.

QuickstartPlatformDescription
Quickstart C++ for LinuxLinuxDemonstrates one-shot speech synthesis to the default speaker.
Quickstart C++ for WindowsWindowsDemonstrates one-shot speech synthesis to the default speaker.
Quickstart C++ for macOSmacOSDemonstrates one-shot speech synthesis to the default speaker.
Quickstart C# .NET for WindowsWindowsDemonstrates one-shot speech synthesis to the default speaker.
Quickstart C# UWP for WindowsWindowsDemonstrates one-shot speech synthesis to the default speaker.
Quickstart C# .NET CoreWindows, LinuxDemonstrates one-shot speech synthesis to the default speaker.
Quickstart for C# Unity (Windows or Android)Windows, AndroidDemonstrates one-shot speech synthesis to a synthesis result and then rendering to the default speaker.
Quickstart for AndroidAndroidDemonstrates one-shot speech synthesis to the default speaker.
Quickstart Java JREWindows, Linux, macOSDemonstrates one-shot speech synthesis to the default speaker.
Quickstart PythonWindows, Linux, macOSDemonstrates one-shot speech synthesis to the default speaker.
Quickstart Objective-C iOSiOSDemonstrates one-shot speech synthesis to a synthesis result and then rendering to the default speaker.
Quickstart Swift iOSiOSDemonstrates one-shot speech synthesis to the default speaker.
Quickstart Objective-C macOSmacOSDemonstrates one-shot speech synthesis to the default speaker.
Quickstart Swift macOSmacOSDemonstrates one-shot speech synthesis to the default speaker.

Voice assistants quickstarts

The following quickstarts demonstrate how to create a custom voice assistant.The applications will connect to a previously authored bot configured to use the Direct Line Speech channel, send a voice request, and return a voice response activity (if configured).If you want to build these quickstarts from scratch, please follow the quickstart or basics articles on our documentation page.

QuickstartPlatformDescription
Quickstart Java JREWindows, Linux, macOSDemonstrates speech recognition through the DialogServiceConnector and receiving activity responses.
Quickstart C# UWP for WindowsWindowsDemonstrates speech recognition through the DialogServiceConnector and receiving activity responses.
Get familiar with Batch transcription

Samples

The following samples demonstrate additional capabilities of the Speech SDK, such as additional modes of speech recognition as well as intent recognition and translation. Voice Assistant samples can be found in a separate GitHub repo.

SamplePlatformDescription
C++ Console app for WindowsWindowsDemonstrates speech recognition, speech synthesis, intent recognition, conversation transcription and translation
C++ Speech Recognition from MP3/Opus file (Linux only)LinuxDemonstrates speech recognition from an MP3/Opus file
C# Console app for .NET Framework on WindowsWindowsDemonstrates speech recognition, speech synthesis, intent recognition, and translation
C# Console app for .NET Core (Windows or Linux)Windows, Linux, macOSDemonstrates speech recognition, speech synthesis, intent recognition, and translation
Java Console app for JREWindows, Linux, macOSDemonstrates speech recognition, speech synthesis, intent recognition, and translation
Python Console appWindows, Linux, macOSDemonstrates speech recognition, speech synthesis, intent recognition, and translation
Speech-to-text WPF sample for .NET Framework on WindowsWindowsDemonstrates speech recognition
Translation WPF sample for .NET Framework on WindowsWindowsDemonstrates translation
Speech-to-text UWP sampleWindowsDemonstrates speech recognition
Text-to-speech UWP sampleWindowsDemonstrates speech synthesis
Speech recognition sample for AndroidAndroidDemonstrates speech and intent recognition
Speech recognition and translation sample for the browser, using JavaScriptWebDemonstrates speech recognition, intent recognition, and translation
Speech recognition and translation sample using JavaScript and Node.jsNode.jsDemonstrates speech recognition, intent recognition, and translation
Speech recognition sample for iOS using a connection objectiOSDemonstrates speech recognition
Extended speech recognition sample for iOSiOSDemonstrates speech recognition using streams etc.
Speech synthesis sample for iOSiOSDemonstrates speech synthesis using streams etc.
C# UWP DialogServiceConnector sample for WindowsWindowsDemonstrates speech recognition through the DialogServiceConnector and receiving activity responses.
C# Unity sample for Windows or AndroidWindows, AndroidDemonstrates speech recognition, intent recognition, and translation for Unity
C# Unity SpeechBotConnector sample for Windows or AndroidWindows, AndroidDemonstrates speech recognition through the SpeechBotConnector and receiving activity responses.

Samples for using the Speech Service REST API (no Speech SDK installation required):

SampleDescription
Batch transcriptionDemonstrates usage of batch transcription from different programming languages

Microsoft Speech Api 4.0 Download

Sample data for Custom Speech

Microsoft Speech Api Python

Resources