×
Bing Speech Synthesis

Price per Channel

$30.00

By using Bing Speech Synthesis (BingSS) plugin to UniMRCP Server, IVR platforms can utilize Microsoft Bing Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.

Microsoft Bing Speech API performs text to speech conversion supporting the following main features.

Human Sounding Speech

Text is instantly synthesized into human-sounding speech and can be used for real-time conversion.

Languages

The text to speech API supports a variety of languages.

Voice Output Parameters

By supporting SSML 1.0, the API allows changing certain characteristics of generated voice output like speaking rate, volume and pronunciation.

Voices

The text to speech API supports a variety of different male and female voices.

Google Speech Synthesis

Price per Channel

$30.00

By using Google Speech Synthesis (GSS) plugin to UniMRCP Server, IVR platforms can utilize Google Cloud Text-to-Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.

Google Cloud Text-to-Speech API synthesizes natural-sounding speech, providing the following main features.

Multilingual

Supports 32 voices in 12 languages and variants, with more to come soon.

Wavenet Voices

Exclusive access to DeepMind WaveNet voices that provide the most natural-sounding speech.

Text and SSML support

Customize your speech with SSML tags that allow you to add pauses, numbers, date and time formatting, and other pronunciation instructions.

Speaking Rate Tuning

Customize your speaking rate to be 4x faster or slower than the normal rate.

Pitch Tuning

Customize the pitch of your selected voice, up to 20 semitones more or less than the default output.

Volume Gain Control

Increase the volume of the output by up to 16db or decrease the volume up to -96db.

Watson Speech Synthesis

Price per Channel

$30.00

By using Watson Speech Synthesis (SS) plugin to UniMRCP Server, IVR platforms can utilize IBM Watson Text to Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.

IBM Watson Text to Speech API performs text to speech conversion supporting the following main features.

Human Sounding Speech

Text is instantly synthesized into human-sounding speech and can be used for real-time conversion.

Languages

The text to speech API supports a variety of languages.

Voice Output Parameters

By supporting SSML 1.0, the API allows changing certain characteristics of generated voice output like speaking rate, volume and pronunciation.

Voices

The text to speech API supports a variety of different male and female voices.

Polly Speech Synthesis

Price per Channel

$30.00

By using Amazon Web Services (AWS) Polly plugin to UniMRCP Server, IVR platforms can utilize AWS Polly Text-to-Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.

AWS Polly is a Text-to-Speech service that uses advanced deep learning technologies to synthesize speech that sounds like a human voice.

High Quality

Polly uses best-in-class Text-to-Speech (TTS) technology to synthesize natural speech with high pronunciation accuracy (including abbreviations, acronym expansions, date/time interpretations, and homograph disambiguation).

Low Latency

Polly ensures fast response times, which make it a viable option for low-latency use cases such as dialog systems.

Large Portfolio of Languages and Voices

Polly supports dozens of voices and multiple languages, offering male and female voice options for most languages.

Cloud-based Solution

Text-to-Speech conversion done in the cloud dramatically reduces local resource requirements. This enables support of all the available languages and voices at the best possible quality. Moreover, speech improvements are instantly available to all end-users and do not require additional updates for devices.

Yandex Speech Synthesis

Price per Channel

$30.00

By using Yandex Speech Synthesis (SS) plugin to UniMRCP Server, IVR platforms can utilize Yandex SpeechKit Text to Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.

Yandex SpeechKit Text to Speech API performs text to speech conversion supporting the following main features.

Natural-sounding Speech

Yandex SpeechKit composes speech from more than a million individual phonemes, with intonation set by a neural network trained on numerous real-life examples.

Languages

The text to speech API currently supports four languages.

Real-time Synthesis

The response time of API is so quick, that it allows for an efficient implementation of audio data streaming.

Voices

The text to speech API supports a variety of different male and female voices.

Speech Synthesis