Xiaoyi voice

The voice Xiaoyi is available in the Azure Text-to-Speech service for the Chinese language.

Voice Name: Xiaoyi
Voice ID: zh-CN-XiaoyiNeural
Language: Chinese
Gender: Female
Words Per Minute: 263

How to use Xiaoyi voice in your videos

To use Xiaoyi voice in your videos, you can use the following JSON2Video code:

Xiaoyi supports SSML

SSML stands for Speech Synthesis Markup Language. It's a way to add instructions to your text so that a Text-To-Speech (TTS) system knows how to read it aloud.

You use SSML like HTML, but for controlling speech. It helps you adjust things like: Pronunciation, Pauses, Pitch and Volume, Emphasis, Speaking Rate.

Xiaoyi supports different voice styles

As part of SSML, you can use the style tags to change the voice style.

Xiaoyi supports these styles: affectionate angry cheerful disgruntled embarrassed fearful gentle sad serious

Xiaoyi is a neural voice

In Azure Cognitive Services, a Neural voice refers to a voice generated using neural network technology. This means the Text-To-Speech system uses advanced machine learning models to create more natural, human-like speech compared to traditional methods.

Key characteristics of Neural voices: