Masaru voice
The voice Masaru is available in the Azure Text-to-Speech service for the Japanese language.
How to use Masaru voice in your videos
To use Masaru voice in your videos, you can use the following JSON2Video code:
{
"type": "voice",
"model": "azure",
"voice": "ja-JP-MasaruMultilingualNeural",
"text": "\u6625\u306b\u306a\u308b\u3068\u3001\u5ead\u306f\u8272\u3068\u308a\u3069\u308a\u306e\u82b1\u3084\u9cf4\u304d\u9ce5\u3067\u3044\u3063\u3071\u3044\u306b\u306a\u308a\u307e\u3059\u3002\u53e4\u3044\u30aa\u30fc\u30af\u306e\u6728\u306f\u8a2a\u308c\u308b\u4eba\u3005\u306b\u65e5\u9670\u3092\u63d0\u4f9b\u3057\u3001\u30d0\u30e9\u306e\u9593\u3067\u8776\u3005\u304c\u8e0a\u308a\u307e\u3059\u3002\u5c0f\u3055\u306a\u5674\u6c34\u306f\u9759\u304b\u306a\u97f3\u3092\u594f\u3067\u3001\u81ea\u7136\u306e\u7f8e\u3057\u3055\u3092\u697d\u3057\u3093\u3067\u30ea\u30e9\u30c3\u30af\u30b9\u3059\u308b\u306e\u306b\u6700\u9069\u306a\u30b9\u30dd\u30c3\u30c8\u3068\u306a\u308a\u307e\u3059\u3002"
}
Masaru supports SSML
SSML stands for Speech Synthesis Markup Language. It's a way to add instructions to your text so that a Text-To-Speech (TTS) system knows how to read it aloud.
You use SSML like HTML, but for controlling speech. It helps you adjust things like: Pronunciation, Pauses, Pitch and Volume, Emphasis, Speaking Rate.
{
"type": "voice",
"voice": "ja-JP-MasaruMultilingualNeural",
"text": "<speak>Hello, <break time="500ms"/> how are you today? <emphasis level="strong">This is important!</emphasis></speak>"
}
Masaru is a neural voice
In Azure Cognitive Services, a Neural voice refers to a voice generated using neural network technology. This means the Text-To-Speech system uses advanced machine learning models to create more natural, human-like speech compared to traditional methods.
Key characteristics of Neural voices:
- More expressive and realistic
- Better at handling pitch, tone, and rhythm variations
- Sounds closer to how humans naturally speak
Masaru is a multilingual voice
A Multilingual voice in Azure Cognitive Services refers to a voice that can speak in multiple languages or accents while maintaining a consistent speaking style.
Key points about Multilingual voices:
- The same voice can pronounce text in different languages accurately
- Useful for applications that switch between languages or need to handle multilingual content
- Maintains natural tone and consistency, even when switching languages