Skip to main content
POST
https://api.novita.ai
/
v3
/
minimax-speech-2.8-turbo
MiniMax Speech 2.8 Turbo Sync Text-to-Speech
curl --request POST \
  --url https://api.novita.ai/v3/minimax-speech-2.8-turbo \
  --header 'Authorization: <authorization>' \
  --header 'Content-Type: <content-type>' \
  --data '
{
  "text": "<string>",
  "stream": true,
  "voice_modify": {
    "pitch": 123,
    "timbre": 123,
    "intensity": 123,
    "sound_effects": "<string>"
  },
  "audio_setting": {
    "format": "<string>",
    "bitrate": 123,
    "channel": 123,
    "force_cbr": true,
    "sample_rate": 123
  },
  "output_format": "<string>",
  "voice_setting": {
    "vol": 123,
    "pitch": 123,
    "speed": 123,
    "emotion": "<string>",
    "voice_id": "<string>",
    "latex_read": true,
    "text_normalization": true
  },
  "aigc_watermark": true,
  "language_boost": "<string>",
  "stream_options": {
    "exclude_aggregated_audio": true
  },
  "timber_weights": [
    {
      "weight": 123,
      "voice_id": "<string>"
    }
  ],
  "subtitle_enable": true,
  "continuous_sound": true,
  "pronunciation_dict": {
    "tone": [
      {}
    ]
  }
}
'
{
  "data": {
    "audio": "<string>",
    "status": 123,
    "subtitle_file": "<string>"
  },
  "trace_id": "<string>",
  "base_resp": {
    "status_msg": "<string>",
    "status_code": 123
  },
  "extra_info": {
    "bitrate": 123,
    "audio_size": 123,
    "word_count": 123,
    "audio_format": "<string>",
    "audio_length": 123,
    "audio_channel": 123,
    "usage_characters": 123,
    "audio_sample_rate": 123,
    "invisible_character_ratio": 123
  }
}
MiniMax synchronous text-to-speech API using HTTP protocol. Supports various voice, emotion, speed and other parameter settings.

Request Headers

Content-Type
string
required
Supports: application/json
Authorization
string
required
Bearer authentication format, for example: Bearer {{API Key}}.

Request Body

text
string
required
Text to synthesize into speech, length limit is less than 10000 characters. If text length is greater than 3000 characters, streaming output is recommended. Supports paragraph breaks (newline), pause control (&lt;#x#&gt; tag), and interjection tags (such as (laughs), (coughs), etc., only supported by speech-2.8-hd/turbo)
stream
boolean
default:false
Controls whether to enable streaming output. Default is false
voice_modify
object
audio_setting
object
output_format
string
default:"hex"
Controls output format, options are url or hex, default is hex. This parameter is only valid in non-streaming scenarios. URL is valid for 24 hoursOptional values: url, hex
voice_setting
object
aigc_watermark
boolean
default:false
Controls whether to add audio rhythm identifier at the end of synthesized audio, default is false. This parameter is only valid for non-streaming synthesis
language_boost
string
Whether to enhance recognition ability for specified minor languages and dialects. Default is null, can be set to auto to let the model decide automaticallyOptional values: Chinese, Chinese,Yue, English, Arabic, Russian, Spanish, French, Portuguese, German, Turkish, Dutch, Ukrainian, Vietnamese, Indonesian, Japanese, Italian, Korean, Thai, Polish, Romanian, Greek, Czech, Finnish, Hindi, Bulgarian, Danish, Hebrew, Malay, Persian, Slovak, Swedish, Croatian, Filipino, Hungarian, Norwegian, Slovenian, Catalan, Nynorsk, Tamil, Afrikaans, auto
stream_options
object
timber_weights
array
Mixed voice settings, supports up to 4 voice mixtures
subtitle_enable
boolean
default:false
Controls whether to enable subtitle service, default is false. This parameter is only valid in non-streaming output scenarios, and only valid for speech-2.6-hd, speech-2.6-turbo, speech-02-turbo, speech-02-hd, speech-01-turbo, speech-01-hd models
continuous_sound
boolean
default:false
Enable this parameter to make clause transitions more natural, only supported by speech-2.8-hd and speech-2.8-turbo models
pronunciation_dict
object

Response

data
object
Returned synthesis data object, may be null and needs null check
trace_id
string
Session ID for this request, used for troubleshooting
base_resp
object
Status code and details for this request
extra_info
object
Additional audio information