GLM Audio to Text

curl --request POST \
  --url https://api.novita.ai/v3/glm-asr \
  --header 'Authorization: <authorization>' \
  --header 'Content-Type: <content-type>' \
  --data '
{
  "file": "<string>",
  "prompt": "<string>",
  "hotwords": [
    {}
  ]
}
'

{
  "text": "<string>"
}

POST

glm-asr

GLM Audio to Text

curl --request POST \
  --url https://api.novita.ai/v3/glm-asr \
  --header 'Authorization: <authorization>' \
  --header 'Content-Type: <content-type>' \
  --data '
{
  "file": "<string>",
  "prompt": "<string>",
  "hotwords": [
    {}
  ]
}
'

{
  "text": "<string>"
}

Use the GLM-ASR-2512 model to transcribe audio files into text, supporting multi-language transcription.

Request Headers

Content-Type

string

required

Supports: application/json

Authorization

string

required

Bearer authentication format, for example: Bearer {{API Key}}.

Request Body

file

string

required

The audio file URL or Base64 encoded string to be transcribed. Supported audio formats: .wav / .mp3. Limitations: file size ≤ 25 MB, audio duration ≤ 30 seconds

prompt

string

For long text scenarios, you can provide previous transcription results as context. Recommended to be less than 8000 characters.

hotwords

array

Hotword list to improve recognition accuracy for domain-specific vocabulary. Format example: [“person name”, “place name”]. Recommended not to exceed 100 items.Array length: 0 - 100

Response

text

string

The complete transcribed content of the audio

GLM Text to Speech GLM Voice Clone

⌘I

Overview

Basic

Model APIs

GPUs

GLM Audio to Text

Request Headers

Request Body

Response

Overview

Basic

Model APIs

GPUs

​Request Headers

​Request Body

​Response

Request Headers

Request Body

Response