Use the GLM-ASR-2512 model to transcribe audio files into text, supporting multi-language transcription.
Supports: application/json
Bearer authentication format, for example: Bearer {{API Key}}.
Request Body
The audio file URL or Base64 encoded string to be transcribed. Supported audio formats: .wav / .mp3. Limitations: file size ≤ 25 MB, audio duration ≤ 30 seconds
For long text scenarios, you can provide previous transcription results as context. Recommended to be less than 8000 characters.
Hotword list to improve recognition accuracy for domain-specific vocabulary. Format example: [“person name”, “place name”]. Recommended not to exceed 100 items.Array length: 0 - 100
Response
The complete transcribed content of the audio