POST /v1/audio/translations when you want uploaded speech converted into English text.
This endpoint is similar to transcription, but the returned text is translated English output rather than a same-language transcript.
Required and optional fields
| Field | Required | Notes |
|---|---|---|
model | Yes | Translation model |
file | Yes | Binary audio upload |
prompt | No | Optional formatting or name-preservation hint |
language | No | Optional language hint |
Multipart Request Example
Response Shape
When to use translation vs transcription
- use translation when you want English text output from non-English audio
- use Speech to Text when you want a same-language transcript
Common mistakes
- expecting the output language to match the source audio
- forgetting multipart form encoding
- using the wrong endpoint when a transcript would be enough