POST /v1/audio/transcriptions when you want a same-language text transcript of uploaded audio.
This endpoint accepts multipart form uploads and returns one JSON response with a text field.
Required and optional fields
| Field | Required | Notes |
|---|---|---|
model | Yes | Transcription model |
file | Yes | Binary audio upload |
language | No | Optional language hint |
prompt | No | Optional formatting or name-preservation hint |
Multipart Request Example
Response Shape
When to use transcription vs translation
- use transcription when you want text in the same language as the audio
- use Speech Translation when you want translated English text
Common Mistakes
- uploading a corrupted or unsupported audio file
- forgetting multipart form encoding
- assuming the response is a streamed token feed rather than one final JSON object