Audio API when you need speech synthesis, transcription, or translation as direct audio workflows.
Audio Operations
POST /v1/audio/speechconverts text into audio outputPOST /v1/audio/transcriptionsconverts uploaded audio into textPOST /v1/audio/translationstranslates uploaded audio into text
- text-to-speech returns streamed audio bytes
- transcription returns JSON text
- translation returns JSON text
Text to Speech
Convert text into streamed audio output.
Speech to Text
Transcribe uploaded audio into text.
Speech Translation
Translate uploaded audio into text.
Formats and Uploads
Check supported file formats and upload guidance.