Speech Translation - NagaAI Documentation

Use POST /v1/audio/translations when you want uploaded speech converted into English text. This endpoint is similar to transcription, but the returned text is translated English output rather than a same-language transcript.

Required and optional fields

Field	Required	Notes
`model`	Yes	Translation model
`file`	Yes	Binary audio upload
`prompt`	No	Optional formatting or name-preservation hint
`language`	No	Optional language hint

Multipart Request Example

from pathlib import Path
from openai import OpenAI

client = OpenAI(
    base_url="https://api.naga.ac/v1",
    api_key="YOUR_API_KEY",
)

translation = client.audio.translations.create(
    model="whisper-1",
    file=Path("spanish-interview.mp3"),
    prompt="Preserve product names exactly as spoken.",
)

print(translation.text)

Response Shape

{
  "text": "The speaker says the launch will happen next week."
}

When to use translation vs transcription

use translation when you want English text output from non-English audio
use Speech to Text when you want a same-language transcript

Common mistakes

expecting the output language to match the source audio
forgetting multipart form encoding
using the wrong endpoint when a transcript would be enough

Reference

Create translation

​Required and optional fields

​Multipart Request Example

​Response Shape

​When to use translation vs transcription

​Common mistakes

​Reference