Chat Completions

OpenAI-compatible The request/response format mirrors OpenAI’s Chat Completions. Bring your existing OpenAI SDK code and only change baseURL and the apiKey.

Request Parameters

model

string

required

Target model ID.

messages

array

required

Chat messages. Each item:

role (string): “system” | “user” | “assistant” | “tool”
content (string or array): Text, or an array of blocks including:
- Text: { "type": "text", "text": "..." }
- Image: { "type": "image_url", "image_url": { "url": "https://...", "detail": "low|high|auto" } }
- File (PDF): { "type": "file", "file": { "filename": "...", "file_data": "https://..." } }
- Audio (Gemini only): { "type": "input_audio", "input_audio": { "data": "BASE64", "format": "wav|mp3" } }

tools

array

Tool definitions (JSON), OpenAI-compatible.

tool_choice

string | object

Tool forcing strategy.

response_format

object

Text: { "type": "text" }
JSON object: { "type": "json_object" }
JSON schema: { "type": "json_schema", "json_schema": { "name": "...", "schema": { ... }, "strict": true } }

temperature

number

Sampling temperature (0..2).

top_p

number

Nucleus sampling probability (0..1).

stream

boolean

default:"false"

Whether to stream the response.

stream_options

object

{ "include_usage": boolean }

stop

string | string[]

Stop sequences.

max_completion_tokens

integer

Maximum number of tokens to generate.

reasoning_effort

string

presence_penalty

number

Penalty for new tokens based on their presence in the text so far (-2..2).

frequency_penalty

number

Penalty for new tokens based on their frequency in the text so far (-2..2).

logit_bias

object

Modify the likelihood of specified tokens appearing in the completion.

parallel_tool_calls

boolean

Whether to enable parallel tool calls.

prediction.static_content

object

Pre-seeded content for structured tasks.

web_search_options

object

Optional web search config.

image_config

object

Image generation configuration (for models with native image generation like gemini-2.5-flash-image) - aspect_ratio (string): Aspect ratio for generated images. Supported values: “1:1”, “1:4”, “1:8”, “2:3”, “3:2”, “3:4”, “4:1”, “4:3”, “4:5”, “5:4”, “8:1”, “9:16”, “16:9”, “21:9”

See Features → Multimodal for details about text, image, file, and audio blocks in messages.

Usage examples

from openai import OpenAI

client = OpenAI(
    base_url="https://api.naga.ac/v1",
    api_key="YOUR_API_KEY",
)

resp = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[
        {"role": "user", "content": "What's 2+2?"}
    ],
    temperature=0.2,
)
print(resp.choices[0].message.content)

Multimodal Example

Image and file inputs follow the same content-array format:

{
  "model": "gemini-2.5-flash",
  "messages": [
    {
      "role": "user",
      "content": [
        { "type": "text", "text": "What is in this audio and document?" },
        {
          "type": "image_url",
          "image_url": {
            "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg"
          }
        },
        {
          "type": "input_audio",
          "input_audio": { "data": "BASE64_AUDIO", "format": "wav" }
        },
        {
          "type": "file",
          "file": {
            "filename": "document.pdf",
            "file_data": "https://bitcoin.org/bitcoin.pdf"
          }
        }
      ]
    }
  ]
}

See Features → Multimodal for provider-specific modality support.

Response

A standard OpenAI-compatible response including:

id, object, created, model
choices (array) with message, finish_reason, etc.
Optional usage if enabled (stream_options.include_usage or at final response)

When stream=true, server-sent events follow the OpenAI streaming format.

Response Fields

string

Unique request identifier

object

string

Always “chat.completion” or “chat.completion.chunk” for streaming

created

integer

Unix timestamp of request creation

model

string

The model used for completion

choices

array

Array of completion choices

Show properties

index

integer

Choice index

message

object

The completion message

Show properties

role

string

Always “assistant”

content

string

The generated response text

tool_calls

array

Tool calls if tools were used

finish_reason

string

Reason for completion stop (“stop”, “length”, “tool_calls”, etc.)

usage

object

Token usage statistics

Show properties

prompt_tokens

integer

Input tokens used

completion_tokens

integer

Output tokens generated

total_tokens

integer

Total tokens used

completion_tokens_details

object

Details about the completion tokens

Show properties

image_tokens

integer

Image tokens generated

prompt_tokens_details

object

Details about the prompt tokens

Show properties

cached_tokens

integer

Cached prompt tokens

API Guides

API Reference

Request Parameters

Usage examples

Multimodal Example

Response

Response Fields

API Guides

API Reference

​Request Parameters

​Usage examples

​Multimodal Example

​Response

​Response Fields

Request Parameters

Usage examples

Multimodal Example

Response

Response Fields