| Item | Content |
|---|---|
| Base URL | https://llm.ai-nebula.com/v1/chat/completions |
| Authentication Method | API Key (Token) |
| Request Headers | Authorization: Bearer sk-xxxx, Content-Type: application/json |
tool_calls (content is usually null, finish_reason=tool_calls). You need to execute the corresponding function on your server based on tool_calls[*].function.name/arguments.role:"tool" message and continue completion (streaming is supported).tool_call_id must match the one returned in Phase 1.choices, usage.usage aggregation may be appended at the end; if stream_options.include_usage=true is enabled, chunks may contain real-time usage.grok-4-fast-reasoning), usage in the response will distinguish between completion_tokens and reasoning_tokens.text_tokens = completion_tokens - reasoning_tokensresponse_format: json_schema and provide a strict JSON Schema; if necessary, combine with lowering temperature and setting max_tokens.tool_calls from incremental chunks, execute the function on the server, and pass the result back to the model as a tool message.seed parameter is supported; it is recommended to enable it only for workflows requiring reproducibility.grok-3 and grok-4 are standard versions, suitable for most scenarios.grok-3-fast is a fast version, suitable for scenarios requiring quick responses.grok-4-fast-reasoning is a reasoning version, suitable for scenarios requiring deep thinking and complex reasoning.temperature under strict JSON mode.grok-3-fast for quick responses and grok-4-fast-reasoning for complex reasoning.grok-4, grok-3) support reasoning capabilities but do not output visual chain-of-thought text.grok-4-fast-reasoning is a fast reasoning version, specialized for scenarios requiring deep reasoning.usage field in the response will contain reasoning_tokens statistics (inside completion_token_details) to understand the model's reasoning consumption.text_tokens = completion_tokens - reasoning_tokens, making it easy to distinguish between actual output text and tokens consumed by the reasoning process.