The OpenAI Chat Completions API endpoint lets you send chat messages and receive AI-generated responses using the familiar OpenAI request format. You can use GPT, Claude, and Gemini models through this single endpoint, making it easy to switch between model families without changing your integration code. Base URL:Documentation Index
Fetch the complete documentation index at: https://docs.xingchaoyiqing.com/llms.txt
Use this file to discover all available pages before exploring further.
http://apillm.globalaiopc.com/gw_llm_power
Endpoint: POST /v1/chat/completions
Authentication
Authenticate every request using theAuthorization header with your API key:
Supported Models
GPT Models
| Standard | -official | -hc | -fd | -low |
|---|---|---|---|---|
gpt-5.4 | gpt-5.4-official | gpt-5.4-hc | gpt-5.4-fd | gpt-5.4-low |
gpt-5.5 | gpt-5.5-official | gpt-5.5-hc | gpt-5.5-fd | gpt-5.5-low |
Claude Models
| Standard | -official | -hc | -fd |
|---|---|---|---|
claude-haiku-4-5 | claude-haiku-4-5-official | claude-haiku-4-5-hc | claude-haiku-4-5-fd |
claude-opus-4-5 | claude-opus-4-5-official | claude-opus-4-5-hc | claude-opus-4-5-fd |
claude-opus-4-6 | claude-opus-4-6-official | claude-opus-4-6-hc | claude-opus-4-6-fd |
claude-opus-4-7 | claude-opus-4-7-official | claude-opus-4-7-hc | claude-opus-4-7-fd |
claude-sonnet-4-5 | claude-sonnet-4-5-official | claude-sonnet-4-5-hc | claude-sonnet-4-5-fd |
claude-sonnet-4-6 | claude-sonnet-4-6-official | claude-sonnet-4-6-hc | claude-sonnet-4-6-fd |
Gemini Models
| Standard | -official | -low |
|---|---|---|
gemini-2.5-flash-lite | — | — |
gemini-2.5-pro | gemini-2.5-pro-official | gemini-2.5-pro-low |
gemini-3-flash-preview | gemini-3-flash-preview-official | gemini-3-flash-preview-low |
gemini-3.1-flash-lite-preview | — | — |
gemini-3.1-pro-preview | gemini-3.1-pro-preview-official | gemini-3.1-pro-preview-low |
Model Suffix Reference
| Suffix | Description |
|---|---|
| (none) | Standard version |
-official | Official version |
-hc | High-quality pool (AWS or premium account pool) |
-fd | Proxy pool / mixed account pool |
-low | Budget version |
Multimodal Support
| Capability | Supported Models |
|---|---|
| Image analysis | All models |
| Video analysis | Gemini models only |
Request Parameters
The name of the model to use. For example:
gpt-5.4, claude-opus-4-7, or gemini-3.1-pro-preview.An array of chat message objects forming the conversation history. Each object must include a
role and content field.The role of the message author. Accepted values:
system, user, or assistant.The content of the message. Pass a plain text string for standard text input, or an OpenAI-compatible multimodal array for image or video analysis.
Controls the randomness of the model’s output. Values range from
0 to 2. Lower values produce more focused, deterministic responses; higher values produce more varied output. We recommend adjusting either temperature or top_p, but not both simultaneously.Nucleus sampling parameter. The model considers only the tokens comprising the top
top_p probability mass. We recommend adjusting either top_p or temperature, but not both simultaneously.When set to
true, the response is returned as a stream of Server-Sent Events (SSE). The stream ends with a final data: [DONE] message.The maximum number of tokens the model may generate in its response.
One or more sequences at which the model will stop generating further tokens. Pass a single string or an array of up to four strings.
Response Fields
A unique identifier for the request.
The type of the returned object. Always
chat.completion for non-streaming responses.The Unix timestamp (in seconds) of when the response was created.
The name of the model that was used to generate the response.
The role of the generated message. Always
assistant.The text content generated by the model.
The reason the model stopped generating tokens. Common values include
stop (natural end) and length (token limit reached).The number of tokens in the input messages.
The number of tokens in the generated response.
The total number of tokens used in the request (prompt + completion).
Code Examples
Example Response
To receive a streaming response, set
"stream": true in your request body. The API will return a series of Server-Sent Events (SSE), each containing a partial response delta. The stream terminates with a final data: [DONE] message.