The Gemini Native API lets you interact with Gemini models using Google’s native request format, including theDocumentation Index
Fetch the complete documentation index at: https://docs.xingchaoyiqing.com/llms.txt
Use this file to discover all available pages before exploring further.
contents and generationConfig structure. All Gemini models accessible through this endpoint support both image and video analysis, making it the right choice whenever you need multimodal capabilities with the full flexibility of the native Gemini protocol.
Base URL: http://apillm.globalaiopc.com/gw_llm_power
Endpoints:
POST /v1/models/{model}:generateContent— Standard (non-streaming) responsePOST /v1/models/{model}:streamGenerateContent— Streaming response
Authentication
Authenticate every request using theAuthorization header with your API key:
Supported Models
All Gemini models available through this endpoint support image and video analysis.| Model | Variants |
|---|---|
gemini-2.5-flash-lite | Standard only |
gemini-2.5-pro | Standard, -official, -low |
gemini-3-flash-preview | Standard, -official, -low |
gemini-3.1-flash-lite-preview | Standard only |
gemini-3.1-pro-preview | Standard, -official, -low |
Model Suffix Reference
| Suffix | Description |
|---|---|
| (none) | Standard / stable version |
-official | Official version |
-low | Budget version |
Request Parameters
The array of message content objects that make up the conversation. Each object typically contains a
role and a parts array.The role of the message author. Use
user for human turns and model for prior model turns in multi-turn conversations.An array of content parts for the message. Typically contains
text objects for plain text input. The native Gemini multimodal structure (inline images, video, etc.) is also supported.The text content of the part.
Controls the randomness of the model’s output. Lower values produce more focused, deterministic responses; higher values produce more creative output.
Nucleus sampling parameter. The model considers only the tokens comprising the top
topP probability mass.The maximum number of tokens the model may generate in its response.
An optional system prompt that sets the context and behavior for the model. Provide this as a text part within the
systemInstruction object.Response Fields
The text content generated by the model.
The reason the model stopped generating. Common values include
STOP (natural end) and MAX_TOKENS (token limit reached).The number of tokens in the input contents and system instruction.
The number of tokens in the generated response candidates.
The total number of tokens used in the request (prompt + candidates).
Code Examples
Example Response
To receive a streaming response, replace
:generateContent with :streamGenerateContent in the request URL. The API will return a series of incremental response chunks in the native Gemini streaming format.