PatewayAI 文档索引 在线阅读

API 参考

限频说明#

为保障所有用户的服务质量,API 对每个账户设有以下速率限制:

限制类型 默认额度 说明
RPM(每分钟请求数) 300 次/分钟 每个模型独立计数
RPD(每日请求数) 216,000 次/天 所有模型共享额度
TPM(每分钟 Token 数) 6,144,000 tokens/分钟 每个模型独立计数

限流说明:

触发限流时:

如需提频,请联系 support@pateway.ai

Anthropic 协议#

POST /v1/messages

使用 Anthropic 原生格式调用 Claude 系列模型。请求头中需包含x-api-key: YOUR_API_KEY

请求体参数#

参数类型说明必填
modelstring模型IDtrue
messagearray对话消息数组true
max_tokensinteger最大输出 token 数true

更多参数,详见Anthropic官方。

示例请求#

curl --request POST \  --url https://api.pateway.ai/v1/messages \  --header 'content-type: application/json' \  --header 'x-api-key: YOUR_API_KEY' \  --data '{    "model": "claude-sonnet-4-6",    "max_tokens": 1024,    "system": "你是一个专业的代码助手",    "messages": [      {        "role": "user",        "content": "帮我用 Python 写一个快速排序"      }    ]  }'

响应示例#

{  "content": [    {      "type": "text",      "text": "你好!很高兴见到你 😊 有什么我可以帮你的吗?"    }  ],  "id": "msg_01PXFpjoBEXMn1yEGDdp8RrH",  "model": "claude-opus-4-6",  "role": "assistant",  "stop_details": null,  "stop_reason": "end_turn",  "stop_sequence": null,  "type": "message",  "usage": {    "input_tokens": 10,    "cache_creation_input_tokens": 0,    "cache_read_input_tokens": 0,    "cache_creation": {      "ephemeral_5m_input_tokens": 0,      "ephemeral_1h_input_tokens": 0    },    "output_tokens": 31,    "service_tier": "standard",    "inference_geo": "not_available"  }}

流式输出#

请求示例

curl -N --request POST \  --url https://api.pateway.ai/v1/messages \  --header 'content-type: application/json' \  --header 'x-api-key: YOUR_API_KEY' \  --header 'accept: text/event-stream' \  --data '{    "model": "claude-opus-4-6",    "max_tokens": 1024,    "messages": [      {        "role": "user",        "content": [          {            "type": "text",            "text": "你好"          }        ]      }    ],    "stream": true  }'

响应以 SSE 格式逐块返回,示例:

event: message_startdata: {"type":"message_start","message":{"id":"msg_01...","type":"message","role":"assistant","model":"claude-opus-4-6","stop_reason":null,"usage":{"input_tokens":10,"output_tokens":0}}} event: content_block_deltadata: {"type":"content_block_delta","index":0,"delta":{"type":"text_delta","text":"你好"}} event: message_deltadata: {"type":"message_delta","delta":{"stop_reason":"end_turn"},"usage":{"output_tokens":5}} event: message_stopdata: {"type":"message_stop"}

OpenAI 协议(Responses API)#

POST /v1/responses

使用 OpenAI Responses API 格式调用 Codex 系列模型。请求头中需包含 Authorization: Bearer YOUR_API_KEY

请求体参数#

参数类型说明必填
modelstring模型IDtrue
inputstring用户输入内容true
streamboolean是否开启流式输出,默认 falsefalse

更多参数,详见 OpenAI 官方文档。

示例请求#

curl --request POST \  --url https://api.pateway.ai/v1/responses \  --header 'Content-Type: application/json' \  --header 'Authorization: Bearer YOUR_API_KEY' \  --data '{    "model": "YOUR_MODEL_ID",    "input": "帮我用 Python 写一个快速排序",    "stream": false  }'

响应示例#

{  "id": "resp_xxxxx",  "object": "response",  "model": "YOUR_MODEL_ID",  "output_text": "def quicksort(arr):\n    if len(arr) <= 1:\n        return arr\n    ...",  "usage": {    "input_tokens": 12,    "output_tokens": 86,    "total_tokens": 98  }}

流式输出#

curl -N --request POST \  --url https://api.pateway.ai/v1/responses \  --header 'Content-Type: application/json' \  --header 'Authorization: Bearer YOUR_API_KEY' \  --data '{    "model": "YOUR_MODEL_ID",    "input": "你好",    "stream": true  }'

响应以 SSE 格式逐块返回,示例:

event: response.output_text.deltadata: {"type":"response.output_text.delta","delta":"你好"} event: response.output_text.donedata: {"type":"response.output_text.done","text":"你好!有什么我可以帮你的吗?"} event: response.completeddata: {"type":"response.completed","response":{"id":"resp_xxxxx","status":"completed","output_text":"你好!有什么我可以帮你的吗?","usage":{"input_tokens":5,"output_tokens":12,"total_tokens":17}}}

错误码#

HTTP状态码语义示例错误信息
400请求体无法读取或解析失败。请求体解析失败: unexpected end of JSON input
400请求的模型不存在或当前不可用。The model 'xxx' does not exist
400上游供应商返回 400,请求参数被上游拒绝,原始响应透传。(透传上游原始响应体)
401请求未携带 API Key(需通过 x-api-keyAuthorization: Bearer 传递)。Missing API Key
401API Key 不存在、格式错误或未注册。Invalid API Key
402用户余额不足以承担本次请求预估的费用。余额不足: balance not enough
403API Key 已被禁用或已过期。Key disabled or expired
403用户账号已被冻结或禁用。account suspended: frozen
413请求体超过允许的最大大小。请求体过大
429请求频次过高rate limit exceeded

补充说明#

429 响应头

当触发限频(429)时,响应会包含以下辅助头信息:

响应头 说明
Retry-After 建议的重试等待秒数
X-RateLimit-Limit-Requests RPM 上限
X-RateLimit-Remaining-Requests RPM 剩余次数
X-RateLimit-Limit-Tokens TPM 上限
X-RateLimit-Remaining-Tokens TPM 剩余额度

查询可用模型#

GET /v1/models

示例请求#

curl --request GET \  --url https://api.pateway.ai/v1/models \  --header 'Content-Type: application/json' \  --header 'x-api-key: YOUR_API_KEY'
curl --request GET \  --url https://api.pateway.ai/v1/models \  --header 'Content-Type: application/json' \  --header 'Authorization: Bearer YOUR_API_KEY'

响应示例#

{  "data": [    {      "id": "claude-haiku-4-5-20251001",      "type": "model",      "display_name": "Claude Haiku 4.5"    },    {      "id": "claude-opus-4-6",      "type": "model",      "display_name": "Claude Opus 4.6"    },    {      "id": "claude-sonnet-4-6",      "type": "model",      "display_name": "Claude Sonnet 4.6"    }  ],  "has_more": false,  "first_id": "claude-haiku-4-5-20251001",  "last_id": "claude-sonnet-4-6"}
{  "data": [    {      "id": "gpt-5.3-codex",      "type": "model",      "display_name": "GPT-5.3 Codex"    },    {      "id": "gpt-5.5",      "type": "model",      "display_name": "GPT-5.5"    }  ],  "has_more": false,  "first_id": "gpt-5.3-codex",  "last_id": "gpt-5.5"}
建议以此接口返回结果为准,获取最新可用模型名称。