Free Models · One-Click Aggregation · Global Access · Chinese LLM API

Chinese LLM APIs — Accessible and Affordable for Global Developers

Token Router unifies access to Zhipu, Baidu, Alibaba, Tencent, MiniMax, Moonshot and more — with free models available and enterprise-grade paid services.

Core Capabilities

Token Router — China's One-Stop Bridge for Large Model APIs

Chinese Models · One-Stop Aggregation

Unified API — switch models with a single line of code. Free tier: daily credits for chat, coding, and image generation. Paid upgrade: high-performance models, longer context, enterprise SLA.

China Models · Global Access

Leverage China's computing and electricity cost advantages with multi-region deployment, cross-border settlement, and overseas optimization — giving global developers access to cost-effective Chinese models.

Agent & Clients · Full Compatibility

Compatible with Claude, Codex, Trae and other mainstream Agents. Supports OpenClaw, Cursor, Deep Code and other client tools. Simple configuration required.

Open Source · Self-Hosted

Secondary deployment based on Llama 3, Qwen, DeepSeek and other open-source models. Supports hybrid cloud and custom fine-tuning.

Model Plaza

All Models — 统一 API 接入，按量计费

月之暗面

Kimi K2.6

Kimi K2.6 是 Kimi 最新最智能的模型，Kimi K2.6 的通用 Agent、代码、视觉理解等综合能力得到全面提升，其中在博士级难度的完整版人类最后的考试（Humanity’s Last Exam）、在考察模型真实软件工程能力的 SWE-Bench Pro、评估 Agent 深度检索能力的 DeepSearchQA 等基准测试中均取得行业领先的成绩，同时支持文本、图片与视频输入，思考与非思考模式，对话与 Agent 任务。

Input Price $6.50 /1M Tokens

Output Price $27.00 /1M Tokens

月之暗面

Kimi K2.5

Kimi K2.5 支持文本、图片与视频输入，思考与非思考模式，对话与 Agent 任务

Input Price $4.00 /1M Tokens

Output Price $21.00 /1M Tokens

深度求索

DeepSeek-V4-Flash

DeepSeek-V4-Flash 是 DeepSeek 于 2026年4月推出的高效能大语言模型（304B参数，激活13B）。它主打极低延迟与超高性价比，原生支持100万token超长上下文（可处理整本书），在长文本任务中的算力需求仅为旗舰版的10%。

Input Price $1.00 /1M Tokens

Output Price $2.00 /1M Tokens

深度求索

DeepSeek-V4-Pro

DeepSeek-V4-Pro 是 DeepSeek 于 2026年4月发布的第四代旗舰大语言模型（总参数未公开，激活参数约70-100B）。它采用混合专家（MoE）架构与创新的混合注意力机制（CSA+HCA），原生支持 1M token 超长上下文。

Input Price $3.00 /1M Tokens

Output Price $6.00 /1M Tokens

智谱AI

GLM-4.7-Flash

最新基座模型的普惠版本。GLM-4.7-Flash 作为 30B 级 SOTA 模型，提供了一个兼顾性能与效率的新选择。面向 Agentic Coding 场景强化了编码能力、长程任务规划与工具协同，并在多个公开基准的当期榜单中取得同尺寸开源模型中的出色表现。在执行复杂智能体任务，在工具调用时指令遵循更强，Artifacts 与 Agentic Coding 的前端美感和长程任务完成效率进一步提升。

Input Price $0.00 /1M Tokens

Output Price $0.00 /1M Tokens

智谱AI

GLM-4.6V-Flash

GLM-4.6V-Flash 是 GLM-4.6V 的免费版本，是 GLM 系列在多模态方向上的一次重要迭代，支持开启或关闭思考模式。它将训练时上下文窗口提升到128k tokens，在视觉理解精度上达到同参数规模 SOTA，并首次在模型架构中将 Function Call（工具调用）能力原生融入视觉模型，打通从「视觉感知」到「可执行行动（Action）」的链路，为真实业务场景中的多模态 Agent 提供统一的技术底座。适用于图片OCR信息提取、图片内容理解与其相关属性提取，多模态时序融合、动态内容分析

Input Price $0.00 /1M Tokens

Output Price $0.00 /1M Tokens

All Models

Limited Time Offer

Start Free

Feedback

Have a question or suggestion? Let us know

Submit Feedback

Contact Information

Working Hours：Monday to Friday 9:00-18:00

WeChat：Fandy0923Y

QQ：2212206431

Email：service@pingtoken.cn

Address

Building 92, East District, lujiajiao Xinyuan, Lane 38, Le'ai Road, Qingpu District, Shanghai China