Model Catalog

AIOrouter provides access to Chinese AI models through a single API key. Model availability is evidence-backed — only providers with verified credentials and live contract verification are available for routing.

Available Models

Model Provider Context JSON Mode Functions Vision Availability
DeepSeek V4 Pro DeepSeek 1M ✅ Available
DeepSeek R2 (CoT) DeepSeek 1M ✅ Available
Qwen3-235B Alibaba 128K ⚠️ ✅ Available
Kimi K2 Moonshot 1M 🏆 ✅ Available
GLM-5 Zhipu 128K ⚠️ ⏳ Pending
Ernie 5.0 Baidu 128K ⚠️ ❌ Unavailable
Doubao Pro ByteDance 128K ⚠️ ❌ Unavailable

Availability Legend:

BETA Note (2026-05-12): DeepSeek, Qwen, and Kimi are launch candidates pending live contract smoke verification (P2-W7-019). GLM is pending business verification. Ernie and Doubao are excluded from BETA due to mainland China phone registration blockers. See docs/provider-credentials-registry.md and docs/provider-contract-verification.md for current evidence.

⚠️ Qwen, GLM, Ernie, and Doubao context windows are from internal estimates (2026-05-03). Not verified against live provider documentation. DeepSeek and Kimi data verified from official sources.

Model Selection Guide

DeepSeek V4 Pro — Flagship All-Rounder

DeepSeek R2 — Chain-of-Thought Reasoning

Qwen3-235B — Alibaba's Flagship

GLM-5 — Bilingual Specialist

Kimi K2 — 1M Token Context Window 🏆

Ernie 5.0 — Enterprise-Grade

Doubao Pro — Absolute Cost Leader

Auto-Routing

If you don't specify a model, AIOrouter's intelligent router automatically selects the best provider based on:

  1. Availability: Routes away from degraded providers
  2. Capability: Matches model to request features (JSON mode, functions, etc.)
  3. Context: Ensures request fits within the model's context window
  4. Latency: Prefers providers with lower current latency

You can always see which provider handled your request in the X-Provider response header.

Pricing

Subscriptions include a provider-cost indexed monthly allowance. Token-equivalent usage varies by model cost: value models go further, while reasoning or western models consume allowance faster or may require prepaid credits.

See Billing Guide for subscription pricing and quota information.

Data freshness: DeepSeek data verified against api-docs.deepseek.com on 2026-05-06. Qwen, GLM, Ernie, and Doubao context windows have NOT been verified against live provider docs — they are from our internal model-capabilities.json (2026-05-03) and should be verified before public launch.