Model Explorer

409+ Models

100% official direct connections. Real-time pricing. Zero quantization.

Ai21Ai21

AI21: Jamba Large 1.7

ai21/jamba-large-1.7 from Ai21, optimized for chat completions workloads and available through Yi-AI.

Input

$2.20/M tokens

chatresponses
Aion LabsAion Labs

AionLabs: Aion-1.0

aion-labs/aion-1.0 from Aion Labs, optimized for chat completions workloads and available through Yi-AI.

Input

$4.40/M tokens

chatresponses
Aion LabsAion Labs

AionLabs: Aion-1.0-Mini

aion-labs/aion-1.0-mini from Aion Labs, optimized for chat completions workloads and available through Yi-AI.

Input

$0.770/M tokens

chatresponses
Aion LabsAion Labs

AionLabs: Aion-2.0

aion-labs/aion-2.0 from Aion Labs, optimized for chat completions workloads and available through Yi-AI.

Input

$0.880/M tokens

chatresponses
Aion LabsAion Labs

AionLabs: Aion-RP 1.0 (8B)

aion-labs/aion-rp-llama-3.1-8b from Aion Labs, optimized for chat completions workloads and available through Yi-AI.

Input

$0.880/M tokens

chatresponses
AllenaiAllenai

AllenAI: Olmo 3 32B Think

allenai/olmo-3-32b-think from Allenai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.165/M tokens

chatresponses
AmazonAmazon

Amazon: Nova 2 Lite

amazon/nova-2-lite-v1 from Amazon, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$0.330/M tokens

chatresponses
AmazonAmazon

Amazon: Nova Lite 1.0

amazon/nova-lite-v1 from Amazon, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$0.066/M tokens

chatresponses
AmazonAmazon

Amazon: Nova Micro 1.0

amazon/nova-micro-v1 from Amazon, optimized for chat completions workloads and available through Yi-AI.

Input

$0.038/M tokens

chatresponses
AmazonAmazon

Amazon: Nova Premier 1.0

amazon/nova-premier-v1 from Amazon, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$2.75/M tokens

chatresponses
AmazonAmazon

Amazon: Nova Pro 1.0

amazon/nova-pro-v1 from Amazon, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$0.880/M tokens

chatresponses
Anthracite OrgAnthracite Org

Magnum v4 72B

anthracite-org/magnum-v4-72b from Anthracite Org, optimized for chat completions workloads and available through Yi-AI.

Input

$3.30/M tokens

chatresponses
AnthropicAnthropic

Anthropic Claude Haiku Latest

~anthropic/claude-haiku-latest from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$1.10/M tokens

chatresponses
AnthropicAnthropic

Anthropic Claude Sonnet Latest

~anthropic/claude-sonnet-latest from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$3.30/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude 3 Haiku

anthropic/claude-3-haiku from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$0.275/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Fable 5

anthropic/claude-fable-5 from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$11.00/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Fable Latest

~anthropic/claude-fable-latest from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$11.00/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Haiku 4.5

anthropic/claude-haiku-4.5 from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$1.10/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus 4

anthropic/claude-opus-4 from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$16.50/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus 4.1

anthropic/claude-opus-4.1 from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$16.50/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus 4.5

anthropic/claude-opus-4.5 from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$5.50/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus 4.6

anthropic/claude-opus-4.6 from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$5.50/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus 4.7

anthropic/claude-opus-4.7 from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$5.50/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus 4.7 (Fast)

anthropic/claude-opus-4.7-fast from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$33.00/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus 4.8

anthropic/claude-opus-4.8 from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$5.50/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus 4.8 (Fast)

anthropic/claude-opus-4.8-fast from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$11.00/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Opus Latest

~anthropic/claude-opus-latest from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$5.50/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Sonnet 4

anthropic/claude-sonnet-4 from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$3.30/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Sonnet 4.5

anthropic/claude-sonnet-4.5 from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$3.30/M tokens

chatresponses
AnthropicAnthropic

Anthropic: Claude Sonnet 4.6

anthropic/claude-sonnet-4.6 from Anthropic, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$3.30/M tokens

chatresponses
AnthropicAnthropic

claude-3-5-haiku-20241022

claude-3-5-haiku-20241022 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$0.880/M tokens

chatresponses
AnthropicAnthropic

claude-3-5-haiku-latest

claude-3-5-haiku-latest from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$1.10/M tokens

chatresponses
AnthropicAnthropic

claude-3-5-sonnet-20240620

claude-3-5-sonnet-20240620 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$3.30/M tokens

chatresponses
AnthropicAnthropic

claude-3-5-sonnet-20241022

claude-3-5-sonnet-20241022 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$3.30/M tokens

chatresponses
AnthropicAnthropic

claude-3-5-sonnet-latest

claude-3-5-sonnet-latest from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$3.30/M tokens

chatresponses
AnthropicAnthropic

claude-3-haiku-20240307

claude-3-haiku-20240307 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$0.275/M tokens

chatresponses
AnthropicAnthropic

claude-3-opus-20240229

claude-3-opus-20240229 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$5.50/M tokens

chatresponses
AnthropicAnthropic

claude-3-sonnet-20240229

claude-3-sonnet-20240229 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$3.30/M tokens

chatresponses
AnthropicAnthropic

claude-haiku-4-5

claude-haiku-4-5 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$1.10/M tokens

chatresponses
AnthropicAnthropic

claude-haiku-latest

claude-haiku-latest from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$1.10/M tokens

chatresponses
AnthropicAnthropic

claude-opus-4

claude-opus-4 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$16.50/M tokens

chatresponses
AnthropicAnthropic

claude-opus-4-1

claude-opus-4-1 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$16.50/M tokens

chatresponses
AnthropicAnthropic

claude-opus-4-5

claude-opus-4-5 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$5.50/M tokens

chatresponses
AnthropicAnthropic

claude-opus-4-6

claude-opus-4-6 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$5.50/M tokens

chatresponses
AnthropicAnthropic

claude-opus-4-6-fast

claude-opus-4-6-fast from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$33.00/M tokens

chatresponses
AnthropicAnthropic

claude-opus-4-7

claude-opus-4-7 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$5.50/M tokens

chatresponses
AnthropicAnthropic

claude-opus-4-7-fast

claude-opus-4-7-fast from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$33.00/M tokens

chatresponses
AnthropicAnthropic

claude-opus-latest

claude-opus-latest from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$5.50/M tokens

chatresponses
AnthropicAnthropic

claude-sonnet-4

claude-sonnet-4 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$3.30/M tokens

chatresponses
AnthropicAnthropic

claude-sonnet-4-20250514

claude-sonnet-4-20250514 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$3.30/M tokens

chatresponses
AnthropicAnthropic

claude-sonnet-4-5

claude-sonnet-4-5 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$3.30/M tokens

chatresponses
AnthropicAnthropic

claude-sonnet-4-6

claude-sonnet-4-6 from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$3.30/M tokens

chatresponses
AnthropicAnthropic

claude-sonnet-latest

claude-sonnet-latest from Anthropic, optimized for chat completions workloads and available through Yi-AI.

Input

$3.30/M tokens

chatresponses
Arcee AiArcee Ai

Arcee AI: Coder Large

arcee-ai/coder-large from Arcee Ai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.550/M tokens

chatresponses
Arcee AiArcee Ai

Arcee AI: Trinity Large Thinking

arcee-ai/trinity-large-thinking from Arcee Ai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.275/M tokens

chatresponses
Arcee AiArcee Ai

Arcee AI: Trinity Mini

arcee-ai/trinity-mini from Arcee Ai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.050/M tokens

chatresponses
Arcee AiArcee Ai

Arcee AI: Virtuoso Large

arcee-ai/virtuoso-large from Arcee Ai, optimized for chat completions workloads and available through Yi-AI.

Input

$0.825/M tokens

chatresponses
BaaiBaai

BAAI: bge-base-en-v1.5

baai/bge-base-en-v1.5 from Baai, optimized for embedding generation workloads and available through Yi-AI.

Input

$0.0055/M tokens

embeddings
BaaiBaai

BAAI: bge-large-en-v1.5

baai/bge-large-en-v1.5 from Baai, optimized for embedding generation workloads and available through Yi-AI.

Input

$0.011/M tokens

embeddings
BaaiBaai

BAAI: bge-m3

baai/bge-m3 from Baai, optimized for embedding generation workloads and available through Yi-AI.

Input

$0.011/M tokens

embeddings
BaiduBaidu

Baidu: ERNIE 4.5 VL 424B A47B

baidu/ernie-4.5-vl-424b-a47b from Baidu, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$0.462/M tokens

chatresponses
BytedanceBytedance

ByteDance: UI-TARS 7B

bytedance/ui-tars-1.5-7b from Bytedance, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$0.110/M tokens

chatresponses
Bytedance SeedBytedance Seed

ByteDance Seed: Seed 1.6

bytedance-seed/seed-1.6 from Bytedance Seed, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$0.275/M tokens

chatresponses
Bytedance SeedBytedance Seed

ByteDance Seed: Seed 1.6 Flash

bytedance-seed/seed-1.6-flash from Bytedance Seed, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$0.083/M tokens

chatresponses
Bytedance SeedBytedance Seed

ByteDance Seed: Seed-2.0-Lite

bytedance-seed/seed-2.0-lite from Bytedance Seed, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$0.275/M tokens

chatresponses
Bytedance SeedBytedance Seed

ByteDance Seed: Seed-2.0-Mini

bytedance-seed/seed-2.0-mini from Bytedance Seed, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$0.110/M tokens

chatresponses
CanopylabsCanopylabs

Canopy Labs: Orpheus 3B

canopylabs/orpheus-3b-0.1-ft from Canopylabs, optimized for multimodal understanding workloads and available through Yi-AI.

Input

$7.70/M tokens

chatresponses
CohereCohere

Cohere: Command A

cohere/command-a from Cohere, optimized for chat completions workloads and available through Yi-AI.

Input

$2.75/M tokens

chatresponses
CohereCohere

Cohere: Command R (08-2024)

cohere/command-r-08-2024 from Cohere, optimized for chat completions workloads and available through Yi-AI.

Input

$0.165/M tokens

chatresponses
CohereCohere

Cohere: Command R+ (08-2024)

cohere/command-r-plus-08-2024 from Cohere, optimized for chat completions workloads and available through Yi-AI.

Input

$2.75/M tokens

chatresponses
CohereCohere

Cohere: Command R7B (12-2024)

cohere/command-r7b-12-2024 from Cohere, optimized for chat completions workloads and available through Yi-AI.

Input

$0.041/M tokens

chatresponses
DeepcogitoDeepcogito

Deep Cogito: Cogito v2.1 671B

deepcogito/cogito-v2.1-671b from Deepcogito, optimized for chat completions workloads and available through Yi-AI.

Input

$1.38/M tokens

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V3

deepseek/deepseek-chat from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.220/M tokens

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V3 0324

deepseek/deepseek-chat-v3-0324 from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.220/M tokens

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V3.1

deepseek/deepseek-chat-v3.1 from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.231/M tokens

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V3.1 Terminus

deepseek/deepseek-v3.1-terminus from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.297/M tokens

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V3.2

deepseek/deepseek-v3.2 from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.252/M tokens

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V3.2 Exp

deepseek/deepseek-v3.2-exp from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.297/M tokens

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V4 Flash

deepseek/deepseek-v4-flash from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.099/M tokens

chatresponses
DeepSeekDeepSeek

DeepSeek: DeepSeek V4 Pro

deepseek/deepseek-v4-pro from DeepSeek, optimized for chat completions workloads and available through Yi-AI.

Input

$0.478/M tokens

chatresponses
Showing 80 of 409 models. Use search to narrow results.
YiAI Router

YiAI Router provides unified model access for Claude Code / Cursor development workflows. Developers are responsible for complying with local laws and for the content they generate.

Account

Language

© 2026 YiAI Infrastructure.Unified Access Layer for AI Coding Workflows