openrouter pric
Models
604 modelsReset FiltersPricing: Low to HighNewestTop WeeklyPricing: Low to HighPricing: High to LowContext: High to LowThroughput: High to LowLatency: Low to High
00
| Model Name & ID | Input ($/1M tokens) | Output ($/1M tokens) | Context (tokens) |
|---|---|---|---|
AllenAI: Olmo 3.1 32B Think (free)allenai/olmo-3.1-32b-think:free | $0 | $0 | 65,536 |
Xiaomi: MiMo-V2-Flash (free)xiaomi/mimo-v2-flash:free | $0 | $0 | 262,144 |
NVIDIA: Nemotron 3 Nano 30B A3B (free)nvidia/nemotron-3-nano-30b-a3b:free | $0 | $0 | 256,000 |
Mistral: Devstral 2 2512 (free)mistralai/devstral-2512:free | $0 | $0 | 262,144 |
Nex AGI: DeepSeek V3.1 Nex N1 (free)nex-agi/deepseek-v3.1-nex-n1:free | $0 | $0 | 131,072 |
Arcee AI: Trinity Mini (free)arcee-ai/trinity-mini:free | $0 | $0 | 131,072 |
AllenAI: Olmo 3 32B Think (free)allenai/olmo-3-32b-think:free | $0 | $0 | 65,536 |
Kwaipilot: KAT-Coder-Pro V1 (free)kwaipilot/kat-coder-pro:free | $0 | $0 | 256,000 |
NVIDIA: Nemotron Nano 12B 2 VL (free)nvidia/nemotron-nano-12b-v2-vl:free | $0 | $0 | 128,000 |
NVIDIA: Nemotron Nano 9B V2 (free)nvidia/nemotron-nano-9b-v2:free | $0 | $0 | 128,000 |
OpenAI: gpt-oss-120b (free)openai/gpt-oss-120b:free | $0 | $0 | 131,072 |
OpenAI: gpt-oss-20b (free)openai/gpt-oss-20b:free | $0 | $0 | 131,072 |
Z.AI: GLM 4.5 Air (free)z-ai/glm-4.5-air:free | $0 | $0 | 131,072 |
Qwen: Qwen3 Coder 480B A35B (free)qwen/qwen3-coder:free | $0 | $0 | 262,000 |
MoonshotAI: Kimi K2 0711 (free)moonshotai/kimi-k2:free | $0 | $0 | 32,768 |
Venice: Uncensored (free)cognitivecomputations/dolphin-mistral-24b-venice-edition:free | $0 | $0 | 32,768 |
Google: Gemma 3n 2B (free)google/gemma-3n-e2b-it:free | $0 | $0 | 8,192 |
TNG: DeepSeek R1T2 Chimera (free)tngtech/deepseek-r1t2-chimera:free | $0 | $0 | 163,840 |
DeepSeek: R1 0528 (free)deepseek/deepseek-r1-0528:free | $0 | $0 | 163,840 |
Google: Gemma 3n 4B (free)google/gemma-3n-e4b-it:free | $0 | $0 | 8,192 |
Qwen: Qwen3 4B (free)qwen/qwen3-4b:free | $0 | $0 | 40,960 |
TNG: DeepSeek R1T Chimera (free)tngtech/deepseek-r1t-chimera:free | $0 | $0 | 163,840 |
Mistral: Mistral Small 3.1 24B (free)mistralai/mistral-small-3.1-24b-instruct:free | $0 | $0 | 128,000 |
Google: Gemma 3 4B (free)google/gemma-3-4b-it:free | $0 | $0 | 32,768 |
Google: Gemma 3 12B (free)google/gemma-3-12b-it:free | $0 | $0 | 32,768 |
Google: Gemma 3 27B (free)google/gemma-3-27b-it:free | $0 | $0 | 131,072 |
Google: Gemini 2.0 Flash Experimental (free)google/gemini-2.0-flash-exp:free | $0 | $0 | 1,048,576 |
Meta: Llama 3.3 70B Instruct (free)meta-llama/llama-3.3-70b-instruct:free | $0 | $0 | 131,072 |
Meta: Llama 3.2 3B Instruct (free)meta-llama/llama-3.2-3b-instruct:free | $0 | $0 | 131,072 |
Qwen: Qwen2.5-VL 7B Instruct (free)qwen/qwen-2.5-vl-7b-instruct:free | $0 | $0 | 32,768 |
Nous: Hermes 3 405B Instruct (free)nousresearch/hermes-3-llama-3.1-405b:free | $0 | $0 | 131,072 |
Meta: Llama 3.1 405B Instruct (free)meta-llama/llama-3.1-405b-instruct:free | $0 | $0 | 131,072 |
Mistral: Mistral 7B Instruct (free)mistralai/mistral-7b-instruct:free | $0100% off | $0100% off | 32,768 |
Thenlper: GTE-Basethenlper/gte-base | $0.005 | $0 | 512 |
Intfloat: E5-Base-v2intfloat/e5-base-v2 | $0.005 | $0 | 512 |
Sentence Transformers: paraphrase-MiniLM-L6-v2sentence-transformers/paraphrase-minilm-l6-v2 | $0.005 | $0 | 512 |
Sentence Transformers: all-MiniLM-L12-v2sentence-transformers/all-minilm-l12-v2 | $0.005 | $0 | 512 |
BAAI: bge-base-en-v1.5baai/bge-base-en-v1.5 | $0.005 | $0 | 512 |
Sentence Transformers: multi-qa-mpnet-base-dot-v1sentence-transformers/multi-qa-mpnet-base-dot-v1 | $0.005 | $0 | 512 |
Sentence Transformers: all-mpnet-base-v2sentence-transformers/all-mpnet-base-v2 | $0.005 | $0 | 512 |
Sentence Transformers: all-MiniLM-L6-v2sentence-transformers/all-minilm-l6-v2 | $0.005 | $0 | 512 |
Thenlper: GTE-Largethenlper/gte-large | $0.01 | $0 | 512 |
Intfloat: E5-Large-v2intfloat/e5-large-v2 | $0.01 | $0 | 512 |
Intfloat: Multilingual-E5-Largeintfloat/multilingual-e5-large | $0.01 | $0 | 512 |
BAAI: bge-large-en-v1.5baai/bge-large-en-v1.5 | $0.01 | $0 | 512 |
BAAI: bge-m3baai/bge-m3 | $0.01 | $0 | 8,192 |
Qwen: Qwen3 Embedding 8Bqwen/qwen3-embedding-8b | $0.01 | $0 | 32,768 |
OpenAI: Text Embedding 3 Smallopenai/text-embedding-3-small | $0.02 | $0 | 8,192 |
Qwen: Qwen3 Embedding 4Bqwen/qwen3-embedding-4b | $0.02 | $0 | 32,768 |
OpenAI: gpt-oss-20bopenai/gpt-oss-20b | $0.016 | $0.06 | 131,072 |
Meta: Llama 3.2 3B Instructmeta-llama/llama-3.2-3b-instruct | $0.02 | $0.02 | 131,072 |
Meta: Llama 3.1 8B Instructmeta-llama/llama-3.1-8b-instruct | $0.02 | $0.03 | 131,072 |
Google: Gemma 3 4Bgoogle/gemma-3-4b-it | $0.01703 | $0.06815 | 96,000 |
Google: Gemma 3n 4Bgoogle/gemma-3n-e4b-it | $0.02 | $0.04 | 32,768 |
Mistral: Mistral Nemomistralai/mistral-nemo | $0.02 | $0.04 | 131,072 |
Llama Guard 3 8Bmeta-llama/llama-guard-3-8b | $0.02 | $0.06 | 131,072 |
IBM: Granite 4.0 Microibm-granite/granite-4.0-h-micro | $0.017 | $0.11 | 131,000 |
OpenAI: gpt-oss-120bopenai/gpt-oss-120b | $0.02 | $0.10 | 131,072 |
Nous: DeepHermes 3 Mistral 24B Previewnousresearch/deephermes-3-mistral-24b-preview | $0.02 | $0.10 | 32,768 |
NousResearch: Hermes 2 Pro – Llama-3 8Bnousresearch/hermes-2-pro-llama-3-8b | $0.025 | $0.08 | 8,192 |
Mistral: Mistral 7B Instructmistralai/mistral-7b-instruct | $0.028 | $0.054 | 32,768 |
Meta: Llama 3 8B Instructmeta-llama/llama-3-8b-instruct | $0.03 | $0.06 | 8,192 |
Qwen: Qwen2.5 Coder 7B Instructqwen/qwen2.5-coder-7b-instruct | $0.03 | $0.09 | 32,768 |
Google: Gemma 2 9Bgoogle/gemma-2-9b-it | $0.03 | $0.09 | 8,192 |
Google: Gemma 3 12Bgoogle/gemma-3-12b-it | $0.03 | $0.10 | 131,072 |
Mistral: Mistral Small 3.1 24Bmistralai/mistral-small-3.1-24b-instruct | $0.03 | $0.11 | 131,072 |
Mistral: Mistral Small 3mistralai/mistral-small-24b-instruct-2501 | $0.03 | $0.11 | 32,768 |
DeepSeek: R1 Distill Llama 70Bdeepseek/deepseek-r1-distill-llama-70b | $0.03 | $0.11 | 131,072 |
Qwen2.5 Coder 32B Instructqwen/qwen-2.5-coder-32b-instruct | $0.03 | $0.11 | 32,768 |
Google: Gemma 3 27Bgoogle/gemma-3-27b-it | $0.036 | $0.064 | 131,072 |
Mistral: Ministral 3Bmistralai/ministral-3b | $0.04 | $0.04 | 131,072 |
Sao10K: Llama 3 8B Lunarissao10k/l3-lunaris-8b | $0.04 | $0.05 | 8,192 |
Meta: Llama 3.2 1B Instructmeta-llama/llama-3.2-1b-instruct | $0.027 | $0.20 | 60,000 |
THUDM: GLM 4.1V 9B Thinkingthudm/glm-4.1v-9b-thinking | $0.035 | $0.138 | 65,536 |
Qwen: Qwen3 8Bqwen/qwen3-8b | $0.035 | $0.138 | 128,000 |
Amazon: Nova Micro 1.0amazon/nova-micro-v1 | $0.035 | $0.14 | 128,000 |
Qwen: Qwen2.5 7B Instructqwen/qwen-2.5-7b-instruct | $0.04 | $0.10 | 32,768 |
Cohere: Command R7B (12-2024)cohere/command-r7b-12-2024 | $0.0375 | $0.15 | 128,000 |
Meta: Llama 3.2 11B Vision Instructmeta-llama/llama-3.2-11b-vision-instruct | $0.049 | $0.049 | 131,072 |
NVIDIA: Nemotron Nano 9B V2nvidia/nemotron-nano-9b-v2 | $0.04 | $0.16 | 131,072 |
OpenAI: gpt-oss-120b (exacto)openai/gpt-oss-120b:exacto | $0.039 | $0.19 | 131,072 |
Arcee AI: Trinity Miniarcee-ai/trinity-mini | $0.045 | $0.15 | 131,072 |
LiquidAI/LFM2-8B-A1Bliquid/lfm2-8b-a1b | $0.05 | $0.10 | 32,768 |
LiquidAI/LFM2-2.6Bliquid/lfm-2.2-6b | $0.05 | $0.10 | 32,768 |
Microsoft: Phi 4 Multimodal Instructmicrosoft/phi-4-multimodal-instruct | $0.05 | $0.10 | 131,072 |
MythoMax 13Bgryphe/mythomax-l2-13b | $0.06 | $0.06 | 4,096 |
DeepSeek: DeepSeek R1 0528 Qwen3 8Bdeepseek/deepseek-r1-0528-qwen3-8b | $0.06 | $0.09 | 128,000 |
AllenAI: Olmo 2 32B Instructallenai/olmo-2-0325-32b-instruct | $0.05 | $0.20 | 128,000 |
Qwen: Qwen-Turboqwen/qwen-turbo | $0.05 | $0.20 | 1,000,000 |
Mistral: Devstral 2 2512mistralai/devstral-2512 | $0.05 | $0.22 | 262,144 |
Z.AI: GLM 4.5 Airz-ai/glm-4.5-air | $0.05 | $0.22 | 131,072 |
Mistral: Devstral Small 2505mistralai/devstral-small-2505 | $0.06 | $0.12 | 128,000 |
Qwen: Qwen3 14Bqwen/qwen3-14b | $0.05 | $0.22 | 40,960 |
Qwen: Qwen2.5 VL 32B Instructqwen/qwen2.5-vl-32b-instruct | $0.05 | $0.22 | 16,384 |
Microsoft: Phi 4microsoft/phi-4 | $0.06 | $0.14 | 16,384 |
Mistral: Mistral Small 3.2 24Bmistralai/mistral-small-3.2-24b-instruct | $0.06 | $0.18 | 131,072 |
Qwen: Qwen3 30B A3Bqwen/qwen3-30b-a3b | $0.06 | $0.22 | 40,960 |
NVIDIA: Nemotron 3 Nano 30B A3Bnvidia/nemotron-3-nano-30b-a3b | $0.06 | $0.24 | 262,144 |
Amazon: Nova Lite 1.0amazon/nova-lite-v1 | $0.06 | $0.24 | 300,000 |
Qwen: Qwen3 30B A3B Thinking 2507qwen/qwen3-30b-a3b-thinking-2507 | $0.051 | $0.34 | 32,768 |
OpenAI: GPT-5 Nanoopenai/gpt-5-nano | $0.05 | $0.40 | 400,000 |
Qwen: Qwen3 Coder 30B A3B Instructqwen/qwen3-coder-30b-a3b-instruct | $0.07 | $0.27 | 160,000 |
Baidu: ERNIE 4.5 21B A3B Thinkingbaidu/ernie-4.5-21b-a3b-thinking | $0.07 | $0.28 | 131,072 |
Baidu: ERNIE 4.5 21B A3Bbaidu/ernie-4.5-21b-a3b | $0.07 | $0.28 | 120,000 |
Mistral: Devstral Small 1.1mistralai/devstral-small | $0.07 | $0.28 | 128,000 |
Mistral: Mistral Embed 2312mistralai/mistral-embed-2312 | $0.10 | $0 | 8,192 |
OpenAI: Text Embedding Ada 002openai/text-embedding-ada-002 | $0.10 | $0 | 8,192 |
Qwen: Qwen3 32Bqwen/qwen3-32b | $0.08 | $0.24 | 40,960 |
ByteDance Seed: Seed 1.6 Flashbytedance-seed/seed-1.6-flash | $0.075 | $0.30 | 262,144 |
OpenAI: gpt-oss-safeguard-20bopenai/gpt-oss-safeguard-20b | $0.075 | $0.30 | 131,072 |
Microsoft: Phi 4 Reasoning Plusmicrosoft/phi-4-reasoning-plus | $0.07 | $0.35 | 32,768 |
Google: Gemini 2.0 Flash Litegoogle/gemini-2.0-flash-lite-001 | $0.075 | $0.30 | 1,048,576 |
Mistral: Ministral 3 3B 2512mistralai/ministral-3b-2512 | $0.10 | $0.10 | 131,072 |
Z.AI: GLM 4 32Bz-ai/glm-4-32b | $0.10 | $0.10 | 128,000 |
Meta: Llama 4 Scoutmeta-llama/llama-4-scout | $0.08 | $0.30 | 327,680 |
Mistral: Ministral 8Bmistralai/ministral-8b | $0.10 | $0.10 | 131,072 |
Mistral: Pixtral 12Bmistralai/pixtral-12b | $0.10 | $0.10 | 32,768 |
Microsoft: Phi-3.5 Mini 128K Instructmicrosoft/phi-3.5-mini-128k-instruct | $0.10 | $0.10 | 128,000 |
Microsoft: Phi-3 Mini 128K Instructmicrosoft/phi-3-mini-128k-instruct | $0.10 | $0.10 | 128,000 |
Qwen: Qwen3 30B A3B Instruct 2507qwen/qwen3-30b-a3b-instruct-2507 | $0.08 | $0.33 | 262,144 |
Qwen: Qwen3 235B A22B Instruct 2507qwen/qwen3-235b-a22b-2507 | $0.071 | $0.463 | 262,144 |
AllenAI: Olmo 3 7B Instructallenai/olmo-3-7b-instruct | $0.10 | $0.20 | 65,536 |
Qwen: Qwen3 Next 80B A3B Instructqwen/qwen3-next-80b-a3b-instruct | $0.06 | $0.60 | 262,144 |
ByteDance: UI-TARS 7Bbytedance/ui-tars-1.5-7b | $0.10 | $0.20 | 128,000 |
Mistral: Mistral 7B Instruct v0.1mistralai/mistral-7b-instruct-v0.1 | $0.11 | $0.19 | 2,824 |
Mistral: Mistral Small Creativemistralai/mistral-small-creative | $0.10 | $0.30 | 32,768 |
OpenAI: Text Embedding 3 Largeopenai/text-embedding-3-large | $0.13 | $0 | 8,192 |
Mistral: Voxtral Small 24B 2507mistralai/voxtral-small-24b-2507 | $0.10 | $0.30 | 32,000 |
Qwen: Qwen3 VL 8B Instructqwen/qwen3-vl-8b-instruct | $0.08 | $0.50 | 131,072 |
Tongyi DeepResearch 30B A3Balibaba/tongyi-deepresearch-30b-a3b | $0.09 | $0.40 | 131,072 |
Meta: Llama 3.3 70B Instructmeta-llama/llama-3.3-70b-instruct | $0.10 | $0.32 | 131,072 |
OpenGVLab: InternVL3 78Bopengvlab/internvl3-78b | $0.10 | $0.39 | 32,768 |
AllenAI: Olmo 3 7B Thinkallenai/olmo-3-7b-think | $0.12 | $0.20 | 65,536 |
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5nvidia/llama-3.3-nemotron-super-49b-v1.5 | $0.10 | $0.40 | 131,072 |
Google: Gemini 2.5 Flash Lite Preview 09-2025google/gemini-2.5-flash-lite-preview-09-2025 | $0.10 | $0.40 | 1,048,576 |
Google: Gemini 2.5 Flash Litegoogle/gemini-2.5-flash-lite | $0.10 | $0.40 | 1,048,576 |
OpenAI: GPT-4.1 Nanoopenai/gpt-4.1-nano | $0.10 | $0.40 | 1,047,576 |
Google: Gemini 2.0 Flashgoogle/gemini-2.0-flash-001 | $0.10 | $0.40 | 1,048,576 |
Nous: Hermes 4 70Bnousresearch/hermes-4-70b | $0.11 | $0.38 | 131,072 |
Google: Gemini Embedding 001google/gemini-embedding-001 | $0.15 | $0 | 20,000 |
Mistral: Codestral Embed 2505mistralai/codestral-embed-2505 | $0.15 | $0 | 8,192 |
NeverSleep: Lumimaid v0.2 8Bneversleep/llama-3.1-lumimaid-8b | $0.09 | $0.60 | 32,768 |
Qwen2.5 72B Instructqwen/qwen-2.5-72b-instruct | $0.12 | $0.39 | 32,768 |
EssentialAI: Rnj 1 Instructessentialai/rnj-1-instruct | $0.15 | $0.15 | 32,768 |
Mistral: Ministral 3 8B 2512mistralai/ministral-8b-2512 | $0.15 | $0.15 | 262,144 |
DeepSeek: R1 Distill Qwen 14Bdeepseek/deepseek-r1-distill-qwen-14b | $0.15 | $0.15 | 32,768 |
MiniMax: MiniMax M2.1minimax/minimax-m2.1 | $0.12 | $0.48 | 196,608 |
Qwen: Qwen3 235B A22B Thinking 2507qwen/qwen3-235b-a22b-thinking-2507 | $0.11 | $0.60 | 262,144 |
Qwen: Qwen3 VL 235B A22B Instructqwen/qwen3-vl-235b-a22b-instruct | $0.12 | $0.56 | 262,144 |
Qwen: QwQ 32Bqwen/qwq-32b | $0.15 | $0.40 | 32,768 |
Baidu: ERNIE 4.5 VL 28B A3Bbaidu/ernie-4.5-vl-28b-a3b | $0.14 | $0.56 | 30,000 |
Tencent: Hunyuan A13B Instructtencent/hunyuan-a13b-instruct | $0.14 | $0.57 | 131,072 |
Arcee AI: Spotlightarcee-ai/spotlight | $0.18 | $0.18 | 131,072 |
Meta: Llama Guard 4 12Bmeta-llama/llama-guard-4-12b | $0.18 | $0.18 | 163,840 |
Qwen: Qwen3 VL 30B A3B Instructqwen/qwen3-vl-30b-a3b-instruct | $0.15 | $0.60 | 262,144 |
Meta: Llama 4 Maverickmeta-llama/llama-4-maverick | $0.15 | $0.60 | 1,048,576 |
Qwen: Qwen2.5 VL 72B Instructqwen/qwen2.5-vl-72b-instruct | $0.15 | $0.60 | 32,768 |
Cohere: Command R (08-2024)cohere/command-r-08-2024 | $0.15 | $0.60 | 128,000 |
OpenAI: GPT-4o-mini (2024-07-18)openai/gpt-4o-mini-2024-07-18 | $0.15 | $0.60 | 128,000 |
OpenAI: GPT-4o-miniopenai/gpt-4o-mini | $0.15 | $0.60 | 128,000 |
TheDrummer: Rocinante 12Bthedrummer/rocinante-12b | $0.17 | $0.43 | 32,768 |
Mistral: Ministral 3 14B 2512mistralai/ministral-14b-2512 | $0.20 | $0.20 | 262,144 |
Qwen: Qwen2.5-VL 7B Instructqwen/qwen-2.5-vl-7b-instruct | $0.20 | $0.20 | 32,768 |
Mistral: Mistral 7B Instruct v0.3mistralai/mistral-7b-instruct-v0.3 | $0.20 | $0.20 | 32,768 |
Meta: LlamaGuard 2 8Bmeta-llama/llama-guard-2-8b | $0.20 | $0.20 | 8,192 |
Mistral: Mistral 7B Instruct v0.2mistralai/mistral-7b-instruct-v0.2 | $0.20 | $0.20 | 32,768 |
DeepSeek: DeepSeek V3.1deepseek/deepseek-chat-v3.1 | $0.15 | $0.75 | 32,768 |
Qwen: Qwen3 235B A22Bqwen/qwen3-235b-a22b | $0.18 | $0.54 | 40,960 |
Cogito V2 Preview Llama 109Bdeepcogito/cogito-v2-preview-llama-109b-moe | $0.18 | $0.59 | 32,767 |
Z.AI: GLM 4.7z-ai/glm-4.7 | $0.16 | $0.80 | 202,752 |
AI21: Jamba Mini 1.7ai21/jamba-mini-1.7 | $0.20 | $0.40 | 256,000 |
DeepSeek: DeepSeek V3.2 Expdeepseek/deepseek-v3.2-exp | $0.21 | $0.32 | 163,840 |
xAI: Grok 4.1 Fastx-ai/grok-4.1-fast | $0.20 | $0.50 | 2,000,000 |
xAI: Grok 4 Fastx-ai/grok-4-fast | $0.20 | $0.50 | 2,000,000 |
NVIDIA: Nemotron Nano 12B 2 VLnvidia/nemotron-nano-12b-v2-vl | $0.20 | $0.60 | 131,072 |
Mistral: Sabamistralai/mistral-saba | $0.20 | $0.60 | 32,768 |
Qwen: Qwen3 Next 80B A3B Thinkingqwen/qwen3-next-80b-a3b-thinking | $0.15 | $1.20 | 262,144 |
Qwen: Qwen VL Plusqwen/qwen-vl-plus | $0.21 | $0.63 | 7,500 |
Mistral Tinymistralai/mistral-tiny | $0.25 | $0.25 | 32,768 |
DeepSeek: DeepSeek V3 0324deepseek/deepseek-chat-v3-0324 | $0.19 | $0.87 | 163,840 |
Meituan: LongCat Flash Chatmeituan/longcat-flash-chat | $0.20 | $0.80 | 131,072 |
DeepSeek: DeepSeek V3.2deepseek/deepseek-v3.2 | $0.25 | $0.38 | 163,840 |
DeepSeek: DeepSeek V3.1 Terminus (exacto)deepseek/deepseek-v3.1-terminus:exacto | $0.21 | $0.79 | 163,840 |
DeepSeek: DeepSeek V3.1 Terminusdeepseek/deepseek-v3.1-terminus | $0.21 | $0.79 | 163,840 |
Kwaipilot: KAT-Coder-Pro V1kwaipilot/kat-coder-pro | $0.20731% off | $0.82831% off | 256,000 |
DeepSeek: R1 Distill Qwen 32Bdeepseek/deepseek-r1-distill-qwen-32b | $0.27 | $0.27 | 131,072 |
MiniMax: MiniMax M2minimax/minimax-m2 | $0.20 | $1 | 196,608 |
Qwen: Qwen3 VL 30B A3B Thinkingqwen/qwen3-vl-30b-a3b-thinking | $0.20 | $1 | 131,072 |
Prime Intellect: INTELLECT-3prime-intellect/intellect-3 | $0.20 | $1.10 | 131,072 |
MiniMax: MiniMax-01minimax/minimax-01 | $0.20 | $1.10 | 1,000,192 |
DeepSeek: DeepSeek V3.2 Specialedeepseek/deepseek-v3.2-speciale | $0.27 | $0.41 | 163,840 |
Qwen: Qwen3 Coder 480B A35Bqwen/qwen3-coder | $0.22 | $0.95 | 262,144 |
Nous: Hermes 3 70B Instructnousresearch/hermes-3-llama-3.1-70b | $0.30 | $0.30 | 65,536 |
TNG: R1T Chimeratngtech/tng-r1t-chimera | $0.25 | $0.85 | 163,840 |
TNG: DeepSeek R1T2 Chimeratngtech/deepseek-r1t2-chimera | $0.25 | $0.85 | 163,840 |
Meta: Llama 3 70B Instructmeta-llama/llama-3-70b-instruct | $0.30 | $0.40 | 8,192 |
TheDrummer: Cydonia 24B V4.1thedrummer/cydonia-24b-v4.1 | $0.30 | $0.50 | 131,072 |
xAI: Grok Code Fast 1x-ai/grok-code-fast-1 | $0.20 | $1.50 | 256,000 |
Inception: Mercuryinception/mercury | $0.25 | $1 | 128,000 |
xAI: Grok 3 Minix-ai/grok-3-mini | $0.30 | $0.50 | 131,072 |
Inception: Mercury Coderinception/mercury-coder | $0.25 | $1 | 128,000 |
xAI: Grok 3 Mini Betax-ai/grok-3-mini-beta | $0.30 | $0.50 | 131,072 |
MoonshotAI: Kimi K2 Thinkingmoonshotai/kimi-k2-thinking | $0.32 | $0.48 | 262,144 |
Anthropic: Claude 3 Haikuanthropic/claude-3-haiku | $0.25 | $1.25 | 200,000 |
Z.AI: GLM 4.6Vz-ai/glm-4.6v | $0.30 | $0.90 | 131,072 |
Qwen: Qwen3 VL 8B Thinkingqwen/qwen3-vl-8b-thinking | $0.18 | $2.10 | 256,000 |
Mistral: Codestral 2508mistralai/codestral-2508 | $0.30 | $0.90 | 256,000 |
Baidu: ERNIE 4.5 300B A47Bbaidu/ernie-4.5-300b-a47b | $0.28 | $1.10 | 123,000 |
Meta: Llama 3.2 90B Vision Instructmeta-llama/llama-3.2-90b-vision-instruct | $0.35 | $0.40 | 32,768 |
Qwen: Qwen3 Coder 480B A35B (exacto)qwen/qwen3-coder:exacto | $0.22 | $1.80 | 262,144 |
MoonshotAI: Kimi Dev 72Bmoonshotai/kimi-dev-72b | $0.29 | $1.15 | 131,072 |
TNG: DeepSeek R1T Chimeratngtech/deepseek-r1t-chimera | $0.30 | $1.20 | 163,840 |
DeepSeek: DeepSeek V3deepseek/deepseek-chat | $0.30 | $1.20 | 163,840 |
TheDrummer: UnslopNemo 12Bthedrummer/unslopnemo-12b | $0.40 | $0.40 | 32,768 |
Meta: Llama 3.1 70B Instructmeta-llama/llama-3.1-70b-instruct | $0.40 | $0.40 | 131,072 |
ByteDance Seed: Seed 1.6bytedance-seed/seed-1.6 | $0.25 | $2 | 262,144 |
OpenAI: GPT-5.1-Codex-Miniopenai/gpt-5.1-codex-mini | $0.25 | $2 | 400,000 |
Qwen: Qwen3 Coder Flashqwen/qwen3-coder-flash | $0.30 | $1.50 | 128,000 |
OpenAI: GPT-5 Miniopenai/gpt-5-mini | $0.25 | $2 | 400,000 |
Z.AI: GLM 4.6z-ai/glm-4.6 | $0.35 | $1.50 | 202,752 |
Z.AI: GLM 4.5z-ai/glm-4.5 | $0.35 | $1.55 | 131,072 |
ReMM SLERP 13Bundi95/remm-slerp-l2-13b | $0.45 | $0.65 | 6,144 |
Qwen: Qwen Plus 0728qwen/qwen-plus-2025-07-28 | $0.40 | $1.20 | 1,000,000 |
Qwen: Qwen-Plusqwen/qwen-plus | $0.40 | $1.20 | 131,072 |
WizardLM-2 8x22Bmicrosoft/wizardlm-2-8x22b | $0.48 | $0.48 | 65,536 |
Baidu: ERNIE 4.5 VL 424B A47Bbaidu/ernie-4.5-vl-424b-a47b | $0.42 | $1.25 | 123,000 |
Amazon: Nova 2 Liteamazon/nova-2-lite-v1 | $0.30 | $2.50 | 1,000,000 |
Google: Gemini 2.5 Flash Image (Nano Banana)google/gemini-2.5-flash-image | $0.30 | $2.50 | 32,768 |
Google: Gemini 2.5 Flash Preview 09-2025google/gemini-2.5-flash-preview-09-2025 | $0.30 | $2.50 | 1,048,576 |
Google: Gemini 2.5 Flash Image Preview (Nano Banana)google/gemini-2.5-flash-image-preview | $0.30 | $2.50 | 32,768 |
Google: Gemini 2.5 Flashgoogle/gemini-2.5-flash | $0.30 | $2.50 | 1,048,576 |
OpenAI: GPT-4.1 Miniopenai/gpt-4.1-mini | $0.40 | $1.60 | 1,047,576 |
DeepSeek: R1 0528deepseek/deepseek-r1-0528 | $0.40 | $1.75 | 163,840 |
MoonshotAI: Kimi K2 0905moonshotai/kimi-k2-0905 | $0.39 | $1.90 | 262,144 |
Arcee AI: Coder Largearcee-ai/coder-large | $0.50 | $0.80 | 32,768 |
Mistral: Mixtral 8x7B Instructmistralai/mixtral-8x7b-instruct | $0.54 | $0.54 | 32,768 |
Mistral: Mistral Medium 3.1mistralai/mistral-medium-3.1 | $0.40 | $2 | 131,072 |
Mistral: Devstral Mediummistralai/devstral-medium | $0.40 | $2 | 131,072 |
Mistral: Mistral Medium 3mistralai/mistral-medium-3 | $0.40 | $2 | 131,072 |
Z.AI: GLM 4.6 (exacto)z-ai/glm-4.6:exacto | $0.44 | $1.76 | 204,800 |
MiniMax: MiniMax M1minimax/minimax-m1 | $0.40 | $2.20 | 1,000,000 |
TheDrummer: Skyfall 36B V2thedrummer/skyfall-36b-v2 | $0.55 | $0.80 | 32,768 |
Mistral: Mistral Large 3 2512mistralai/mistral-large-2512 | $0.50 | $1.50 | 262,144 |
Qwen: Qwen3 VL 32B Instructqwen/qwen3-vl-32b-instruct | $0.50 | $1.50 | 262,144 |
OpenAI: GPT-3.5 Turboopenai/gpt-3.5-turbo | $0.50 | $1.50 | 16,385 |
StepFun: Step3stepfun-ai/step3 | $0.57 | $1.42 | 65,536 |
Google: Gemma 2 27Bgoogle/gemma-2-27b-it | $0.65 | $0.65 | 8,192 |
DeepSeek: DeepSeek Prover V2deepseek/deepseek-prover-v2 | $0.50 | $2.18 | 163,840 |
Sourceful: Riverflow V2 Fast Previewsourceful/riverflow-v2-fast-preview | $0 | $7.19 | 8,192 |
Sao10K: Llama 3.3 Euryale 70Bsao10k/l3.3-euryale-70b | $0.65 | $0.75 | 131,072 |
Sao10K: Llama 3.1 Euryale 70B v2.2sao10k/l3.1-euryale-70b | $0.65 | $0.75 | 32,768 |
MoonshotAI: Kimi K2 0711moonshotai/kimi-k2 | $0.50 | $2.40 | 131,072 |
Z.AI: GLM 4.5Vz-ai/glm-4.5v | $0.60 | $1.80 | 65,536 |
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1nvidia/llama-3.1-nemotron-ultra-253b-v1 | $0.60 | $1.80 | 131,072 |
Google: Gemini 3 Flash Previewgoogle/gemini-3-flash-preview | $0.50 | $3 | 1,048,576 |
Qwen: Qwen3 VL 235B A22B Thinkingqwen/qwen3-vl-235b-a22b-thinking | $0.45 | $3.50 | 262,144 |
Qwen: Qwen Plus 0728 (thinking)qwen/qwen-plus-2025-07-28:thinking | $0.40 | $4 | 1,000,000 |
Sourceful: Riverflow V2 Standard Previewsourceful/riverflow-v2-standard-preview | $0 | $8.38 | 8,192 |
AionLabs: Aion-1.0-Miniaion-labs/aion-1.0-mini | $0.70 | $1.40 | 131,072 |
MoonshotAI: Kimi K2 0905 (exacto)moonshotai/kimi-k2-0905:exacto | $0.60 | $2.50 | 262,144 |
Mancer: Weaver (alpha)mancer/weaver | $0.75 | $1 | 8,000 |
Arcee AI: Virtuoso Largearcee-ai/virtuoso-large | $0.75 | $1.20 | 131,072 |
Morph: Morph V3 Fastmorph/morph-v3-fast | $0.80 | $1.20 | 81,920 |
EleutherAI: Llemma 7beleutherai/llemma_7b | $0.80 | $1.20 | 4,096 |
AlfredPros: CodeLLaMa 7B Instruct Solidityalfredpros/codellama-7b-instruct-solidity | $0.80 | $1.20 | 4,096 |
DeepSeek: R1deepseek/deepseek-r1 | $0.70 | $2.40 | 163,840 |
ByteDance Seed: Seedream 4.5bytedance-seed/seedream-4.5 | $0 | $9.581 | 4,096 |
AionLabs: Aion-RP 1.0 (8B)aion-labs/aion-rp-llama-3.1-8b | $0.80 | $1.60 | 32,768 |
Deep Cogito: Cogito V2 Preview Llama 70Bdeepcogito/cogito-v2-preview-llama-70b | $0.88 | $0.88 | 32,768 |
Relace: Relace Apply 3relace/relace-apply-3 | $0.85 | $1.25 | 256,000 |
Morph: Morph V3 Largemorph/morph-v3-large | $0.90 | $1.90 | 262,144 |
Nous: Hermes 3 405B Instructnousresearch/hermes-3-llama-3.1-405b | $1 | $1 | 131,072 |
Microsoft: Phi-3 Medium 128K Instructmicrosoft/phi-3-medium-128k-instruct | $1 | $1 | 128,000 |
Qwen: Qwen VL Maxqwen/qwen-vl-max | $0.80 | $3.20 | 131,072 |
Amazon: Nova Pro 1.0amazon/nova-pro-v1 | $0.80 | $3.20 | 300,000 |
Noromaid 20Bneversleep/noromaid-20b | $1 | $1.75 | 4,096 |
Switchpoint Routerswitchpoint/router | $0.85 | $3.40 | 131,072 |
Anthropic: Claude 3.5 Haiku (2024-10-22)anthropic/claude-3.5-haiku-20241022 | $0.80 | $4 | 200,000 |
Anthropic: Claude 3.5 Haikuanthropic/claude-3.5-haiku | $0.80 | $4 | 200,000 |
OpenAI: GPT-3.5 Turbo (older v0613)openai/gpt-3.5-turbo-0613 | $1 | $2 | 4,095 |
Arcee AI: Maestro Reasoningarcee-ai/maestro-reasoning | $0.90 | $3.30 | 131,072 |
Relace: Relace Searchrelace/relace-search | $1 | $3 | 256,000 |
Nous: Hermes 4 405Bnousresearch/hermes-4-405b | $1 | $3 | 131,072 |
NVIDIA: Llama 3.1 Nemotron 70B Instructnvidia/llama-3.1-nemotron-70b-instruct | $1.20 | $1.20 | 131,072 |
Deep Cogito: Cogito v2.1 671Bdeepcogito/cogito-v2.1-671b | $1.25 | $1.25 | 128,000 |
Anthropic: Claude Haiku 4.5anthropic/claude-haiku-4.5 | $1 | $5 | 200,000 |
Qwen: Qwen3 Coder Plusqwen/qwen3-coder-plus | $1 | $5 | 128,000 |
OpenAI: o4 Mini Highopenai/o4-mini-high | $1.10 | $4.40 | 200,000 |
OpenAI: o4 Miniopenai/o4-mini | $1.10 | $4.40 | 200,000 |
OpenAI: o3 Mini Highopenai/o3-mini-high | $1.10 | $4.40 | 200,000 |
OpenAI: o3 Miniopenai/o3-mini | $1.10 | $4.40 | 200,000 |
Sao10k: Llama 3 Euryale 70B v2.1sao10k/l3-euryale-70b | $1.48 | $1.48 | 8,192 |
OpenAI: GPT-3.5 Turbo Instructopenai/gpt-3.5-turbo-instruct | $1.50 | $2 | 4,095 |
Sourceful: Riverflow V2 Max Previewsourceful/riverflow-v2-max-preview | $0 | $17.96 | 8,192 |
Qwen: Qwen3 Maxqwen/qwen3-max | $1.20 | $6 | 256,000 |
OpenAI: Codex Miniopenai/codex-mini | $1.50 | $6 | 200,000 |
Qwen: Qwen-Maxqwen/qwen-max | $1.60 | $6.40 | 32,768 |
OpenAI: GPT-5.1-Codex-Maxopenai/gpt-5.1-codex-max | $1.25 | $10 | 400,000 |
OpenAI: GPT-5.1openai/gpt-5.1 | $1.25 | $10 | 400,000 |
OpenAI: GPT-5.1 Chatopenai/gpt-5.1-chat | $1.25 | $10 | 128,000 |
OpenAI: GPT-5.1-Codexopenai/gpt-5.1-codex | $1.25 | $10 | 400,000 |
OpenAI: GPT-5 Codexopenai/gpt-5-codex | $1.25 | $10 | 400,000 |
OpenAI: GPT-5 Chatopenai/gpt-5-chat | $1.25 | $10 | 128,000 |
OpenAI: GPT-5openai/gpt-5 | $1.25 | $10 | 400,000 |
Google: Gemini 2.5 Progoogle/gemini-2.5-pro | $1.25 | $10 | 1,048,576 |
Google: Gemini 2.5 Pro Preview 06-05google/gemini-2.5-pro-preview | $1.25 | $10 | 1,048,576 |
Google: Gemini 2.5 Pro Preview 05-06google/gemini-2.5-pro-preview-05-06 | $1.25 | $10 | 1,048,576 |
Mistral Large 2411mistralai/mistral-large-2411 | $2 | $6 | 131,072 |
Mistral Large 2407mistralai/mistral-large-2407 | $2 | $6 | 131,072 |
Mistral: Pixtral Large 2411mistralai/pixtral-large-2411 | $2 | $6 | 131,072 |
Mistral: Mixtral 8x22B Instructmistralai/mixtral-8x22b-instruct | $2 | $6 | 65,536 |
Mistral Largemistralai/mistral-large | $2 | $6 | 128,000 |
OpenAI: GPT-5 Image Miniopenai/gpt-5-image-mini | $2.50 | $2 | 400,000 |
OpenAI: o4 Mini Deep Researchopenai/o4-mini-deep-research | $2 | $8 | 200,000 |
AI21: Jamba Large 1.7ai21/jamba-large-1.7 | $2 | $8 | 256,000 |
OpenAI: o3openai/o3 | $2 | $8 | 200,000 |
OpenAI: GPT-4.1openai/gpt-4.1 | $2 | $8 | 1,047,576 |
Perplexity: Sonar Reasoning Properplexity/sonar-reasoning-pro | $2 | $8 | 128,000 |
Perplexity: Sonar Deep Researchperplexity/sonar-deep-research | $2 | $8 | 128,000 |
OpenAI: GPT-5.2 Chatopenai/gpt-5.2-chat | $1.75 | $14 | 128,000 |
OpenAI: GPT-5.2openai/gpt-5.2 | $1.75 | $14 | 400,000 |
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google/gemini-3-pro-image-preview | $2 | $12 | 65,536 |
Google: Gemini 3 Pro Previewgoogle/gemini-3-pro-preview | $2 | $12 | 1,048,576 |
Sao10K: Llama 3.1 70B Hanami x1sao10k/l3.1-70b-hanami-x1 | $3 | $3 | 16,000 |
OpenAI: GPT-3.5 Turbo 16kopenai/gpt-3.5-turbo-16k | $3 | $4 | 16,385 |
OpenAI: GPT-4o Audioopenai/gpt-4o-audio-preview | $2.50 | $10 | 128,000 |
Cohere: Command Acohere/command-a | $2.50 | $10 | 256,000 |
OpenAI: GPT-4o (2024-11-20)openai/gpt-4o-2024-11-20 | $2.50 | $10 | 128,000 |
Magnum v4 72Banthracite-org/magnum-v4-72b | $3 | $5 | 16,384 |
Inflection: Inflection 3 Piinflection/inflection-3-pi | $2.50 | $10 | 8,000 |
Inflection: Inflection 3 Productivityinflection/inflection-3-productivity | $2.50 | $10 | 8,000 |
Cohere: Command R+ (08-2024)cohere/command-r-plus-08-2024 | $2.50 | $10 | 128,000 |
OpenAI: GPT-4o (2024-08-06)openai/gpt-4o-2024-08-06 | $2.50 | $10 | 128,000 |
OpenAI: GPT-4oopenai/gpt-4o | $2.50 | $10 | 128,000 |
Amazon: Nova Premier 1.0amazon/nova-premier-v1 | $2.50 | $12.50 | 1,000,000 |
Deep Cogito: Cogito V2 Preview Llama 405Bdeepcogito/cogito-v2-preview-llama-405b | $3.50 | $3.50 | 32,768 |
Meta: Llama 3.1 405B Instructmeta-llama/llama-3.1-405b-instruct | $3.50 | $3.50 | 10,000 |
Black Forest Labs: FLUX.2 Problack-forest-labs/flux.2-pro | $3.66 | $3.66 | 46,864 |
Meta: Llama 3.1 405B (base)meta-llama/llama-3.1-405b | $4 | $4 | 32,768 |
Anthropic: Claude Sonnet 4.5anthropic/claude-sonnet-4.5 | $3 | $15 | 1,000,000 |
xAI: Grok 4x-ai/grok-4 | $3 | $15 | 256,000 |
xAI: Grok 3x-ai/grok-3 | $3 | $15 | 131,072 |
Anthropic: Claude Sonnet 4anthropic/claude-sonnet-4 | $3 | $15 | 1,000,000 |
xAI: Grok 3 Betax-ai/grok-3-beta | $3 | $15 | 131,072 |
Perplexity: Sonar Properplexity/sonar-pro | $3 | $15 | 200,000 |
Anthropic: Claude 3.7 Sonnet (thinking)anthropic/claude-3.7-sonnet:thinking | $3 | $15 | 200,000 |
Anthropic: Claude 3.7 Sonnetanthropic/claude-3.7-sonnet | $3 | $15 | 200,000 |
AionLabs: Aion-1.0aion-labs/aion-1.0 | $4 | $8 | 131,072 |
SorcererLM 8x22Braifle/sorcererlm-8x22b | $4.50 | $4.50 | 16,000 |
Perplexity: Sonarperplexity/sonar | $1 | $1 | 127,072 |
Perplexity: Sonar Reasoningperplexity/sonar-reasoning | $1 | $5 | 127,000 |
OpenAI: ChatGPT-4oopenai/chatgpt-4o-latest | $5 | $15 | 128,000 |
OpenAI: GPT-4o (2024-05-13)openai/gpt-4o-2024-05-13 | $5 | $15 | 128,000 |
Goliath 120Balpindale/goliath-120b | $6 | $8 | 6,144 |
Anthropic: Claude Opus 4.5anthropic/claude-opus-4.5 | $5 | $25 | 200,000 |
OpenAI: GPT-4o (extended)openai/gpt-4o:extended | $6 | $18 | 128,000 |
Black Forest Labs: FLUX.2 Maxblack-forest-labs/flux.2-max | $7.32 | $7.32 | 46,864 |
Anthropic: Claude 3.5 Sonnetanthropic/claude-3.5-sonnet | $6 | $30 | 200,000 |
OpenAI: GPT-5 Imageopenai/gpt-5-image | $10 | $10 | 400,000 |
OpenAI: GPT-4 Turboopenai/gpt-4-turbo | $10 | $30 | 128,000 |
OpenAI: GPT-4 Turbo Previewopenai/gpt-4-turbo-preview | $10 | $30 | 128,000 |
OpenAI: GPT-4 Turbo (older v1106)openai/gpt-4-1106-preview | $10 | $30 | 128,000 |
OpenAI: o3 Deep Researchopenai/o3-deep-research | $10 | $40 | 200,000 |
Black Forest Labs: FLUX.2 Flexblack-forest-labs/flux.2-flex | $14.64 | $14.64 | 67,344 |
OpenAI: o1openai/o1 | $15 | $60 | 200,000 |
Perplexity: Sonar Pro Searchperplexity/sonar-pro-search | $3 | $15 | 200,000 |
Anthropic: Claude Opus 4.1anthropic/claude-opus-4.1 | $15 | $75 | 200,000 |
Anthropic: Claude Opus 4anthropic/claude-opus-4 | $15 | $75 | 200,000 |
Anthropic: Claude 3 Opusanthropic/claude-3-opus | $15 | $75 | 200,000 |
OpenAI: GPT-5 Proopenai/gpt-5-pro | $15 | $120 | 400,000 |
OpenAI: GPT-4o-mini Search Previewopenai/gpt-4o-mini-search-preview | $0.15 | $0.60 | 128,000 |
OpenAI: o3 Proopenai/o3-pro | $20 | $80 | 200,000 |
OpenAI: GPT-4 (older v0314)openai/gpt-4-0314 | $30 | $60 | 8,191 |
OpenAI: GPT-4openai/gpt-4 | $30 | $60 | 8,191 |
OpenAI: GPT-5.2 Proopenai/gpt-5.2-pro | $21 | $168 | 400,000 |
OpenAI: GPT-4o Search Previewopenai/gpt-4o-search-preview | $2.50 | $10 | 128,000 |
OpenAI: o1-proopenai/o1-pro | $150 | $600 | 200,000 |
Body Builder (beta)openrouter/bodybuilder | — | — | 128,000 |
Bert-Nebulon Alphaopenrouter/bert-nebulon-alpha | — | — | 256,000 |
Sherlock Dash Alphaopenrouter/sherlock-dash-alpha | — | — | 1,840,000 |
Sherlock Think Alphaopenrouter/sherlock-think-alpha | — | — | 1,840,000 |
Polaris Alphaopenrouter/polaris-alpha | — | — | 256,000 |
Qwen: Qwen3 Embedding 0.6Bqwen/qwen3-embedding-0.6b | — | — | 8,192 |
Andromeda Alphaopenrouter/andromeda-alpha | — | — | 128,000 |
Arcee AI: AFM 4.5Barcee-ai/afm-4.5b | — | — | 65,536 |
Sonoma Dusk Alphaopenrouter/sonoma-dusk-alpha | — | — | 2,000,000 |
Sonoma Sky Alphaopenrouter/sonoma-sky-alpha | — | — | 2,000,000 |
ByteDance: Seed OSS 36B Instructbytedance/seed-oss-36b-instruct | — | — | 131,072 |
Deep Cogito: Cogito V2 Preview Deepseek 671Bdeepcogito/cogito-v2-preview-deepseek-671b | — | — | 131,072 |
DeepSeek: DeepSeek V3.1 Basedeepseek/deepseek-v3.1-base | — | — | 163,840 |
Horizon Betaopenrouter/horizon-beta | — | — | 256,000 |
Horizon Alphaopenrouter/horizon-alpha | — | — | 256,000 |
Cypher Alphaopenrouter/cypher-alpha | — | — | 1,000,000 |
Morph: Fast Applymorph/morph-v2 | — | — | 32,000 |
Mistral: Magistral Small 2506mistralai/magistral-small-2506 | — | — | 40,000 |
Mistral: Magistral Medium 2506mistralai/magistral-medium-2506 | — | — | 40,960 |
SentientAGI: Dobby Mini Plus Llama 3.1 8Bsentientagi/dobby-mini-unhinged-plus-llama-3.1-8b | — | — | 131,072 |
DeepSeek: R1 Distill Qwen 7Bdeepseek/deepseek-r1-distill-qwen-7b | — | — | 131,072 |
Google: Gemma 1 2Bgoogle/gemma-2b-it | — | — | 8,192 |
Sarvam AI: Sarvam-Msarvamai/sarvam-m | — | — | 32,768 |
TheDrummer: Valkyrie 49B V1thedrummer/valkyrie-49b-v1 | — | — | 131,072 |
Meta: Llama 3.3 8B Instructmeta-llama/llama-3.3-8b-instruct | — | — | 128,000 |
Arcee AI: Caller Largearcee-ai/caller-large | — | — | 32,768 |
Arcee AI: Virtuoso Medium V2arcee-ai/virtuoso-medium-v2 | — | — | 131,072 |
Arcee AI: Arcee Blitzarcee-ai/arcee-blitz | — | — | 32,768 |
Microsoft: Phi 4 Reasoningmicrosoft/phi-4-reasoning | — | — | 32,768 |
Qwen: Qwen3 0.6Bqwen/qwen3-0.6b-04-28 | — | — | 32,000 |
Qwen: Qwen3 1.7Bqwen/qwen3-1.7b | — | — | 32,000 |
OpenGVLab: InternVL3 14Bopengvlab/internvl3-14b | — | — | 32,000 |
OpenGVLab: InternVL3 2Bopengvlab/internvl3-2b | — | — | 32,000 |
THUDM: GLM Z1 Rumination 32Bthudm/glm-z1-rumination-32b | — | — | 32,000 |
THUDM: GLM Z1 9Bthudm/glm-z1-9b | — | — | 32,000 |
THUDM: GLM 4 9Bthudm/glm-4-9b | — | — | 32,000 |
Microsoft: MAI DS R1microsoft/mai-ds-r1 | — | — | 163,840 |
THUDM: GLM Z1 32Bthudm/glm-z1-32b | — | — | 32,768 |
THUDM: GLM 4 32Bthudm/glm-4-32b | — | — | 32,768 |
ArliAI: QwQ 32B RpR v1arliai/qwq-32b-arliai-rpr-v1 | — | — | 32,768 |
Agentica: Deepcoder 14B Previewagentica-org/deepcoder-14b-preview | — | — | 96,000 |
MoonshotAI: Kimi VL A3B Thinkingmoonshotai/kimi-vl-a3b-thinking | — | — | 131,072 |
Optimus Alphaopenrouter/optimus-alpha | — | — | 1,000,000 |
NVIDIA: Llama 3.1 Nemotron Nano 8B v1nvidia/llama-3.1-nemotron-nano-8b-v1 | — | — | 131,072 |
NVIDIA: Llama 3.3 Nemotron Super 49B v1nvidia/llama-3.3-nemotron-super-49b-v1 | — | — | 131,072 |
Swallow: Llama 3.1 Swallow 8B Instruct V0.3tokyotech-llm/llama-3.1-swallow-8b-instruct-v0.3 | — | — | 16,384 |
Quasar Alphaopenrouter/quasar-alpha | — | — | 1,000,000 |
OpenHands LM 32B V0.1all-hands/openhands-lm-32b-v0.1 | — | — | 131,072 |
DeepSeek: DeepSeek V3 Basedeepseek/deepseek-v3-base | — | — | 131,072 |
Typhoon2 8B Instructscb10x/llama3.1-typhoon2-8b-instruct | — | — | 8,192 |
Typhoon2 70B Instructscb10x/llama3.1-typhoon2-70b-instruct | — | — | 8,192 |
Bytedance: UI-TARS 72Bbytedance-research/ui-tars-72b | — | — | 32,768 |
Qwen: Qwen2.5 VL 3B Instructqwen/qwen2.5-vl-3b-instruct | — | — | 64,000 |
Google: Gemini 2.5 Pro Experimentalgoogle/gemini-2.5-pro-exp-03-25 | — | — | 1,048,576 |
Qrwkv 72Bfeatherless/qwerky-72b | — | — | 32,768 |
OlympicCoder 32Bopen-r1/olympiccoder-32b | — | — | 32,768 |
SteelSkull: L3.3 Electra R1 70Bsteelskull/l3.3-electra-r1-70b | — | — | 128,000 |
Google: Gemma 3 1Bgoogle/gemma-3-1b-it | — | — | 32,000 |
AI21: Jamba 1.6 Largeai21/jamba-1.6-large | — | — | 256,000 |
AI21: Jamba Mini 1.6ai21/jamba-1.6-mini | — | — | 256,000 |
Reka: Flash 3rekaai/reka-flash-3 | — | — | 32,000 |
LatitudeGames: Wayfarer Large 70B Llama 3.3latitudegames/wayfarer-large-70b-llama-3.3 | — | — | 128,000 |
DeepSeek: DeepSeek R1 Zerodeepseek/deepseek-r1-zero | — | — | 163,840 |
Qwen: Qwen2.5 32B Instructqwen/qwen2.5-32b-instruct | — | — | 131,072 |
MoonshotAI: Moonlight 16B A3B Instructmoonshotai/moonlight-16b-a3b-instruct | — | — | 8,192 |
Nous: DeepHermes 3 Llama 3 8B Previewnousresearch/deephermes-3-llama-3-8b-preview | — | — | 131,072 |
OpenAI: GPT-4.5 (Preview)openai/gpt-4.5-preview | — | — | 128,000 |
Perplexity: R1 1776perplexity/r1-1776 | — | — | 128,000 |
Dolphin3.0 R1 Mistral 24Bcognitivecomputations/dolphin3.0-r1-mistral-24b | — | — | 32,768 |
Dolphin3.0 Mistral 24Bcognitivecomputations/dolphin3.0-mistral-24b | — | — | 32,768 |
Llama 3.1 Tulu 3 405Ballenai/llama-3.1-tulu-3-405b | — | — | |
DeepSeek: R1 Distill Llama 8Bdeepseek/deepseek-r1-distill-llama-8b | — | — | |
DeepSeek: R1 Distill Qwen 1.5Bdeepseek/deepseek-r1-distill-qwen-1.5b | — | — | 131,072 |
Liquid: LFM 7Bliquid/lfm-7b | — | — | 32,768 |
Liquid: LFM 3Bliquid/lfm-3b | — | — | 32,768 |
Mistral: Codestral 2501mistralai/codestral-2501 | — | — | 256,000 |
Inflatebot: Mag Mell R1 12Binflatebot/mn-mag-mell-r1 | — | — | 32,000 |
EVA Llama 3.33 70Beva-unit-01/eva-llama-3.33-70b | — | — | 16,384 |
xAI: Grok 2 Vision 1212x-ai/grok-2-vision-1212 | — | — | 32,768 |
xAI: Grok 2 1212x-ai/grok-2-1212 | — | — | 131,072 |
Qwen: QwQ 32B Previewqwen/qwq-32b-preview | — | — | 32,768 |
Google: Gemini Experimental 1121google/gemini-exp-1121 | — | — | 40,960 |
EVA Qwen2.5 72Beva-unit-01/eva-qwen-2.5-72b | — | — | 32,000 |
xAI: Grok Vision Betax-ai/grok-vision-beta | — | — | 8,192 |
Google: Gemini Experimental 1114google/gemini-exp-1114 | — | — | 40,960 |
Infermatic: Mistral Nemo Inferor 12Binfermatic/mn-inferor-12b | — | — | 32,000 |
EVA Qwen2.5 32Beva-unit-01/eva-qwen-2.5-32b | — | — | 32,000 |
NeverSleep: Lumimaid v0.2 70Bneversleep/llama-3.1-lumimaid-70b | — | — | 131,072 |
xAI: Grok Betax-ai/grok-beta | — | — | 131,072 |
xAI: Grok 2x-ai/grok-2 | — | — | 32,768 |
xAI: Grok 2 minix-ai/grok-2-mini | — | — | 32,768 |
Google: Gemini 1.5 Flash 8Bgoogle/gemini-flash-1.5-8b | — | — | 1,000,000 |
Liquid: LFM 40B MoEliquid/lfm-40b | — | — | 32,768 |
EVA Qwen2.5 14Beva-unit-01/eva-qwen-2.5-14b | — | — | 32,768 |
Magnum v2 72Banthracite-org/magnum-v2-72b | — | — | 32,768 |
OpenAI: o1-mini (2024-09-12)openai/o1-mini-2024-09-12 | — | — | 128,000 |
OpenAI: o1-miniopenai/o1-mini | — | — | 128,000 |
OpenAI: o1-preview (2024-09-12)openai/o1-preview-2024-09-12 | — | — | 128,000 |
OpenAI: o1-previewopenai/o1-preview | — | — | 128,000 |
Reflection 70Bmattshumer/reflection-70b | — | — | 131,072 |
Google: Gemini 1.5 Flash Experimentalgoogle/gemini-flash-1.5-exp | — | — | 1,000,000 |
Lynn: Llama 3 Soliloquy 7B v3 32Klynn/soliloquy-v3 | — | — | 32,768 |
AI21: Jamba 1.5 Miniai21/jamba-1-5-mini | — | — | 256,000 |
Yi 1.5 34B Chat01-ai/yi-1.5-34b-chat | — | — | 4,096 |
AI21: Jamba 1.5 Largeai21/jamba-1-5-large | — | — | 256,000 |
Aetherwiing: Starcannon 12Baetherwiing/mn-starcannon-12b | — | — | 12,000 |
01.AI: Yi Vision01-ai/yi-vision | — | — | 16,384 |
01.AI: Yi Large FC01-ai/yi-large-fc | — | — | 16,384 |
01.AI: Yi Large Turbo01-ai/yi-large-turbo | — | — | 4,096 |
Mistral Nemo 12B Celestenothingiisreal/mn-celeste-12b | — | — | 32,000 |
Perplexity: Llama 3.1 Sonar 70B Onlineperplexity/llama-3.1-sonar-large-128k-online | — | — | 127,072 |
Perplexity: Llama 3.1 Sonar 8B Onlineperplexity/llama-3.1-sonar-small-128k-online | — | — | 127,072 |
Google: Gemini 1.5 Pro Experimentalgoogle/gemini-pro-1.5-exp | — | — | 1,000,000 |
Dolphin Llama 3 70B 🐬cognitivecomputations/dolphin-llama-3-70b | — | — | 8,192 |
Mistral: Codestral Mambamistralai/codestral-mamba | — | — | 256,000 |
Qwen 2 7B Instructqwen/qwen-2-7b-instruct | — | — | 32,768 |
Magnum 72Balpindale/magnum-72b | — | — | 16,384 |
Nous: Hermes 2 Theta 8Bnousresearch/hermes-2-theta-llama-3-8b | — | — | 16,384 |
Sao10K: Llama 3 Stheno 8B v3.3 32Ksao10k/l3-stheno-8b | — | — | 32,000 |
AI21: Jamba Instructai21/jamba-instruct | — | — | 256,000 |
01.AI: Yi Large01-ai/yi-large | — | — | 32,768 |
NVIDIA: Nemotron-4 340B Instructnvidia/nemotron-4-340b-instruct | — | — | 4,096 |
Anthropic: Claude 3.5 Sonnet (2024-06-20)anthropic/claude-3.5-sonnet-20240620 | — | — | 200,000 |
Microsoft: Phi-3 Medium 4K Instructmicrosoft/phi-3-medium-4k-instruct | — | — | 4,000 |
StarCoder2 15B Instructbigcode/starcoder2-15b-instruct | — | — | 16,384 |
Dolphin 2.9.2 Mixtral 8x22B 🐬cognitivecomputations/dolphin-mixtral-8x22b | — | — | 65,536 |
Qwen 2 72B Instructqwen/qwen-2-72b-instruct | — | — | 32,768 |
OpenChat 3.6 8Bopenchat/openchat-8b | — | — | 8,192 |
NeverSleep: Llama 3 Lumimaid 70Bneversleep/llama-3-lumimaid-70b | — | — | 8,192 |
Perplexity: Llama3 Sonar 70Bperplexity/llama-3-sonar-large-32k-chat | — | — | 32,768 |
Perplexity: Llama3 Sonar 8B Onlineperplexity/llama-3-sonar-small-32k-online | — | — | 28,000 |
Perplexity: Llama3 Sonar 8Bperplexity/llama-3-sonar-small-32k-chat | — | — | 32,768 |
DeepSeek V2.5deepseek/deepseek-chat-v2.5 | — | — | 128,000 |
Perplexity: Llama3 Sonar 70B Onlineperplexity/llama-3-sonar-large-32k-online | — | — | 28,000 |
Google: Gemini 1.5 Flashgoogle/gemini-flash-1.5 | — | — | 1,000,000 |
Meta: Llama 3 70B (Base)meta-llama/llama-3-70b | — | — | 8,192 |
Meta: Llama 3 8B (Base)meta-llama/llama-3-8b | — | — | 8,192 |
LLaVA v1.6 34Bliuhaotian/llava-yi-34b | — | — | 4,096 |
OLMo 7B Instructallenai/olmo-7b-instruct | — | — | 2,048 |
Qwen 1.5 4B Chatqwen/qwen-4b-chat | — | — | 32,768 |
Qwen 1.5 7B Chatqwen/qwen-7b-chat | — | — | 32,768 |
Qwen 1.5 14B Chatqwen/qwen-14b-chat | — | — | 32,768 |
Qwen 1.5 32B Chatqwen/qwen-32b-chat | — | — | 32,768 |
Qwen 1.5 72B Chatqwen/qwen-72b-chat | — | — | 32,768 |
Qwen 1.5 110B Chatqwen/qwen-110b-chat | — | — | 32,768 |
NeverSleep: Llama 3 Lumimaid 8Bneversleep/llama-3-lumimaid-8b | — | — | 24,576 |
Snowflake: Arctic Instructsnowflake/snowflake-arctic-instruct | — | — | 4,096 |
Fireworks: FireLLaVA 13Bfireworks/firellava-13b | — | — | 4,096 |
Lynn: Llama 3 Soliloquy 8B v2lynn/soliloquy-l3 | — | — | 24,576 |
Fimbulvetr 11B v2sao10k/fimbulvetr-11b-v2 | — | — | 8,192 |
WizardLM-2 7Bmicrosoft/wizardlm-2-7b | — | — | 32,000 |
Zephyr 141B-A35Bhuggingfaceh4/zephyr-orpo-141b-a35b | — | — | 65,536 |
Mistral: Mixtral 8x22B (base)mistralai/mixtral-8x22b | — | — | 65,536 |
Google: Gemini 1.5 Progoogle/gemini-pro-1.5 | — | — | 2,000,000 |
Cohere: Command R+cohere/command-r-plus | — | — | 128,000 |
Cohere: Command R+ (04-2024)cohere/command-r-plus-04-2024 | — | — | 128,000 |
Databricks: DBRX 132B Instructdatabricks/dbrx-instruct | — | — | 32,768 |
Midnight Rose 70Bsophosympatheia/midnight-rose-70b | — | — | 4,096 |
Cohere: Command Rcohere/command-r | — | — | 128,000 |
Cohere: Commandcohere/command | — | — | 4,096 |
Anthropic: Claude 3 Sonnetanthropic/claude-3-sonnet | — | — | 200,000 |
Cohere: Command R (03-2024)cohere/command-r-03-2024 | — | — | 128,000 |
Google: Gemma 7Bgoogle/gemma-7b-it | — | — | 8,192 |
Nous: Hermes 2 Mistral 7B DPOnousresearch/nous-hermes-2-mistral-7b-dpo | — | — | 8,192 |
Meta: CodeLlama 70B Instructmeta-llama/codellama-70b-instruct | — | — | 2,048 |
RWKV v5: Eagle 7Brecursal/eagle-7b | — | — | 10,000 |
Yi 34B 200K01-ai/yi-34b-200k | — | — | 200,000 |
Nous: Hermes 2 Mixtral 8x7B SFTnousresearch/nous-hermes-2-mixtral-8x7b-sft | — | — | 32,768 |
Nous: Hermes 2 Mixtral 8x7B DPOnousresearch/nous-hermes-2-mixtral-8x7b-dpo | — | — | 32,768 |
Mistral Smallmistralai/mistral-small | — | — | 32,000 |
Mistral Mediummistralai/mistral-medium | — | — | 32,000 |
Bagel 34B v0.2jondurbin/bagel-34b | — | — | 200,000 |
Noromaid Mixtral 8x7B Instructneversleep/noromaid-mixtral-8x7b-instruct | — | — | 8,000 |
Nous: Hermes 2 Yi 34Bnousresearch/nous-hermes-yi-34b | — | — | 4,096 |
Dolphin 2.6 Mixtral 8x7B 🐬cognitivecomputations/dolphin-mixtral-8x7b | — | — | 32,768 |
RWKV v5 3B AI Townrecursal/rwkv-5-3b-ai-town | — | — | 10,000 |
RWKV v5 World 3Brwkv/rwkv-5-world-3b | — | — | 10,000 |
StripedHyena Hessian 7B (base)togethercomputer/stripedhyena-hessian-7b | — | — | 32,768 |
StripedHyena Nous 7Btogethercomputer/stripedhyena-nous-7b | — | — | 32,768 |
Psyfighter v2 13Bkoboldai/psyfighter-13b-2 | — | — | 4,096 |
Nous: Hermes 2 Vision 7B (alpha)nousresearch/nous-hermes-2-vision-7b | — | — | 4,096 |
MythoMist 7Bgryphe/mythomist-7b | — | — | 32,768 |
Yi 6B (base)01-ai/yi-6b | — | — | 4,096 |
Yi 34B Chat01-ai/yi-34b-chat | — | — | 4,096 |
Yi 34B (base)01-ai/yi-34b | — | — | 4,096 |
Cinematika 7B (alpha)openrouter/cinematika-7b | — | — | 8,000 |
Nous: Capybara 7Bnousresearch/nous-capybara-7b | — | — | 8,192 |
Psyfighter 13Bjebcarter/psyfighter-13b | — | — | 4,096 |
OpenChat 3.5 7Bopenchat/openchat-7b | — | — | 8,192 |
Neural Chat 7B v3.1intel/neural-chat-7b | — | — | 4,096 |
Anthropic: Claude Instant v1.1anthropic/claude-instant-1.1 | — | — | 100,000 |
Anthropic: Claude v2anthropic/claude-2 | — | — | 200,000 |
Anthropic: Claude v2.1anthropic/claude-2.1 | — | — | 200,000 |
OpenHermes 2.5 Mistral 7Bteknium/openhermes-2.5-mistral-7b | — | — | 4,096 |
LLaVA 13Bliuhaotian/llava-13b | — | — | 2,048 |
Nous: Capybara 34Bnousresearch/nous-capybara-34b | — | — | 200,000 |
OpenAI: GPT-4 Visionopenai/gpt-4-vision-preview | — | — | 128,000 |
lzlv 70Blizpreciatior/lzlv-70b-fp16-hf | — | — | 4,096 |
Toppy M 7Bundi95/toppy-m-7b | — | — | 4,096 |
Auto Routeropenrouter/auto | — | — | 2,000,000 |
OpenAI: GPT-3.5 Turbo 16k (older v1106)openai/gpt-3.5-turbo-1106 | — | — | 16,385 |
Google: PaLM 2 Code Chat 32kgoogle/palm-2-codechat-bison-32k | — | — | 32,760 |
Google: PaLM 2 Chat 32kgoogle/palm-2-chat-bison-32k | — | — | 32,760 |
OpenHermes 2 Mistral 7Bteknium/openhermes-2-mistral-7b | — | — | 8,192 |
Mistral OpenOrca 7Bopen-orca/mistral-7b-openorca | — | — | 8,192 |
Airoboros 70Bjondurbin/airoboros-l2-70b | — | — | 4,096 |
Nous: Hermes 70Bnousresearch/nous-hermes-llama2-70b | — | — | 4,096 |
Xwin 70Bxwin-lm/xwin-lm-70b | — | — | 8,192 |
Synthia 70Bmigtissera/synthia-70b | — | — | 8,192 |
Pygmalion: Mythalion 13Bpygmalionai/mythalion-13b | — | — | 8,192 |
OpenAI: GPT-4 32k (older v0314)openai/gpt-4-32k-0314 | — | — | 32,767 |
OpenAI: GPT-4 32kopenai/gpt-4-32k | — | — | 32,767 |
Nous: Hermes 13Bnousresearch/nous-hermes-llama2-13b | — | — | 4,096 |
Phind: CodeLlama 34B v2phind/phind-codellama-34b | — | — | 4,096 |
Meta: CodeLlama 34B Instructmeta-llama/codellama-34b-instruct | — | — | 8,192 |
Hugging Face: Zephyr 7Bhuggingfaceh4/zephyr-7b-beta | — | — | 4,096 |
Anthropic: Claude Instant v1.0anthropic/claude-instant-1.0 | — | — | 100,000 |
Anthropic: Claude v1.2anthropic/claude-1.2 | — | — | 100,000 |
Anthropic: Claude v1anthropic/claude-1 | — | — | 100,000 |
Anthropic: Claude Instant v1anthropic/claude-instant-1 | — | — | 100,000 |
Anthropic: Claude v2.0anthropic/claude-2.0 | — | — | 100,000 |
Google: PaLM 2 Code Chatgoogle/palm-2-codechat-bison | — | — | 7,168 |
Google: PaLM 2 Chatgoogle/palm-2-chat-bison | — | — | 9,216 |
Meta: Llama 2 70B Chatmeta-llama/llama-2-70b-chat | — | — | 4,096 |
Meta: Llama 2 13B Chatmeta-llama/llama-2-13b-chat | — | — | 4,096 |
OpenAI: GPT-3.5 Turbo 16kopenai/gpt-3.5-turbo-0125 | — | — | 16,385 |
OpenAI: GPT-3.5 Turbo (older v0301)openai/gpt-3.5-turbo-0301 | — | — | 4,095 |