openrouter pric

Models

604 modelsReset FiltersPricing: Low to HighNewestTop WeeklyPricing: Low to HighPricing: High to LowContext: High to LowThroughput: High to LowLatency: Low to High

00

Model Name & IDInput ($/1M tokens)Output ($/1M tokens)Context (tokens)
AllenAI: Olmo 3.1 32B Think (free)allenai/olmo-3.1-32b-think:free$0$065,536
Xiaomi: MiMo-V2-Flash (free)xiaomi/mimo-v2-flash:free$0$0262,144
NVIDIA: Nemotron 3 Nano 30B A3B (free)nvidia/nemotron-3-nano-30b-a3b:free$0$0256,000
Mistral: Devstral 2 2512 (free)mistralai/devstral-2512:free$0$0262,144
Nex AGI: DeepSeek V3.1 Nex N1 (free)nex-agi/deepseek-v3.1-nex-n1:free$0$0131,072
Arcee AI: Trinity Mini (free)arcee-ai/trinity-mini:free$0$0131,072
AllenAI: Olmo 3 32B Think (free)allenai/olmo-3-32b-think:free$0$065,536
Kwaipilot: KAT-Coder-Pro V1 (free)kwaipilot/kat-coder-pro:free$0$0256,000
NVIDIA: Nemotron Nano 12B 2 VL (free)nvidia/nemotron-nano-12b-v2-vl:free$0$0128,000
NVIDIA: Nemotron Nano 9B V2 (free)nvidia/nemotron-nano-9b-v2:free$0$0128,000
OpenAI: gpt-oss-120b (free)openai/gpt-oss-120b:free$0$0131,072
OpenAI: gpt-oss-20b (free)openai/gpt-oss-20b:free$0$0131,072
Z.AI: GLM 4.5 Air (free)z-ai/glm-4.5-air:free$0$0131,072
Qwen: Qwen3 Coder 480B A35B (free)qwen/qwen3-coder:free$0$0262,000
MoonshotAI: Kimi K2 0711 (free)moonshotai/kimi-k2:free$0$032,768
Venice: Uncensored (free)cognitivecomputations/dolphin-mistral-24b-venice-edition:free$0$032,768
Google: Gemma 3n 2B (free)google/gemma-3n-e2b-it:free$0$08,192
TNG: DeepSeek R1T2 Chimera (free)tngtech/deepseek-r1t2-chimera:free$0$0163,840
DeepSeek: R1 0528 (free)deepseek/deepseek-r1-0528:free$0$0163,840
Google: Gemma 3n 4B (free)google/gemma-3n-e4b-it:free$0$08,192
Qwen: Qwen3 4B (free)qwen/qwen3-4b:free$0$040,960
TNG: DeepSeek R1T Chimera (free)tngtech/deepseek-r1t-chimera:free$0$0163,840
Mistral: Mistral Small 3.1 24B (free)mistralai/mistral-small-3.1-24b-instruct:free$0$0128,000
Google: Gemma 3 4B (free)google/gemma-3-4b-it:free$0$032,768
Google: Gemma 3 12B (free)google/gemma-3-12b-it:free$0$032,768
Google: Gemma 3 27B (free)google/gemma-3-27b-it:free$0$0131,072
Google: Gemini 2.0 Flash Experimental (free)google/gemini-2.0-flash-exp:free$0$01,048,576
Meta: Llama 3.3 70B Instruct (free)meta-llama/llama-3.3-70b-instruct:free$0$0131,072
Meta: Llama 3.2 3B Instruct (free)meta-llama/llama-3.2-3b-instruct:free$0$0131,072
Qwen: Qwen2.5-VL 7B Instruct (free)qwen/qwen-2.5-vl-7b-instruct:free$0$032,768
Nous: Hermes 3 405B Instruct (free)nousresearch/hermes-3-llama-3.1-405b:free$0$0131,072
Meta: Llama 3.1 405B Instruct (free)meta-llama/llama-3.1-405b-instruct:free$0$0131,072
Mistral: Mistral 7B Instruct (free)mistralai/mistral-7b-instruct:free$0100% off$0100% off32,768
Thenlper: GTE-Basethenlper/gte-base$0.005$0512
Intfloat: E5-Base-v2intfloat/e5-base-v2$0.005$0512
Sentence Transformers: paraphrase-MiniLM-L6-v2sentence-transformers/paraphrase-minilm-l6-v2$0.005$0512
Sentence Transformers: all-MiniLM-L12-v2sentence-transformers/all-minilm-l12-v2$0.005$0512
BAAI: bge-base-en-v1.5baai/bge-base-en-v1.5$0.005$0512
Sentence Transformers: multi-qa-mpnet-base-dot-v1sentence-transformers/multi-qa-mpnet-base-dot-v1$0.005$0512
Sentence Transformers: all-mpnet-base-v2sentence-transformers/all-mpnet-base-v2$0.005$0512
Sentence Transformers: all-MiniLM-L6-v2sentence-transformers/all-minilm-l6-v2$0.005$0512
Thenlper: GTE-Largethenlper/gte-large$0.01$0512
Intfloat: E5-Large-v2intfloat/e5-large-v2$0.01$0512
Intfloat: Multilingual-E5-Largeintfloat/multilingual-e5-large$0.01$0512
BAAI: bge-large-en-v1.5baai/bge-large-en-v1.5$0.01$0512
BAAI: bge-m3baai/bge-m3$0.01$08,192
Qwen: Qwen3 Embedding 8Bqwen/qwen3-embedding-8b$0.01$032,768
OpenAI: Text Embedding 3 Smallopenai/text-embedding-3-small$0.02$08,192
Qwen: Qwen3 Embedding 4Bqwen/qwen3-embedding-4b$0.02$032,768
OpenAI: gpt-oss-20bopenai/gpt-oss-20b$0.016$0.06131,072
Meta: Llama 3.2 3B Instructmeta-llama/llama-3.2-3b-instruct$0.02$0.02131,072
Meta: Llama 3.1 8B Instructmeta-llama/llama-3.1-8b-instruct$0.02$0.03131,072
Google: Gemma 3 4Bgoogle/gemma-3-4b-it$0.01703$0.0681596,000
Google: Gemma 3n 4Bgoogle/gemma-3n-e4b-it$0.02$0.0432,768
Mistral: Mistral Nemomistralai/mistral-nemo$0.02$0.04131,072
Llama Guard 3 8Bmeta-llama/llama-guard-3-8b$0.02$0.06131,072
IBM: Granite 4.0 Microibm-granite/granite-4.0-h-micro$0.017$0.11131,000
OpenAI: gpt-oss-120bopenai/gpt-oss-120b$0.02$0.10131,072
Nous: DeepHermes 3 Mistral 24B Previewnousresearch/deephermes-3-mistral-24b-preview$0.02$0.1032,768
NousResearch: Hermes 2 Pro – Llama-3 8Bnousresearch/hermes-2-pro-llama-3-8b$0.025$0.088,192
Mistral: Mistral 7B Instructmistralai/mistral-7b-instruct$0.028$0.05432,768
Meta: Llama 3 8B Instructmeta-llama/llama-3-8b-instruct$0.03$0.068,192
Qwen: Qwen2.5 Coder 7B Instructqwen/qwen2.5-coder-7b-instruct$0.03$0.0932,768
Google: Gemma 2 9Bgoogle/gemma-2-9b-it$0.03$0.098,192
Google: Gemma 3 12Bgoogle/gemma-3-12b-it$0.03$0.10131,072
Mistral: Mistral Small 3.1 24Bmistralai/mistral-small-3.1-24b-instruct$0.03$0.11131,072
Mistral: Mistral Small 3mistralai/mistral-small-24b-instruct-2501$0.03$0.1132,768
DeepSeek: R1 Distill Llama 70Bdeepseek/deepseek-r1-distill-llama-70b$0.03$0.11131,072
Qwen2.5 Coder 32B Instructqwen/qwen-2.5-coder-32b-instruct$0.03$0.1132,768
Google: Gemma 3 27Bgoogle/gemma-3-27b-it$0.036$0.064131,072
Mistral: Ministral 3Bmistralai/ministral-3b$0.04$0.04131,072
Sao10K: Llama 3 8B Lunarissao10k/l3-lunaris-8b$0.04$0.058,192
Meta: Llama 3.2 1B Instructmeta-llama/llama-3.2-1b-instruct$0.027$0.2060,000
THUDM: GLM 4.1V 9B Thinkingthudm/glm-4.1v-9b-thinking$0.035$0.13865,536
Qwen: Qwen3 8Bqwen/qwen3-8b$0.035$0.138128,000
Amazon: Nova Micro 1.0amazon/nova-micro-v1$0.035$0.14128,000
Qwen: Qwen2.5 7B Instructqwen/qwen-2.5-7b-instruct$0.04$0.1032,768
Cohere: Command R7B (12-2024)cohere/command-r7b-12-2024$0.0375$0.15128,000
Meta: Llama 3.2 11B Vision Instructmeta-llama/llama-3.2-11b-vision-instruct$0.049$0.049131,072
NVIDIA: Nemotron Nano 9B V2nvidia/nemotron-nano-9b-v2$0.04$0.16131,072
OpenAI: gpt-oss-120b (exacto)openai/gpt-oss-120b:exacto$0.039$0.19131,072
Arcee AI: Trinity Miniarcee-ai/trinity-mini$0.045$0.15131,072
LiquidAI/LFM2-8B-A1Bliquid/lfm2-8b-a1b$0.05$0.1032,768
LiquidAI/LFM2-2.6Bliquid/lfm-2.2-6b$0.05$0.1032,768
Microsoft: Phi 4 Multimodal Instructmicrosoft/phi-4-multimodal-instruct$0.05$0.10131,072
MythoMax 13Bgryphe/mythomax-l2-13b$0.06$0.064,096
DeepSeek: DeepSeek R1 0528 Qwen3 8Bdeepseek/deepseek-r1-0528-qwen3-8b$0.06$0.09128,000
AllenAI: Olmo 2 32B Instructallenai/olmo-2-0325-32b-instruct$0.05$0.20128,000
Qwen: Qwen-Turboqwen/qwen-turbo$0.05$0.201,000,000
Mistral: Devstral 2 2512mistralai/devstral-2512$0.05$0.22262,144
Z.AI: GLM 4.5 Airz-ai/glm-4.5-air$0.05$0.22131,072
Mistral: Devstral Small 2505mistralai/devstral-small-2505$0.06$0.12128,000
Qwen: Qwen3 14Bqwen/qwen3-14b$0.05$0.2240,960
Qwen: Qwen2.5 VL 32B Instructqwen/qwen2.5-vl-32b-instruct$0.05$0.2216,384
Microsoft: Phi 4microsoft/phi-4$0.06$0.1416,384
Mistral: Mistral Small 3.2 24Bmistralai/mistral-small-3.2-24b-instruct$0.06$0.18131,072
Qwen: Qwen3 30B A3Bqwen/qwen3-30b-a3b$0.06$0.2240,960
NVIDIA: Nemotron 3 Nano 30B A3Bnvidia/nemotron-3-nano-30b-a3b$0.06$0.24262,144
Amazon: Nova Lite 1.0amazon/nova-lite-v1$0.06$0.24300,000
Qwen: Qwen3 30B A3B Thinking 2507qwen/qwen3-30b-a3b-thinking-2507$0.051$0.3432,768
OpenAI: GPT-5 Nanoopenai/gpt-5-nano$0.05$0.40400,000
Qwen: Qwen3 Coder 30B A3B Instructqwen/qwen3-coder-30b-a3b-instruct$0.07$0.27160,000
Baidu: ERNIE 4.5 21B A3B Thinkingbaidu/ernie-4.5-21b-a3b-thinking$0.07$0.28131,072
Baidu: ERNIE 4.5 21B A3Bbaidu/ernie-4.5-21b-a3b$0.07$0.28120,000
Mistral: Devstral Small 1.1mistralai/devstral-small$0.07$0.28128,000
Mistral: Mistral Embed 2312mistralai/mistral-embed-2312$0.10$08,192
OpenAI: Text Embedding Ada 002openai/text-embedding-ada-002$0.10$08,192
Qwen: Qwen3 32Bqwen/qwen3-32b$0.08$0.2440,960
ByteDance Seed: Seed 1.6 Flashbytedance-seed/seed-1.6-flash$0.075$0.30262,144
OpenAI: gpt-oss-safeguard-20bopenai/gpt-oss-safeguard-20b$0.075$0.30131,072
Microsoft: Phi 4 Reasoning Plusmicrosoft/phi-4-reasoning-plus$0.07$0.3532,768
Google: Gemini 2.0 Flash Litegoogle/gemini-2.0-flash-lite-001$0.075$0.301,048,576
Mistral: Ministral 3 3B 2512mistralai/ministral-3b-2512$0.10$0.10131,072
Z.AI: GLM 4 32Bz-ai/glm-4-32b$0.10$0.10128,000
Meta: Llama 4 Scoutmeta-llama/llama-4-scout$0.08$0.30327,680
Mistral: Ministral 8Bmistralai/ministral-8b$0.10$0.10131,072
Mistral: Pixtral 12Bmistralai/pixtral-12b$0.10$0.1032,768
Microsoft: Phi-3.5 Mini 128K Instructmicrosoft/phi-3.5-mini-128k-instruct$0.10$0.10128,000
Microsoft: Phi-3 Mini 128K Instructmicrosoft/phi-3-mini-128k-instruct$0.10$0.10128,000
Qwen: Qwen3 30B A3B Instruct 2507qwen/qwen3-30b-a3b-instruct-2507$0.08$0.33262,144
Qwen: Qwen3 235B A22B Instruct 2507qwen/qwen3-235b-a22b-2507$0.071$0.463262,144
AllenAI: Olmo 3 7B Instructallenai/olmo-3-7b-instruct$0.10$0.2065,536
Qwen: Qwen3 Next 80B A3B Instructqwen/qwen3-next-80b-a3b-instruct$0.06$0.60262,144
ByteDance: UI-TARS 7Bbytedance/ui-tars-1.5-7b$0.10$0.20128,000
Mistral: Mistral 7B Instruct v0.1mistralai/mistral-7b-instruct-v0.1$0.11$0.192,824
Mistral: Mistral Small Creativemistralai/mistral-small-creative$0.10$0.3032,768
OpenAI: Text Embedding 3 Largeopenai/text-embedding-3-large$0.13$08,192
Mistral: Voxtral Small 24B 2507mistralai/voxtral-small-24b-2507$0.10$0.3032,000
Qwen: Qwen3 VL 8B Instructqwen/qwen3-vl-8b-instruct$0.08$0.50131,072
Tongyi DeepResearch 30B A3Balibaba/tongyi-deepresearch-30b-a3b$0.09$0.40131,072
Meta: Llama 3.3 70B Instructmeta-llama/llama-3.3-70b-instruct$0.10$0.32131,072
OpenGVLab: InternVL3 78Bopengvlab/internvl3-78b$0.10$0.3932,768
AllenAI: Olmo 3 7B Thinkallenai/olmo-3-7b-think$0.12$0.2065,536
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5nvidia/llama-3.3-nemotron-super-49b-v1.5$0.10$0.40131,072
Google: Gemini 2.5 Flash Lite Preview 09-2025google/gemini-2.5-flash-lite-preview-09-2025$0.10$0.401,048,576
Google: Gemini 2.5 Flash Litegoogle/gemini-2.5-flash-lite$0.10$0.401,048,576
OpenAI: GPT-4.1 Nanoopenai/gpt-4.1-nano$0.10$0.401,047,576
Google: Gemini 2.0 Flashgoogle/gemini-2.0-flash-001$0.10$0.401,048,576
Nous: Hermes 4 70Bnousresearch/hermes-4-70b$0.11$0.38131,072
Google: Gemini Embedding 001google/gemini-embedding-001$0.15$020,000
Mistral: Codestral Embed 2505mistralai/codestral-embed-2505$0.15$08,192
NeverSleep: Lumimaid v0.2 8Bneversleep/llama-3.1-lumimaid-8b$0.09$0.6032,768
Qwen2.5 72B Instructqwen/qwen-2.5-72b-instruct$0.12$0.3932,768
EssentialAI: Rnj 1 Instructessentialai/rnj-1-instruct$0.15$0.1532,768
Mistral: Ministral 3 8B 2512mistralai/ministral-8b-2512$0.15$0.15262,144
DeepSeek: R1 Distill Qwen 14Bdeepseek/deepseek-r1-distill-qwen-14b$0.15$0.1532,768
MiniMax: MiniMax M2.1minimax/minimax-m2.1$0.12$0.48196,608
Qwen: Qwen3 235B A22B Thinking 2507qwen/qwen3-235b-a22b-thinking-2507$0.11$0.60262,144
Qwen: Qwen3 VL 235B A22B Instructqwen/qwen3-vl-235b-a22b-instruct$0.12$0.56262,144
Qwen: QwQ 32Bqwen/qwq-32b$0.15$0.4032,768
Baidu: ERNIE 4.5 VL 28B A3Bbaidu/ernie-4.5-vl-28b-a3b$0.14$0.5630,000
Tencent: Hunyuan A13B Instructtencent/hunyuan-a13b-instruct$0.14$0.57131,072
Arcee AI: Spotlightarcee-ai/spotlight$0.18$0.18131,072
Meta: Llama Guard 4 12Bmeta-llama/llama-guard-4-12b$0.18$0.18163,840
Qwen: Qwen3 VL 30B A3B Instructqwen/qwen3-vl-30b-a3b-instruct$0.15$0.60262,144
Meta: Llama 4 Maverickmeta-llama/llama-4-maverick$0.15$0.601,048,576
Qwen: Qwen2.5 VL 72B Instructqwen/qwen2.5-vl-72b-instruct$0.15$0.6032,768
Cohere: Command R (08-2024)cohere/command-r-08-2024$0.15$0.60128,000
OpenAI: GPT-4o-mini (2024-07-18)openai/gpt-4o-mini-2024-07-18$0.15$0.60128,000
OpenAI: GPT-4o-miniopenai/gpt-4o-mini$0.15$0.60128,000
TheDrummer: Rocinante 12Bthedrummer/rocinante-12b$0.17$0.4332,768
Mistral: Ministral 3 14B 2512mistralai/ministral-14b-2512$0.20$0.20262,144
Qwen: Qwen2.5-VL 7B Instructqwen/qwen-2.5-vl-7b-instruct$0.20$0.2032,768
Mistral: Mistral 7B Instruct v0.3mistralai/mistral-7b-instruct-v0.3$0.20$0.2032,768
Meta: LlamaGuard 2 8Bmeta-llama/llama-guard-2-8b$0.20$0.208,192
Mistral: Mistral 7B Instruct v0.2mistralai/mistral-7b-instruct-v0.2$0.20$0.2032,768
DeepSeek: DeepSeek V3.1deepseek/deepseek-chat-v3.1$0.15$0.7532,768
Qwen: Qwen3 235B A22Bqwen/qwen3-235b-a22b$0.18$0.5440,960
Cogito V2 Preview Llama 109Bdeepcogito/cogito-v2-preview-llama-109b-moe$0.18$0.5932,767
Z.AI: GLM 4.7z-ai/glm-4.7$0.16$0.80202,752
AI21: Jamba Mini 1.7ai21/jamba-mini-1.7$0.20$0.40256,000
DeepSeek: DeepSeek V3.2 Expdeepseek/deepseek-v3.2-exp$0.21$0.32163,840
xAI: Grok 4.1 Fastx-ai/grok-4.1-fast$0.20$0.502,000,000
xAI: Grok 4 Fastx-ai/grok-4-fast$0.20$0.502,000,000
NVIDIA: Nemotron Nano 12B 2 VLnvidia/nemotron-nano-12b-v2-vl$0.20$0.60131,072
Mistral: Sabamistralai/mistral-saba$0.20$0.6032,768
Qwen: Qwen3 Next 80B A3B Thinkingqwen/qwen3-next-80b-a3b-thinking$0.15$1.20262,144
Qwen: Qwen VL Plusqwen/qwen-vl-plus$0.21$0.637,500
Mistral Tinymistralai/mistral-tiny$0.25$0.2532,768
DeepSeek: DeepSeek V3 0324deepseek/deepseek-chat-v3-0324$0.19$0.87163,840
Meituan: LongCat Flash Chatmeituan/longcat-flash-chat$0.20$0.80131,072
DeepSeek: DeepSeek V3.2deepseek/deepseek-v3.2$0.25$0.38163,840
DeepSeek: DeepSeek V3.1 Terminus (exacto)deepseek/deepseek-v3.1-terminus:exacto$0.21$0.79163,840
DeepSeek: DeepSeek V3.1 Terminusdeepseek/deepseek-v3.1-terminus$0.21$0.79163,840
Kwaipilot: KAT-Coder-Pro V1kwaipilot/kat-coder-pro$0.20731% off$0.82831% off256,000
DeepSeek: R1 Distill Qwen 32Bdeepseek/deepseek-r1-distill-qwen-32b$0.27$0.27131,072
MiniMax: MiniMax M2minimax/minimax-m2$0.20$1196,608
Qwen: Qwen3 VL 30B A3B Thinkingqwen/qwen3-vl-30b-a3b-thinking$0.20$1131,072
Prime Intellect: INTELLECT-3prime-intellect/intellect-3$0.20$1.10131,072
MiniMax: MiniMax-01minimax/minimax-01$0.20$1.101,000,192
DeepSeek: DeepSeek V3.2 Specialedeepseek/deepseek-v3.2-speciale$0.27$0.41163,840
Qwen: Qwen3 Coder 480B A35Bqwen/qwen3-coder$0.22$0.95262,144
Nous: Hermes 3 70B Instructnousresearch/hermes-3-llama-3.1-70b$0.30$0.3065,536
TNG: R1T Chimeratngtech/tng-r1t-chimera$0.25$0.85163,840
TNG: DeepSeek R1T2 Chimeratngtech/deepseek-r1t2-chimera$0.25$0.85163,840
Meta: Llama 3 70B Instructmeta-llama/llama-3-70b-instruct$0.30$0.408,192
TheDrummer: Cydonia 24B V4.1thedrummer/cydonia-24b-v4.1$0.30$0.50131,072
xAI: Grok Code Fast 1x-ai/grok-code-fast-1$0.20$1.50256,000
Inception: Mercuryinception/mercury$0.25$1128,000
xAI: Grok 3 Minix-ai/grok-3-mini$0.30$0.50131,072
Inception: Mercury Coderinception/mercury-coder$0.25$1128,000
xAI: Grok 3 Mini Betax-ai/grok-3-mini-beta$0.30$0.50131,072
MoonshotAI: Kimi K2 Thinkingmoonshotai/kimi-k2-thinking$0.32$0.48262,144
Anthropic: Claude 3 Haikuanthropic/claude-3-haiku$0.25$1.25200,000
Z.AI: GLM 4.6Vz-ai/glm-4.6v$0.30$0.90131,072
Qwen: Qwen3 VL 8B Thinkingqwen/qwen3-vl-8b-thinking$0.18$2.10256,000
Mistral: Codestral 2508mistralai/codestral-2508$0.30$0.90256,000
Baidu: ERNIE 4.5 300B A47Bbaidu/ernie-4.5-300b-a47b$0.28$1.10123,000
Meta: Llama 3.2 90B Vision Instructmeta-llama/llama-3.2-90b-vision-instruct$0.35$0.4032,768
Qwen: Qwen3 Coder 480B A35B (exacto)qwen/qwen3-coder:exacto$0.22$1.80262,144
MoonshotAI: Kimi Dev 72Bmoonshotai/kimi-dev-72b$0.29$1.15131,072
TNG: DeepSeek R1T Chimeratngtech/deepseek-r1t-chimera$0.30$1.20163,840
DeepSeek: DeepSeek V3deepseek/deepseek-chat$0.30$1.20163,840
TheDrummer: UnslopNemo 12Bthedrummer/unslopnemo-12b$0.40$0.4032,768
Meta: Llama 3.1 70B Instructmeta-llama/llama-3.1-70b-instruct$0.40$0.40131,072
ByteDance Seed: Seed 1.6bytedance-seed/seed-1.6$0.25$2262,144
OpenAI: GPT-5.1-Codex-Miniopenai/gpt-5.1-codex-mini$0.25$2400,000
Qwen: Qwen3 Coder Flashqwen/qwen3-coder-flash$0.30$1.50128,000
OpenAI: GPT-5 Miniopenai/gpt-5-mini$0.25$2400,000
Z.AI: GLM 4.6z-ai/glm-4.6$0.35$1.50202,752
Z.AI: GLM 4.5z-ai/glm-4.5$0.35$1.55131,072
ReMM SLERP 13Bundi95/remm-slerp-l2-13b$0.45$0.656,144
Qwen: Qwen Plus 0728qwen/qwen-plus-2025-07-28$0.40$1.201,000,000
Qwen: Qwen-Plusqwen/qwen-plus$0.40$1.20131,072
WizardLM-2 8x22Bmicrosoft/wizardlm-2-8x22b$0.48$0.4865,536
Baidu: ERNIE 4.5 VL 424B A47Bbaidu/ernie-4.5-vl-424b-a47b$0.42$1.25123,000
Amazon: Nova 2 Liteamazon/nova-2-lite-v1$0.30$2.501,000,000
Google: Gemini 2.5 Flash Image (Nano Banana)google/gemini-2.5-flash-image$0.30$2.5032,768
Google: Gemini 2.5 Flash Preview 09-2025google/gemini-2.5-flash-preview-09-2025$0.30$2.501,048,576
Google: Gemini 2.5 Flash Image Preview (Nano Banana)google/gemini-2.5-flash-image-preview$0.30$2.5032,768
Google: Gemini 2.5 Flashgoogle/gemini-2.5-flash$0.30$2.501,048,576
OpenAI: GPT-4.1 Miniopenai/gpt-4.1-mini$0.40$1.601,047,576
DeepSeek: R1 0528deepseek/deepseek-r1-0528$0.40$1.75163,840
MoonshotAI: Kimi K2 0905moonshotai/kimi-k2-0905$0.39$1.90262,144
Arcee AI: Coder Largearcee-ai/coder-large$0.50$0.8032,768
Mistral: Mixtral 8x7B Instructmistralai/mixtral-8x7b-instruct$0.54$0.5432,768
Mistral: Mistral Medium 3.1mistralai/mistral-medium-3.1$0.40$2131,072
Mistral: Devstral Mediummistralai/devstral-medium$0.40$2131,072
Mistral: Mistral Medium 3mistralai/mistral-medium-3$0.40$2131,072
Z.AI: GLM 4.6 (exacto)z-ai/glm-4.6:exacto$0.44$1.76204,800
MiniMax: MiniMax M1minimax/minimax-m1$0.40$2.201,000,000
TheDrummer: Skyfall 36B V2thedrummer/skyfall-36b-v2$0.55$0.8032,768
Mistral: Mistral Large 3 2512mistralai/mistral-large-2512$0.50$1.50262,144
Qwen: Qwen3 VL 32B Instructqwen/qwen3-vl-32b-instruct$0.50$1.50262,144
OpenAI: GPT-3.5 Turboopenai/gpt-3.5-turbo$0.50$1.5016,385
StepFun: Step3stepfun-ai/step3$0.57$1.4265,536
Google: Gemma 2 27Bgoogle/gemma-2-27b-it$0.65$0.658,192
DeepSeek: DeepSeek Prover V2deepseek/deepseek-prover-v2$0.50$2.18163,840
Sourceful: Riverflow V2 Fast Previewsourceful/riverflow-v2-fast-preview$0$7.198,192
Sao10K: Llama 3.3 Euryale 70Bsao10k/l3.3-euryale-70b$0.65$0.75131,072
Sao10K: Llama 3.1 Euryale 70B v2.2sao10k/l3.1-euryale-70b$0.65$0.7532,768
MoonshotAI: Kimi K2 0711moonshotai/kimi-k2$0.50$2.40131,072
Z.AI: GLM 4.5Vz-ai/glm-4.5v$0.60$1.8065,536
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1nvidia/llama-3.1-nemotron-ultra-253b-v1$0.60$1.80131,072
Google: Gemini 3 Flash Previewgoogle/gemini-3-flash-preview$0.50$31,048,576
Qwen: Qwen3 VL 235B A22B Thinkingqwen/qwen3-vl-235b-a22b-thinking$0.45$3.50262,144
Qwen: Qwen Plus 0728 (thinking)qwen/qwen-plus-2025-07-28:thinking$0.40$41,000,000
Sourceful: Riverflow V2 Standard Previewsourceful/riverflow-v2-standard-preview$0$8.388,192
AionLabs: Aion-1.0-Miniaion-labs/aion-1.0-mini$0.70$1.40131,072
MoonshotAI: Kimi K2 0905 (exacto)moonshotai/kimi-k2-0905:exacto$0.60$2.50262,144
Mancer: Weaver (alpha)mancer/weaver$0.75$18,000
Arcee AI: Virtuoso Largearcee-ai/virtuoso-large$0.75$1.20131,072
Morph: Morph V3 Fastmorph/morph-v3-fast$0.80$1.2081,920
EleutherAI: Llemma 7beleutherai/llemma_7b$0.80$1.204,096
AlfredPros: CodeLLaMa 7B Instruct Solidityalfredpros/codellama-7b-instruct-solidity$0.80$1.204,096
DeepSeek: R1deepseek/deepseek-r1$0.70$2.40163,840
ByteDance Seed: Seedream 4.5bytedance-seed/seedream-4.5$0$9.5814,096
AionLabs: Aion-RP 1.0 (8B)aion-labs/aion-rp-llama-3.1-8b$0.80$1.6032,768
Deep Cogito: Cogito V2 Preview Llama 70Bdeepcogito/cogito-v2-preview-llama-70b$0.88$0.8832,768
Relace: Relace Apply 3relace/relace-apply-3$0.85$1.25256,000
Morph: Morph V3 Largemorph/morph-v3-large$0.90$1.90262,144
Nous: Hermes 3 405B Instructnousresearch/hermes-3-llama-3.1-405b$1$1131,072
Microsoft: Phi-3 Medium 128K Instructmicrosoft/phi-3-medium-128k-instruct$1$1128,000
Qwen: Qwen VL Maxqwen/qwen-vl-max$0.80$3.20131,072
Amazon: Nova Pro 1.0amazon/nova-pro-v1$0.80$3.20300,000
Noromaid 20Bneversleep/noromaid-20b$1$1.754,096
Switchpoint Routerswitchpoint/router$0.85$3.40131,072
Anthropic: Claude 3.5 Haiku (2024-10-22)anthropic/claude-3.5-haiku-20241022$0.80$4200,000
Anthropic: Claude 3.5 Haikuanthropic/claude-3.5-haiku$0.80$4200,000
OpenAI: GPT-3.5 Turbo (older v0613)openai/gpt-3.5-turbo-0613$1$24,095
Arcee AI: Maestro Reasoningarcee-ai/maestro-reasoning$0.90$3.30131,072
Relace: Relace Searchrelace/relace-search$1$3256,000
Nous: Hermes 4 405Bnousresearch/hermes-4-405b$1$3131,072
NVIDIA: Llama 3.1 Nemotron 70B Instructnvidia/llama-3.1-nemotron-70b-instruct$1.20$1.20131,072
Deep Cogito: Cogito v2.1 671Bdeepcogito/cogito-v2.1-671b$1.25$1.25128,000
Anthropic: Claude Haiku 4.5anthropic/claude-haiku-4.5$1$5200,000
Qwen: Qwen3 Coder Plusqwen/qwen3-coder-plus$1$5128,000
OpenAI: o4 Mini Highopenai/o4-mini-high$1.10$4.40200,000
OpenAI: o4 Miniopenai/o4-mini$1.10$4.40200,000
OpenAI: o3 Mini Highopenai/o3-mini-high$1.10$4.40200,000
OpenAI: o3 Miniopenai/o3-mini$1.10$4.40200,000
Sao10k: Llama 3 Euryale 70B v2.1sao10k/l3-euryale-70b$1.48$1.488,192
OpenAI: GPT-3.5 Turbo Instructopenai/gpt-3.5-turbo-instruct$1.50$24,095
Sourceful: Riverflow V2 Max Previewsourceful/riverflow-v2-max-preview$0$17.968,192
Qwen: Qwen3 Maxqwen/qwen3-max$1.20$6256,000
OpenAI: Codex Miniopenai/codex-mini$1.50$6200,000
Qwen: Qwen-Maxqwen/qwen-max$1.60$6.4032,768
OpenAI: GPT-5.1-Codex-Maxopenai/gpt-5.1-codex-max$1.25$10400,000
OpenAI: GPT-5.1openai/gpt-5.1$1.25$10400,000
OpenAI: GPT-5.1 Chatopenai/gpt-5.1-chat$1.25$10128,000
OpenAI: GPT-5.1-Codexopenai/gpt-5.1-codex$1.25$10400,000
OpenAI: GPT-5 Codexopenai/gpt-5-codex$1.25$10400,000
OpenAI: GPT-5 Chatopenai/gpt-5-chat$1.25$10128,000
OpenAI: GPT-5openai/gpt-5$1.25$10400,000
Google: Gemini 2.5 Progoogle/gemini-2.5-pro$1.25$101,048,576
Google: Gemini 2.5 Pro Preview 06-05google/gemini-2.5-pro-preview$1.25$101,048,576
Google: Gemini 2.5 Pro Preview 05-06google/gemini-2.5-pro-preview-05-06$1.25$101,048,576
Mistral Large 2411mistralai/mistral-large-2411$2$6131,072
Mistral Large 2407mistralai/mistral-large-2407$2$6131,072
Mistral: Pixtral Large 2411mistralai/pixtral-large-2411$2$6131,072
Mistral: Mixtral 8x22B Instructmistralai/mixtral-8x22b-instruct$2$665,536
Mistral Largemistralai/mistral-large$2$6128,000
OpenAI: GPT-5 Image Miniopenai/gpt-5-image-mini$2.50$2400,000
OpenAI: o4 Mini Deep Researchopenai/o4-mini-deep-research$2$8200,000
AI21: Jamba Large 1.7ai21/jamba-large-1.7$2$8256,000
OpenAI: o3openai/o3$2$8200,000
OpenAI: GPT-4.1openai/gpt-4.1$2$81,047,576
Perplexity: Sonar Reasoning Properplexity/sonar-reasoning-pro$2$8128,000
Perplexity: Sonar Deep Researchperplexity/sonar-deep-research$2$8128,000
OpenAI: GPT-5.2 Chatopenai/gpt-5.2-chat$1.75$14128,000
OpenAI: GPT-5.2openai/gpt-5.2$1.75$14400,000
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google/gemini-3-pro-image-preview$2$1265,536
Google: Gemini 3 Pro Previewgoogle/gemini-3-pro-preview$2$121,048,576
Sao10K: Llama 3.1 70B Hanami x1sao10k/l3.1-70b-hanami-x1$3$316,000
OpenAI: GPT-3.5 Turbo 16kopenai/gpt-3.5-turbo-16k$3$416,385
OpenAI: GPT-4o Audioopenai/gpt-4o-audio-preview$2.50$10128,000
Cohere: Command Acohere/command-a$2.50$10256,000
OpenAI: GPT-4o (2024-11-20)openai/gpt-4o-2024-11-20$2.50$10128,000
Magnum v4 72Banthracite-org/magnum-v4-72b$3$516,384
Inflection: Inflection 3 Piinflection/inflection-3-pi$2.50$108,000
Inflection: Inflection 3 Productivityinflection/inflection-3-productivity$2.50$108,000
Cohere: Command R+ (08-2024)cohere/command-r-plus-08-2024$2.50$10128,000
OpenAI: GPT-4o (2024-08-06)openai/gpt-4o-2024-08-06$2.50$10128,000
OpenAI: GPT-4oopenai/gpt-4o$2.50$10128,000
Amazon: Nova Premier 1.0amazon/nova-premier-v1$2.50$12.501,000,000
Deep Cogito: Cogito V2 Preview Llama 405Bdeepcogito/cogito-v2-preview-llama-405b$3.50$3.5032,768
Meta: Llama 3.1 405B Instructmeta-llama/llama-3.1-405b-instruct$3.50$3.5010,000
Black Forest Labs: FLUX.2 Problack-forest-labs/flux.2-pro$3.66$3.6646,864
Meta: Llama 3.1 405B (base)meta-llama/llama-3.1-405b$4$432,768
Anthropic: Claude Sonnet 4.5anthropic/claude-sonnet-4.5$3$151,000,000
xAI: Grok 4x-ai/grok-4$3$15256,000
xAI: Grok 3x-ai/grok-3$3$15131,072
Anthropic: Claude Sonnet 4anthropic/claude-sonnet-4$3$151,000,000
xAI: Grok 3 Betax-ai/grok-3-beta$3$15131,072
Perplexity: Sonar Properplexity/sonar-pro$3$15200,000
Anthropic: Claude 3.7 Sonnet (thinking)anthropic/claude-3.7-sonnet:thinking$3$15200,000
Anthropic: Claude 3.7 Sonnetanthropic/claude-3.7-sonnet$3$15200,000
AionLabs: Aion-1.0aion-labs/aion-1.0$4$8131,072
SorcererLM 8x22Braifle/sorcererlm-8x22b$4.50$4.5016,000
Perplexity: Sonarperplexity/sonar$1$1127,072
Perplexity: Sonar Reasoningperplexity/sonar-reasoning$1$5127,000
OpenAI: ChatGPT-4oopenai/chatgpt-4o-latest$5$15128,000
OpenAI: GPT-4o (2024-05-13)openai/gpt-4o-2024-05-13$5$15128,000
Goliath 120Balpindale/goliath-120b$6$86,144
Anthropic: Claude Opus 4.5anthropic/claude-opus-4.5$5$25200,000
OpenAI: GPT-4o (extended)openai/gpt-4o:extended$6$18128,000
Black Forest Labs: FLUX.2 Maxblack-forest-labs/flux.2-max$7.32$7.3246,864
Anthropic: Claude 3.5 Sonnetanthropic/claude-3.5-sonnet$6$30200,000
OpenAI: GPT-5 Imageopenai/gpt-5-image$10$10400,000
OpenAI: GPT-4 Turboopenai/gpt-4-turbo$10$30128,000
OpenAI: GPT-4 Turbo Previewopenai/gpt-4-turbo-preview$10$30128,000
OpenAI: GPT-4 Turbo (older v1106)openai/gpt-4-1106-preview$10$30128,000
OpenAI: o3 Deep Researchopenai/o3-deep-research$10$40200,000
Black Forest Labs: FLUX.2 Flexblack-forest-labs/flux.2-flex$14.64$14.6467,344
OpenAI: o1openai/o1$15$60200,000
Perplexity: Sonar Pro Searchperplexity/sonar-pro-search$3$15200,000
Anthropic: Claude Opus 4.1anthropic/claude-opus-4.1$15$75200,000
Anthropic: Claude Opus 4anthropic/claude-opus-4$15$75200,000
Anthropic: Claude 3 Opusanthropic/claude-3-opus$15$75200,000
OpenAI: GPT-5 Proopenai/gpt-5-pro$15$120400,000
OpenAI: GPT-4o-mini Search Previewopenai/gpt-4o-mini-search-preview$0.15$0.60128,000
OpenAI: o3 Proopenai/o3-pro$20$80200,000
OpenAI: GPT-4 (older v0314)openai/gpt-4-0314$30$608,191
OpenAI: GPT-4openai/gpt-4$30$608,191
OpenAI: GPT-5.2 Proopenai/gpt-5.2-pro$21$168400,000
OpenAI: GPT-4o Search Previewopenai/gpt-4o-search-preview$2.50$10128,000
OpenAI: o1-proopenai/o1-pro$150$600200,000
Body Builder (beta)openrouter/bodybuilder128,000
Bert-Nebulon Alphaopenrouter/bert-nebulon-alpha256,000
Sherlock Dash Alphaopenrouter/sherlock-dash-alpha1,840,000
Sherlock Think Alphaopenrouter/sherlock-think-alpha1,840,000
Polaris Alphaopenrouter/polaris-alpha256,000
Qwen: Qwen3 Embedding 0.6Bqwen/qwen3-embedding-0.6b8,192
Andromeda Alphaopenrouter/andromeda-alpha128,000
Arcee AI: AFM 4.5Barcee-ai/afm-4.5b65,536
Sonoma Dusk Alphaopenrouter/sonoma-dusk-alpha2,000,000
Sonoma Sky Alphaopenrouter/sonoma-sky-alpha2,000,000
ByteDance: Seed OSS 36B Instructbytedance/seed-oss-36b-instruct131,072
Deep Cogito: Cogito V2 Preview Deepseek 671Bdeepcogito/cogito-v2-preview-deepseek-671b131,072
DeepSeek: DeepSeek V3.1 Basedeepseek/deepseek-v3.1-base163,840
Horizon Betaopenrouter/horizon-beta256,000
Horizon Alphaopenrouter/horizon-alpha256,000
Cypher Alphaopenrouter/cypher-alpha1,000,000
Morph: Fast Applymorph/morph-v232,000
Mistral: Magistral Small 2506mistralai/magistral-small-250640,000
Mistral: Magistral Medium 2506mistralai/magistral-medium-250640,960
SentientAGI: Dobby Mini Plus Llama 3.1 8Bsentientagi/dobby-mini-unhinged-plus-llama-3.1-8b131,072
DeepSeek: R1 Distill Qwen 7Bdeepseek/deepseek-r1-distill-qwen-7b131,072
Google: Gemma 1 2Bgoogle/gemma-2b-it8,192
Sarvam AI: Sarvam-Msarvamai/sarvam-m32,768
TheDrummer: Valkyrie 49B V1thedrummer/valkyrie-49b-v1131,072
Meta: Llama 3.3 8B Instructmeta-llama/llama-3.3-8b-instruct128,000
Arcee AI: Caller Largearcee-ai/caller-large32,768
Arcee AI: Virtuoso Medium V2arcee-ai/virtuoso-medium-v2131,072
Arcee AI: Arcee Blitzarcee-ai/arcee-blitz32,768
Microsoft: Phi 4 Reasoningmicrosoft/phi-4-reasoning32,768
Qwen: Qwen3 0.6Bqwen/qwen3-0.6b-04-2832,000
Qwen: Qwen3 1.7Bqwen/qwen3-1.7b32,000
OpenGVLab: InternVL3 14Bopengvlab/internvl3-14b32,000
OpenGVLab: InternVL3 2Bopengvlab/internvl3-2b32,000
THUDM: GLM Z1 Rumination 32Bthudm/glm-z1-rumination-32b32,000
THUDM: GLM Z1 9Bthudm/glm-z1-9b32,000
THUDM: GLM 4 9Bthudm/glm-4-9b32,000
Microsoft: MAI DS R1microsoft/mai-ds-r1163,840
THUDM: GLM Z1 32Bthudm/glm-z1-32b32,768
THUDM: GLM 4 32Bthudm/glm-4-32b32,768
ArliAI: QwQ 32B RpR v1arliai/qwq-32b-arliai-rpr-v132,768
Agentica: Deepcoder 14B Previewagentica-org/deepcoder-14b-preview96,000
MoonshotAI: Kimi VL A3B Thinkingmoonshotai/kimi-vl-a3b-thinking131,072
Optimus Alphaopenrouter/optimus-alpha1,000,000
NVIDIA: Llama 3.1 Nemotron Nano 8B v1nvidia/llama-3.1-nemotron-nano-8b-v1131,072
NVIDIA: Llama 3.3 Nemotron Super 49B v1nvidia/llama-3.3-nemotron-super-49b-v1131,072
Swallow: Llama 3.1 Swallow 8B Instruct V0.3tokyotech-llm/llama-3.1-swallow-8b-instruct-v0.316,384
Quasar Alphaopenrouter/quasar-alpha1,000,000
OpenHands LM 32B V0.1all-hands/openhands-lm-32b-v0.1131,072
DeepSeek: DeepSeek V3 Basedeepseek/deepseek-v3-base131,072
Typhoon2 8B Instructscb10x/llama3.1-typhoon2-8b-instruct8,192
Typhoon2 70B Instructscb10x/llama3.1-typhoon2-70b-instruct8,192
Bytedance: UI-TARS 72Bbytedance-research/ui-tars-72b32,768
Qwen: Qwen2.5 VL 3B Instructqwen/qwen2.5-vl-3b-instruct64,000
Google: Gemini 2.5 Pro Experimentalgoogle/gemini-2.5-pro-exp-03-251,048,576
Qrwkv 72Bfeatherless/qwerky-72b32,768
OlympicCoder 32Bopen-r1/olympiccoder-32b32,768
SteelSkull: L3.3 Electra R1 70Bsteelskull/l3.3-electra-r1-70b128,000
Google: Gemma 3 1Bgoogle/gemma-3-1b-it32,000
AI21: Jamba 1.6 Largeai21/jamba-1.6-large256,000
AI21: Jamba Mini 1.6ai21/jamba-1.6-mini256,000
Reka: Flash 3rekaai/reka-flash-332,000
LatitudeGames: Wayfarer Large 70B Llama 3.3latitudegames/wayfarer-large-70b-llama-3.3128,000
DeepSeek: DeepSeek R1 Zerodeepseek/deepseek-r1-zero163,840
Qwen: Qwen2.5 32B Instructqwen/qwen2.5-32b-instruct131,072
MoonshotAI: Moonlight 16B A3B Instructmoonshotai/moonlight-16b-a3b-instruct8,192
Nous: DeepHermes 3 Llama 3 8B Previewnousresearch/deephermes-3-llama-3-8b-preview131,072
OpenAI: GPT-4.5 (Preview)openai/gpt-4.5-preview128,000
Perplexity: R1 1776perplexity/r1-1776128,000
Dolphin3.0 R1 Mistral 24Bcognitivecomputations/dolphin3.0-r1-mistral-24b32,768
Dolphin3.0 Mistral 24Bcognitivecomputations/dolphin3.0-mistral-24b32,768
Llama 3.1 Tulu 3 405Ballenai/llama-3.1-tulu-3-405b
DeepSeek: R1 Distill Llama 8Bdeepseek/deepseek-r1-distill-llama-8b
DeepSeek: R1 Distill Qwen 1.5Bdeepseek/deepseek-r1-distill-qwen-1.5b131,072
Liquid: LFM 7Bliquid/lfm-7b32,768
Liquid: LFM 3Bliquid/lfm-3b32,768
Mistral: Codestral 2501mistralai/codestral-2501256,000
Inflatebot: Mag Mell R1 12Binflatebot/mn-mag-mell-r132,000
EVA Llama 3.33 70Beva-unit-01/eva-llama-3.33-70b16,384
xAI: Grok 2 Vision 1212x-ai/grok-2-vision-121232,768
xAI: Grok 2 1212x-ai/grok-2-1212131,072
Qwen: QwQ 32B Previewqwen/qwq-32b-preview32,768
Google: Gemini Experimental 1121google/gemini-exp-112140,960
EVA Qwen2.5 72Beva-unit-01/eva-qwen-2.5-72b32,000
xAI: Grok Vision Betax-ai/grok-vision-beta8,192
Google: Gemini Experimental 1114google/gemini-exp-111440,960
Infermatic: Mistral Nemo Inferor 12Binfermatic/mn-inferor-12b32,000
EVA Qwen2.5 32Beva-unit-01/eva-qwen-2.5-32b32,000
NeverSleep: Lumimaid v0.2 70Bneversleep/llama-3.1-lumimaid-70b131,072
xAI: Grok Betax-ai/grok-beta131,072
xAI: Grok 2x-ai/grok-232,768
xAI: Grok 2 minix-ai/grok-2-mini32,768
Google: Gemini 1.5 Flash 8Bgoogle/gemini-flash-1.5-8b1,000,000
Liquid: LFM 40B MoEliquid/lfm-40b32,768
EVA Qwen2.5 14Beva-unit-01/eva-qwen-2.5-14b32,768
Magnum v2 72Banthracite-org/magnum-v2-72b32,768
OpenAI: o1-mini (2024-09-12)openai/o1-mini-2024-09-12128,000
OpenAI: o1-miniopenai/o1-mini128,000
OpenAI: o1-preview (2024-09-12)openai/o1-preview-2024-09-12128,000
OpenAI: o1-previewopenai/o1-preview128,000
Reflection 70Bmattshumer/reflection-70b131,072
Google: Gemini 1.5 Flash Experimentalgoogle/gemini-flash-1.5-exp1,000,000
Lynn: Llama 3 Soliloquy 7B v3 32Klynn/soliloquy-v332,768
AI21: Jamba 1.5 Miniai21/jamba-1-5-mini256,000
Yi 1.5 34B Chat01-ai/yi-1.5-34b-chat4,096
AI21: Jamba 1.5 Largeai21/jamba-1-5-large256,000
Aetherwiing: Starcannon 12Baetherwiing/mn-starcannon-12b12,000
01.AI: Yi Vision01-ai/yi-vision16,384
01.AI: Yi Large FC01-ai/yi-large-fc16,384
01.AI: Yi Large Turbo01-ai/yi-large-turbo4,096
Mistral Nemo 12B Celestenothingiisreal/mn-celeste-12b32,000
Perplexity: Llama 3.1 Sonar 70B Onlineperplexity/llama-3.1-sonar-large-128k-online127,072
Perplexity: Llama 3.1 Sonar 8B Onlineperplexity/llama-3.1-sonar-small-128k-online127,072
Google: Gemini 1.5 Pro Experimentalgoogle/gemini-pro-1.5-exp1,000,000
Dolphin Llama 3 70B 🐬cognitivecomputations/dolphin-llama-3-70b8,192
Mistral: Codestral Mambamistralai/codestral-mamba256,000
Qwen 2 7B Instructqwen/qwen-2-7b-instruct32,768
Magnum 72Balpindale/magnum-72b16,384
Nous: Hermes 2 Theta 8Bnousresearch/hermes-2-theta-llama-3-8b16,384
Sao10K: Llama 3 Stheno 8B v3.3 32Ksao10k/l3-stheno-8b32,000
AI21: Jamba Instructai21/jamba-instruct256,000
01.AI: Yi Large01-ai/yi-large32,768
NVIDIA: Nemotron-4 340B Instructnvidia/nemotron-4-340b-instruct4,096
Anthropic: Claude 3.5 Sonnet (2024-06-20)anthropic/claude-3.5-sonnet-20240620200,000
Microsoft: Phi-3 Medium 4K Instructmicrosoft/phi-3-medium-4k-instruct4,000
StarCoder2 15B Instructbigcode/starcoder2-15b-instruct16,384
Dolphin 2.9.2 Mixtral 8x22B 🐬cognitivecomputations/dolphin-mixtral-8x22b65,536
Qwen 2 72B Instructqwen/qwen-2-72b-instruct32,768
OpenChat 3.6 8Bopenchat/openchat-8b8,192
NeverSleep: Llama 3 Lumimaid 70Bneversleep/llama-3-lumimaid-70b8,192
Perplexity: Llama3 Sonar 70Bperplexity/llama-3-sonar-large-32k-chat32,768
Perplexity: Llama3 Sonar 8B Onlineperplexity/llama-3-sonar-small-32k-online28,000
Perplexity: Llama3 Sonar 8Bperplexity/llama-3-sonar-small-32k-chat32,768
DeepSeek V2.5deepseek/deepseek-chat-v2.5128,000
Perplexity: Llama3 Sonar 70B Onlineperplexity/llama-3-sonar-large-32k-online28,000
Google: Gemini 1.5 Flashgoogle/gemini-flash-1.51,000,000
Meta: Llama 3 70B (Base)meta-llama/llama-3-70b8,192
Meta: Llama 3 8B (Base)meta-llama/llama-3-8b8,192
LLaVA v1.6 34Bliuhaotian/llava-yi-34b4,096
OLMo 7B Instructallenai/olmo-7b-instruct2,048
Qwen 1.5 4B Chatqwen/qwen-4b-chat32,768
Qwen 1.5 7B Chatqwen/qwen-7b-chat32,768
Qwen 1.5 14B Chatqwen/qwen-14b-chat32,768
Qwen 1.5 32B Chatqwen/qwen-32b-chat32,768
Qwen 1.5 72B Chatqwen/qwen-72b-chat32,768
Qwen 1.5 110B Chatqwen/qwen-110b-chat32,768
NeverSleep: Llama 3 Lumimaid 8Bneversleep/llama-3-lumimaid-8b24,576
Snowflake: Arctic Instructsnowflake/snowflake-arctic-instruct4,096
Fireworks: FireLLaVA 13Bfireworks/firellava-13b4,096
Lynn: Llama 3 Soliloquy 8B v2lynn/soliloquy-l324,576
Fimbulvetr 11B v2sao10k/fimbulvetr-11b-v28,192
WizardLM-2 7Bmicrosoft/wizardlm-2-7b32,000
Zephyr 141B-A35Bhuggingfaceh4/zephyr-orpo-141b-a35b65,536
Mistral: Mixtral 8x22B (base)mistralai/mixtral-8x22b65,536
Google: Gemini 1.5 Progoogle/gemini-pro-1.52,000,000
Cohere: Command R+cohere/command-r-plus128,000
Cohere: Command R+ (04-2024)cohere/command-r-plus-04-2024128,000
Databricks: DBRX 132B Instructdatabricks/dbrx-instruct32,768
Midnight Rose 70Bsophosympatheia/midnight-rose-70b4,096
Cohere: Command Rcohere/command-r128,000
Cohere: Commandcohere/command4,096
Anthropic: Claude 3 Sonnetanthropic/claude-3-sonnet200,000
Cohere: Command R (03-2024)cohere/command-r-03-2024128,000
Google: Gemma 7Bgoogle/gemma-7b-it8,192
Nous: Hermes 2 Mistral 7B DPOnousresearch/nous-hermes-2-mistral-7b-dpo8,192
Meta: CodeLlama 70B Instructmeta-llama/codellama-70b-instruct2,048
RWKV v5: Eagle 7Brecursal/eagle-7b10,000
Yi 34B 200K01-ai/yi-34b-200k200,000
Nous: Hermes 2 Mixtral 8x7B SFTnousresearch/nous-hermes-2-mixtral-8x7b-sft32,768
Nous: Hermes 2 Mixtral 8x7B DPOnousresearch/nous-hermes-2-mixtral-8x7b-dpo32,768
Mistral Smallmistralai/mistral-small32,000
Mistral Mediummistralai/mistral-medium32,000
Bagel 34B v0.2jondurbin/bagel-34b200,000
Noromaid Mixtral 8x7B Instructneversleep/noromaid-mixtral-8x7b-instruct8,000
Nous: Hermes 2 Yi 34Bnousresearch/nous-hermes-yi-34b4,096
Dolphin 2.6 Mixtral 8x7B 🐬cognitivecomputations/dolphin-mixtral-8x7b32,768
RWKV v5 3B AI Townrecursal/rwkv-5-3b-ai-town10,000
RWKV v5 World 3Brwkv/rwkv-5-world-3b10,000
StripedHyena Hessian 7B (base)togethercomputer/stripedhyena-hessian-7b32,768
StripedHyena Nous 7Btogethercomputer/stripedhyena-nous-7b32,768
Psyfighter v2 13Bkoboldai/psyfighter-13b-24,096
Nous: Hermes 2 Vision 7B (alpha)nousresearch/nous-hermes-2-vision-7b4,096
MythoMist 7Bgryphe/mythomist-7b32,768
Yi 6B (base)01-ai/yi-6b4,096
Yi 34B Chat01-ai/yi-34b-chat4,096
Yi 34B (base)01-ai/yi-34b4,096
Cinematika 7B (alpha)openrouter/cinematika-7b8,000
Nous: Capybara 7Bnousresearch/nous-capybara-7b8,192
Psyfighter 13Bjebcarter/psyfighter-13b4,096
OpenChat 3.5 7Bopenchat/openchat-7b8,192
Neural Chat 7B v3.1intel/neural-chat-7b4,096
Anthropic: Claude Instant v1.1anthropic/claude-instant-1.1100,000
Anthropic: Claude v2anthropic/claude-2200,000
Anthropic: Claude v2.1anthropic/claude-2.1200,000
OpenHermes 2.5 Mistral 7Bteknium/openhermes-2.5-mistral-7b4,096
LLaVA 13Bliuhaotian/llava-13b2,048
Nous: Capybara 34Bnousresearch/nous-capybara-34b200,000
OpenAI: GPT-4 Visionopenai/gpt-4-vision-preview128,000
lzlv 70Blizpreciatior/lzlv-70b-fp16-hf4,096
Toppy M 7Bundi95/toppy-m-7b4,096
Auto Routeropenrouter/auto2,000,000
OpenAI: GPT-3.5 Turbo 16k (older v1106)openai/gpt-3.5-turbo-110616,385
Google: PaLM 2 Code Chat 32kgoogle/palm-2-codechat-bison-32k32,760
Google: PaLM 2 Chat 32kgoogle/palm-2-chat-bison-32k32,760
OpenHermes 2 Mistral 7Bteknium/openhermes-2-mistral-7b8,192
Mistral OpenOrca 7Bopen-orca/mistral-7b-openorca8,192
Airoboros 70Bjondurbin/airoboros-l2-70b4,096
Nous: Hermes 70Bnousresearch/nous-hermes-llama2-70b4,096
Xwin 70Bxwin-lm/xwin-lm-70b8,192
Synthia 70Bmigtissera/synthia-70b8,192
Pygmalion: Mythalion 13Bpygmalionai/mythalion-13b8,192
OpenAI: GPT-4 32k (older v0314)openai/gpt-4-32k-031432,767
OpenAI: GPT-4 32kopenai/gpt-4-32k32,767
Nous: Hermes 13Bnousresearch/nous-hermes-llama2-13b4,096
Phind: CodeLlama 34B v2phind/phind-codellama-34b4,096
Meta: CodeLlama 34B Instructmeta-llama/codellama-34b-instruct8,192
Hugging Face: Zephyr 7Bhuggingfaceh4/zephyr-7b-beta4,096
Anthropic: Claude Instant v1.0anthropic/claude-instant-1.0100,000
Anthropic: Claude v1.2anthropic/claude-1.2100,000
Anthropic: Claude v1anthropic/claude-1100,000
Anthropic: Claude Instant v1anthropic/claude-instant-1100,000
Anthropic: Claude v2.0anthropic/claude-2.0100,000
Google: PaLM 2 Code Chatgoogle/palm-2-codechat-bison7,168
Google: PaLM 2 Chatgoogle/palm-2-chat-bison9,216
Meta: Llama 2 70B Chatmeta-llama/llama-2-70b-chat4,096
Meta: Llama 2 13B Chatmeta-llama/llama-2-13b-chat4,096
OpenAI: GPT-3.5 Turbo 16kopenai/gpt-3.5-turbo-012516,385
OpenAI: GPT-3.5 Turbo (older v0301)openai/gpt-3.5-turbo-03014,095

发表回复

您的邮箱地址不会被公开。 必填项已用 * 标注