Model Pricing
All prices are per 1 million tokens in USD.Text Models
GLM 5
glm-5#21LargeInput Price$1.00
Output Price$3.20
Context200K
Agentic Engineering and Complex Reasoning
Kimi K2.5
kimi-k2.5#24LargeInput Price$0.60
Output Price$3.00
Context256K
Complex Agent Use and Code Development
Gemma 4 31B
gemma-4-31b#27LargeInput Price$0.15
Output Price$0.40
Context256K
Math, Science, Coding, Document Parsing, Visual Reasoning
Gemma 4 26B A4B
gemma-4-26b-a4b#40LargeInput Price$0.15
Output Price$0.40
Context256K
Efficient Reasoning, Low-Latency Inference, Image Analysis
Qwen 3.5 35B A3B
qwen35-35b-a3b#92MediumInput Price$0.30
Output Price$1.25
Context256K
General Purpose, Long Context, Image Analysis
Qwen 3.5 9B
qwen35-9bMediumInput Price$0.05
Output Price$0.15
Context256K
Fast Responses, Image Analysis, Simple Tasks
Arcee Trinity Large Thinking
arcee-trinity-large-thinking#118LargeInput Price$0.30
Output Price$1.00
Context256K
Agentic Workflows, Multi-Step Planning, Tool Orchestration
Kimi K2 Thinking
kimi-k2-thinking#50LargeInput Price$0.60
Output Price$3.00
Context256K
Complex Agent Use and Code Development
GLM 4.7
glm-4.7#23LargeInput Price$0.50
Output Price$2.25
Context198K
Complex Agent Use
GLM 4.7 Thinking
glm-4.7-thinking#38LargeInput Price$0.45
Output Price$2.00
Context198K
Complex Agent Use and Reasoning
GLM 4.7 Flash
glm-4.7-flash#126LargeInput Price$0.10
Output Price$0.50
Context128K
Fast Advanced Agent Use
Qwen3 235B
qwen3-235b#59LargeInput Price$0.40
Output Price$3.00
Context128K
Advanced Agent Use and Code Development
MiniMax M2.5
minimax-m2.5#81LargeInput Price$0.30
Output Price$1.20
Context1M
AI Agents and Autonomous Workflows
Qwen3 Coder 480B
qwen3-coder-480b-a35b-instruct#109LargeInput Price$0.70
Output Price$2.80
Context256K
Code Development
Qwen3 Next 80B
qwen3-next-80b#89MediumInput Price$0.15
Output Price$1.50
Context256K
Long-Context Chat and Content
GPT OSS 120B
gpt-oss-120b#138LargeInput Price$0.07
Output Price$0.28
Context128K
Advanced Chat and Content
Hermes 3 Llama 3.1 405B
hermes-3-llama-3.1-405b#166LargeInput Price$1.00
Output Price$3.00
Context128K
Advanced Chat and Content
Llama 3.3 70B
llama-3.3-70b#187MediumInput Price$0.70
Output Price$2.50
Context128K
General Chat and Content
Llama 3.2 3B
llama-3.2-3b#290SmallInput Price$0.10
Output Price$0.50
Context128K
Classification, Simple QA
Mistral 31 24B
mistral-31-24b#207MediumInput Price$0.50
Output Price$2.00
Context128K
Basic Agent Functionality
Venice Uncensored
venice-uncensoredMediumInput Price$0.20
Output Price$0.90
Context32K
Uncensored Creative Use
Embedding Models
BGE M3
text-embedding-bge-m3EmbeddingInput Price$0.10
Output Price$0.50
Vector Embeddings for RAG
Legend
- 🧠 Reasoning — Extended thinking and step-by-step problem solving
- ⚡ Function Calling — Can invoke tools and external APIs
- 👀 Vision — Can analyze and understand images
- Arena Rank — Position on the Chatbot Arena Leaderboard (lower is better)

