AI Model Cost Calculator
Compare API costs across all major LLMs โ GPT-4o, Claude, Gemini, Llama. Enter token counts and monthly request volume to find the cheapest option.
| Model | Per request | 1,000 req/mo | Relative cost |
|---|---|---|---|
Gemini 1.5 Flash Google | $0.00022 | $0.2250cheapest | <1% |
Gemini 2.0 Flash Google | $0.00030 | $0.3000 | <1% |
GPT-4o mini OpenAI | $0.00045 | $0.4500 | <1% |
Claude 3 Haiku Anthropic | $0.00088 | $0.8750 | 2% |
GPT-3.5 Turbo OpenAI | $0.00125 | $1.25 | 2% |
Llama 3.1 70B Meta | $0.00132 | $1.32 | 3% |
Claude 3.5 Haiku Anthropic | $0.00280 | $2.80 | 5% |
Gemini 1.5 Pro Google | $0.00375 | $3.75 | 7% |
GPT-4o OpenAI | $0.00750 | $7.50 | 14% |
Llama 3.1 405B Meta | $0.00750 | $7.50 | 14% |
o1-mini OpenAI | $0.00900 | $9.00 | 17% |
Claude 3.5 Sonnet Anthropic | $0.0105 | $10.50 | 20% |
GPT-4 Turbo OpenAI | $0.0250 | $25.00 | 48% |
o1 OpenAI | $0.0450 | $45.00 | 86% |
Claude 3 Opus Anthropic | $0.0525 | $52.50 | 100% |
Pricing as of 2025. Costs are estimates โ verify at each provider's pricing page. Input/output ratio affects the actual cost significantly.
Sponsored
Related Tools
AI Token Counter
Count tokens and estimate API costs for GPT-4, Claude, Gemini and more
Prompt Tokenizer
Estimate token count and API cost for any prompt across GPT-4.1, Claude 3.7, Gemini 2.5 and more. Adjust expected output tokens to calculate total cost.
Prompt Diff
Compare two AI prompt versions side by side. See word-level diff, token delta, quality scores, and improvement hints โ 100% in-browser.
API Key Checker
Validate your AI API keys instantly
AI Model Cost Calculator โ Compare LLM API Pricing Across GPT-4, Claude & Gemini
Not sure which LLM is the cheapest for your use case? This AI model cost calculator lets you compare API pricing across all major models โ GPT-4o, Claude 3.7, Gemini 2.5, Llama, and more โ by entering your expected token counts and monthly request volume.
LLM costs add up fast at scale. A model that costs $0.50 less per million tokens may save hundreds of dollars per month when you're running thousands of requests daily. Use this calculator to make data-driven model selection decisions.
What you can do with this tool
- Compare per-token costs across GPT-4o, GPT-4o mini, Claude 3.7 Sonnet, Gemini 2.5 Pro, and other popular models
- Enter your input token count (prompt) and output token count (response) separately
- Set your monthly request volume to see total monthly cost per model
- Quickly identify the cheapest model for your budget and latency requirements
- Side-by-side comparison table sorted by cost
How to use the Cost Calculator
- Enter the average number of input tokens per request (your prompt length)
- Enter the average number of output tokens per request (response length)
- Set your expected monthly requests
- The table updates instantly with costs per model and total monthly spend
FAQ
Why do input and output tokens have different prices?
LLM providers charge less for input tokens than output tokens because generating tokens requires more compute than reading them. Output tokens typically cost 3โ5x more than input tokens.
Are the prices up to date?
We update pricing as providers announce changes. For the latest official rates, always verify against the provider's pricing page before making production budget decisions.