💰

AI Model Cost Calculator

Compare API costs across all major LLMs — GPT-4o, Claude, Gemini, Llama. Enter token counts and monthly request volume to find the cheapest option.

Input tokens / request

Output tokens / request

Requests per month

Model	Per request	1,000 req/mo	Relative cost
Gemini 1.5 Flash Google	$0.00022	$0.2250cheapest	<1%
Gemini 2.0 Flash Google	$0.00030	$0.3000	<1%
GPT-4o mini OpenAI	$0.00045	$0.4500	<1%
Claude 3 Haiku Anthropic	$0.00088	$0.8750	2%
GPT-3.5 Turbo OpenAI	$0.00125	$1.25	2%
Llama 3.1 70B Meta	$0.00132	$1.32	3%
Claude 3.5 Haiku Anthropic	$0.00280	$2.80	5%
Gemini 1.5 Pro Google	$0.00375	$3.75	7%
GPT-4o OpenAI	$0.00750	$7.50	14%
Llama 3.1 405B Meta	$0.00750	$7.50	14%
o1-mini OpenAI	$0.00900	$9.00	17%
Claude 3.5 Sonnet Anthropic	$0.0105	$10.50	20%
GPT-4 Turbo OpenAI	$0.0250	$25.00	48%
o1 OpenAI	$0.0450	$45.00	86%
Claude 3 Opus Anthropic	$0.0525	$52.50	100%

Pricing as of 2025. Costs are estimates — verify at each provider's pricing page. Input/output ratio affects the actual cost significantly.

Related Tools

🧮AI Tools

AI Token Counter

Count tokens and estimate API costs for GPT-4, Claude, Gemini and more

Open tool

🔢AI Tools

Prompt Tokenizer

Estimate token count and API cost for any prompt across GPT-4.1, Claude 3.7, Gemini 2.5 and more. Adjust expected output tokens to calculate total cost.

Open tool

🔀AI Tools

Prompt Diff

Compare two AI prompt versions side by side. See word-level diff, token delta, quality scores, and improvement hints — 100% in-browser.

Open tool

🔑API Tools

API Key Checker

Validate your AI API keys instantly

Open tool

Sponsor this · ads@gettinytool.com

AI Model Cost Calculator — Compare LLM API Pricing Across GPT-4, Claude & Gemini

Not sure which LLM is the cheapest for your use case? This AI model cost calculator lets you compare API pricing across all major models — GPT-4o, Claude 3.7, Gemini 2.5, Llama, and more — by entering your expected token counts and monthly request volume.

LLM costs add up fast at scale. A model that costs $0.50 less per million tokens may save hundreds of dollars per month when you're running thousands of requests daily. Use this calculator to make data-driven model selection decisions.

What you can do with this tool

Compare per-token costs across GPT-4o, GPT-4o mini, Claude 3.7 Sonnet, Gemini 2.5 Pro, and other popular models
Enter your input token count (prompt) and output token count (response) separately
Set your monthly request volume to see total monthly cost per model
Quickly identify the cheapest model for your budget and latency requirements
Side-by-side comparison table sorted by cost

How to use the Cost Calculator

Enter the average number of input tokens per request (your prompt length)
Enter the average number of output tokens per request (response length)
Set your expected monthly requests
The table updates instantly with costs per model and total monthly spend

FAQ

Why do input and output tokens have different prices?

LLM providers charge less for input tokens than output tokens because generating tokens requires more compute than reading them. Output tokens typically cost 3–5x more than input tokens.

Are the prices up to date?

We update pricing as providers announce changes. For the latest official rates, always verify against the provider's pricing page before making production budget decisions.