gettinytool.com
๐Ÿ’ฐ

AI Model Cost Calculator

Compare API costs across all major LLMs โ€” GPT-4o, Claude, Gemini, Llama. Enter token counts and monthly request volume to find the cheapest option.

ModelPer request1,000 req/mo
Gemini 1.5 Flash
Google
$0.00022$0.2250cheapest
Gemini 2.0 Flash
Google
$0.00030$0.3000
GPT-4o mini
OpenAI
$0.00045$0.4500
Claude 3 Haiku
Anthropic
$0.00088$0.8750
GPT-3.5 Turbo
OpenAI
$0.00125$1.25
Llama 3.1 70B
Meta
$0.00132$1.32
Claude 3.5 Haiku
Anthropic
$0.00280$2.80
Gemini 1.5 Pro
Google
$0.00375$3.75
GPT-4o
OpenAI
$0.00750$7.50
Llama 3.1 405B
Meta
$0.00750$7.50
o1-mini
OpenAI
$0.00900$9.00
Claude 3.5 Sonnet
Anthropic
$0.0105$10.50
GPT-4 Turbo
OpenAI
$0.0250$25.00
o1
OpenAI
$0.0450$45.00
Claude 3 Opus
Anthropic
$0.0525$52.50

Pricing as of 2025. Costs are estimates โ€” verify at each provider's pricing page. Input/output ratio affects the actual cost significantly.

Sponsored

Related Tools

Sponsor this ยท ads@gettinytool.com

AI Model Cost Calculator โ€” Compare LLM API Pricing Across GPT-4, Claude & Gemini

Not sure which LLM is the cheapest for your use case? This AI model cost calculator lets you compare API pricing across all major models โ€” GPT-4o, Claude 3.7, Gemini 2.5, Llama, and more โ€” by entering your expected token counts and monthly request volume.

LLM costs add up fast at scale. A model that costs $0.50 less per million tokens may save hundreds of dollars per month when you're running thousands of requests daily. Use this calculator to make data-driven model selection decisions.

What you can do with this tool

  • Compare per-token costs across GPT-4o, GPT-4o mini, Claude 3.7 Sonnet, Gemini 2.5 Pro, and other popular models
  • Enter your input token count (prompt) and output token count (response) separately
  • Set your monthly request volume to see total monthly cost per model
  • Quickly identify the cheapest model for your budget and latency requirements
  • Side-by-side comparison table sorted by cost

How to use the Cost Calculator

  1. Enter the average number of input tokens per request (your prompt length)
  2. Enter the average number of output tokens per request (response length)
  3. Set your expected monthly requests
  4. The table updates instantly with costs per model and total monthly spend

FAQ

Why do input and output tokens have different prices?

LLM providers charge less for input tokens than output tokens because generating tokens requires more compute than reading them. Output tokens typically cost 3โ€“5x more than input tokens.

Are the prices up to date?

We update pricing as providers announce changes. For the latest official rates, always verify against the provider's pricing page before making production budget decisions.

We use essential cookies for site functionality and optional analytics cookies to improve tools. Read our Privacy Policy and Terms.