Total Cost (MTD)
$6,742.00
+12.3%vs last month
Projected Monthly
$8,920.45
at current rate
Total Savings
$1,345.67
16.6% of spend
Optimization Score
78/100
room for improvement
Provider Breakdown
OpenAI$2,847.32
42.3%45,892 req
Anthropic$1,956.78
29.1%12,453 req
Google$1,023.45
15.2%28,456 req
Cohere$512.89
7.6%8,934 req
Mistral AI$302.12
4.5%5,621 req
Together AI$99.44
1.5%2,845 req
Optimization Suggestions
Switch GPT-4 to GPT-4o
HIGHGPT-4o is 50% cheaper with faster response times
Save $892.45/mo
Enable Claude 3.5 caching
HIGHUse cached tokens for repeated contexts (40% discount)
Save $456.32/mo
Batch API for embeddings
MEDIUMProcess embeddings in batches for 70% cost reduction
Save $312.67/mo
Switch to Gemini 1.5 Flash
MEDIUMFlash model is 95% cheaper for non-critical tasks
Save $234.89/mo
Add retry logic for failures
LOWImplement exponential backoff to reduce wasted requests
Save $178.23/mo
Use Mistral for simple tasks
LOWSwitch non-complex prompts to Mistral Small
Save $145.67/mo
| Date | Provider | Model | Requests | Tokens | Cost |
|---|---|---|---|---|---|
| 2026-02-24 | OpenAI | gpt-4o | 1,234 | 630,000 | $45.67 |
| 2026-02-24 | Anthropic | claude-3-5-sonnet | 456 | 252,000 | $32.45 |
| 2026-02-23 | OpenAI | gpt-4 | 1,567 | 730,000 | $89.23 |
| 2026-02-23 | gemini-1.5-pro | 892 | 323,000 | $28.45 | |
| 2026-02-22 | Anthropic | claude-3-opus | 234 | 201,000 | $56.78 |
| 2026-02-22 | OpenAI | gpt-4-turbo | 1,456 | 675,000 | $67.89 |
| 2026-02-21 | Cohere | command-r-plus | 678 | 245,000 | $23.45 |
| 2026-02-21 | Mistral | large | 345 | 123,000 | $18.92 |