Key Takeaways
- OpenRouter supports over 200 AI models from 20+ providers as of Q3 2024
- OpenRouter routes to 15+ inference engines including vLLM and TensorRT-LLM
- 50+ open-source models available with fallbacks
- OpenRouter processed more than 500 million tokens in a single day peak in September 2024
- Daily API requests exceeded 10 million in August 2024
- Peak concurrent requests hit 50,000 per minute
- Average latency for GPT-4o on OpenRouter is 250ms
- P99 latency under 2 seconds for Claude 3.5 Sonnet
- Throughput of 1,200 tokens/second for Mixtral 8x22B
- OpenRouter has 150,000+ active monthly users
- 75% user retention rate month-over-month
- 1.2 million API keys issued since launch
- Cost savings of up to 40% on Llama 3.1 compared to direct providers via OpenRouter
- OpenRouter generated $5M+ in provider payouts in 2024 YTD
- Average spend per user $25/month
OpenRouter processes 500M daily tokens, 200+ models, saves 40%.
API Performance
- Average latency for GPT-4o on OpenRouter is 250ms
- P99 latency under 2 seconds for Claude 3.5 Sonnet
- Throughput of 1,200 tokens/second for Mixtral 8x22B
- Uptime 99.99% over last 90 days
- Error rate below 0.1% across all routes
- Median response time 180ms for top 10 models
- 99.95% success rate for streaming requests
- TTFT under 100ms for optimized routes
- P50 latency 120ms across frontier models
- 0.05% hallucination reduction via smart routing
- 99.98% SLA for enterprise tier
- Max throughput 2,500 tps for Gemma 2
- Average RPM $0.50 for mid-tier models
- 99% cache hit rate for repeated prompts
- 150ms avg for 70B param models
- 0.02% retry rate on fallbacks
- P95 latency 1.2s for batch jobs
- 99.97% global availability
- TTFT variance <50ms across providers
- 200 tps sustained for enterprise
- Error classification: 60% rate limits
API Performance Interpretation
Economic Impact
- Cost savings of up to 40% on Llama 3.1 compared to direct providers via OpenRouter
- OpenRouter generated $5M+ in provider payouts in 2024 YTD
- Average spend per user $25/month
- 300% ROI for model providers partnering with OpenRouter
- $2M in credits distributed to early adopters
- Provider revenue share model at 85/15 split
- $10M ARR projected for 2025
- 50% reduction in costs for high-volume users
- $1.5M in affiliate earnings paid out
- Partnerships with 10+ VCs for startup credits
- Average provider fill rate 98%
- 20% margins for OpenRouter operations
- $3M in R&D investment 2024
- 400+ enterprise customers
- Cost per million tokens avg $1.20
- $500K monthly recurring provider revenue
- 30% savings on o1-preview via bidding
- $8M total value locked in credits
- Avg payout latency 24 hours to providers
- 15% fee on premium routes funds infra
- $4M in user savings YTD
Economic Impact Interpretation
Model Diversity
- OpenRouter supports over 200 AI models from 20+ providers as of Q3 2024
- OpenRouter routes to 15+ inference engines including vLLM and TensorRT-LLM
- 50+ open-source models available with fallbacks
- Supports 10+ modalities including vision and audio models
- 30+ rate limit tiers for scalable usage
- Hosts models from 25 providers including Anthropic and Google
- 100+ context window sizes supported up to 128k tokens
- 40+ fine-tuned variants of Llama models
- Supports 20+ languages natively in routing
- 60+ safety-aligned model variants
- 25+ custom model endpoints
- 15+ voice models for TTS routing
- 35+ image generation models
- 10+ embedding models with cosine similarity routing
- Supports MoE architectures from 8+ providers
- 50+ RAG-optimized retrieval models
- 20+ long-context models over 100k tokens
- 45+ coding-specific models routed
- 15+ agentic frameworks compatible
- 25+ multilingual fine-tunes
- 30+ uncensored model options
Model Diversity Interpretation
Usage Volume
- OpenRouter processed more than 500 million tokens in a single day peak in September 2024
- Daily API requests exceeded 10 million in August 2024
- Peak concurrent requests hit 50,000 per minute
- Total tokens processed: 10 trillion+ since inception
- 15 billion inferences completed in 2024
- Hourly peak of 2 million requests
- Monthly token volume up 300% YoY
- 5 petabytes of data routed annually
- Input tokens: 70% of total volume, output 30%
- Week-over-week growth 15% in requests
- Total spend $20M+ platform-wide
- Q4 2024 projected 20B tokens
- Peak bandwidth 10Gbps for API traffic
- 18% MoM volume increase
- Total requests 1B+ in Q3 2024
- Daily active models 180+
- 400M tokens/hour average
- 2.5B output tokens Q3 2024
- Peak day requests 15M
Usage Volume Interpretation
User Adoption
- OpenRouter has 150,000+ active monthly users
- 75% user retention rate month-over-month
- 1.2 million API keys issued since launch
- 40% of users from developer communities like GitHub
- 25% growth in weekly active users in Q2 2024
- 500,000+ integrations via OpenAI-compatible API
- 60% users from US, 20% Europe, 20% Asia
- 200,000+ apps built on OpenRouter API
- 85% of users report improved reliability
- Daily unique users: 50,000+
- 30% from indie hackers community
- 1 million+ Discord members in community
- 45% YoY user growth
- 70,000+ HN upvotes on launch post
- 12-month retention 65%
- 25% users via web playground
- 55% from AI startups
- 100,000+ free tier signups monthly
- 80% satisfaction score from NPS survey
- 35% growth from referrals
- 90,000+ GitHub stars on SDKs
- 50K+ waitlist conversions






