Portkey Model Catalog
Browse, compare, and select from 1,600+ AI models across providers with real-time cost, latency, and capability benchmarks
Overview:
Portkey Model Catalog is a continuously updated index of 1,600+ AI models across 200+ providers, giving teams the data they need to make confident model selection decisions. Rather than researching pricing, latency, and capabilities across individual provider documentation pages, the catalog surfaces all of this in a single comparable view.
Every model in the catalog is directly accessible via the Portkey AI Gateway. Switching models is a configuration change – not an engineering project – so teams can move between models as the landscape evolves without modifying application code. The catalog integrates with the gateway's routing and A/B testing capabilities to enable data-driven model selection on live production traffic.
- 1,600+ models indexed across OpenAI, Anthropic, Google, Mistral, Meta, Cohere, and 200+ providers.
- Side-by-side comparison of cost per token, latency, context window, modalities, and benchmark scores.
- Real-time pricing data updated continuously across all providers.
- Measured p50 and p99 latency benchmarks from Portkey's global gateway infrastructure.
- Capability filters by modality, context length, fine-tuning support, function calling, and compliance certification.
- Every catalogued model instantly accessible via the Portkey AI Gateway with no code changes required to switch.
- Historical price tracking to monitor provider pricing trends over time.
- Covers both open and closed source models across hosted and self-hosted deployment options.
Comprehensive Model Coverage
1,600+ models indexed across every major provider – OpenAI, Anthropic, Google, Mistral, Meta, Cohere, AWS Bedrock, Azure, and hundreds more – all queryable from a single interface. The catalog covers both open and closed source models and is updated continuously as new models are released.
- 1,600+ models indexed across 200+ providers.
- OpenAI, Anthropic, Google, Mistral, Meta, Cohere, AWS Bedrock, Azure, and more.
- Open and closed source models included.
- Updated continuously as new models are released.
Side-by-Side Comparison
Compare any set of models on cost per input and output token, latency, context window size, supported modalities, and published benchmark scores in a single view. Eliminate the manual effort of collating this data from individual provider documentation pages.
- Cost per input and output token comparison.
- Context window size across all models.
- Latency benchmarks side by side.
- Published benchmark score comparison.
Real-Time Pricing Data
Up-to-date pricing data for every model and provider, updated continuously and with historical price tracking so teams can monitor how provider pricing evolves over time and make cost-optimisation decisions based on accurate current data.
- Real-time price data updated continuously.
- Input and output token pricing per model.
- Cross-provider cost comparison in one view.
- Historical price tracking for trend analysis.
Latency Benchmarks
Measured p50 and p99 latency data from Portkey's global gateway infrastructure, reflecting real-world performance rather than provider-quoted figures. Updated continuously as infrastructure and model performance changes.
- p50 and p99 latency measured from global gateway infrastructure.
- Real-world performance data rather than provider estimates.
- Per-provider and per-model latency comparison.
- Updated continuously as performance changes.
Capability Filters
Filter the catalog by modality (text, vision, audio, embeddings), context window length, fine-tuning support, function calling capability, and compliance certifications to quickly narrow to the right model for a specific use case without reviewing every entry manually.
- Filter by modality: text, vision, audio, embeddings.
- Context window length filter.
- Fine-tuning support and function calling flags.
- Compliance certification filter for regulated use cases.
Stop Guessing – Choose Models With Data
The AI model landscape changes every week. New models are released, prices drop, and latency characteristics shift. Portkey's Model Catalog gives teams current, accurate data on cost, latency, and capabilities so model selection decisions are based on evidence rather than reputation or recency bias.
- 1,600+ models across 200+ providers in one searchable view.
- Real-time pricing and latency benchmarks updated continuously.
- Side-by-side capability comparison across any model set.
- Historical pricing trends for long-term cost planning.
One-Click Gateway Integration
Every model in the catalog is directly accessible via the Portkey AI Gateway. Switching from one model to another is a configuration change at the gateway layer – no application code changes, no redeployment, no new SDK integrations required. Teams can respond to new model releases or pricing changes within minutes.
- All catalogued models accessible via the Portkey AI Gateway.
- Model switching via configuration – no code changes required.
- Config-driven model selection per virtual key or team.
- Respond to new releases and pricing changes immediately.
A/B Testing via the Catalog
Combine the Model Catalog with the Portkey AI Gateway's A/B testing capability to split production traffic between two models and compare quality, latency, and cost on real requests before fully committing to a switch. Promote the better-performing model with a single configuration change.
Avoid Vendor Lock-In
Because every catalogued model is accessible through the same gateway API, teams are never locked into a single provider. Migrating between providers – or running multiple providers simultaneously for different workloads – requires no changes to application code and no new credentials exposed to application teams.
- No vendor lock-in – switch providers via configuration.
- Run multiple providers simultaneously for different workloads.
- Raw provider credentials never exposed to application teams.
- Provider migration without application redeployment.
Portkey Model Catalog Specifications:
Table 1. Model Catalog Coverage and Data |
||
|---|---|---|
| Cloud (Managed) | Self-Hosted (Enterprise) | |
| Models indexed | 1,600+ models across 200+ providers – updated continuously as new models are released | |
| Providers covered | OpenAI, Anthropic, Google, Mistral, Meta, Cohere, AWS Bedrock, Azure, and 200+ more | |
| Pricing data | Real-time input and output token pricing with historical price tracking per model | |
| Latency benchmarks | Measured p50 and p99 latency from Portkey's global gateway infrastructure | |
| Capability filters | Modality, context window, fine-tuning support, function calling, compliance certification | |
| Gateway integration | All catalogued models accessible via the Portkey AI Gateway – model switching via configuration only | |
| Deployment options | Managed cloud (US, EU) | Kubernetes, Docker, private VPC |
| Model types | Open and closed source. Text, vision, audio, and embedding models included. | |
| Table 2. Integration and Compatibility |
|---|
| Gateway Integration |
| Every catalogued model accessible via the Portkey AI Gateway. Switch models via configuration with no code changes or redeployment required. |
| SDKs |
| Python and JavaScript/TypeScript SDKs. Full OpenAI SDK compatibility – models switched at the gateway layer without SDK changes. |
| A/B Testing |
| Any two catalogued models can be A/B tested via the Portkey AI Gateway's traffic-splitting capability on live production requests. |
| Agent Frameworks |
| LangChain, LlamaIndex, CrewAI, AutoGen, Vercel AI SDK, and all OpenAI-compatible frameworks. |
| Compliance |
| SOC 2 Type II, GDPR compliant. Zero data retention (ZDR) option available on Enterprise tier. |
| Table 3. Comparison and Selection Capabilities |
|---|
| Cost Comparison |
| Real-time input and output token pricing across all providers. Historical price tracking for trend analysis and long-term cost planning. |
| Latency Benchmarks |
| Measured p50 and p99 latency from Portkey's global gateway infrastructure. Updated continuously as performance changes. |
| Capability Filters |
| Filter by modality, context window length, fine-tuning support, function calling, and compliance certification to narrow model selection quickly. |
| Model Switching |
| Configuration-driven model selection at the gateway layer. No application code changes, no redeployment, no new SDK integrations required to switch. |
| Benchmark Scores |
| Published benchmark scores included per model for side-by-side quality comparison across the catalog. |
