Portkey AI Observability
Full-stack LLM observability – real-time insights, tracing, cost attribution, and eval pipelines

Portkey Observability

Portkey AI Products

AI Observability

Portkey AI Observability

#PORTKEY-OBSERVABILITY
Our Price: Request a Quote

Overview:

Portkey Observability provides full-stack visibility into every LLM interaction across your AI applications. Most LLM applications ship without proper monitoring – Portkey adds a complete observability layer including token traces, cost attribution, evaluation pipelines, and third-party integrations in minutes, not weeks.

The Portkey observability layer sits at the gateway between your application and every LLM provider it calls. Every request and response is recorded with 40+ metadata fields, linked to a distributed trace, and made available for filtering, export, and evaluation – without manual instrumentation or changes to your application code.

Every request and response logged with 40+ metadata fields including model, tokens, cost, latency, and user ID.
Distributed tracing across multi-step LLM workflows and agentic pipelines.
Structured feedback collection and eval pipelines (RAGAS, G-Eval, custom) on live production traffic.
Real-time cost attribution per model, team, user, and project with budget alert thresholds.
Advanced log filtering across 40+ dimensions with full-text search and exportable results.
Native integrations with Langfuse, Datadog, Arize, Weights & Biases, and more.
Deploy with two lines of code – no manual instrumentation required.
Compatible with all major LLM providers and agent frameworks.

Detailed Request Logging

Every request and response is recorded automatically at the gateway layer with over 40 metadata fields. Filter, search, and export logs by any attribute to debug issues, audit usage, and understand application behaviour in production.

Cost and latency captured per request.
40+ metadata fields including model, tokens, user ID, environment, and custom tags.
Filter logs by any attribute combination.
Exportable log data in JSON and CSV formats.

Distributed Tracing

Monitor the full lifecycle of LLM requests and agentic workflows in a unified, chronological trace view – from the first prompt to the final response. Portkey captures every intermediate step including tool calls, sub-agent invocations, and retrieval operations.

End-to-end request traces across multi-step workflows.
Tool call and sub-agent tracing with parent–child linking.
Timeline view of every execution step.
Cost and token usage linked to each trace node.

Feedback & Eval Pipelines

Collect structured feedback at the request or conversation level and run evaluation pipelines directly on live production traffic. Portkey supports RAGAS, G-Eval, and custom eval frameworks, enabling continuous quality measurement without a separate infrastructure.

Human and AI feedback collection at request or session level.
RAGAS, G-Eval, and custom eval pipeline support.
Hallucination and faithfulness scoring.
Eval results feed into RLHF and fine-tuning workflows.

Advanced Log Filtering

Filter production logs across 40+ dimensions including model, cost, latency, user, environment, custom metadata, and eval scores. Pinpoint issues in seconds rather than trawling through raw API logs.

Filter across 40+ dimensions simultaneously.
Full-text search across request and response payloads.
Save and share custom filter views.
Export filtered results for downstream analysis.

FinOps & Cost Dashboards

Real-time cost attribution broken down by model, team, user, and project. Identify cost leaks early, enforce per-team and per-project budgets, and drive more efficient AI workflows with historical spend trend analysis.

Cost breakdown per model, team, user, and project.
Real-time spend dashboards with configurable time ranges.
Budget guardrail alerts (soft and hard thresholds).
Historical trend analysis and exportable cost reports.

Observability Integrations

Export traces and metrics to your existing observability stack without changing your application code. Portkey provides one-click integrations with all major LLM observability and monitoring platforms.

Langfuse, Datadog, Arize, and Weights & Biases integrations.
OpenTelemetry and Grafana export support.
One-click integration – no code changes required.
Works with any LLM framework or application stack.

SIEM and Data Warehouse Export

Export all observability data – logs, traces, eval results, and cost attribution – to your SIEM or data warehouse for long-term retention, compliance reporting, and cross-system analytics.

Custom Metadata and Tagging

Attach custom metadata to every request at instrumentation time: environment, feature flag, experiment ID, user segment, or any other dimension your team tracks. All custom tags are fully filterable and exportable.

Portkey Observability Specifications:

Table 1. Observability Performance and Capacities
	Cloud (Managed)	Self-Hosted (Enterprise)
Log ingestion throughput	Up to 10,000 req/min	Unlimited (hardware-dependent)
Metadata fields per log	40+ built-in fields + unlimited custom metadata
Trace retention	30 days (extendable)	Configurable (your storage)
Supported LLM providers	200+ models across OpenAI, Anthropic, Google, Mistral, Cohere, Azure, AWS Bedrock, and more
Eval frameworks	RAGAS, G-Eval, custom eval pipelines, human feedback collection
Log export formats	JSON, CSV, Webhook, S3, BigQuery, Datadog, Grafana, SIEM integrations
High availability	Multi-region, 99.99% SLA	Active/active cluster support

Table 2. Integration and Compatibility
SDKs
Python and JavaScript/TypeScript SDKs. OpenAI SDK drop-in compatibility – change one line to route through Portkey.
Observability Platforms
Native integrations with Langfuse, Datadog, Arize, Weights & Biases, Grafana, and OpenTelemetry.
Deployment Options
Managed cloud (US, EU regions), self-hosted on Kubernetes, and private cloud (VPC). Docker images available.
Authentication
API key authentication with virtual key scoping. SSO and SAML support on Enterprise tier.
Compliance
SOC 2 Type II, GDPR compliant. Zero data retention (ZDR) option available.

Table 3. Logging and Filtering Capabilities
Built-in Log Dimensions
Model, provider, tokens (prompt/completion/total), cost, latency, user ID, environment, trace ID, and timestamp.
Custom Metadata
Attach unlimited custom key-value metadata to any request. All custom fields are indexed and fully filterable.
Search
Full-text search across request and response payloads. Saved filter views and shareable filter URLs.
Cost Attribution
Real-time cost breakdown by model, provider, team, user, project, and custom metadata dimensions.
Eval Scoring
Filterable eval scores including RAGAS faithfulness, answer relevance, context recall, G-Eval coherence, and custom metrics.