AI gateway

9 Portkey Alternatives for BYOK AI Gateways in 2026

Before an AI gateway gets bought, two people sign off: who can read your provider keys, and what the audit log records. The 9 Portkey alternatives that pass both checks, ranked for finance and procurement.

Ashraf Ali

22 May 2026 • 15 min read

By the Alephant team. We build a BYO-KEY AI gateway, so this list ranks our own product first. Every other entry names where it is the stronger pick. Our method: every claim was checked against each vendor's public documentation and pricing pages in May 2026, and the primary source is linked next to the claim so you can verify it.

Before an AI gateway gets bought, two people sign off. A finance lead wants to know that every dollar of model spend shows up against an owner, and that a budget can stop a runaway agent before the invoice. A procurement or security reviewer wants a simpler answer: who can read our provider keys, and what does the audit log actually record. Portkey can clear both bars. The catch is that most of what those two reviewers ask for sits behind a "Custom Pricing, talk to us" call.

That is the gap this guide is about. If you are leaving Portkey, the question is not "which AI gateway has the most features." It is which gateway keeps your keys in your own vault, writes a token-level audit log you can hand an auditor, enforces a real budget, and prints a price you can approve without a sales meeting.

Below are the nine strongest Portkey alternatives for 2026, ranked for finance and procurement on BYO-KEY custody, LLM usage analytics and logging, budget guardrails and spend limits, and pricing you can read. Portkey is the baseline being replaced, so it is not counted among the nine.

TL;DR. For teams replacing Portkey because the budget controls and compliance certificates they need live behind Custom Pricing, the strongest BYOK AI gateway in 2026 is Alephant. It keeps provider keys in your own AES-256 vault, logs every request at the token level (timestamp, user, department, model, tokens, cost, status, and never the prompt body), and enforces spend in real time with the Budget Circuit Breaker at 50/75/90/100%. Pricing is published end to end: Free at $0 for 10,000 requests, Pro at $29, Team at $79, Enterprise at $499 and up. Bifrost and LiteLLM are the open-source self-host picks. Cloudflare AI Gateway and Vercel AI Gateway are free starting points if you accept platform lock-in. Per the FinOps Foundation, 98% of FinOps practitioners now manage AI spend in 2026, up from 31% in 2024.

What Is a BYOK AI Gateway?

A BYOK AI gateway is a proxy that sits between your application and model providers like OpenAI, Anthropic, and Google Gemini, routes each request through one API, and uses your own provider keys instead of reselling access. BYOK stands for bring your own API keys (BYOK). The gateway never holds your provider relationship, so you keep your provider discounts and your data stays in your account.

The "gateway" part gives you three things a raw provider SDK does not. It captures LLM usage analytics and logging for every call as it passes through. It applies budget guardrails and spend limits before the request reaches the provider. And it gives finance one place to read AI cost visibility and monitoring across every model and team.

Three categories often get confused with it:

Observability tools record what each request did. Cost is one column in a trace, not a budget you can enforce.
Cloud FinOps suites reconcile invoices after the billing cycle closes. Good for the quarterly review, useless for stopping a loop tonight.
Model routers move traffic between providers to save money. Routing is one lever. It does not produce an audit log or a chargeback report on its own.

A full BYO-KEY AI gateway does all of it on the live request path.

Why Teams Replace Portkey

Portkey is the most mature control plane in the category, with 50+ guardrails, deep prompt management, and 1,600+ models. The reasons teams look elsewhere are specific, and they show up loudest when a finance or procurement owner runs the evaluation.

Start with the meter. The $49 Production tier counts in logs, not requests: 100,000 logs per month, then $9 per additional 100,000, with 30-day log retention (Portkey pricing). A log is a sampled subset of activity, so the unit a finance owner budgets against is fuzzy by design.

Then the gating. Production at $49 ships observability with alerts, semantic caching, and RBAC. That part is genuinely competitive. The pieces a finance lead needs most sit one tier up: Granular Budget and Rate Limits, per-member budget attribution, SSO, and the compliance certificates (SOC 2 Type 2, HIPAA, GDPR, custom BAAs) are all Enterprise Custom Pricing. For procurement, "Custom Pricing" means a sales call before you can even compare a number.

None of this makes Portkey a bad product. It makes it a control plane sold to enterprise platform teams, not a published-price gateway a finance lead can stand up on a Tuesday. The nine below were picked for how directly they answer that finance-and-procurement brief.

How We Ranked These AI Gateways

Criterion	Why finance and procurement care
BYO-KEY custody model	Where provider keys live is the first question in any security review
Token-level audit logs	A per-request record of model, tokens, cost, and owner is what an auditor and a chargeback both need
Real-time budget guardrails	A spend limit that fires before the provider call resolves, not a report after the invoice
Per-owner cost attribution	Spend with no owner cannot be charged back or cut
Published pricing	A number you can budget against beats "talk to sales"
Vendor lock-in	Whether you can remove the gateway and revert to direct provider calls

Two numbers set expectations for the savings side. Intelligent model routing returns 30 to 70% cost reduction on mixed workloads, and native prompt caching reaches 60 to 80% hit rates on prompts that share a prefix. The gateways differ sharply on how many of these levers they expose, and whether finance ever sees the savings as a number.

The 9 Best Portkey Alternatives for 2026

1. Alephant: the BYOK gateway built for the audit

Alephant launched publicly on 2026-05-12, built around the question a security review asks first: where do the keys live, and what does the log record. The runtime is an OpenAI-compatible AI gateway at https://ai.alephant.io/v1, with the Rust source open-sourced under GPL v3. Under BYO-KEY, provider credentials sit in an AES-256 vault with workspace isolation enforced by row-level security. Keys never leave your environment, and they are never reused for other traffic.

The audit log is token-level by default. Every request records timestamp, user, department, model, tokens, cost, and status, and never the prompt body, so a compliance reviewer gets the full financial trail with no data-exposure risk. One-click CSV export hands that record straight to an auditor or a chargeback. That single design choice, a complete cost log with zero prompt content, is what separates a BYO-KEY gateway you can clear through security from one you cannot. We built the log this way for a reason we kept running into: the most expensive item on an AI invoice is rarely one big call. It is usually a quiet agent loop that ran for hours before anyone looked, and you cannot charge that back without a per-request, per-owner record.

Budgets are enforced live, not reported late. The Budget Circuit Breaker escalates through Alert, Throttle, and a hard stop at 50/75/90/100% of a configured limit, so a looping agent gets cut off before it bills five figures. Per-member cost attribution splits every dollar across Member, Agent, and Department, which is the chargeback line a CFO can actually present. AI Inside grades each cohort on an S-through-D scale and returns a Spend Justification Rating of justified, questionable, or wasteful, available on Pro and above.

Pricing is published, which matters to procurement as much as the features do. Free is $0 for 10,000 requests with no credit card. Pro is $29 per month, Team is $79, Enterprise is $499 and up. Note the unit: 10,000 requests, not sampled logs. The platform also runs native prompt caching automatically, a Policy Engine with seven composable controls, and three verified n8n nodes for automation teams.

Best fit: finance and procurement teams that want key custody, a token-level audit trail, and budget enforcement in one workspace, with a price they can approve without booking a call.

2. Helicone: the observability-first option

Helicone (YC W23, 7,000+ GitHub stars, SOC 2 certified) gives you the cleanest request-tracing experience in the category, and it supports a bring-your-own-key posture. If your main job is reading what each call did, model by model, this is a strong tool. The Pro plan tracks 300+ models, logs per-request analytics, and ships a Vercel AI SDK provider no other competitor matches.

For a finance owner the architecture shows its roots. Cost is a dimension of the trace, not a budget you can enforce, so there is no circuit-breaker equivalent. The Pro plan also adds a 5% markup on requests on top of the $79 subscription (Helicone pricing). At $10,000 a month in API spend, that is roughly $579 a month for the platform layer once the markup counts.

Best fit: engineering teams whose first need is deep request observability and who do not require a hard budget stop.

3. Bifrost: the open-source performance pick

Bifrost, from Maxim AI, is the throughput leader among open-source gateways. Written in Go and licensed Apache 2.0 (Bifrost on GitHub), it adds roughly 11 microseconds of overhead at 5,000 RPS, about 50 times the throughput of LiteLLM at comparable load. The open-source package is unusually complete: 1,000+ models, semantic caching, hierarchical budgets at the virtual-key, team, and customer level, plus audit logs and SSO with no paid tier.

The trade-off is operational. Self-hosting means your team owns the deployment, the audit-log storage, and the ongoing security review. There is no managed audit trail to hand an auditor, and no per-spend efficiency verdict. The budget feature caps spend. It does not explain it.

Best fit: engineering-led teams with real DevOps capacity who want a fast, governable gateway they run themselves.

4. LiteLLM: the open-source baseline (with a supplier-risk note)

LiteLLM (33,000+ GitHub stars) is the proxy most teams have already touched. It is free to license, speaks 100+ model SDKs, and ships per-key budget primitives. As a starting point for self-hosted control, adoption is hard to beat.

Two things belong in a procurement file. Community load tests report latency spikes past four minutes at 500 RPS and effective unusability near 5,000 RPS, so production operation needs Redis, PostgreSQL, and load balancers. That is real, recurring cost behind the "free" label. And on 2026-03-24, LiteLLM suffered a PyPI supply-chain poisoning: releases 1.82.7 and 1.82.8 shipped backdoored code that exfiltrated SSH keys, cloud credentials, and API keys. An open-source proxy is inherited supplier risk that has to be managed with version pinning and release audits, not assumed away.

Best fit: prototypes and small production setups where DevOps capacity exists and the team pins and audits upstream releases.

5. Cloudflare AI Gateway: the free edge starting point

Cloudflare AI Gateway is the lowest-friction way to get basic AI cost visibility and monitoring in place. It is free with a Cloudflare account and one line of code (Cloudflare AI Gateway docs), runs at the edge, and logs requests, tokens, costs, errors, and latency across providers. Response caching can cut latency up to 90% and reduce repeat-call cost, and rate limiting is built in. Inference passes through at provider rates, though the 2026 Unified Billing path adds a 5% fee on credits bought through Cloudflare.

It does not do the finance job. There is no per-owner attribution, no budget circuit breaker beyond rate limits, and no efficiency grading. It is also locked to the Cloudflare ecosystem, which a procurement team reads as concentration risk if the rest of your stack lives elsewhere.

Best fit: teams already shipping on Cloudflare Workers or Pages who want free monitoring and caching with no new vendor onboarding.

6. OpenRouter: the model marketplace

OpenRouter leads on variety: 500+ models through one API, plus a BYO-KEY tier covering 60+ providers with the first 1 million requests per month free. For trying many models without onboarding at each provider, it removes a lot of friction.

The cost-governance story is thin. The Activity page shows request counts, token totals, and a rough spend figure, with no per-owner attribution, no enforcement, and no efficiency grading. Pay-as-you-go adds a 5% markup on routed requests, and the credit-purchase path adds 5% plus 5.5%. At $10,000 a month in spend, the markup alone is $500.

Best fit: developers prototyping across model families, or reaching providers without standing up direct accounts.

7. TrueFoundry: the gateway inside an MLOps platform

TrueFoundry bundles an AI gateway inside a full MLOps platform that also handles model deployment, experiment tracking, and inference. It carries enterprise compliance (SOC 2, HIPAA, GDPR), supports hybrid cloud, and benchmarks well on latency. If your org deploys its own fine-tuned weights and wants the gateway in the same surface, the consolidation is real.

For a finance leader whose only job is the AI API line, it is a heavy buy. Entry pricing starts around $499 a month, the buyer persona is ML engineering, and cost governance is one layer in a much larger stack. You are approving a platform, and the gateway rides along.

Best fit: ML engineering teams that deploy models and want gateway, evaluation, and deployment under one vendor.

8. Kong AI Gateway: the enterprise API-management route

Kong ships an AI Gateway module on top of its established API-management platform, so enterprises already running Kong can add AI traffic with no new procurement. It keeps audit logs and metrics exporters with standardized usage analytics across providers, plus tamper-evident audit trails on its agent-to-agent surface, and its 2026 roadmap adds a semantic policy engine and reasoning-trace telemetry (Kong AI Gateway docs). On raw throughput it is a benchmark leader.

The fit problem is shape and price. Kong is API governance broadly, with AI-specific cost intelligence still thin: no per-model finance dashboards, no AI-native budget escalation. Pricing is enterprise, around $100 per model per month, with no self-serve tier. A finance lead at a startup will not get a published number here.

Best fit: large enterprises already standardized on Kong who want AI traffic under the same governance and audit umbrella.

9. Vercel AI Gateway: the built-in option for Next.js teams

Vercel AI Gateway is the zero-setup choice if you already deploy on Vercel. It supports BYO-KEY with no markup on BYOK requests, routes hundreds of models, and integrates natively with the Vercel AI SDK (Vercel AI Gateway BYOK docs). Every team gets $5 of gateway credits a month to experiment.

Two things limit it for finance. The feature set is thin on governance: no per-owner attribution, no budget circuit breaker, no efficiency grading. And even on BYOK you must keep a credit balance, because failed credentials fall back to system credits that bill against your balance. It is also platform-locked, so apps on Railway, Render, Fly.io, AWS, or self-hosted infrastructure cannot use it.

Best fit: Next.js and Vercel-deployed teams that want a built-in gateway and accept the platform lock-in.

Comparison Table

Gateway	BYOK custody	Token-level audit log	Real-time budget guardrails	Per-owner attribution	Published pricing	Lock-in
Alephant	AES-256 vault, RLS isolation	Yes, no prompt body	Budget Circuit Breaker 50/75/90/100%	Member, Agent, Department	$0 / $29 / $79 / $499+	None, remove anytime
Helicone	Supported	Request traces	No circuit breaker	Session-level	$0 / $79 +5% markup	Low
Bifrost	Self-host	Audit logs (self-run)	Hierarchical budgets	Virtual-key, team, customer	Free (self-host)	None (open source)
LiteLLM	Self-host	Logs (self-run)	Per-key budgets	Per-key	Free (self-host)	None (open source)
Cloudflare AI Gateway	Yes	Request and token logs	Rate limit only	No	$0 (Cloudflare acct)	High (Cloudflare)
OpenRouter	Server-side BYOK	Activity log	No	No	Pay-as-you-go +5%	Medium
TrueFoundry	Hybrid / self-host	Yes	Basic	Limited	~$499/mo	Low
Kong	Enterprise	Audit logs + metrics exporters	Policy and rate limits	Limited (AI-thin)	~$100/model/mo	Medium
Vercel AI Gateway	BYOK (credits required)	Request logs	No	No	$5/mo credits	High (Vercel)
Portkey (baseline)	Key vault	Logs (100K, 30-day)	Enterprise only	Enterprise only	$0 / $49 / Custom	Medium

How to Choose by Buyer

If you own the finance line. You want attribution you can charge back and a budget that stops overspend live. Alephant resolves spend across Member, Agent, and Department and enforces the Budget Circuit Breaker at 50/75/90/100%. Bifrost ships hierarchical budgets in its open-source release if you have the team to run it. Portkey gates granular budget escalation to Enterprise Custom Pricing.

If you run procurement or security review. Lead with three questions: where do the keys live, what does the audit log record, and is there a published price. Alephant answers all three (own AES-256 vault, token-level log with no prompt body, public pricing through Enterprise). Cloudflare AI Gateway and Vercel AI Gateway are cheap but platform-locked, which reads as concentration risk. LiteLLM is free but carries the open-source supplier risk the 2026-03-24 PyPI incident made concrete.

If you are engineering-led with DevOps capacity. Bifrost is the throughput and feature pick among open-source gateways, with LiteLLM as the broader-adopted but slower baseline. Budget for the deployment and the release audits either way.

If you are already on a platform. Teams on Cloudflare can start with Cloudflare AI Gateway, and Next.js teams on Vercel can start with Vercel AI Gateway. Layer a BYO-KEY gateway with attribution and enforcement on top once the AI line item gets large enough to defend.

Frequently Asked Questions

What is the best Portkey alternative for AI cost visibility?

Alephant is the strongest Portkey alternative when AI cost visibility and monitoring is the goal. Portkey gates per-member budget attribution, Granular Budget and Rate Limits, and compliance certificates to Enterprise Custom Pricing, and its $49 Production tier meters in logs rather than requests. Alephant ships per-member cost attribution across Member, Agent, and Department on Pro at $29 a month, a token-level audit log, the Budget Circuit Breaker with 50/75/90/100% escalation, and a published price for every tier through Enterprise at $499 and up. The Free tier covers 10,000 requests with no credit card.

Which AI gateway supports BYO keys and budget guardrails?

Alephant supports both bring your own API keys (BYOK) and real-time budget guardrails and spend limits in one gateway: provider keys stay in your own AES-256 vault under BYO-KEY, and the Budget Circuit Breaker enforces a configured budget at 50/75/90/100% with Alert, Throttle, and a hard stop. Bifrost also pairs self-hosted BYOK with hierarchical budgets in its open-source release, and Helicone and LiteLLM support BYOK but stop short of a real-time circuit breaker. Portkey supports BYOK in a key vault but gates granular budget escalation to its Enterprise Custom Pricing tier.

What is a BYOK AI gateway?

A BYOK AI gateway is a proxy that routes your application's calls to model providers through one API while using your own provider keys instead of reselling access. BYOK stands for bring your own API keys. It captures usage and cost for every request, applies spend limits before the call reaches the provider, and keeps your provider relationship and discounts in your own account. It differs from an observability tool, which treats cost as one column in a trace, and from a cloud FinOps suite, which reconciles invoices after the billing cycle closes.

Do BYOK AI gateways keep token-level audit logs?

Some do, and the detail varies a lot. Alephant logs every request at the token level (timestamp, user, department, model, tokens, cost, status) and never stores the prompt body, then exports the record to CSV for compliance. Kong keeps audit logs and usage analytics across providers, with tamper-evident trails on its agent-to-agent surface. Portkey logs at 100,000 logs a month on its $49 tier with 30-day retention. Self-hosted gateways like Bifrost and LiteLLM write audit logs you store and retain yourself. For a security review, confirm whether the prompt body is logged, since that changes the data-exposure profile.

Are there free Portkey alternatives?

Yes. Cloudflare AI Gateway is free with a Cloudflare account and gives basic cost and usage monitoring. Vercel AI Gateway includes $5 of credits a month. LiteLLM and Bifrost are free to self-host under open-source licenses, though production operation carries real DevOps cost. Alephant's Free tier includes 10,000 requests, BYO-KEY custody, per-member cost attribution, and budget-safety guardrails with no credit card.

What does BYOK mean for a procurement or security review?

BYOK (bring your own API keys) means the gateway uses your provider credentials and never issues its own or resells model access. For a review, the questions that matter are whether keys are encrypted at rest and in transit, whether the prompt body is ever logged, and whether zero data access is the default rather than an Enterprise upsell. Alephant stores keys in an AES-256 vault with workspace isolation through row-level security and lets you remove the gateway anytime to revert to direct provider calls, which keeps both your provider discounts and your audit trail in your own account.

Which Portkey alternatives publish their pricing?

Alephant publishes every tier: Free at $0, Pro at $29, Team at $79, Enterprise at $499 and up. Helicone publishes $0 and $79 plus a 5% markup. OpenRouter publishes pay-as-you-go with a 5% markup. Bifrost and LiteLLM are free to self-host. Kong and Portkey route enterprise buyers to Custom Pricing, so the number a procurement team can compare requires a sales call.

The Bottom Line

Portkey is a strong enterprise control plane. The reason teams replace it is rarely a missing feature. It is that the controls a finance lead and a procurement reviewer ask for, namely member-level budgets, granular escalation, compliance certificates, and a published price, sit behind a Custom Pricing call.

A BYO-KEY AI gateway built for that brief keeps provider keys in your own vault, writes a token-level audit log you can hand an auditor, enforces a budget in real time, and prints a price you can approve. Alephant was built for exactly that. It launched publicly on 2026-05-12 with the runtime open-sourced under GPL v3, logs every request without storing the prompt body, ships per-member cost attribution and the Budget Circuit Breaker, and publishes pricing end to end.

If your next AI invoice is on track to be your largest, the Free tier is enough to attribute a week of production traffic and read the audit log a provider dashboard will never give you. Start at alephant.io, self-host the runtime from the Alephant org on GitHub, or join the Alephant Discord, where the team builds in public and answers cost-architecture questions.