The Portkey alternative

The AI cost optimization gateway
Portkey customers are switching to.

Palo Alto Networks bought Portkey on April 30, 2026. Portkey's roadmap is now security-driven (Prisma AIRS), not cost-driven. If you chose Portkey to cut your AI bill — not because you needed enterprise AI security governance — Trimio is built for the lane Portkey just left.

The category split

After the PANW deal, AI gateway
isn't one category anymore.

Three distinct gateways, three distinct buyers. Match yours.

Security-first
For CISO budgets
Bought as a security suite extension
Portkey/PANW, Gravitee, SlashLLM. Compliance and runtime security are the headline. Cost optimization is not the roadmap.
Performance-first
For platform engineering
Open-source adoption first, paid tier later
LiteLLM, Bifrost, Envoy AI Gateway. Broad provider coverage, fast routing. Limited cost intelligence; recent CVEs on the leading project.
Economics-first
For FinOps and CFO offices
Bought as budget governance and savings share
Trimio is the only gateway built end-to-end for AI cost reduction. Compression + ML-based routing + provider-cache maximization, with measured savings in dollars, not estimates.
Side by side

Trimio vs. Portkey
where the bill actually gets cut.

Both platforms cover the gateway basics — virtual keys, logging, RBAC, SSO. The difference is where savings come from.

Capability Trimio Portkey (PANW / Prisma AIRS)
Cost mechanisms
Prompt compression engineStrip stale tool results, redundant system context, non-load-bearing chunks. Cache-anchor preserving. Yes — 18–30% savings on agentic, patent-pending No equivalent
Least-Cost Routing (LCR)Real-time per-request model selection across capability tiers. Yes — ML scorer + 9 condition types, ~44% savings on Moderate preset, 93% quality PASS Basic conditional routing rules — no ML, no quality validation, no savings dashboard
Provider cache maximizationAnchor tracking and incremental tail compression to preserve Anthropic prompt-cache hits across agentic sessions. Yes — 70–90% prefix hit on agentic, 81% reduction on re-reads Basic response cache, no provider-side anchor tracking
FinOps budget cap on routingHard ceiling — never reroutes more than N% of a key's traffic in a 24h window. Yes — operator-configurable safety valve No equivalent
Pricing
Pricing model Savings share — 20% of documented savings. You pay nothing until we save you something. Per-seat / per-request subscription. Pay regardless of outcome.
Alpha terms No platform fee for 90 days, 20% savings share only Standard enterprise contract pricing
Architecture
SQL injection surfaceLiteLLM and others in the Python/JS proxy space have shipped two critical pre-auth SQLi CVEs in 60 days (CVE-2026-42208). No web-facing PG surface on LLM API routes. All SQL parameterized. PR-time lint gate. Not directly affected by LiteLLM CVE; specific posture not publicly disclosed
Quality assurance
Live quality monitor5-dimension judge (accuracy, completeness, coherence, instruction-following, fidelity) scoring real production traffic against reference. Yes — async reference call, zero client latency impact, 7-day rolling alerts No equivalent
Operator-runnable eval frameworkCustomer points the eval at their own traffic with their chosen judge model before any production routing change. Yes — 16-workload corpus, customer-configurable No equivalent
Roadmap direction
Product focus AI cost optimization (FinOps + budget governance) Now AI runtime security (Prisma AIRS integration)
Buyer CFO, FinOps, VP Engineering CISO (post-PANW)
The math

What 30–50% off your bill looks like.

Conservative 35% blended savings. Trimio takes 20% of documented savings. You keep 80%. Numbers below assume current monthly spend with no other changes.

$10K / mo today
$2,800 / mo
Net retained after Trimio fee. $33,600/year back to your team.
Trimio share: $700/mo. Annual share: $8,400. No platform fee in alpha.
$50K / mo today
$14,000 / mo
Net retained. $168,000/year reclaimed. Pays for itself many times over.
Trimio share: $3,500/mo. Annual share: $42,000. No platform fee in alpha.
$100K / mo today
$28,000 / mo
Net retained. $336,000/year reclaimed. Documented monthly, reconcilable against your provider invoices.
Trimio share: $7,000/mo. Annual share: $84,000. No platform fee in alpha.
Common questions from Portkey customers

What changes when you switch.

How long does the switch take?
One URL change in your client config. Same OpenAI / Anthropic / Bedrock SDKs, same model IDs, same request shapes. Most customers are live in 30 minutes.
Do I lose features by leaving Portkey?
Trimio covers the gateway basics — virtual keys, RBAC, SSO, audit log, observability, PII detection. Where Portkey had stronger guardrail tooling for security review, that's now the PANW Prisma AIRS lane and a separate procurement. If your reason for Portkey was cost, Trimio is the upgrade.
What about my Portkey contract?
Run Trimio in parallel on a single virtual key during the 2-week shadow evaluation. Measure the savings on your own traffic against your existing setup before any commercial decision. We don't ask for contract changes until the numbers are yours.
What does "savings share" actually mean?
You pay 20% of the documented savings each month. If we save you nothing, you pay nothing. If we save you $50K, you pay $10K and keep $40K. We only make money when you save money — and the savings are visible in the dashboard, reconcilable against your provider invoices.
Is Trimio SOC 2 attested?
SOC 2 Type I observation date is October 2026. Currently at ~88-92% readiness per internal self-assessment, modeled on SSAE 18. Full readiness report and 12+ supporting policies available under NDA for procurement diligence.
What if the PANW deal doesn't close?
Expected close July 31, 2026. Whether or not it closes on time, Portkey's roadmap is already publicly committed to AI runtime security via Prisma AIRS. The cost-optimization direction is set. The migration question is when, not if.
Next step

See the savings math on your traffic.

20 minutes. No pitch deck. We run a savings estimate against a representative slice of your usage pattern and show you the numbers. Free during alpha.