Proxyify - One API Key. Every AI Model.
AI Gateway

One key.
Every AI model.

No provider accounts. No API juggling.
Access 100+ models with a single endpoint.

JK
SR
AL
MP
+
Trusted by 500+ developers
scroll

Models

100+ models.
One endpoint.

Switch between providers without changing a line of code.

OpenAI
Claude
Gemini
Llama
Mistral
Cohere
DeepSeek
Groq
Perplexity
xAI Grok
OpenAI
Claude
Gemini
Llama
Mistral
Cohere
DeepSeek
Groq
Perplexity
xAI Grok
Gemini 2.0 Flash
GPT-4o
Claude 3.7 Sonnet
Llama 3.3 70B
Mistral Large
DeepSeek-R1
Command R+
Groq Llama3
Gemini 1.5 Pro
o3-mini
Claude Haiku
Gemini 2.0 Flash
GPT-4o
Claude 3.7 Sonnet
Llama 3.3 70B
Mistral Large
DeepSeek-R1
Command R+
Groq Llama3
Gemini 1.5 Pro
o3-mini
Claude Haiku


Dashboard

Full visibility into
every API call

Real-time logs, usage analytics and credit tracking - all in one place.

proxyify.dev/dashboard

Total Requests

2.4M

↑ 18% this week

Credits Used

$1,247

of $5,000 balance

Active Keys

3

across 2 projects

Avg Latency

142 ms

↓ 8 ms below avg

API Requests

Last 7 days

Weekly
Mon
Tue
Wed
Thu
Fri
Sat
Sun

Model Usage

38% GPT-4o
GPT-4o38%
Claude 3.724%
Gemini 2.019%
Others19%

Request Log

Live activity

Live
KeyModelTokensCreditsLatencyStatus
Smart Suggest

Cost Optimizer

Last 100 requests using GPT-4o. Switch to DeepSeek R1:

64%

cost saved · low quality diff

Configure Rules →



Platform capabilities

Built for security.
Designed for speed.

Every AI model, one key — with the enterprise security controls your OpenAI key never had.

POST /v1/chat/completions
Text · Image · Video · Audio
100+ models
Change one line
"https://api.openai.com/v1"
"https://proxyify.dev/v1"
openai SDK LangChain Vercel AI LlamaIndex

01 — Access & Integration

One key. 100+ models. Zero setup.

No provider accounts, no API key juggling. Register once and every model — text, image, video, TTS, STT — is instantly available. Swap your base_url and every existing SDK works unchanged.

100+ models all modalities OpenAI-compatible instant access
IP Whitelist 10.0.0.0/24
Origin Lock myapp.com
Blocked Countries RU · CN · KP
Active Hours 09:00–18:00 UTC
Spending Limit 500 cr / month
Max Tokens 2,048 / req
Expires At 2025-12-31
Environment production
Safe to embed in frontend code

02 — Security

API Key Controls You Won't Find Elsewhere

Every key is a security policy. Lock it to IPs, domains, or countries. Set the hours it's active. Cap spending per day, week, or month. Limit tokens per request. Add an expiry date. Tag it as production, staging, or dev. None of this exists anywhere else.

IP whitelist origin lock country block active hours spending cap token limit key expiry env tag
Permanent key
prx-xxxxx
backend only
↓ generates
Short-lived token
bt-yyyyy
1h TTL

Inherits all key restrictions · Safe in browser & mobile

03 — Mobile & Browser

Ephemeral tokens for client-side apps

Generate short-lived bt- tokens from your permanent key and hand them to browsers or native apps. They expire automatically — and they inherit every IP, origin, and spending restriction of the parent key.

configurable TTL auto-expire inherits restrictions zero leak risk
API response · _proxyify
credits_used 12.40
model_used gpt-4o
latency_ms 843
cheaper_model claude-haiku-3.5
↓ 83% cheaper · same capability

04 — Cost Intelligence

Every response tells you how to spend less

Every API response includes a _proxyify.cheaper_model field — the cheapest model with equivalent capabilities that could have handled your request. Credits used, cost in USD, and latency are also returned on every call.

cheaper_model hint credits_used cost_usd latency_ms
"stream": true
data: {"token": "The"}
data: {"token": " answer"}
data: [DONE]
Per-user tracking
X-Proxyify-User-Id: user_7a3f
user_7a3f38 req · 120 cr

05 — Streaming & Observability

SSE streaming and per-user request logs

Add "stream": true for real-time token delivery. Pass X-Proxyify-User-Id to track exactly who in your app is spending what — filterable and CSV-exportable from your dashboard.

SSE streaming all text models per-user logs CSV export


Pricing

Pay for what you use.

Buy credits once, use them anytime. No subscriptions, no resets.

Free

$0

500 credits to start

Get started free
  • 20 requests / minute
  • 200 requests / day
  • No credit card required

Starter

$9

12000 credits

Buy credits
  • 100 requests / minute
  • No daily cap
  • Email support
Most popular

Pro

$29

30000 credits

Buy credits
  • 300 requests / minute
  • No daily cap
  • Priority email support

Scale

$99

120000 credits

Buy credits
  • Unlimited requests / minute
  • No daily cap
  • Priority email support

Included in every plan

100+ models, all modalities
SSE streaming
Origin & IP key locking
Ephemeral tokens
Per-key model allowlists
Active hours & country block
Key expiry & environment tags
Webhook & credit alerts
Request logs & insights
Credits never expire

Pay per token, not per month  ·  Credits never expire  ·  No commitment  ·  Larger packs, better value



FAQ

Common questions

Everything you need to know before getting started.

Proxyify is an AI gateway that gives you a single API endpoint to access 100+ AI models — text, image, video and audio — from providers like OpenAI, Anthropic, Google, Meta and more. You manage one key, we handle routing, billing and monitoring. No provider accounts needed.

With direct provider APIs you need separate accounts, keys and billing for every provider. Proxyify unifies everything: one key, one endpoint, one dashboard. You also get per-key spending limits, IP/origin restrictions, country blocking and real-time logs — features individual providers don't offer.

No. You only need a Proxyify account. We handle the provider relationships on our end — you just buy credits and start making requests. This is the core difference from tools like Portkey, LiteLLM or Helicone, which all require you to bring your own provider keys.

Yes. Proxyify is fully compatible with the OpenAI SDK. Just change the base_url to point to Proxyify — no other code changes required. Python openai, JavaScript openai, LangChain and LlamaIndex all work out of the box.

Credits never expire — they stay in your account until used. New accounts start with 500 free credits, no credit card required. After that, you top up with a one-time credit pack whenever you need to. Usage is charged per request based on the model's token, second or character pricing.

Yes. Each key supports: allowed IP addresses (with CIDR ranges), allowed HTTP origins, country blocking, model allowlists, category locks (e.g. text-only), time-based access windows, spending caps and key expiry (TTL). Requests that violate any rule are rejected before consuming any credits.

Rate limits are plan-based. Free accounts are limited to 20 requests per minute. Starter accounts get 100 RPM, Pro gets 300 RPM and Scale has no platform-level limit. All plans also benefit from SSE streaming support.

No. We never log or store prompt content or model responses. Your request logs only contain metadata: model used, token count, credit cost, latency and status code. Your data stays between you and the model provider.

Sign in with Google, create your first API key from the dashboard and make your first request to POST /v1/chat/completions. No credit card required. The quickstart guide in our docs walks you through a working example in under 5 minutes.