Endpoint · hosted · waitlist

Flat-rate capacity for
high-volume AI work.

Chompute Endpoint is the cost-reduction lane behind the control plane. Point Claude Code, Codex, or any OpenAI-compatible client at one URL, then route repeated workloads to predictable capacity instead of a per-token meter.

OpenAI compatible

Change the base URL. Keep the SDK.

Chompute speaks the OpenAI protocol end-to-end. Swap the endpoint and your existing tools, agents, and libraries work without a single other change.

Python OpenAI SDK
01import os02from openai import OpenAI0304client = OpenAI(05    api_key=os.environ["CHOMPUTE_API_KEY"],06    base_url="https://chompute-services.dragonfruit.ai/openai/v1",07)0809completion = client.chat.completions.create(10    model="chompute-lite",11    messages=[12        {"role": "user", "content": "Run this agent through Chompute Endpoint."},13    ],14)1516print(completion.choices[0].message.content)
Plans

Pick quality. Pick speed. Keep spend predictable.

Quality is the model grade. Speed is how quickly your requests come back. The goal is simple: move the right work to a flat-rate lane instead of letting token usage surprise the company.

01 · AI quality
02 · AI speed
Your plan

Advanced · Standard

$100/ month
Unlimited use · per seat
Join the waitlist
No per-token billing. No overages. Ever.
All combinations
Endpoint quality and speed matrix
QualityFastestFastStandard
Best
Advanced
Regular
Who it is for

For teams whose AI usage is starting to compound.

If coding agents, API jobs, or internal automations are becoming a real budget line, Endpoint gives Chompute a lower-cost path to route suitable work.

Engineering teams

Route heavy agent loops away from per-token billing.

Use the same coding tools and OpenAI-compatible clients while Chompute moves suitable high-volume work to a predictable endpoint.

Platform teams

Turn repeated AI work into a managed cost lane.

Pair endpoint access with Chompute policy so premium models stay available for important work and routine workloads take the cheaper path.

Claude Code setup

Use Chompute as your Claude Code endpoint.

Get an Access Key, run one setup command, choose project-only or all Claude Code sessions, then keep working from your terminal while Chompute handles routing and usage visibility.

Open setup guide
Routing

One endpoint can still make policy-aware routing decisions.

The gateway keeps the client contract simple while Chompute decides when work should stay premium, move to flat-rate endpoint capacity, or follow a fallback path.

◇ Gateway.liveRouting
POST /v1/chat  ›  Refactor auth middleware across 12 files
AIGATEWAYTeam AflatTeam BflatEndpointflatPremiumkeepFallbacksafeAPPROVEDPROVIDER
routed → endpoint · flat-ratepolicy route ◦ openai-compat
Waitlist

Get early access.

Tell us about your workload in the waitlist form. We will follow up with endpoint availability, pricing details, and the policy-aware migration path for your stack.