The growth engine for AI
Price AI agents with confidence, know your margins, and bill for the value your customers get.
20+
Supported chat models
15 min
Median setup time
6
Providers routed
POST /chat/completionsProviders and models
Keep your customers on the best model for every request
Cephas abstracts providers so you can switch models instantly without breaking your API contract.
One endpoint for every model
Route chat traffic across OpenAI, Claude, and Gemini with a single API and a shared message format.
Company margins by default
Assign a margin per company or per model so every request reflects your pricing strategy.
Customer API keys
Customers generate a key once and reuse it across companies while you retain routing control.
Secure routing
Keys are encrypted, traffic is protected, and you control which providers are active per company.
Model-aware billing
Usage is tracked per model with clear cost breakdowns and margin totals for each company.
From signup to routed traffic in three steps
Companies configure providers and margins, customers use a single API key, and every call flows through Cephas.
Company setup
Create a company profile
Set default margin, connect providers, and enable the models you want to expose to customers.
Acme Inc · $0.50 / 1k tokens · gpt-4o, gemini-1.5-pro
Customer access
Issue customer API keys
Customers generate their API key once and reuse it across all the companies they work with.
API Key · cph_••••••••••••••••
API calls
Route every chat request
Send the API key plus company ID. Cephas handles routing, usage tracking, and margin billing.
POST /chat/completions with API key + company ID
Base model rates plus your margin
Pay only for usage while you keep margin on every company. No platform fees or long-term contracts.
Usage-based pricing
We bill at the provider model rate. You set a margin per company and keep the spread.
Example margin
Gemini 1.5 Pro
+$0.50 / 1k tokens
Launch plan
$20
Platform fee per month
Wrap every model behind one company-ready API
Cephas gives you unified routing, usage visibility, and margin control so you can scale AI features without unpredictable costs.