Models & pricing

The catalog, the providers behind it, fallback chains, and how per-token pricing works.

The catalog

Conduix curates a catalog of production-grade models across eight providers. Browse the full live catalog at /dashboard/models — each row shows the model id, provider, context window, and the exact per-1M-token rate you pay.

OpenAI

gpt-4o, gpt-4o-mini, o3, o4-mini

Anthropic

claude-opus-4, claude-sonnet-4, claude-haiku-4-5

Google

gemini-2.5-pro, gemini-2.5-flash

Groq

llama-3.3-70b, llama-4-maverick

Together

qwen-3-72b-instruct, llama-4-maverick

Mistral

mistral-large-2, codestral-2024

DeepSeek

deepseek-v3, deepseek-r1

Fireworks

firefunction-v2, llama-4-maverick

How pricing works

Transparent per-token pricing. The rate at /dashboard/models is what you pay — all-in. Credits behave like dollars (1 credit ≈ $1). No per-seat fees, no surprise overages.

Pay-as-you-go

Buy credits in advance, spend as you go

Auto-reload

Top up automatically when you hit a threshold

Subscription plans

Explorer, Builder, Scale — see /pricing

Volume discounts

Custom unit pricing for committed volume

Enterprise

NET-30 invoice, POs, custom MSAs

Calling a model

Use the model id in the standard OpenAI request body. Conduix routes to the correct provider based on the id.

python

client.chat.completions.create(
    model="claude-haiku-4-5-20251001",  # → routed to Anthropic
    messages=[{"role": "user", "content": "Summarize…"}],
)

Automatic fallback chains

Each model has a curated fallback list. If the primary fails (provider down, circuit breaker tripped, rate-limited upstream), Conduix retries against the next entry — invisibly. Your customer never sees the failure.

text

Requested:  gpt-4o-mini
Fallback chain:
  1. gpt-4o-mini      (openai)
  2. claude-haiku-4-5 (anthropic)
  3. gemini-2.5-flash (google)

When a fallback served the request, the response includes x-conduix-fallback: true and x-conduix-model-served shows the actual model used. See Observability.

Restricting which models a key can call

Per-key allowlists keep developers in their lane (e.g. internal tools can't accidentally call frontier-tier models). Configure in the dashboard or via the keys API:

json

{
  "name": "Internal tools",
  "allowed_models": ["gpt-4o-mini", "claude-haiku-4-5-20251001"],
  "rate_limit": 60
}

Using a non-catalog model

Need a model Conduix doesn't curate? Or your own self-hosted model? Use BYO endpoints: register your own OpenAI-compatible URL and call model: "byo:<slug>/<your-model>".

Tiers

Models are grouped into tiers for governance. You can build allowlists at the tier level rather than naming each model:

frontier

Newest, top-performing, most expensive (e.g. claude-opus-4, o3)

premium

Strong general-purpose (gpt-4o, claude-sonnet-4, gemini-2.5-pro)

mid

Cost-effective workhorses (gpt-4o-mini, claude-haiku-4-5)

budget

Cheapest — open-weight via Together/Groq/DeepSeek

Authentication Billing & credits