LiteLLM

Route LiteLLM calls through Chutes.

LiteLLM can send OpenAI-compatible chat requests to Chutes when api_base points at https://llm.chutes.ai/v1 and the key is Bearer-compatible.
OpenAI-compatible guideConnect draftEndpoint guide
Status
OpenAI-compatible client
Base URL
https://llm.chutes.ai/v1
Live model example
Qwen/Qwen3-32B-TEE
Catalog checked
Jun 25, 2026; context 40K

Full setup walkthrough

Everything you need is on this page — no need to leave for the toolkit. Each step is copy-paste runnable and uses a live model id from the catalog.

  1. 01

    Install LiteLLM

    bash
    pip install litellm
  2. 02

    Call the native Chutes provider

    LiteLLM ships a native chutes_ai/ provider that already knows the base URL — no api_base needed.
    litellm.completion
    python
    import os
    import litellm
    
    response = litellm.completion(
        model=f"chutes_ai/{os.environ.get('CHUTES_MODEL', 'Qwen/Qwen3-32B-TEE')}",
        api_key=os.environ["CHUTES_API_KEY"],
        messages=[{"role": "user", "content": "Say hello in one sentence."}],
    )
    print(response.choices[0].message.content)
  3. 03

    Provide the API key

    Store your cpk_ key as CHUTES_API_KEY in the environment or a secret store. Send it as Authorization: Bearer — never X-API-Key, which inference ignores.
    bash
    export CHUTES_API_KEY=cpk_...

The framework contract

Set
base_url / baseURL / api_base
Value
https://llm.chutes.ai/v1
Set
model
Value
Qwen/Qwen3-32B-TEE from the live catalog

Troubleshooting

Symptom
The model ID fails
Likely cause
The catalog changed or the client cached model metadata.
Fix
Refresh /v1/models and update the config.
Symptom
Auth falls into anonymous limits
Likely cause
The key is missing or not sent as Bearer auth.
Fix
Export CHUTES_API_KEY=cpk_... and send Authorization: Bearer.
Symptom
Routing aliases fail
Likely cause
Saved default aliases need a dashboard Model Routing pool.
Fix
Use a concrete model ID or an inline comma-separated pool first.