LiteLLM
Route LiteLLM calls through Chutes.
LiteLLM can send OpenAI-compatible chat requests to Chutes when api_base points at https://llm.chutes.ai/v1 and the key is Bearer-compatible.
Status
OpenAI-compatible client
Base URL
https://llm.chutes.ai/v1
Live model example
Qwen/Qwen3-32B-TEE
Catalog checked
Jun 25, 2026; context 40K
Full setup walkthrough
Everything you need is on this page — no need to leave for the toolkit. Each step is copy-paste runnable and uses a live model id from the catalog.
- 01
Install LiteLLM
bashpip install litellm - 02
Call the native Chutes provider
LiteLLM ships a nativechutes_ai/provider that already knows the base URL — noapi_baseneeded.litellm.completionpythonimport os import litellm response = litellm.completion( model=f"chutes_ai/{os.environ.get('CHUTES_MODEL', 'Qwen/Qwen3-32B-TEE')}", api_key=os.environ["CHUTES_API_KEY"], messages=[{"role": "user", "content": "Say hello in one sentence."}], ) print(response.choices[0].message.content) - 03
Provide the API key
Store yourcpk_key asCHUTES_API_KEYin the environment or a secret store. Send it asAuthorization: Bearer— neverX-API-Key, which inference ignores.bashexport CHUTES_API_KEY=cpk_...
The framework contract
Set
base_url / baseURL / api_base
Value
https://llm.chutes.ai/v1
Set
model
Value
Qwen/Qwen3-32B-TEE from the live catalog
Troubleshooting
Symptom
The model ID fails
Likely cause
The catalog changed or the client cached model metadata.
Fix
Refresh /v1/models and update the config.
Symptom
Auth falls into anonymous limits
Likely cause
The key is missing or not sent as Bearer auth.
Fix
Export CHUTES_API_KEY=cpk_... and send Authorization: Bearer.
Symptom
Routing aliases fail
Likely cause
Saved default aliases need a dashboard Model Routing pool.
Fix
Use a concrete model ID or an inline comma-separated pool first.