Announcing TEE: Trusted Execution Environments - Now Publicly Available
NEW Chutes Search - AI-Powered Search is Live! NEW fictio - Create your own experienceAnnouncing TEE: Trusted Execution Environments - Now Publicly Available
NEW Chutes Search - AI-Powered Search is Live! NEW fictio - Create your own experienceAnnouncing TEE: Trusted Execution Environments - Now Publicly Available
NEW Chutes Search - AI-Powered Search is Live! NEW fictio - Create your own experienceAnnouncing TEE: Trusted Execution Environments - Now Publicly Available
NEW Chutes Search - AI-Powered Search is Live! NEW fictio - Create your own experienceAnnouncing TEE: Trusted Execution Environments - Now Publicly Available
NEW Chutes Search - AI-Powered Search is Live! NEW fictio - Create your own experienceAnnouncing TEE: Trusted Execution Environments - Now Publicly Available
NEW Chutes Search - AI-Powered Search is Live! NEW fictio - Create your own experienceAnnouncing TEE: Trusted Execution Environments - Now Publicly Available
NEW Chutes Search - AI-Powered Search is Live! NEW fictio - Create your own experienceAnnouncing TEE: Trusted Execution Environments - Now Publicly Available
NEW Chutes Search - AI-Powered Search is Live! NEW fictio - Create your own experienceAnnouncing TEE: Trusted Execution Environments - Now Publicly Available
NEW Chutes Search - AI-Powered Search is Live! NEW fictio - Create your own experienceAnnouncing TEE: Trusted Execution Environments - Now Publicly Available
NEW Chutes Search - AI-Powered Search is Live! NEW fictio - Create your own experiencePer-token rates and TEE GPU deployments, priced in the open.
No middlemen. No markup tiers. Pay for the tokens you use, or deploy your own chute on a confidential GPU and pay by the second.
Model pricing
Pay per token. No subscription, no minimum, no markup. All featured models run on confidential TEE compute.
Estimate your cost
Pick a workload. Prices update below.
Pay from a Bittensor wallet in TAO
Looking for something else? Browse the full model catalog in the app.
Browse all modelsDeploy your own private chute
Run your own dedicated AI workload on verified self-serve confidential GPU capacity.
Your code, your weights, your data, isolated end-to-end. Deployed in minutes from the chutes CLI. Use this when you need a private
model, a custom fine-tune, or a dedicated instance for production traffic.
How it works
- Build your image, define your chute
Use the CLI to build a container, declare your
NodeSelectorwith the GPU class, VRAM, and count your workload needs, then expose your endpoints via cords. - Deploy with one command
chutes deployregisters your chute, pays the one-time fee, and brings it online as private by default. - Pay only while it runs
Billed by the second at the GPU's hourly rate. Idle instances shut down automatically, so no GPU-seconds are wasted.
# In your chute definition:
# choose the self-serve private GPU class
# tee=True, node_selector=NodeSelector(gpu_count=1, include=["pro_6000"])
chutes build my_chute:chute --wait
chutes deploy my_chute:chute --accept-feePrivate Chutes are billed at the GPU's hourly rate for however long the instance runs, plus a one-time deployment fee equal to 3× the hourly rate at the time of deployment. No subscription required.
GPUs shown here must have live pricing and TEE measurement support. The deployment
fee is paid once when you run chutes deploy; after that,
you pay only the per-second hourly rate while the instance is up. Actual placement
follows your NodeSelector and available capacity.
Prefer a monthly bill?
Plus and Pro bundle a daily request quota with a discount on per-token rates, in one predictable monthly payment. Pick a plan if you'd rather budget a fixed amount each month than top up your balance as you go.
- Bundled daily quota
- 6% off PAYG rates beyond the quota
- Larger daily quota
- 10% off PAYG rates beyond the quota