Simple, Transparent Pricing

No hidden fees, no cap. Choose the plan that's right for your vibe and scale as you grow.

Monthly

Yearly

Free

Perfect for getting started

Free

Get Started

What's included:

Host unlimited public models, datasets
Create unlimited orgs with no member limits
Access the latest ML tools and open source
Community support
5,000 API calls per month
Basic compute with free CPUs
Limitations:
No private models
Limited compute resources
Standard response times

Pro

Unlock advanced features

$9/month/month

Subscribe Now

What's included:

Everything in Free tier
ZeroGPU and Dev Mode for Spaces
Free credits across all Inference Providers
Early access to upcoming features
Pro badge on your profile
100,000 API calls per month
Pay-as-you-go option for additional usage
Limitations:
No enterprise features
Standard SLA

Enterprise Hub

Accelerate your AI roadmap

$20/month/month

Contact Sales

What's included:

Everything in Pro tier
SSO and SAML support
Select data location with Storage Regions
Precise actions reviews with Audit logs
Granular access control with Resource groups
Centralized token control and approval
Dataset Viewer for private datasets
Advanced compute options for Spaces
5x more ZeroGPU quota for members
Deploy Inference on your own Infra
Managed billing with yearly commits
Priority support
Unlimited API calls

Plan Comparison

Features	Free Tier	Pro	Enterprise
Core Features
Public Models Access
API Access
Storage	1GB	15GB	Unlimited
Advanced Features
Custom Model Hosting		Up to 5	Unlimited
Private Models
Custom Training
Model Versioning
Support
Support Level	Community	Email	24/7 Priority
SLA Guarantee
Dedicated Account Manager

Additional Computing Options

Spaces Hardware

Upgrade your Space compute

$0/hour

Starting at

Free CPUs
Build more advanced Spaces
7 optimized hardware options
From CPU to GPU to Accelerators

Inference Endpoints

Deploy models on fully managed infrastructure

$0.032/hour

Starting at

Deploy dedicated Endpoints in seconds
Keep your costs low
Fully-managed autoscaling
Enterprise security

API Usage

Pay only for what you use

$0.001/1000 tokens

Starting at

Ultra low per-token pricing
Volume discounts available
No minimum commitments
Transparent usage dashboard

GPU Turbo Scaling

Auto-scaling GPU clusters on demand

$0.50/hour

Starting at

On-demand NVIDIA GPUs
Automatic cluster scaling
Real-time performance metrics
Zero wait time provisioning

Custom ASIC Support

Specialized hardware acceleration

$1.20/hour

Starting at

Dedicated TPU/ASIC hardware
10x faster inference speed
Optimized for large models
Hardware-specific optimizations

Edge Deployment

Push models to IoT devices

$0.10/device/month

Starting at

Ultra-low latency inference
Model compression technology
Optimized for IoT and mobile
Remote updates and monitoring

Multi-Region Deployment

Global low-latency inference

$0.25/region/hour

Starting at

Deploy to 15+ global regions
Traffic-based auto-routing
Regional data compliance
Geo-redundant failover

Quantum Processing Units

Next-gen quantum acceleration

$5.00/hour

Preview pricing

Experimental QPU access
Quantum ML algorithm library
Specialized problem acceleration
Academic research priority

Serverless Inference

Pay-per-use compute scaling

$0.15/million predictions

Starting at

Zero infrastructure management
Infinite scale potential
Cold-start optimization
Cost-effective for variable loads

Frequently Asked Questions

Ready to level up your AI game?

Start building with our platform today. No credit card required for the free tier.

Get Started Free Talk to Sales