Simple, Transparent Pricing

No hidden fees, no cap. Choose the plan that's right for your vibe and scale as you grow.

Monthly
Yearly

Free

Perfect for getting started

Free
Get Started

What's included:

  • Host unlimited public models, datasets
  • Create unlimited orgs with no member limits
  • Access the latest ML tools and open source
  • Community support
  • 5,000 API calls per month
  • Basic compute with free CPUs
  • Limitations:

  • No private models
  • Limited compute resources
  • Standard response times
MOST POPULAR

Pro

Unlock advanced features

$9/month/month
Subscribe Now

What's included:

  • Everything in Free tier
  • ZeroGPU and Dev Mode for Spaces
  • Free credits across all Inference Providers
  • Early access to upcoming features
  • Pro badge on your profile
  • 100,000 API calls per month
  • Pay-as-you-go option for additional usage
  • Limitations:

  • No enterprise features
  • Standard SLA

Enterprise Hub

Accelerate your AI roadmap

$20/month/month
Contact Sales

What's included:

  • Everything in Pro tier
  • SSO and SAML support
  • Select data location with Storage Regions
  • Precise actions reviews with Audit logs
  • Granular access control with Resource groups
  • Centralized token control and approval
  • Dataset Viewer for private datasets
  • Advanced compute options for Spaces
  • 5x more ZeroGPU quota for members
  • Deploy Inference on your own Infra
  • Managed billing with yearly commits
  • Priority support
  • Unlimited API calls

Plan Comparison

FeaturesFree TierProEnterprise
Core Features
Public Models Access
API Access
Storage1GB15GBUnlimited
Advanced Features
Custom Model HostingUp to 5Unlimited
Private Models
Custom Training
Model Versioning
Support
Support LevelCommunityEmail24/7 Priority
SLA Guarantee
Dedicated Account Manager

Additional Computing Options

Spaces Hardware

Upgrade your Space compute

$0/hour

Starting at

  • Free CPUs
  • Build more advanced Spaces
  • 7 optimized hardware options
  • From CPU to GPU to Accelerators

Inference Endpoints

Deploy models on fully managed infrastructure

$0.032/hour

Starting at

  • Deploy dedicated Endpoints in seconds
  • Keep your costs low
  • Fully-managed autoscaling
  • Enterprise security

API Usage

Pay only for what you use

$0.001/1000 tokens

Starting at

  • Ultra low per-token pricing
  • Volume discounts available
  • No minimum commitments
  • Transparent usage dashboard

GPU Turbo Scaling

Auto-scaling GPU clusters on demand

$0.50/hour

Starting at

  • On-demand NVIDIA GPUs
  • Automatic cluster scaling
  • Real-time performance metrics
  • Zero wait time provisioning

Custom ASIC Support

Specialized hardware acceleration

$1.20/hour

Starting at

  • Dedicated TPU/ASIC hardware
  • 10x faster inference speed
  • Optimized for large models
  • Hardware-specific optimizations

Edge Deployment

Push models to IoT devices

$0.10/device/month

Starting at

  • Ultra-low latency inference
  • Model compression technology
  • Optimized for IoT and mobile
  • Remote updates and monitoring

Multi-Region Deployment

Global low-latency inference

$0.25/region/hour

Starting at

  • Deploy to 15+ global regions
  • Traffic-based auto-routing
  • Regional data compliance
  • Geo-redundant failover

Quantum Processing Units

Next-gen quantum acceleration

$5.00/hour

Preview pricing

  • Experimental QPU access
  • Quantum ML algorithm library
  • Specialized problem acceleration
  • Academic research priority

Serverless Inference

Pay-per-use compute scaling

$0.15/million predictions

Starting at

  • Zero infrastructure management
  • Infinite scale potential
  • Cold-start optimization
  • Cost-effective for variable loads

Frequently Asked Questions

Ready to level up your AI game?

Start building with our platform today. No credit card required for the free tier.