Replicate GPU Pricing
4 GPU instances across 1 regions.4 GPU models available — from $0.81/hr.
Model hosting platform that lets you run open-source ML models in the cloud with a single API call. Replicate handles infrastructure while you pay per second of GPU time.
- Run any open-source model with one API call
- Per-second billing — very cost-effective for bursty workloads
- Community model library with thousands of models
- Custom model deployment via Cog
- Not suitable for training — inference only
- No persistent storage
- Cold start latency for infrequently-used models
Replicate currently lists 4 GPU instances across 4 GPU models and 1 regions. Pricing starts at $0.81/hr, while the median listing price is $4.14/hr. Compare by model, commitment type, and region before treating the cheapest row as the best choice.
GPU Models at Replicate
All Replicate GPU Instances
4 resultsCompare Other Providers
Replicate vs … (Head-to-Head)
Replicate GPU Cloud — FAQ
Replicate GPU instances start from $0.81/hr. The average price is $3.15/hr. Prices depend on GPU model, region, and commitment type (on-demand vs spot).
Replicate offers 4 GPU models: T4, A40, A100 40GB, A100 80GB. Browse the full list above to compare prices per model.
GPU Tracker’s pricing comparisons are paired with true cost and risk signals. Read the methodology page for how refresh cadence, cost assumptions, and reliability indicators are defined.