How much does Replicate charge for GPU cloud?

Replicate GPU prices start from $0.81/hr. They offer 4 GPU models including T4, A40, A100 40GB across 1 region. The median price is $4.14/hr.

What GPUs does Replicate offer?

Replicate offers 4 GPU models: T4, A40, A100 40GB, A100 80GB. Compare all Replicate GPU prices on GPU Tracker, updated every 6 hours.

What is the cheapest Replicate on-demand GPU?

The cheapest Replicate on-demand GPU starts from $0.81/hr. GPU Tracker tracks 4 Replicate GPU instances in real-time.

Replicate GPU Pricing

4 GPU instances across 1 regions.4 GPU models available — from $0.81/hr.

Visit Replicate

ServerlessInferencePer-second billing

Model hosting platform that lets you run open-source ML models in the cloud with a single API call. Replicate handles infrastructure while you pay per second of GPU time.

Strengths

Run any open-source model with one API call
Per-second billing — very cost-effective for bursty workloads
Community model library with thousands of models
Custom model deployment via Cog

Considerations

Not suitable for training — inference only
No persistent storage
Cold start latency for infrequently-used models

Stop to pause billing

Free egress

Quick Answer•Updated Apr 19, 12:31 PM•Methodology

Replicate currently lists 4 GPU instances across 4 GPU models and 1 regions. Pricing starts at $0.81/hr, while the median listing price is $4.14/hr. Compare by model, commitment type, and region before treating the cheapest row as the best choice.

Starting at

$0.81/hr

Median

$4.14/hr

Models

Spot share

T4 pricing A40 pricing A100 40GB pricing A100 80GB pricing Market trends

Starting at

$0.81/hr

cheapest instance

GPU Models

available

Instances

total

Regions

covered

GPU Models at Replicate

T4$0.810/hr A40$2.61/hr A100 40GB$4.14/hr A100 80GB$5.04/hr

All Replicate GPU Instances

4 results

GPU Model	Instance	Count	VRAM	Region	Type	Price/hr	$/GPU/hr
T4	T4-replicate	1×	16GB	US-West	Serverless	$0.8100	—	Rent
A40	A40-replicate	1×	48GB	US-West	Serverless	$2.6100	—	Rent
A100 40GB	A100-40GB-replicate	1×	40GB	US-West	Serverless	$4.1400	—	Rent
A100 80GB	A100-80GB-replicate	1×	80GB	US-West	Serverless	$5.0400	—	Rent

Replicate GPU Cloud — FAQ

How much does Replicate charge for GPUs?

Replicate GPU instances start from $0.81/hr. The average price is $3.15/hr. Prices depend on GPU model, region, and commitment type (on-demand vs spot).

What GPU models does Replicate offer?

Replicate offers 4 GPU models: T4, A40, A100 40GB, A100 80GB. Browse the full list above to compare prices per model.

Where can I see billing assumptions and risk methodology?

GPU Tracker’s pricing comparisons are paired with true cost and risk signals. Read the methodology page for how refresh cadence, cost assumptions, and reliability indicators are defined.

Replicate GPU Pricing

GPU Models at Replicate

All Replicate GPU Instances

Compare Other Providers

Replicate vs … (Head-to-Head)

Replicate GPU Cloud — FAQ