Skip to main content

Fal.ai GPU Pricing

4 GPU instances across 1 regions.3 GPU models available — from $2.70/hr.

InferencePer-second billing

Serverless GPU inference platform optimized for image generation and media AI. Per-millisecond billing with fast cold starts and a marketplace of pre-built models.

Strengths
  • Per-millisecond billing
  • Fast cold starts
  • Image generation specialist
  • No idle costs
Considerations
  • Serverless only
  • Less suitable for long training runs
Stop to pause billing
Free egress
Quick AnswerUpdated Apr 12, 7:00 AMMethodology

Fal.ai currently lists 4 GPU instances across 3 GPU models and 1 regions. Pricing starts at $2.70/hr, while the median listing price is $21.60/hr. Compare by model, commitment type, and region before treating the cheapest row as the best choice.

Starting at
$2.70/hr
Median
$21.60/hr
Models
3
Spot share
0%
Starting at
$2.70/hr
cheapest instance
GPU Models
3
available
Instances
4
total
Regions
1
covered

All Fal.ai GPU Instances

4 results
GPU ModelInstanceCountVRAMRegionTypePrice/hr$/GPU/hr
A100 80GB1x-A100-80GB-fal80GBUS-EastServerless$2.7000Rent
H1001x-H100-fal80GBUS-EastServerless$3.6000Rent
A100 80GB8x-A100-80GB-fal640GBUS-EastServerless$21.6000$2.7000Rent
H100 SXM8x-H100-SXM-fal640GBUS-EastServerless$25.9200$3.2400Rent

Fal.ai GPU Cloud — FAQ

How much does Fal.ai charge for GPUs?

Fal.ai GPU instances start from $2.70/hr. The average price is $13.46/hr. Prices depend on GPU model, region, and commitment type (on-demand vs spot).

What GPU models does Fal.ai offer?

Fal.ai offers 3 GPU models: A100 80GB, H100, H100 SXM. Browse the full list above to compare prices per model.

Where can I see billing assumptions and risk methodology?

GPU Tracker’s pricing comparisons are paired with true cost and risk signals. Read the methodology page for how refresh cadence, cost assumptions, and reliability indicators are defined.

We use cookies for analytics and to remember your preferences. Privacy Policy