GPU Cloud Egress Fees: The Hidden Cost Nobody Talks About

GPU Cloud Egress Fees
Provider-Specific Charges
Hidden Costs Beyond Egress
Real-World Cost Scenarios
Optimization Strategies
FAQ
Related Resources
Sources

GPU Cloud Egress Fees

GPU cloud egress fees: The hidden killer.

Data leaving the data center costs $0.02-0.30/GB. Download a 50GB model and pay $4.50-$15.

Intra-region free. Cross-region and internet: expensive.

A $2/hour H100 can waste hundreds on egress.

Downloading trained models or inference results
Uploading datasets to external APIs
Syncing checkpoints to S3 outside the region
Transferring outputs to client systems
Backing up results to separate cloud providers

Egress structure varies dramatically by provider and region. Intra-region transfers often run free or cost 90% less than inter-region moves.

Provider-Specific Charges

AWS egress rates depend heavily on destination and region. Standard pricing:

Intra-region transfer: Free (EC2 to S3 in same region)
Cross-region transfer: $0.02/GB
Internet egress: $0.09/GB (US regions)
Internet egress: $0.12-0.15/GB (Europe, Asia)

A single 50GB model download to the internet costs $4.50 from US regions. The same transfer costs $6-7.50 from European data centers.

Google Cloud egress mirrors AWS pricing:

Intra-region: Free
Cross-region: $0.01/GB
Internet egress: $0.12/GB (most regions)

Azure rates differ significantly:

Intra-region: Free
Cross-region outbound: $0.02/GB
Internet egress: $0.087/GB (varies by region)
Special offers for Microsoft services reduce costs

RunPod and specialized GPU platforms often include generous egress allowances or offer free inter-provider transfers within their ecosystem.

CoreWeave pricing:

Outbound: $0.12/GB (standard)
Inbound: Free
API transfers: Reduced rates

Vast.AI operates on market-based pricing with user-specified egress rates. Competition keeps costs low, typically $0.03-0.08/GB.

Hidden Costs Beyond Egress

Egress fees represent one layer of hidden charges. Other costs accumulate quickly:

Data transfer within cloud ecosystems: Moving data from GPU instances to storage (S3, Blob Storage, Cloud Storage) incurs charges even within the same region in some cases. AWS charges $0.01/GB for EC2 to CloudFront transfers.

Storage persistence: Keeping training data in cloud storage adds $0.02-0.05 per GB monthly. A 1TB dataset costs $20-50/month just for storage.

Bandwidth for multi-GPU clusters: Intra-cluster communication on some platforms (CoreWeave, Lambda) charges per GB. A synchronization pass across 8 GPUs can cost $2-5.

API request pricing: Cloud providers charge per API call in some cases. Direct Connect and specialized network services cost thousands monthly if used.

Snapshots and backups: EBS snapshots cost $0.05 per GB per month. A 500GB instance snapshot costs $25 monthly to maintain.

Reserved capacity charges: Some providers charge upfront fees ($500-2000) to guarantee GPU availability. Reserved instances commit to multi-year terms.

Real-World Cost Scenarios

Scenario 1: Training and deployment

A team trains an LLM on H100s for one week:

Compute: 168 hours x $2/hour = $336
Storage (1TB training data): 7 days x $0.04/day = $0.28
Checkpoint egress: 50GB x $0.09 = $4.50
Final model download: 100GB x $0.09 = $9.00
Total: $349.78

Egress accounts for 3.8% of costs but compounds over multiple training runs.

Scenario 2: Inference serving

A startup runs inference at scale for 30 days:

Compute: 720 hours x $1.50/hour = $1,080
API request handling: 10M requests x $0.0001 = $1,000
Data egress to clients: 5TB x $0.12 = $600
Storage: 100GB x $0.04/month = $4
Total: $2,684

Egress jumps to 22% of monthly spend. Optimization here saves significant money.

Scenario 3: Multi-region failover

A company maintains H100 instances in three regions:

Compute (3x): 720 hours x 3 x $2 = $4,320
Data replication between regions: 500GB x $0.02 x 3 = $30
Client requests egress: 10TB x $0.12 = $1,200
Total: $5,550

Egress reaches 22% of total spend in multi-region deployments.

Optimization Strategies

Teams reduce egress costs through architectural choices:

Keep data inside the cloud: Instead of downloading models, serve them directly from cloud storage. AWS Lambda@Edge, Google Cloud CDN, and Azure CDN distribute content without egress charges to edge locations.

Use spot instances strategically: Spot GPU instances cost 50-70% less but add egress risk. Batch results and transfer them together, amortizing per-GB costs.

Compress outputs: Model inference outputs can compress 10:1 with appropriate encoding. Transferring 10GB instead of 100GB saves $9 per operation.

Request batching: Single large transfers cost the same per-GB as many small ones. Batch inference results and model downloads.

Regional clustering: Keeping GPU instances, storage, and users in the same region eliminates inter-region transfer costs.

Dedicated connections: Direct Connect (AWS) or Dedicated Interconnect (Google Cloud) offer fixed monthly costs for high-volume transfers. Break-even occurs around 10-20TB monthly.

Provider-specific features: AWS offers free transfer to CloudFront. Google Cloud provides free transfer to Firebase. Azure reduces charges for Microsoft services.

FAQ

How can I estimate egress costs for my workload? Calculate daily data movement (models downloaded, results uploaded, backups). Multiply by regional egress rates ($0.02-0.12/GB). Factor in compression and batching optimization.

Do intra-region transfers ever cost money? Generally no, but some services charge. AWS EC2 to EFS costs $0.01/GB if not in the same availability zone.

Can I negotiate egress rates with cloud providers? Large teams (10TB+ monthly) can negotiate AWS and Google Cloud rates. Standard discounts reach 20-40%.

What is the cheapest way to move data between cloud providers? Dedicated fiber connections cost $1000-5000/month for sustained transfers. For episodic transfers, compress and batch to minimize per-GB charges.

Do free tiers include egress allowances? AWS free tier includes 100GB outbound per month. Most providers count only a portion toward free tiers.

Sources

AWS EC2 Pricing Documentation
Google Cloud Compute Pricing (official)
Microsoft Azure Virtual Machines Pricing
CoreWeave Pricing & Terms
Vast.AI Market-Based GPU Pricing Model

Contents