GPU and Compute Economics

GPU and Compute Economics

Definition

The economic dynamics of AI compute — how GPUs are priced, rented, depreciated, and monetized — and why supply constraints are inverting traditional hardware depreciation assumptions.

Key Points

Open Questions

  • When will GPU supply catch up with demand? Dylan Patel suggests not this decade.
  • Will credit-based pricing replace per-token and per-seat models across the industry?
  • How will Nvidia's Groq LPU acquisition change the inference cost curve?
  • At what point does demand destruction from high prices actually kick in?
  • If "the age of scaling is over" [UNVERIFIED] (dwarkesh ilya sutskever 2), does GPU demand growth shift from training to inference and agent workloads — and what does that mean for capital allocation between training clusters and inference fleets?

Related Concepts