Skip to content

CHIPS.SANJAY920.COM FIELD GUIDE v0.1

How AI is actually built.

From tin droplets in San Diego to gigawatt data centers in West Texas — a field guide to every layer of the GPU supply chain, every binding constraint, and every bet on the table between now and 2030.

3.5

EUV tools / GW

the ratio that sets the ceiling

20–30 GW

annual EUV flow

70–100 tools / yr ÷ 3.5 tools / GW

20×

Hopper → Blackwell

real-world inference speedup

$13 B

yearly rent / GW

at current market pricing

Scroll to descend the stack

  1. 01 THE STACK
  2. 02 BOTTLENECKS
  3. 03 BETS
  4. 04 CHINA
  5. 05 GEOGRAPHY
  6. 06 CAST
  7. 07 TRACE

LIVE STATUS THE SUPPLY CHAIN RIGHT NOW

Updated

MAY 2026 · this quarter

Today the chain breaks at

HBM Memory.

HBM bandwidth depends on how much memory interface fits around the package.

Pressure

92 / 100

  1. ’23
  2. ’24
  3. ’25
  4. ’26
  5. ’27
  6. ’28

Why it’s binding

HBM4 stack prices have tripled in twelve months. SK Hynix and Samsung are sold out through Q3 2027 on three-year contracts. New DRAM fabs from 2025 decisions don’t produce meaningful capacity until late 2027.

TL;DR THE NUMBERS THAT MATTER

AI compute is built through a chain of physical constraints.

Twelve numbers anyone who follows this industry should be able to recall. Read this first; everything below is derivation.

3.5

EUV tools / GW

the ratio that sets the ceiling

Calc 1 GW of AI capacity ÷ ~285 MW served per EUV tool ≈ 3.5 tools

20–30 GW

annual EUV flow

70–100 tools / yr ÷ 3.5 tools / GW

Calc 70–100 EUV tools shipped / yr ÷ 3.5 tools / GW ≈ 20–29 GW / yr

20×

Hopper → Blackwell

real-world inference speedup

Source NVIDIA · Blackwell

$13 B

yearly rent / GW

at current market pricing

Calc 1 GW ≈ 1M GPUs × $3 / GPU-hour × 8,760 hours / yr × 50% utilization ≈ $13 B / yr

70

EUV tools / year

ASML’s 2026 production rate

Source ASML 2025 annual report

$400 M

price per EUV tool

by end of decade, ~$500 M

Source ASML investor day

≈90%

Taiwan’s share

of leading-edge logic capacity

Source SIA / BCG supply-chain report

2.5 TB/s

HBM4 stack bandwidth

per 13 mm of chip edge

Calc 8 channels × 256 IO / channel × 10 GT/s ÷ 8 bits / byte ≈ 2.5 TB / s

30%

of capex → memory

hyperscalers, 2026

Calc ~$180 B hyperscaler memory spend ÷ ~$600 B big-four capex ≈ 30%

↓ 600 M

smartphones in 2027

projected, from 1.1 B in 2024

Source IDC smartphone forecast

$600 B

big four capex 2026

AWS · Google · MSFT · Meta

Calc MSFT $190 B + GOOGL $185 B + AMZN $128 B + META $135 B ≈ $638 B (headline rounded down)

<1 yr

data-center build

AWS Project Rainier · announce → live

Source AWS · Project Rainier launch

§ 01b THE GUIDE DEEP-DIVE EXPLAINERS

The core layers, explained from fabrication to deployment.

Each section below introduces one layer, one figure, and the main constraint that shapes it.

01 · EUV Lithography

How a tin droplet becomes a chip pattern.

Extreme ultraviolet lithography is the light-based machine step that prints the smallest chip patterns.

A laser strikes tin droplets to make 13.5 nm light. That light prints the smallest features on advanced logic chips.

Native unit

λ 13.5 nm · watts

Constraint

Tool output is limited by how many EUV scanners can be built and qualified each year.

Open chapter

Fit overview · pinch to zoom

02 · Wafer Fabrication

How many useful accelerator dies fit on one wafer?

A polished silicon disc is patterned into many individual chip dies, but not all of them survive.

A 300 mm wafer offers many candidate reticle sites, but edge exclusion and defects reduce the number of good large dies.

Native unit

858 mm² · % yield

Constraint

Wafer economics depend on how many large dies survive edge loss and defects.

Open chapter

Fit overview · pinch to zoom

03 · Advanced Packaging

Why packaging matters as much as the chip.

This is the assembly step that joins separate silicon pieces and memory into one fast compute package.

The package connects compute dies to HBM through substrates, bridges, and interposers. Those layers determine bandwidth and yield.

Native unit

μm stack height · mm shoreline

Constraint

Advanced packaging capacity determines how many compute dies can be paired with HBM at scale.

Open chapter

Fit overview · pinch to zoom

04 · HBM Memory Binding now

How HBM gets to 2.5 TB/s.

High Bandwidth Memory is stacked DRAM placed beside the chip so data can reach it quickly.

Stacked DRAM, a base die, and thousands of IO lines add up to the bandwidth modern accelerators depend on.

Native unit

GB/s · TSVs · 12-high

Constraint

HBM bandwidth depends on how much memory interface fits around the package.

Open chapter

Fit overview · pinch to zoom

05 · Accelerator Die

Why modern accelerators are becoming multi-die systems.

This is the main compute silicon that performs the heavy parallel math behind AI workloads.

The accelerator is no longer just one rectangle of silicon. It is a tightly integrated package of logic, memory interfaces, and die-to-die links.

Native unit

transistors · tFLOPS

Constraint

Large accelerators are constrained by reticle area, package layout, and the yield of multi-die assembly.

Open chapter

Fit overview · pinch to zoom

06 · Scale-up Rack

How 72 GPUs behave like one machine.

A rack wires many accelerator trays together so they can behave more like one larger system.

NVL72 combines compute trays, switches, copper links, power, and cooling into a rack-scale training unit.

Native unit

GPUs / pod · NVLinks

Constraint

Rack-scale performance depends on how many GPUs can communicate as one coherent system.

Open chapter

Fit overview · pinch to zoom

07 · Data Center

What it takes to power a dense AI data center.

This is the building-scale layer: substations, power conversion, cooling, and deployment timing.

As racks get denser, data-center design shifts from floor space to substations, conversion losses, and cooling.

Native unit

MW · PUE

Constraint

New AI capacity depends on power delivery, interconnect timing, and cooling infrastructure.

Open chapter

Fit overview · pinch to zoom

08 · Cloud Layer

How hardware becomes cloud revenue.

This layer turns expensive servers into rented compute capacity with a payback clock attached.

This layer connects spending on compute infrastructure to rental pricing, utilization, and payback time.

Native unit

$ / hr-Hopper · depreciation

Constraint

Cloud economics depend on capex, utilization, pricing, and how quickly hardware depreciates.

Open chapter

Fit overview · pinch to zoom

09 · AI Lab → Model

How tokens inherit the cost of everything underneath them.

Infrastructure cost finally becomes model training, inference, and the price of serving tokens.

The lab is where the hardware supply chain turns into a user-facing service, with compute cost flowing into every token served.

Native unit

tokens / s · gross margin

Constraint

At the end of the supply chain, token economics are shaped by inference efficiency and compute cost.

Open chapter

Fit overview · pinch to zoom

§ 02 BOTTLENECKS

The bottleneck never sits still.

Every twelve to eighteen months, the binding constraint on AI compute moves to a different layer of the stack. Knowing where it will sit next is the difference between a great compute contract and a bad one.

Read each column as a year: packaging was yesterday’s constraint, memory is today’s, EUV and power are tomorrow’s.

  1. EUV

    100 / yr · ASML cap

  2. WAFER

    75 wph · N3

  3. PACKAGING

    CoWoS-L · TSMC cap

  4. HBM

    12-hi · 2.5 TB/s

  5. ACCELERATOR

    Multi-die package

  6. SCALE-UP

    NVL72 · 72 GPU

  7. DATA CENTER

    20 GW / yr · US

  8. CLOUD

    $480B → $57B · 8×

today’s binding artifact the seven other layers
Intensity main limiter tightening background

Fit overview · pinch to zoom

Layer
’23
’24
’25
’26
’27
’28
’29
’30

EUV Lithography

Wafer Fabs

Packaging (CoWoS)

HBM Memory

GPU Die

Scale-up Rack

Data Center

Power

What drove each transition

CoWoS bottleneck.

TSMC’s chip-on-wafer-on-substrate line was capped at a few thousand units a month while NVIDIA H100 demand quintupled. Every shipment delayed.

Long context met memory.

KV caches read on every token, sparse MoE multiplying memory demand, smartphones bidding for the same DRAM wafers. SK Hynix tripled HBM prices.

ASML capacity binds.

At ~70–100 tools/year and 3.5 tools per gigawatt, the simple flow math supports only ~20–30 GW/yr of new AI capacity before stock, allocation, and node assumptions. The end-of-decade ceiling shows up here first.

Grid won’t connect you.

Interconnection queues stretched to seven years in PJM. Turbine deposits all booked. Behind-the-meter gas became the standard answer.

§ 03 OPEN HYPOTHESES

Eight open hypotheses, tracked through resolution.

Specific, dated, falsifiable claims about which constraints will bind, which players will surprise, and which conventional wisdoms break by 2030. Each carries a confidence label; the “non-consensus” tag marks claims where mainstream supply-chain analysis disagrees.

Hypothesis 01

high confidence

Memory stays the binding constraint through 2028.

DRAM fabs take two years to build and the memory makers only started building in 2025. Even with smartphone volumes halving, HBM demand outruns supply.

resolves by 2028

Hypothesis 02

uncertain

ASML flow math is tighter than the 200 GW headline.

At 70–100 EUV tools/year and 3.5 tools per gigawatt, simple annual flow is ~20–30 GW/year before installed-base, allocation, and node assumptions. Any larger ceiling needs an explicit stock model.

resolves by 2030

Hypothesis 03

non-consensus

Apple becomes a minority customer of TSMC by 2028.

Already squeezed off N3 majority. By A16 the first customer is AI, not iPhone. TSMC’s focus follows margin, and AI clears every bar.

resolves by 2028

Hypothesis 04

non-consensus

H100 prices are higher in 2027 than they were in 2024.

Models that run on Hoppers got smarter and cheaper to serve. The depreciation cycle is longer than five years, not shorter, so the bear thesis that Hopper-class compute is rapidly worth less does not hold.

resolves by 2027

Hypothesis 05

uncertain

China gets indigenous EUV working by 2030 — not in volume.

Working tools in labs by end of decade. Mass-production hell takes another 5–7 years. China’s real catch-up is in DUV and packaging.

resolves by 2030

Hypothesis 06

non-consensus

Space data centers are a 2035 problem, not a 2030 problem.

Free power saves 10–15% of TCO. Six months of deployment delay costs more. While chips are the bottleneck, earth wins.

resolves by 2030

Hypothesis 07

uncertain

Robots offload thinking to the cloud, not to the chest cavity.

Long-horizon planning batches in data centers; only reflexes run on-board. Centralized intelligence drives decentralized motion.

resolves by 2030

Hypothesis 08

speculative

Fast AGI timelines favor the U.S. supply chain; long timelines favor China’s.

If revenue compounds fast enough that the next model is built on the last one’s gross margin, the West’s dispersed supply chain wins. If not, China’s vertically integrated one does.

open-ended

§ 04 CHINA & TAIWAN

The single-point-of-failure problem.

Of leading-edge logic

≈90%

made on a single island

Roughly nine-tenths of the world’s leading-edge logic comes from Taiwan. The tools that make it use chips that are also made in Taiwan — a snake eating its own tail.

In the high-growth scenario where Taiwan remains available, annual AI compute buildout can be modeled in the hundreds of gigawatts. If something goes wrong on the island, the same scenario collapses to perhaps 10–20 GW of annual new capacity — the limit of what Intel and Samsung can produce. Existing capacity continues but the growth curve stops, and the supply chain takes a decade to rebuild without Taiwan’s know-how.

Huawei is interesting precisely because it is verticalized: fabs (SMIC), networking, software, talent, AI researchers, end-market — the whole stack in one country. If Huawei had TSMC access, the consensus argument runs, they would be NVIDIA’s biggest competitor. They don’t, and they’re still building anyway.

Counterfactual

Roughly 10×–20× less annual new capacity. Existing fleet keeps running; new growth stalls until Arizona, Korea, and Japan rebuild the missing capacity. Best estimates: a decade.

§ 05 GEOGRAPHY

Where the world’s compute is actually made.

Critical campuses

12

The supply chain looks global on a map and concentrated on a list. Every critical layer above traces back to a handful of campuses in a handful of cities.

01

Veldhoven

NL

ASML EUV lithography tools

02

Oberkochen

DE

Carl Zeiss projection optics (18 mirrors per tool)

03

San Diego

CA

Cymer (ASML) EUV source — the tin-droplet laser

04

Wilton

CT

ASML reticle stages (9 G mechanical scanning)

05

Hsinchu

TW

TSMC N3 / N2 fabs and CoWoS packaging

06

Icheon

KR

SK Hynix HBM stacks (NVIDIA’s primary supplier)

07

Hwaseong

KR

Samsung DRAM + HBM + Foundry

08

Hiroshima

JP

Micron HBM stacks (Idaho HQ, JP fab)

09

Hillsboro

OR

Intel 18A fabs (the US backstop bet)

10

Santa Clara

CA

NVIDIA GPU + system design (no fabs)

11

Shenzhen + Shanghai

CN

Huawei Ascend, SMIC fabs (DUV only)

12

Taoyuan

TW

Victory Giant PCBs for the entire industry

§ 06 CAST OF CHARACTERS

Twelve companies, one supply chain.

Profiled

12

The people you should know by name. Each plays a role no one else can, on a campus most readers couldn’t place on a map.

01

ASML

Builds the hardest machine humans make. Without ASML, the modern world stops.

€34 B revenue sole EUV supplier

02

TSMC

Ninety percent of leading-edge logic. NVIDIA’s, Apple’s, AMD’s, and Google’s chips all start here.

$90 B revenue single point of failure

03

NVIDIA

Designs the accelerator everyone else buys. Doesn’t own a fab. Owns the customer.

$200 B revenue ~90% AI accelerator share

04

SK Hynix

Was memory’s runner-up. Picked HBM early. Is now NVIDIA’s most important external supplier.

$40 B revenue ≈50% HBM share

05

Huawei

The only vertical empire — fabs, networking, AI talent, end markets — all in one country. Sanctioned. Still climbing.

$130 B revenue on DUV, climbing fast

06

Google

Has had in-house silicon since 2015. Has the largest scale-up domain in the world. Just woke up to AGI.

$300 B revenue TPU v7 · Gemini 3

07

Anthropic

$4 B → $6 B added in two months. Now compute-constrained on every dimension simultaneously.

$25 B ARR Opus 4.6 · Claude Code

08

OpenAI

Signed deals with every NeoCloud that would have them. Now the largest commercial compute buyer on Earth.

$30 B ARR gpt-5.4 · 7 GW by EOY

09

Carl Zeiss

~1,000 artisans polishing the eighteen multilayer mirrors that go into every EUV tool. Sub-nanometer accuracy, by hand.

€10 B revenue 18 mirrors / tool

10

Crusoe

From flare-gas Bitcoin to OpenAI’s biggest data-center partner. Pioneered behind-the-meter at gigawatt scale.

private Abilene 1.2 GW build

11

Microsoft

OpenAI’s landlord. Also the cloud half of “foundry”. Stuck navigating between the two largest customers of AI.

$300 B revenue Azure + Foundry

12

AWS

Trainium 3 ships this year. The only hyperscaler trying to design its own AI silicon at scale outside of Google.

$100 B AWS rev Trainium 3 + Graviton

§ 07 TRACE A GPU SUPPLY-CHAIN WALK

Where does your chip come from?

Follow a single accelerator across eight stops, twelve companies, and roughly fourteen weeks of physical motion.

  1. 01

    Worldwide

    Silica sand

    Quartz mines in NC, Spruce Pine and elsewhere.

  2. 02

    Japan

    Silicon ingot

    Shin-Etsu and SUMCO grow 30 cm boules.

  3. 03

    Taiwan

    N3 wafer

    TSMC fab in Hsinchu · 70-mask process.

  4. 04

    Netherlands → TW

    EUV pass

    20 EUV layers from ASML tools at TSMC.

  5. 05

    Taiwan

    CoWoS

    4 dies bonded to interposer, then to substrate.

  6. 06

    Korea

    HBM4 bonded

    8 stacks from SK Hynix Icheon. The big ticket.

  7. 07

    TW · US

    Test & burn-in

    ASE / Amkor verify every chip at full clock.

  8. 08

    TW → US

    NVL72 rack

    Wiwynn / Foxconn integrate 72 GPUs + NVLink switch.

≈14 weeks

sand → powered rack

8 stops

physically traversed

12 firms

contribute to one chip

modeled

unit cost · product-specific

3 countries

hold single points of failure