About this Series
During my work, I always asked myself many questions. Some are easy to reply while some are not. For example:
-
Why does computers use binary, not decimal or hexadecimal?
-
Since CANN has provided many operators (算子), why are developers are still developing new operators by themselves?
-
GPU/NPU/TPU are kind such expensive and can make such a huge revenue, why don’t they produce more? What is the limitation?
Since I have got some answers, I would like to record it down. To be more efficient, it could be in Enlgish, Chinese or mixed.
Understand chipsets from cost point of view
Highly abstract, the cost can be:
COST = material + manfacturing + IP
Material to produce chipsets are known as sands, and the original cost is quit low. The IP is kind of fixed and let’s put it aside as well.
Manufacturing is the KEY. And to understand the manufacturing process, why not use First Principles 第一性原理.
What is inside the chips, like CPU?
-
Die (Chip, Integrated Circuit)
-
CPU Core
-
Layer
-
Wafer (晶圆)
-
Transistors
-
-
Isolators
-
Local connectors
-
Intermedia connectors
-
Global connectors
-
-
GPU Core
-
Memory Controbler
-
Compute Fabric
-
-
Die …
How much is a chips factory?
Size: 8 football fields
Price:
-
Hundreds of machines
-
Untraviolet Lithography: 170 Million USD
-
Cheapest: a few Million USD
-
80 layers X photomasks ($300k)
-
每一层,加各种材料,然后光刻,逐步叠加完成
Duration: CPU travels among the machines for over 3 months
Takeaways
The reason that CPU is so expensive can be comprehenive from the materials, manufactoring processes, Quality and time.
-
Materials
-
Silcon.
-
Chemical meterials.
-
-
Processes
-
Each layer needs one photomask
-
Each layer needs chemical materials, which take time
-
Machines are pretty expensive
-
-
Time
-
Many procesesses have to be serial, not parallel.
-
Chemical reactions take time
-
-
Quality control
- The more complicated, the easier to fail.
And these are some interesting figrues.
-
Factory can be 8 football fields large.
-
Hundreds of machines
-
$170M for one Untraviolet Lithography
-
30mm diameter for one wafer
-
80 layers in one Die
-
$30K for each photomask (only for one layer)
-
3 months for one CPU
Cooling System
Water vs Air.
热导率、比热容是关键。用比热容就可以解释为什么用水最理想。
Next blog
In the next one, the tradeoff during Chips design will be explained.