on-chip SRAM AI ASIC

来源: 胡雪盐8 于 2025-12-14 10:17:52 [档案] [博客] [旧帖] [给我悄悄话] 阅读数 : (8334 bytes)

本帖于 2025-12-14 10:18:34 时间, 由普通用户胡雪盐8 编辑

An on-chip SRAM AI ASIC is an accelerator where most of the working set (activations, partial sums, sometimes weights) stays inside SRAM physically on the compute die instead of being fetched from off-chip DRAM/HBM.

1. Latency dominance (especially LLM inference)

SRAM access: ~0.3–1 ns
HBM access (effective): ~50–100 ns
DDR access: 100+ ns

For token-by-token inference, this difference dominates user-visible latency.

2. Energy efficiency

Approximate energy per access:

SRAM: ~0.1–1 pJ/bit
HBM: ~3–5 pJ/bit
DDR: 10+ pJ/bit

LLMs are often memory-energy limited, not compute-limited.

3. Deterministic performance

No DRAM scheduling, refresh, or bank conflicts
Enables cycle-accurate pipelines (important for real-time systems)

Chip class	On-chip SRAM
Mobile NPU	4–32 MB
Edge inference ASIC	32–128 MB
Datacenter inference ASIC	100–300 MB
Wafer-scale (Cerebras)	10s of GB

Famous examples (and what they optimized for)

Groq

All on-chip SRAM
Static schedule, no caches
Unmatched token latency
Limited flexibility and capacity

Google TPU v1–v3

Large SRAM buffers
Matrix-centric workloads
Training + inference hybrid

Cerebras

Wafer-scale SRAM + compute
Avoids off-chip memory entirely
Extreme cost, extreme performance for certain models

When on-chip SRAM AI ASICs are the right answer

Ultra-low latency LLM inference
Real-time systems (finance, robotics, telecom)
Edge or power-constrained environments
Predictable workloads with known model shapes

更多我的博客文章>>>

您的位置：文学城 » 论坛 » 投资理财 » on-chip SRAM AI ASIC

所有跟帖：

• 哈哈，这里成了专业论坛了，SRAM本来就是在芯片里头的，很多年前就是做这个的 -绿园紫竹- ♂ (0 bytes) () 12/14/2025 postreply 10:35:00

• 对英伟大的竞争； -胡雪盐8- ♂ (0 bytes) () 12/14/2025 postreply 12:16:15

• 是AI写的吧？如果是的话，拒绝和AI互动！ -最西边的岛上- ♀ (0 bytes) () 12/14/2025 postreply 10:47:38

• 如果不是的话，觉得人类记忆系统就是脑子里各种RAM；人类生物本性：lizard brain, 6th sense etc -最西边的岛上- ♀ (0 bytes) () 12/14/2025 postreply 10:51:13

• are stored inside ROM of our brain. -最西边的岛上- ♀ (0 bytes) () 12/14/2025 postreply 10:52:01

• 大脑里面CPU和RAM+ROM是集成的 -胡雪盐8- ♂ (0 bytes) () 12/14/2025 postreply 12:20:37

• 同意，it’s definitely biologically integrated. 但是人类医学到目前为止还没有完全 -最西边的岛上- ♀ (0 bytes) () 12/14/2025 postreply 12:40:14

• 搞清自己的大脑，so it’s 2 be determined. Thx. -最西边的岛上- ♀ (0 bytes) () 12/14/2025 postreply 12:41:18

请您先登陆，再发跟帖！