Skip to content

perf(memory): integrate new Static GPU Memory Planning#303

Open
Zhao-Shi-jie wants to merge 4 commits into
InfiniTensor:masterfrom
Zhao-Shi-jie:master
Open

perf(memory): integrate new Static GPU Memory Planning#303
Zhao-Shi-jie wants to merge 4 commits into
InfiniTensor:masterfrom
Zhao-Shi-jie:master

Conversation

@Zhao-Shi-jie

@Zhao-Shi-jie Zhao-Shi-jie commented Mar 10, 2026

Copy link
Copy Markdown

-新增静态显存规划算法;
-该算法基于lazy_allocator 简化得到memory_allocator,这是新的显存规划算法使用的allocator。
-TEST:
不同显存规划算法计算得到不同的峰值显存需求,并且中间张量的最低显存需求小于原lazy_allocator策略的显存需求(下图为使用lazy_allocator策略的测试信息,图中“non weight tensor memory: 3195136”是非权重部分张量占用的大小,减去“Total IO footprint(belong non weight tensor): 180224 bytes”就是中间张量的显存占用)
1e44a52d2ba290834d9d884619161bdb

image 测试全部通过

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant