亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

25.4 A 20nm 6GB Function-In-Memory DRAM, Based on HBM2 with a 1.2TFLOPS Programmable Computing Unit Using Bank-Level Parallelism, for Machine Learning Applications

计算机科学 带宽(计算) 内存带宽 德拉姆 利用 嵌入式系统 炸薯条 计算机体系结构 计算机硬件 计算机网络 电信 计算机安全
作者
Young-Cheon Kwon,Suk Han Lee,Jae‐Hoon Lee,Sanghyuk Kwon,Je Min Ryu,Jong-Pil Son,O Seongil,Hak-soo Yu,Haesuk Lee,Soo Young Kim,Youngmin Cho,Jin Guk Kim,Jongyoon Choi,Hyunsung Shin,Jin Kim,Bengseng Phuah,HyoungMin Kim,Myeong Jun Song,Ahn Choi,Daeho Kim
标识
DOI:10.1109/isscc42613.2021.9365862
摘要

In recent years, artificial intelligence (AI) technology has proliferated rapidly and widely into application areas such as speech recognition, health care, and autonomous driving. To increase the capabilities of AI more powerful systems are needed to process a larger amount of data. This requirement has made domain-specific accelerators, such as GPUs and TPUs, popular; as they can provide orders of magnitude higher performance than state-of-the-art CPUs. However, these accelerators can only operate at their peak performance when they get the necessary data from memory as quickly as it is processed: requiring off-chip memory with a high bandwidth and a large capacity [1]. HBM has thus far met the bandwidth and capacity requirement [2-6], but recent AI technologies such as recurrent neural networks require an even higher bandwidth than HBM [7-8]. While a further increase in off-chip bandwidth can be accomplished by various techniques, it is often limited by power constraints at the chip or system level [9]. Hence, it is essential to decrease demand for off-chip bandwidth with unconventional architectures: such as processing-in-memory. In this paper, we present function-Inmemory DRAM (FIMDRAM) that integrates a 16-wide single-instruction multiple-data engine within the memory banks and that exploits bank-level parallelism to provide 4× higher processing bandwidth than an off-chip memory solution. Second, we show techniques that do not require any modification to conventional memory controllers and their command protocols, which make FIMDRAM more practical for quick industry adoption. Finally, we conclude this paper with circuitand system-level evaluations of our fabricated FIMDRAM.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
7秒前
可爱的函函应助hefang采纳,获得20
13秒前
西瓜刀完成签到 ,获得积分10
29秒前
1分钟前
1分钟前
1分钟前
景景景完成签到,获得积分10
1分钟前
1分钟前
hefang发布了新的文献求助20
1分钟前
景景景发布了新的文献求助10
1分钟前
务实书包完成签到,获得积分10
1分钟前
1分钟前
1分钟前
1分钟前
研友_8Y26PL完成签到 ,获得积分10
1分钟前
科研通AI5应助典雅的飞丹采纳,获得10
1分钟前
h0jian09完成签到,获得积分10
1分钟前
1分钟前
骆马湖完成签到,获得积分10
2分钟前
草木发布了新的文献求助10
2分钟前
2分钟前
黑球发布了新的文献求助10
2分钟前
3分钟前
Aqib发布了新的文献求助10
3分钟前
mao应助Aqib采纳,获得10
3分钟前
3分钟前
Jinyang完成签到 ,获得积分10
4分钟前
赘婿应助黑球采纳,获得10
4分钟前
4分钟前
Sunnpy发布了新的文献求助30
4分钟前
4分钟前
4分钟前
冠状发布了新的文献求助10
4分钟前
Aqib完成签到,获得积分10
4分钟前
4分钟前
万类霜天竞自由完成签到,获得积分10
4分钟前
Lazarus完成签到,获得积分10
4分钟前
胖胖猪完成签到,获得积分10
4分钟前
八宝粥我爱喝完成签到 ,获得积分10
4分钟前
冠状完成签到,获得积分10
4分钟前
高分求助中
All the Birds of the World 4000
Production Logging: Theoretical and Interpretive Elements 3000
Animal Physiology 2000
Les Mantodea de Guyane Insecta, Polyneoptera 2000
Am Rande der Geschichte : mein Leben in China / Ruth Weiss 1500
CENTRAL BOOKS: A BRIEF HISTORY 1939 TO 1999 by Dave Cope 1000
Machine Learning Methods in Geoscience 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3736624
求助须知:如何正确求助?哪些是违规求助? 3280584
关于积分的说明 10020088
捐赠科研通 2997281
什么是DOI,文献DOI怎么找? 1644507
邀请新用户注册赠送积分活动 782041
科研通“疑难数据库(出版商)”最低求助积分说明 749648