Ascend: a Scalable and Unified Architecture for Ubiquitous Deep Neural Network Computing : Industry Track Paper

计算机科学 可扩展性 测距 嵌入式系统 以数据库为中心的体系结构 建筑 编译程序 概括性 内存层次结构 分布式计算 参考体系结构 计算机体系结构 软件体系结构 操作系统 软件 隐藏物 心理学 艺术 视觉艺术 电信 心理治疗师
作者
Heng Liao,Jiajin Tu,Jing Xia,Liu Hu,Zhou Xi-ping,Hao Yuan,Yuxing Hu
出处
期刊:High-Performance Computer Architecture 被引量:28
标识
DOI:10.1109/hpca51647.2021.00071
摘要

Deep neural networks (DNNs) have been successfully applied to a great variety of applications, ranging from small IoT devices to large scale services in a data center. In order to improve the efficiency of processing these DNN models, dedicated hardware accelerators are required for all these scenarios. Theoretically, there exists an optimized acceleration architecture for each application. However, considering the cost of chip design and corresponding tool-chain development, researchers need to trade off between efficiency and generality. In this work, we demonstrate that it is practical to use a unified architecture, called Ascend, to support those applications, ranging from IoT devices to data-center services. We provide a lot of design details to explain that the success of Ascend relies on contributions from different levels. First, heterogeneous computing units are employed to support various DNN models. And the datapath is adapted according to the requirement of computing and data access. Second, when scaling the Ascend architecture from a single core to a cluster containing thousands of cores, it involves design efforts, such as memory hierarchy and system level integration. Third, a multi-tier compiler, which provides flexible choices for developers, is the last critical piece. Experimental results show that using accelerators based on the Ascend architecture can achieve comparable or even better performance in different applications. In addition, various chips based on the Ascend architecture have been successfully commercialized. More than 100 million chips have been used in real products.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
复杂不二完成签到,获得积分10
1秒前
宗沛柔发布了新的文献求助10
2秒前
酷酷的友灵完成签到,获得积分10
2秒前
3秒前
4秒前
SClcy发布了新的文献求助10
4秒前
yyy完成签到,获得积分20
4秒前
丰知然发布了新的文献求助10
5秒前
飘逸宫苴发布了新的文献求助10
5秒前
打打应助wtqaaaa采纳,获得10
6秒前
8秒前
wch071完成签到,获得积分10
8秒前
Lion完成签到,获得积分10
9秒前
yaxianzhi完成签到,获得积分10
11秒前
yyy发布了新的文献求助30
11秒前
暴躁的初阳完成签到,获得积分0
11秒前
11秒前
stonerdog完成签到,获得积分10
14秒前
古的古的应助圆滑的铁勺采纳,获得10
15秒前
等等NANO完成签到,获得积分10
16秒前
冷艳月光完成签到,获得积分10
17秒前
huanhuan完成签到,获得积分20
17秒前
NexusExplorer应助科研通管家采纳,获得10
17秒前
Singularity应助科研通管家采纳,获得10
17秒前
HeNeArKrXeRn应助科研通管家采纳,获得10
17秒前
无花果应助科研通管家采纳,获得10
17秒前
吴彦祖应助科研通管家采纳,获得80
17秒前
Orange应助科研通管家采纳,获得10
17秒前
嗯哼应助科研通管家采纳,获得10
18秒前
18秒前
完美世界应助科研通管家采纳,获得10
18秒前
CodeCraft应助科研通管家采纳,获得10
18秒前
18秒前
18秒前
小二郎应助科研通管家采纳,获得10
18秒前
jijijibibibi完成签到,获得积分10
18秒前
pan发布了新的文献求助10
18秒前
nsk810431231完成签到 ,获得积分10
19秒前
20秒前
高分求助中
LNG地下式貯槽指針(JGA指-107-19)(Recommended practice for LNG inground storage) 1000
rhetoric, logic and argumentation: a guide to student writers 1000
QMS18Ed2 | process management. 2nd ed 1000
Eric Dunning and the Sociology of Sport 850
Operative Techniques in Pediatric Orthopaedic Surgery 510
Generalized Linear Mixed Models 第二版 500
人工地层冻结稳态温度场边界分离方法及新解答 500
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 免疫学 细胞生物学 电极
热门帖子
关注 科研通微信公众号,转发送积分 2920798
求助须知:如何正确求助?哪些是违规求助? 2563065
关于积分的说明 6932824
捐赠科研通 2220944
什么是DOI,文献DOI怎么找? 1180625
版权声明 588751
科研通“疑难数据库(出版商)”最低求助积分说明 577598