ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

任务(项目管理) 计算机体系结构 建筑 计算机科学 计算机硬件 嵌入式系统 人工智能 工程类 地理 系统工程 考古
作者
Han Cai,Ligeng Zhu,Song Han
出处
期刊:Cornell University - arXiv 被引量:1308
标识
DOI:10.48550/arxiv.1812.00332
摘要

Neural architecture search (NAS) has a great impact by automatically designing effective neural network architectures. However, the prohibitive computational demand of conventional NAS algorithms (e.g. $10^4$ GPU hours) makes it difficult to \emph{directly} search the architectures on large-scale tasks (e.g. ImageNet). Differentiable NAS can reduce the cost of GPU hours via a continuous representation of network architecture but suffers from the high GPU memory consumption issue (grow linearly w.r.t. candidate set size). As a result, they need to utilize~\emph{proxy} tasks, such as training on a smaller dataset, or learning with only a few blocks, or training just for a few epochs. These architectures optimized on proxy tasks are not guaranteed to be optimal on the target task. In this paper, we present \emph{ProxylessNAS} that can \emph{directly} learn the architectures for large-scale target tasks and target hardware platforms. We address the high memory consumption issue of differentiable NAS and reduce the computational cost (GPU hours and GPU memory) to the same level of regular training while still allowing a large candidate set. Experiments on CIFAR-10 and ImageNet demonstrate the effectiveness of directness and specialization. On CIFAR-10, our model achieves 2.08\% test error with only 5.7M parameters, better than the previous state-of-the-art architecture AmoebaNet-B, while using 6$\times$ fewer parameters. On ImageNet, our model achieves 3.1\% better top-1 accuracy than MobileNetV2, while being 1.2$\times$ faster with measured GPU latency. We also apply ProxylessNAS to specialize neural architectures for hardware with direct hardware metrics (e.g. latency) and provide insights for efficient CNN architecture design.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
cue完成签到,获得积分10
1秒前
天天完成签到,获得积分10
1秒前
小黑哥发布了新的文献求助10
1秒前
1秒前
2秒前
2秒前
Cathy17sl发布了新的文献求助10
3秒前
3秒前
1q完成签到,获得积分10
3秒前
顾文完成签到,获得积分20
4秒前
4秒前
SciGPT应助Richardisme采纳,获得10
4秒前
5秒前
YangZhang发布了新的文献求助10
5秒前
qnmd完成签到,获得积分10
5秒前
5秒前
成就的井发布了新的文献求助10
7秒前
Stranger发布了新的文献求助10
7秒前
伶俐老头发布了新的文献求助10
7秒前
8秒前
GABOfly应助shui采纳,获得10
8秒前
淡定的弘完成签到,获得积分10
8秒前
pipipi完成签到,获得积分10
8秒前
8秒前
Languagesssss发布了新的文献求助10
9秒前
Recitative关注了科研通微信公众号
10秒前
10秒前
狗东西完成签到,获得积分10
11秒前
南木完成签到 ,获得积分10
11秒前
wen发布了新的文献求助30
11秒前
杜子完成签到,获得积分10
11秒前
12秒前
SciGPT应助Feng5945采纳,获得10
12秒前
小太阳发布了新的文献求助10
13秒前
pipipi发布了新的文献求助10
13秒前
YU完成签到 ,获得积分10
13秒前
Hello应助细心秀发采纳,获得10
13秒前
13秒前
小蘑菇应助美满的冬卉采纳,获得10
13秒前
putaotang完成签到,获得积分10
14秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Comprehensive Methanol Science: Production, Applications, and Emerging Technologies 4000
Kinesiophobia : a new view of chronic pain behavior 2000
Comprehensive Methanol Science: Production, Applications, and Emerging Technologies Volume 2: Methanol Production from Fossil Fuels and Renewable Resources 1000
Comprehensive Methanol Science: Production, Applications, and Emerging Technologies Volume 1: Methanol Characteristics and Environmental Challenges in Direct Methane Conversion 1000
The Social Psychology of Citizenship 1000
Research for Social Workers 1000
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 计算机科学 有机化学 物理 生物化学 纳米技术 复合材料 内科学 化学工程 人工智能 催化作用 遗传学 数学 基因 量子力学 物理化学
热门帖子
关注 科研通微信公众号,转发送积分 5918188
求助须知:如何正确求助?哪些是违规求助? 6882717
关于积分的说明 15805847
捐赠科研通 5044518
什么是DOI,文献DOI怎么找? 2714756
邀请新用户注册赠送积分活动 1667482
关于科研通互助平台的介绍 1605962