发布文献求助

A Holistic Approach to Automatic Mixed-Precision Code Generation and Tuning for Affine Programs

计算机科学编码（集合论）双精度浮点格式单精度浮点格式代码生成加速架空（工程）仿射变换算法并行计算发电机（电路理论）计算机工程浮点型集合（抽象数据类型）数学程序设计语言钥匙（锁）功率（物理）纯数学物理量子力学计算机安全

作者

Jinchen Xu,Guanghui Song,Bei Zhou,Li Fei,Jiangwei Hao,Jie Zhao

标识

DOI：10.1145/3627535.3638484

摘要

Reducing floating-point (FP) precision is used to trade the quality degradation of a numerical program's output for performance, but this optimization coincides with type casting, whose overhead is undisclosed until a mixed-precision code version is generated. This uncertainty enforces the decoupled implementation of mixed-precision code generation and autotuning in prior work. In this paper, we present a holistic approach called PrecTuner that consolidates the mixed-precision code generator and the autotuner by defining one parameter. This parameter is first initialized by some automatically sampled values and used to generate several code variants, with various loop transformations also taken into account. The generated code variants are next profiled to solve a performance model formulated using the aforementioned parameter, possibly under a pre-defined quality degradation budget. The best-performing value of the defined parameter is finally predicted without evaluating all code variants. Experimental results of the PolyBench benchmarks on CPU demonstrate that PrecTuner outperforms LuIs by 3.28× while achieving smaller errors, and we also validate its effectiveness in optimizing a real-life large-scale application. In addition, PrecTuner also obtains a mean speedup of 1.81× and 1.52×-1.73× over Pluto on single- and multi-core CPU, respectively, and 1.71× over PPCG on GPU.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2024年影响因子查询已上线 (2024-6-20)

更新

大幅提高文件上传限制，最高150M (2024-4-1)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 酷波er上传了应助文件

1秒前; 我是站长才怪的应助被jinchen采纳，获得10

1秒前; wyx关闭了wyx的文献求助

2秒前; 闵问柳发布了新的文献求助10

3秒前; 李健的应助被受伤的水星采纳，获得10

3秒前; 外向的书蝶完成签到，获得积分10

4秒前; 专炸油条完成签到，获得积分10

5秒前; milawong完成签到，获得积分10

5秒前; 李健上传了应助文件

5秒前; 英姑的应助被姽稚采纳，获得10

6秒前; breaddog完成签到，获得积分10

6秒前; 社会主义接班人完成签到，获得积分10

7秒前; lingling00发布了新的文献求助10

7秒前; 科研通AI2S的应助被ABS采纳，获得10

7秒前; 852上传了应助文件

9秒前; 子车茗上传了应助文件

9秒前; 受伤的水星完成签到，获得积分10

10秒前; 因太帅被判无期完成签到，获得积分20

11秒前; anny.white完成签到，获得积分10

11秒前; 李健上传了应助文件

11秒前; 红箭烟雨发布了新的文献求助30

11秒前; 蔫清发布了新的文献求助10

14秒前; 蔫蔫的应助被谦让的樱采纳，获得10

14秒前; Hello的应助被赵亮采纳，获得10

14秒前; 小马甲上传了应助文件

14秒前; 受伤的水星发布了新的文献求助10

16秒前; wyx关闭了wyx的文献求助

16秒前; 欢喜的沛凝关闭了欢喜的沛凝的文献求助

17秒前; 李学发布了新的文献求助10

18秒前; 草花丝带的应助被coeds采纳，获得10

19秒前; JamesPei的应助被南烟采纳，获得10

19秒前; Hello上传了应助文件

20秒前; 安静的白zz发布了新的文献求助10

21秒前; 丰富的海燕完成签到，获得积分10

23秒前; 赵亮发布了新的文献求助10

25秒前; outsider完成签到，获得积分20

30秒前; 大个的应助被真实的一鸣采纳，获得10

31秒前; 万能图书馆上传了应助文件

31秒前; coeds完成签到，获得积分10

32秒前; 青阳完成签到，获得积分10

33秒前

高分求助中: Licensing Deals in Pharmaceuticals 2019-2024 3000; Cognitive Paradigms in Knowledge Organisation 2000; Effect of reactor temperature on FCC yield 2000; Introduction to Spectroscopic Ellipsometry of Thin Film Materials Instrumentation, Data Analysis, and Applications 1800; Natural History of Mantodea 螳螂的自然史 1000; A Photographic Guide to Mantis of China 常见螳螂野外识别手册 800; How Maoism Was Made: Reconstructing China, 1949-1965 800

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3313875; 求助须知：如何正确求助？哪些是违规求助？ 2946172; 关于积分的说明 8528716; 捐赠科研通 2621728; 什么是DOI，文献DOI怎么找？ 1434045; 科研通“疑难数据库（出版商）”最低求助积分说明 665112; 邀请新用户注册赠送积分活动 650697

今日热心研友

小鱼爱吃肉

坚强的广山

无敌最俊朗

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通