计算机科学
二锂
现场可编程门阵列
散列函数
密码学
可重组计算
计算机硬件
嵌入式系统
并行计算
算法
计算机安全
量子力学
脱质子化
物理
离子
作者
Tengfei Wang,Chi Zhang,Pei Cao,Dawu Gu
出处
期刊:IEEE Transactions on Very Large Scale Integration Systems
[Institute of Electrical and Electronics Engineers]
日期:2022-06-29
卷期号:30 (9): 1158-1171
被引量:20
标识
DOI:10.1109/tvlsi.2022.3179459
摘要
In the process of NIST postquantum cryptography standardization, module lattice-based Dilithium has been chosen as one of the three third-round finalists for digital signature schemes. More evaluations of its implementation efficiency on different platforms are required for further competition. In this article, we present an efficient implementation of Dilithium on a field-programmable gate array (FPGA) system-on-chip (SoC) platform. To achieve a high computation speed, we design a hardware architecture to perform the main body of the algorithm, and the preprocessing and postprocessing steps are accomplished by the processor. For the hardware architecture, we take some optimizations on the most time-consuming operations, that is, polynomial multiplication, hashing, and sampling. Polynomial multiplications are accelerated by the radix-4 number theoretic transform (NTT) architecture with a conflict-free memory mapping scheme. A fast modular multiplication on the Dilithium modulus is proposed to support the underlying calculations. For hashing and sampling, we design a multipurpose hashing unit and a compact sampling unit. The cooperative work of the two units accelerates the sampling process significantly. We implement the Key Generation, Signing, and Verification algorithms of the round-3 Dilithium at all three security levels on the Xilinx Zynq-7000 platform. Compared with existing software/hardware codesign for Dilithium on a similar platform, our design achieves about $17\times $ and $40\times $ improvements in performance for the Signing and Verification algorithms, respectively, at the cost of about $7.8\times $ more look up table (LUT) resources.
科研通智能强力驱动
Strongly Powered by AbleSci AI