基于格的密码学
密码学
现场可编程门阵列
计算机科学
吞吐量
瓶颈
乘法(音乐)
矩阵乘法
Virtex公司
并行计算
算术
数学
算法
计算机硬件
嵌入式系统
量子
量子密码学
物理
量子信息
组合数学
无线
电信
量子力学
作者
Dur‐e‐Shahwar Kundi,Yuqing Zhang,Chenghua Wang,Ayesha Khalid,Máire O’Neill,Weiqiang Liu
出处
期刊:IEEE Transactions on Emerging Topics in Computing
[Institute of Electrical and Electronics Engineers]
日期:2022-01-25
卷期号:10 (4): 1993-2005
被引量:31
标识
DOI:10.1109/tetc.2022.3144101
摘要
Lattice-based cryptography (LBC) has emerged as the most viable substitutes to the classical cryptographic schemes as 5 out of 7 finalist schemes in the 3rd round of the NIST post-quantum cryptography (PQC) standardization process are lattice based in construction. This work explores novel architectural optimizations in the FPGA-based hardware implementation of polynomial multiplication, which is a bottleneck in every LBC construction. To target ultra-high throughput, both schoolbook polynomial multiplication (SPM) and number theoretic transform (NTT) are explored: a completely parallel architecture of an SPM is undertaken while for NTT, radix-2 and radix- $2^2$ multi-path delay commutator (MDC) based pipelined architectures are adopted. Our proposed high-speed SPM (HSPM) structure on latest Xilinx UltraScale+ FPGA is 5× faster than the state-of-the-art LBC designs. Whereas, the proposed high-speed NTT (HNTT) structure (i.e., R $2^2$ MDC) takes only 0.63 $\mu$ s for the encryption, hence achieving the highest throughput of 408 Mbps. Moreover, all of the proposed designs achieve highest design efficiencies (i.e., throughput per slice (TPS)) in comparison to available LBC designs.
科研通智能强力驱动
Strongly Powered by AbleSci AI