路由器
计算机科学
容错
可靠性(半导体)
集成电路
三维集成电路
算法
拓扑(电路)
嵌入式系统
可靠性工程
分布式计算
功率(物理)
工程类
电气工程
计算机网络
操作系统
物理
量子力学
作者
Khanh N. Dang,Akram Ben Ahmed,Abderazek Ben Abdallah,Xuan‐Tu Tran
标识
DOI:10.1109/tcad.2021.3069370
摘要
Through silicon via (TSV) is considered as the near-future solution to realize low-power and high-performance 3D-integrated circuits (3D-ICs) and 3D-Network-on-Chips (3D-NoCs). However, the lifetime reliability issue of TSV due to its fault sensitivity and the high operating temperature of 3D-ICs, which also accelerates the fault rate, is one of the most critical challenges. Meanwhile, most current works focus on detecting and correcting TSV defects after manufacturing without considering high-temperature nodes' impact on lifetime reliability. Besides, the recovery for defective clusters is also challenging because of costly redundancies. In this work, we present HotCluster : a hotspot-aware self-correction platform for clustering defects in 3D-NoCs to help understand and tackle this problem. We first give a method to predict normalized fault rates and place redundant TSV groups according to each region's fault rate. In our particular medium fault rate (normalized to the coolest area), HotCluster reduces about 60% of the redundancies in comparison to the uniformly distributed redundancies while having a higher ratio of router working in a normal state. Furthermore, HotCluster integrates both online (weight based) and offline (max-flow min-cut offline method) mapping algorithms to help the system correct the faulty TSV clusters. The experimental results show that both the max-flow min-cut offline method and weight-based online mode with a redundancy of 0.25 exhibits less than 1% of routers disabled under 50% defect rates.
科研通智能强力驱动
Strongly Powered by AbleSci AI