强化学习
计算机科学
德拉姆
串扰
马尔可夫决策过程
带宽(计算)
马尔可夫链
电子工程
马尔可夫过程
人工智能
工程类
计算机硬件
数学
电信
统计
机器学习
作者
Keunwoo Kim,Hyunwook Park,Daehwan Lho,Minsu Kim,Keeyoung Son,Kyungjune Son,Seongguk Kim,Taein Shin,Seong Gon Choi,Joungho Kim
标识
DOI:10.1109/edaps50281.2020.9312906
摘要
In this paper, we propose the through silicon via (TSV) array design optimization method using deep reinforcement learning (DRL) framework. The agent trained through the proposed method can provide an optimal TSV array that minimizes far-end crosstalk (FEXT) in one single step. We define the state, action, and reward that are elements of the Markov Decision Process (MDP) for optimizing the TSV array considering FEXT and train a deep q network (DQN) agent. For verification, we applied the proposed method to a 3 by 3 through silicon via array at stacked DRAM of High Bandwidth Memory (HBM). The network converged well, and as the result, the proposed method provided the optimal design that satisfies the target FEXT in which 3 dB lower than the initial design.
科研通智能强力驱动
Strongly Powered by AbleSci AI