强化学习                        
                
                                
                        
                            计算机科学                        
                
                                
                        
                            带宽(计算)                        
                
                                
                        
                            卷积神经网络                        
                
                                
                        
                            电子工程                        
                
                                
                        
                            信号完整性                        
                
                                
                        
                            算法                        
                
                                
                        
                            工程类                        
                
                                
                        
                            人工智能                        
                
                                
                        
                            电信                        
                
                                
                        
                            互连                        
                
                        
                    
            作者
            
                Keunwoo Kim,Hyunwook Park,Seongguk Kim,Youngwoo Kim,Kyungjune Son,Daehwan Lho,Keeyoung Son,Taein Shin,Boogyo Sim,Joonsang Park,Shinyoung Park,Joungho Kim            
         
                    
        
    
            
            标识
            
                                    DOI:10.1109/temc.2023.3343700
                                    
                                
                                 
         
        
                
            摘要
            
            In this article, a policy-based reinforcement learning (RL) method for optimizing through silicon via (TSV) array design in high-bandwidth memory (HBM) considering signal integrity is proposed. The proposed method can provide an optimal TSV-array signal/ground pattern design to maximize the eye opening (EO), which determines the bandwidth of the high-speed TSV channel. The proposed method adopts the proximal policy optimization algorithm, which directly trains the optimal policy, providing efficient handling of large action spaces rather than value-based RL. The convolutional neural network is used as a feature extractor to extract the location information of the TSV-array. To overcome the computational cost of the reward estimation, a fast EO estimation method is developed based on the equivalent circuit modeling and peak distortion analysis. The proposed method is applied to optimize 1-byte of TSV-array in a 16-high HBM and showed an 18.2% increase in EO compared with the initial design. The optimality performance of the proposed method is compared with deep q-network and random search algorithm, and the proposed method shows 3.4% and 9.6% better optimality, respectively.
         
            
 
                 
                
                    
                    科研通智能强力驱动
Strongly Powered by AbleSci AI