计算机科学
杠杆(统计)
人工神经网络
人工智能
编码
深层神经网络
深度学习
图灵
机器学习
生物化学
基因
化学
程序设计语言
作者
Adam Santoro,Sergey Bartunov,Matthew Botvinick,Daan Wierstra,Timothy Lillicrap
出处
期刊:Cornell University - arXiv
日期:2016-05-19
被引量:283
摘要
Despite recent breakthroughs in the applications of deep neural networks, one setting that presents a persistent challenge is that of one-shot learning. Traditional gradient-based networks require a lot of data to learn, often through extensive iterative training. When new data is encountered, the models must inefficiently relearn their parameters to adequately incorporate the new information without catastrophic interference. Architectures with augmented memory capacities, such as Neural Turing Machines (NTMs), offer the ability to quickly encode and retrieve new information, and hence can potentially obviate the downsides of conventional models. Here, we demonstrate the ability of a memory-augmented neural network to rapidly assimilate new data, and leverage this data to make accurate predictions after only a few samples. We also introduce a new method for accessing an external memory that focuses on memory content, unlike previous methods that additionally use memory location-based focusing mechanisms.
科研通智能强力驱动
Strongly Powered by AbleSci AI