计算机科学
初始化
人工智能
机器学习
元学习(计算机科学)
超参数
适应(眼睛)
任务(项目管理)
光学(聚焦)
一般化
数学分析
物理
光学
经济
管理
程序设计语言
数学
作者
Sungyong Baik,Myungsub Choi,Janghoon Choi,Heewon Kim,Kyoung Mu Lee
标识
DOI:10.1109/tpami.2023.3261387
摘要
The objective of few-shot learning is to design a system that can adapt to a given task with only few examples while achieving generalization. Model-agnostic meta-learning (MAML), which has recently gained the popularity for its simplicity and flexibility, learns a good initialization for fast adaptation to a task under few-data regime. However, its performance has been relatively limited especially when novel tasks are different from tasks previously seen during training. In this work, instead of searching for a better initialization, we focus on designing a better fast adaptation process. Consequently, we propose a new task-adaptive weight update rule that greatly enhances the fast adaptation process. Specifically, we introduce a small meta-network that can generate per-step hyperparameters for each given task: learning rate and weight decay coefficients. The experimental results validate that learning a good weight update rule for fast adaptation is the equally important component that has drawn relatively less attention in the recent few-shot learning approaches. Surprisingly, fast adaptation from random initialization with ALFA can already outperform MAML. Furthermore, the proposed weight-update rule is shown to consistently improve the task-adaptation capability of MAML across diverse problem domains: few-shot classification, cross-domain few-shot classification, regression, visual tracking, and video frame interpolation.
科研通智能强力驱动
Strongly Powered by AbleSci AI