计算机科学
任务(项目管理)
背景(考古学)
边距(机器学习)
质量(理念)
人工智能
基线(sea)
自然语言处理
功能(生物学)
机器学习
人机交互
古生物学
哲学
海洋学
管理
认识论
进化生物学
经济
生物
地质学
作者
Yongchao Zhou,Andrei Ioan Muresanu,Han Zeng-lin,Keiran Paster,Silviu Pitis,Harris Chan,Jimmy Ba
出处
期刊:Cornell University - arXiv
日期:2022-01-01
被引量:127
标识
DOI:10.48550/arxiv.2211.01910
摘要
By conditioning on natural language instructions, large language models (LLMs) have displayed impressive capabilities as general-purpose computers. However, task performance depends significantly on the quality of the prompt used to steer the model, and most effective prompts have been handcrafted by humans. Inspired by classical program synthesis and the human approach to prompt engineering, we propose Automatic Prompt Engineer (APE) for automatic instruction generation and selection. In our method, we treat the instruction as the "program," optimized by searching over a pool of instruction candidates proposed by an LLM in order to maximize a chosen score function. To evaluate the quality of the selected instruction, we evaluate the zero-shot performance of another LLM following the selected instruction. Experiments on 24 NLP tasks show that our automatically generated instructions outperform the prior LLM baseline by a large margin and achieve better or comparable performance to the instructions generated by human annotators on 19/24 tasks. We conduct extensive qualitative and quantitative analyses to explore the performance of APE. We show that APE-engineered prompts can be applied to steer models toward truthfulness and/or informativeness, as well as to improve few-shot learning performance by simply prepending them to standard in-context learning prompts. Please check out our webpage at https://sites.google.com/view/automatic-prompt-engineer.
科研通智能强力驱动
Strongly Powered by AbleSci AI