变压器
计算机科学
安全性令牌
过程(计算)
比例(比率)
人工智能
机器学习
地图学
工程类
操作系统
电气工程
电压
地理
出处
期刊:Cornell University - arXiv
日期:2023-01-01
被引量:405
标识
DOI:10.48550/arxiv.2303.08774
摘要
We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based model pre-trained to predict the next token in a document. The post-training alignment process results in improved performance on measures of factuality and adherence to desired behavior. A core component of this project was developing infrastructure and optimization methods that behave predictably across a wide range of scales. This allowed us to accurately predict some aspects of GPT-4's performance based on models trained with no more than 1/1,000th the compute of GPT-4.
科研通智能强力驱动
Strongly Powered by AbleSci AI