亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

ChatGPT: Jack of all trades, master of none

计算机科学 主数据 航空学 数据库 工程类
作者
Jan Kocoń,Igor Cichecki,Oliwier Kaszyca,Mateusz Kochanek,Dominika Szydło,Joanna Baran,Julita Bielaniewicz,Marcin Gruza,Arkadiusz Janz,Kamil Kanclerz,A. Kocoń,Bartłomiej Koptyra,Wiktoria Mieleszczenko-Kowszewicz,Piotr Miłkowski,Marcin Oleksy,Maciej Piasecki,Łukasz Radliński,Konrad Wojtasik,Stanisław Woźniak,Przemysław Kazienko
出处
期刊:Information Fusion [Elsevier BV]
卷期号:99: 101861-101861 被引量:395
标识
DOI:10.1016/j.inffus.2023.101861
摘要

OpenAI has released the Chat Generative Pre-trained Transformer (ChatGPT) and revolutionized the approach in artificial intelligence to human-model interaction. The first contact with the chatbot reveals its ability to provide detailed and precise answers in various areas. Several publications on ChatGPT evaluation test its effectiveness on well-known natural language processing (NLP) tasks. However, the existing studies are mostly non-automated and tested on a very limited scale. In this work, we examined ChatGPT's capabilities on 25 diverse analytical NLP tasks, most of them subjective even to humans, such as sentiment analysis, emotion recognition, offensiveness, and stance detection. In contrast, the other tasks require more objective reasoning like word sense disambiguation, linguistic acceptability, and question answering. We also evaluated GPT-4 model on five selected subsets of NLP tasks. We automated ChatGPT and GPT-4 prompting process and analyzed more than 49k responses. Our comparison of its results with available State-of-the-Art (SOTA) solutions showed that the average loss in quality of the ChatGPT model was about 25% for zero-shot and few-shot evaluation. For GPT-4 model, a loss for semantic tasks is significantly lower than for ChatGPT. We showed that the more difficult the task (lower SOTA performance), the higher the ChatGPT loss. It especially refers to pragmatic NLP problems like emotion recognition. We also tested the ability to personalize ChatGPT responses for selected subjective tasks via Random Contextual Few-Shot Personalization, and we obtained significantly better user-based predictions. Additional qualitative analysis revealed a ChatGPT bias, most likely due to the rules imposed on human trainers by OpenAI. Our results provide the basis for a fundamental discussion of whether the high quality of recent predictive NLP models can indicate a tool's usefulness to society and how the learning and validation procedures for such systems should be established.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
xttawy发布了新的文献求助10
6秒前
23秒前
小白发布了新的文献求助10
29秒前
孙老师完成签到 ,获得积分10
35秒前
40秒前
xttawy发布了新的文献求助10
43秒前
fearless完成签到,获得积分10
50秒前
改过来发布了新的文献求助10
57秒前
糟糕的豪完成签到 ,获得积分10
1分钟前
xttawy发布了新的文献求助10
1分钟前
大熊完成签到 ,获得积分10
1分钟前
共享精神应助gggkkkkhhhhh采纳,获得10
1分钟前
千里草完成签到,获得积分10
1分钟前
小白发布了新的文献求助10
1分钟前
Ava应助小白采纳,获得10
2分钟前
2分钟前
2分钟前
织梦师发布了新的文献求助10
2分钟前
xttawy发布了新的文献求助10
2分钟前
2分钟前
2分钟前
曌毓发布了新的文献求助10
2分钟前
xttawy发布了新的文献求助10
2分钟前
Augustines完成签到,获得积分10
2分钟前
织梦师完成签到,获得积分10
3分钟前
xttawy发布了新的文献求助10
3分钟前
科研通AI2S应助科研通管家采纳,获得10
3分钟前
Gydl完成签到,获得积分10
3分钟前
xi完成签到 ,获得积分10
3分钟前
3分钟前
4分钟前
李爱国应助柔弱采枫采纳,获得10
4分钟前
xttawy发布了新的文献求助10
4分钟前
红火完成签到 ,获得积分10
4分钟前
科研狗完成签到,获得积分10
4分钟前
xttawy发布了新的文献求助10
5分钟前
5分钟前
科研通AI2S应助科研通管家采纳,获得10
5分钟前
所所应助科研通管家采纳,获得10
5分钟前
xttawy发布了新的文献求助10
5分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Burger's Medicinal Chemistry, Drug Discovery and Development, Volumes 1 - 8, 8 Volume Set, 8th Edition 1800
Cronologia da história de Macau 1600
Netter collection Volume 9 Part I upper digestive tract及Part III Liver Biliary Pancreas 3rd 2024 的超高清PDF,大小约几百兆,不是几十兆版本的 1050
Current concept for improving treatment of prostate cancer based on combination of LH-RH agonists with other agents 1000
Research Handbook on the Law of the Sea 1000
Contemporary Debates in Epistemology (3rd Edition) 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 纳米技术 计算机科学 化学工程 生物化学 物理 复合材料 内科学 催化作用 物理化学 光电子学 细胞生物学 基因 电极 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6165885
求助须知:如何正确求助?哪些是违规求助? 7993420
关于积分的说明 16620955
捐赠科研通 5272149
什么是DOI,文献DOI怎么找? 2812797
邀请新用户注册赠送积分活动 1792757
关于科研通互助平台的介绍 1658809