The interplay of complexity and subjectivity in opinionated discourse

主观性 计算机科学 词典 自然语言处理 论证理论 语言学 情绪分析 语料库语言学 人工智能 语料库 认识论 哲学
作者
Katharina Ehret,Maite Taboada
出处
期刊:Discourse Studies [SAGE]
卷期号:23 (2): 141-165 被引量:7
标识
DOI:10.1177/1461445620966923
摘要

This paper brings together cutting-edge, quantitative corpus methodologies and discourse analysis to explore the relationship between text complexity and subjectivity as descriptive features of opinionated language. We are specifically interested in how text complexity and markers of subjectivity and argumentation interact in opinionated discourse. Our contributions include the marriage of quantitative approaches to text complexity with corpus linguistic methods for the study of subjectivity, in addition to large-scale analyses of evaluative discourse. As our corpus, we use the Simon Fraser Opinion and Comments Corpus (SOCC), which comprises approximately 10,000 opinion articles and the corresponding reader comments from the Canadian online newspaper The Globe and Mail, as well as a parallel corpus of hard news articles also sampled from The Globe and Mail. Methodologically, we combine conditional inference trees with the analysis of random forests, an ensemble learning technique, to investigate the interplay between text complexity and subjectivity. Text complexity is defined in terms of Kolmogorov complexity, that is, the complexity of a text is measured based on its description length. In this approach, texts which can be described more efficiently are considered to be linguistically less complex. Thus, Kolmogorov complexity is a measure of structural surface redundancy. Our take on subjectivity is inspired by research in evaluative language, stance and Appraisal and defined as the expression of evaluation and opinion in language. Drawing on a sentiment analysis lexicon and the literature on stance markers, a custom set of subjectivity and argumentation markers is created. The results show that complexity can be a powerful tool in the classification of text into different text types, and that stance adverbials serve as distinctive features of subjectivity in online news comments.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
zzjiay完成签到,获得积分10
刚刚
丘比特应助假面采纳,获得10
刚刚
刚刚
Orange应助璇222采纳,获得10
3秒前
内向芒果完成签到,获得积分10
3秒前
ldx发布了新的文献求助10
4秒前
4秒前
8秒前
充电宝应助Messi采纳,获得10
9秒前
Jasper应助lx采纳,获得10
9秒前
hgh完成签到,获得积分10
10秒前
光亮的自行车完成签到 ,获得积分10
11秒前
赘婿应助蟹xie采纳,获得10
11秒前
iNk应助ldx采纳,获得10
11秒前
七院应助明亮静芙采纳,获得30
14秒前
14秒前
李健的小迷弟应助木木采纳,获得10
17秒前
pp发布了新的文献求助10
18秒前
walongjushi完成签到 ,获得积分10
18秒前
Isaac完成签到,获得积分10
18秒前
19秒前
20秒前
hongzhou完成签到,获得积分10
20秒前
muyingleng举报哈哈求助涉嫌违规
21秒前
俏皮代柔完成签到,获得积分20
21秒前
ldx完成签到,获得积分10
22秒前
lx发布了新的文献求助10
22秒前
23秒前
23秒前
铜眼科完成签到,获得积分10
23秒前
qucheng完成签到 ,获得积分10
24秒前
25秒前
小松鼠发布了新的文献求助30
28秒前
坦率问晴发布了新的文献求助10
29秒前
流星发布了新的文献求助50
31秒前
英俊的铭应助蟹xie采纳,获得10
31秒前
33秒前
leez发布了新的文献求助10
33秒前
可爱的函函应助武科大采纳,获得10
34秒前
CipherSage应助一一一采纳,获得10
34秒前
高分求助中
Production Logging: Theoretical and Interpretive Elements 2000
Very-high-order BVD Schemes Using β-variable THINC Method 1200
BIOLOGY OF NON-CHORDATES 1000
进口的时尚——14世纪东方丝绸与意大利艺术 Imported Fashion:Oriental Silks and Italian Arts in the 14th Century 800
Autoregulatory progressive resistance exercise: linear versus a velocity-based flexible model 550
Green building development for a sustainable environment with artificial intelligence technology 500
Zeitschrift für Orient-Archäologie 500
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 细胞生物学 免疫学 冶金
热门帖子
关注 科研通微信公众号,转发送积分 3351649
求助须知:如何正确求助?哪些是违规求助? 2977118
关于积分的说明 8677840
捐赠科研通 2658157
什么是DOI,文献DOI怎么找? 1455504
科研通“疑难数据库(出版商)”最低求助积分说明 674001
邀请新用户注册赠送积分活动 664503