可读性
计算机科学
判决
自然语言处理
人工智能
语言模型
嵌入
变压器
语义学(计算机科学)
文字嵌入
程序设计语言
量子力学
物理
电压
作者
Scott A. Crossley,Joon Choi,Yanisa Scherber,Mathis Lucka
出处
期刊:Communications in computer and information science
日期:2023-01-01
卷期号:: 422-427
被引量:2
标识
DOI:10.1007/978-3-031-36336-8_66
摘要
Readability formulas can be used to better match readers and texts. Current state-of-the-art readability formulas rely on large language models like transformer models (e.g., BERT) that model language semantics. However, the size and runtimes make them impractical in educational settings. This study examines the effectiveness of new readability formulas developed on the CommonLit Ease of Readability (CLEAR) corpus using more efficient sentence-embedding models including doc2vec, Universal Sentence Encoder, and Sentence BERT. This study compares sentence-embedding models to traditional readability formulas, newer NLP-informed linguistic feature formulas, and newer BERT-based models. The results indicate that sentence-embedding readability formulas perform well and are practical for use in various educational settings. The study also introduces an open-source NLP website to readily assess the readability of texts along with an application programming interface (API) that can be integrated into online educational learning systems to better match texts to readers.
科研通智能强力驱动
Strongly Powered by AbleSci AI