计算机科学
可靠性(半导体)
人工智能
自然语言处理
字长
读写能力
词(群论)
语言学
心理学
教育学
量子力学
物理
哲学
功率(物理)
作者
Hyun–Bin Hwang,Charlene Polio
出处
期刊:Research methods in applied linguistics
[Elsevier]
日期:2023-10-07
卷期号:2 (3): 100085-100085
被引量:4
标识
DOI:10.1016/j.rmal.2023.100085
摘要
Automated tools are widely used to assess syntactic complexity in second language (L2) writing studies; however, the effects of text length on syntactic complexity indices remain unclear. This can pose a challenge when studying underrepresented populations (e.g., young learners, adults with limited literacy skills), as their lower proficiency may result in less text production. To address this issue, we investigated the minimum text length threshold at which automated measures of syntactic complexity become the most reliable while considering L2 proficiency and prompt topic. Essays from the ICNALE corpus, a dataset of 5,200 essays with four proficiency levels, were used to create a dataset of texts of varying lengths (50, 100, 150, and 200 words). Mixed-effects regression models showed that seven out of 14 indices were not affected by text length regardless of learner proficiency and prompt topic. The other seven differed only between the 50- and 200-word texts within intermediate levels. We suggest a minimum of 100 words as a conservative threshold for the reliability of syntactic complexity indices. Finally, we emphasize the importance of transparent reporting practice regarding text length information.
科研通智能强力驱动
Strongly Powered by AbleSci AI