M André,Rafael Ferreira Mello,André Nascimento,Rafael Dueire Lins,Dragan Gašević
出处
期刊:IEEE Transactions on Learning Technologies [Institute of Electrical and Electronics Engineers] 日期:2021-12-01卷期号:14 (6): 802-816被引量:12
标识
DOI:10.1109/tlt.2022.3150663
摘要
Social presence is an essential construct of the well-known Community of Inquiry (CoI) model, which is created to support design, facilitation, and analysis of asynchronous online discussions. Social presence focuses on the extent to which participants of online discussions can see each other as “real persons” in computer-mediated communication. In the CoI model, social presence is looked at through the affective, interactive, and cohesive categories. Previous research has obtained good results in the automatic identification of these three categories in the analysis of transcripts of asynchronous online discussions using the random forest algorithms benefiting from traditional text mining features in combination with structural features such as Coh-Metrix and linguistic inquiry and word count (LIWC). In this context, this study evaluated the performance of the state-of-the-art decision tree algorithms and the deep learning linguistic model bidirectional encoder representations from transformers (BERT) for automatic detection of social presence in online discussions. The results revealed that eXtreme gradient boosting (XGBoost) and adaptive boosting (AdaBoost) outperformed the traditional random forest classifier that was commonly used in the previous works on automatic analysis of social presence reaching 0.38, 0.79, and 0.94 for the affective, interactive, and cohesive categories, respectively. Moreover, the proposed classifiers also reached better result when compared to BERT. Finally, this study explored a broad range of features used in the automatic classification of online discussion messages according to the categories of social presence. The results showed the importance of the features provided by the well-known linguistic framework LIWC and features calculated by techniques, such as social network analysis and sentiment analysis, that had never been reported previously in the literature for the automatic detection of social presence.