随机森林
机器学习
自杀意念
自杀未遂
人工智能
心理健康
毒物控制
自杀预防
召回
二元分类
计算机科学
逻辑回归
人为因素与人体工程学
心理学
精神科
支持向量机
医学
医疗急救
认知心理学
作者
Zehan Li,Iqra Ameer,Yan Hu,Ahmed Abdelhameed,Cui Tao,Salih Selek,Hua Xu
标识
DOI:10.1109/ichi57859.2023.00074
摘要
Suicide tendency is a fluid and multifaceted process that involves various stages, including suicidal ideation, planning, and attempting suicide. The use of electronic health records (EHR) and predictive algorithms has provided unprecedented opportunities for suicide research, but standard diagnosis codes for suicide tendencies are not always readily available in health records, resulting in low sensitivity when identifying suicide tendencies using structured data. Prior studies have focused on developing binary classification models to identify the presence of single suicide tendencies, such as suicide ideation or suicide attempt. In this study, we have worked on multiclass suicide tendency problem. We conducted a series of experiments to predict multiple suicide tendencies from psychiatric evaluation notes using classic machine learning models and pretrained transformer models. We manually annotated 1,000 Initial Psychiatric Evaluation (IPE) notes using a set of three classes (suicide ideation, suicide attempt, and non-suicidal). The performance of these models were evaluated using weighted F1 score, precision, recall, and accuracy. The Bio-ClinicalBERT model achieved the best performance for multiclass classification, with a weighted F1 score of 0.78, outperforming the classic machine learning models. Logistic regression and random forest models achieved comparable performance to state-of-the-art models in binary classification tasks with F1 score and accuracy of 0.93. The study contributes to mental health informatics with a novel Natural Language Paper (NLP) approach and psychiatric dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI