A Novel Validated Real-World Dataset for the Diagnosis of Multiclass Serous Effusion Cytology according to the International System and Ground-Truth Validation Data

医学 浆液性液体 细胞学 基本事实 渗出 病理 细胞病理学 放射科 人工智能 外科 计算机科学
作者
Esraa Abd-Almoniem,Nadia Abd-Alsabour,Samar S. M. Elsheikh,Rasha R Mostafa,Yasmine Fathy Elesawy
出处
期刊:Acta Cytologica [Karger Publishers]
卷期号:68 (2): 160-170 被引量:2
标识
DOI:10.1159/000538465
摘要

<b><i>Introduction:</i></b> The application of artificial intelligence (AI) algorithms in serous fluid cytology is lacking due to the deficiency in standardized publicly available datasets. Here, we develop a novel public serous effusion cytology dataset. Furthermore, we apply AI algorithms on it to test its diagnostic utility and safety in clinical practice. <b><i>Methods:</i></b> The work is divided into three phases. Phase 1 entails building the dataset based on the multitiered evidence-based classification system proposed by the International System (TIS) of serous fluid cytology along with ground-truth tissue diagnosis for malignancy. To ensure reliable results of future AI research on this dataset, we carefully consider all the steps of the preparation and staining from a real-world cytopathology perspective. In phase 2, we pay special consideration to the image acquisition pipeline to ensure image integrity. Then we utilize the power of transfer learning using the convolutional layers of the VGG16 deep learning model for feature extraction. Finally, in phase 3, we apply the random forest classifier on the constructed dataset. <b><i>Results:</i></b> The dataset comprises 3,731 images distributed among the four TIS diagnostic categories. The model achieves 74% accuracy in this multiclass classification problem. Using a one-versus-all classifier, the fallout rate for images that are misclassified as negative for malignancy despite being a higher risk diagnosis is 0.13. Most of these misclassified images (77%) belong to the atypia of undetermined significance category in concordance with real-life statistics. <b><i>Conclusion:</i></b> This is the first and largest publicly available serous fluid cytology dataset based on a standardized diagnostic system. It is also the first dataset to include various types of effusions and pericardial fluid specimens. In addition, it is the first dataset to include the diagnostically challenging atypical categories. AI algorithms applied on this novel dataset show reliable results that can be incorporated into actual clinical practice with minimal risk of missing a diagnosis of malignancy. This work provides a foundation for researchers to develop and test further AI algorithms for the diagnosis of serous effusions.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
1秒前
1秒前
2秒前
3秒前
迷人寒梦完成签到 ,获得积分10
3秒前
Chaga完成签到,获得积分10
3秒前
BUG发布了新的文献求助10
3秒前
小白发布了新的文献求助10
6秒前
7秒前
7秒前
7秒前
8秒前
pikachu发布了新的文献求助10
8秒前
8秒前
Frankyu完成签到,获得积分10
11秒前
pluto应助初心采纳,获得10
11秒前
白羊发布了新的文献求助10
12秒前
zn315315发布了新的文献求助10
13秒前
可可豆发布了新的文献求助30
13秒前
duanzhuang发布了新的文献求助10
14秒前
xuerkk发布了新的文献求助10
14秒前
无花果应助简单花花采纳,获得10
15秒前
16秒前
迷人寒梦发布了新的文献求助10
16秒前
寂屿发布了新的文献求助20
16秒前
19秒前
林桉关注了科研通微信公众号
19秒前
THEEVE完成签到,获得积分10
21秒前
xuerkk完成签到,获得积分10
21秒前
枝杲发布了新的文献求助10
21秒前
24秒前
潇潇发布了新的文献求助10
27秒前
28秒前
科研通AI6.1应助duanzhuang采纳,获得10
28秒前
微笑千愁完成签到 ,获得积分10
29秒前
枝杲完成签到,获得积分10
29秒前
29秒前
30秒前
简单花花发布了新的文献求助10
31秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Developing Genetic Editing Tools for Lysobacter 2000
卤化钙钛矿人工突触的研究 2000
Моделирование процессов самоорганизации в кристаллообразующих системах 1000
History of U.S. Space Surveillance and Satellite Cataloging 1000
Malcolm Fraser : a biography 700
Handbook of Optical Systems,Volume 6:Advanced Physical Optics 666
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6515050
求助须知:如何正确求助?哪些是违规求助? 8308357
关于积分的说明 17755800
捐赠科研通 5616877
什么是DOI,文献DOI怎么找? 2924843
邀请新用户注册赠送积分活动 1901893
关于科研通互助平台的介绍 1763189