医学
前列腺癌
扫描仪
人工智能
前列腺
深度学习
医学物理学
接收机工作特性
机器学习
计算机科学
放射科
核医学
癌症
内科学
作者
José Guilherme de Almeida,Nuno M. Rodrigues,Ana Sofia Castro Verde,Ana Mascarenhas Gaivão,Carlos Bilreiro,Inês Santiago,Joana Ip,Sara Belião,Celso Matos,Sara Silva,Manolis Tsiknakis,Kostas Marias,Daniele Regge,Nikolaos Papanikolaou
出处
期刊:Radiology
[Radiological Society of North America]
日期:2025-01-22
摘要
“Just Accepted” papers have undergone full peer review and have been accepted for publication in Radiology: Artificial Intelligence. This article will undergo copyediting, layout, and proof review before it is published in its final version. Please note that during production of the final copyedited article, errors may be discovered which could affect the content. Purpose To assess the impact of scanner manufacturer and scan protocol on the performance of deep learning models to classify prostate cancer (PCa) aggressiveness on biparametric MRI (bpMRI). Materials and Methods In this retrospective study, 5,478 cases from ProstateNet, a PCa bpMRI dataset with examinations from 13 centers, were used to develop five deep learning (DL) models to predict PCa aggressiveness with minimal lesion information and test how using data from different subgroups—scanner manufacturers and endorectal coil (ERC) use (Siemens, Philips, GE with and without ERC and the full dataset)—impacts model performance. Performance was assessed using the area under the receiver operating characteristic curve (AUC). The impact of clinical features (age, prostate-specific antigen level, Prostate Imaging Reporting and Data System [PI-RADS] score) on model performance was also evaluated. Results DL models were trained on 4,328 bpMRI cases, and the best model achieved AUC = 0.73 when trained and tested using data from all manufacturers. Hold-out test set performance was higher when models trained with data from a manufacturer were tested on the same manufacturer (within-and between-manufacturer AUC differences of 0.05 on average, P < .001). The addition of clinical features did not improve performance ( P = .24). Learning curve analyses showed that performance remained stable as training data increased. Analysis of DL features showed that scanner manufacturer and scan protocol heavily influenced feature distributions. Conclusion In automated classification of PCa aggressiveness using bpMRI data, scanner manufacturer and endorectal coil use had a major impact on DL model performance and features. Published under a CC BY 4.0 license.
科研通智能强力驱动
Strongly Powered by AbleSci AI