作者
Hao‐Ran Dai,Yuyao Xiao,Caixia Fu,Robert Grimm,Heinrich von Busch,Bram Stieltjes,Moon Hyung Choi,Zhoubing Xu,Guillaume Chabin,Chun Yang,Mengsu Zeng
摘要
Background The number of focal liver lesions (FLLs) detected by imaging has increased worldwide, highlighting the need to develop a robust, objective system for automatically detecting FLLs. Purpose To assess the performance of the deep learning–based artificial intelligence (AI) software in identifying and measuring lesions on contrast‐enhanced magnetic resonance imaging (MRI) images in patients with FLLs. Study Type Retrospective. Subjects 395 patients with 1149 FLLs. Field Strength/Sequence The 1.5 T and 3 T scanners, including T1‐, T2‐, diffusion‐weighted imaging, in/out‐phase imaging, and dynamic contrast‐enhanced imaging. Assessment The diagnostic performance of AI, radiologist, and their combination was compared. Using 20 mm as the cut‐off value, the lesions were divided into two groups, and then divided into four subgroups: <10, 10–20, 20–40, and ≥40 mm, to evaluate the sensitivity of radiologists and AI in the detection of lesions of different sizes. We compared the pathologic sizes of 122 surgically resected lesions with measurements obtained using AI and those made by radiologists. Statistical Tests McNemar test, Bland–Altman analyses, Friedman test, Pearson's chi‐squared test, Fisher's exact test, Dice coefficient, and intraclass correlation coefficients. A P ‐value <0.05 was considered statistically significant. Results The average Dice coefficient of AI in segmentation of liver lesions was 0.62. The combination of AI and radiologist outperformed the radiologist alone, with a significantly higher detection rate (0.894 vs. 0.825) and sensitivity (0.883 vs. 0.806). The AI showed significantly sensitivity than radiologists in detecting all lesions <20 mm (0.848 vs. 0.788). Both AI and radiologists achieved excellent detection performance for lesions ≥20 mm (0.867 vs. 0.881, P = 0.671). A remarkable agreement existed in the average tumor sizes among the three measurements ( P = 0.174). Data Conclusion AI software based on deep learning exhibited practical value in automatically identifying and measuring liver lesions. Level of Evidence 4. Technical Efficacy Stage 2.