EchoPrime: A Multi-Video View-Informed Vision-Language Model for Comprehensive Echocardiography Interpretation

口译(哲学) 计算机科学 计算机视觉 人工智能 程序设计语言
作者
Miloš Vukadinovic,Xiu Tang,Neal Yuan,Paul Cheng,Debiao Li,Susan Cheng,Bryan He,David Ouyang
出处
期刊:Cornell University - arXiv 被引量:9
标识
DOI:10.48550/arxiv.2410.09704
摘要

Echocardiography is the most widely used cardiac imaging modality, capturing ultrasound video data to assess cardiac structure and function. Artificial intelligence (AI) in echocardiography has the potential to streamline manual tasks and improve reproducibility and precision. However, most echocardiography AI models are single-view, single-task systems that do not synthesize complementary information from multiple views captured during a full exam, and thus lead to limited performance and scope of applications. To address this problem, we introduce EchoPrime, a multi-view, view-informed, video-based vision-language foundation model trained on over 12 million video-report pairs. EchoPrime uses contrastive learning to train a unified embedding model for all standard views in a comprehensive echocardiogram study with representation of both rare and common diseases and diagnoses. EchoPrime then utilizes view-classification and a view-informed anatomic attention model to weight video-specific interpretations that accurately maps the relationship between echocardiographic views and anatomical structures. With retrieval-augmented interpretation, EchoPrime integrates information from all echocardiogram videos in a comprehensive study and performs holistic comprehensive clinical echocardiography interpretation. In datasets from two independent healthcare systems, EchoPrime achieves state-of-the art performance on 23 diverse benchmarks of cardiac form and function, surpassing the performance of both task-specific approaches and prior foundation models. Following rigorous clinical evaluation, EchoPrime can assist physicians in the automated preliminary assessment of comprehensive echocardiography.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
生活发布了新的文献求助10
刚刚
刚刚
WXKennyS发布了新的文献求助10
刚刚
刚刚
刘大帅发布了新的文献求助10
1秒前
小聂完成签到,获得积分10
2秒前
3秒前
小邹发布了新的文献求助10
3秒前
Finny发布了新的文献求助10
3秒前
gg发布了新的文献求助10
3秒前
俏皮的凡白完成签到 ,获得积分10
4秒前
老李发布了新的文献求助10
5秒前
斯文宛发布了新的文献求助10
5秒前
隐形曼青应助炙热傲儿采纳,获得10
5秒前
5秒前
科研通AI6.1应助33ovo采纳,获得10
6秒前
朴素从安发布了新的文献求助10
6秒前
GU发布了新的文献求助10
6秒前
7秒前
深情安青应助聪明的背包采纳,获得10
7秒前
8秒前
光亮猫咪发布了新的文献求助10
8秒前
8秒前
8秒前
了了晴山在完成签到,获得积分10
9秒前
9秒前
bububu发布了新的文献求助10
9秒前
万能图书馆应助普鲁斯特采纳,获得10
9秒前
JamesPei应助ygd采纳,获得10
10秒前
小二郎应助L912294993采纳,获得10
10秒前
大个应助动听书兰采纳,获得10
10秒前
11秒前
赤凰太一发布了新的文献求助10
11秒前
11秒前
彭于晏应助xx采纳,获得10
12秒前
12秒前
12秒前
九川发布了新的文献求助10
12秒前
GG发布了新的文献求助10
12秒前
12秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Kinesiophobia : a new view of chronic pain behavior 3000
Les Mantodea de guyane 2500
Molecular Biology of Cancer: Mechanisms, Targets, and Therapeutics 2000
Standard: In-Space Storable Fluid Transfer for Prepared Spacecraft (AIAA S-157-2024) 1000
Signals, Systems, and Signal Processing 510
Discrete-Time Signals and Systems 510
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 计算机科学 有机化学 物理 生物化学 纳米技术 复合材料 内科学 化学工程 人工智能 催化作用 遗传学 数学 基因 量子力学 物理化学
热门帖子
关注 科研通微信公众号,转发送积分 5948897
求助须知:如何正确求助?哪些是违规求助? 7118979
关于积分的说明 15913906
捐赠科研通 5081948
什么是DOI,文献DOI怎么找? 2732269
邀请新用户注册赠送积分活动 1692743
关于科研通互助平台的介绍 1615507