发布文献求助

A Text-guided Protein Design Framework

计算机科学代表（政治）蛋白质设计构造（python库）自然语言处理人工智能情报检索蛋白质结构程序设计语言政治学核磁共振政治物理法学

作者

Shengchao Liu,Yutao Zhu,Jiarui Lu,Xu Zhao,Weili Nie,Anthony Gitter,Chaowei Xiao,Jian Tang,Hongyu Guo,Anima Anandkumar

出处

期刊：Cornell University - arXiv 日期：2023-01-01 被引量：17

链接

arxiv.org arxiv.org arxiv.org datacite.orgdoi.org

标识

DOI：10.48550/arxiv.2302.04611

摘要

Current AI-assisted protein design mainly utilizes protein sequential and structural information. Meanwhile, there exists tremendous knowledge curated by humans in the text format describing proteins' high-level functionalities. Yet, whether the incorporation of such text data can help protein design tasks has not been explored. To bridge this gap, we propose ProteinDT, a multi-modal framework that leverages textual descriptions for protein design. ProteinDT consists of three subsequent steps: ProteinCLAP which aligns the representation of two modalities, a facilitator that generates the protein representation from the text modality, and a decoder that creates the protein sequences from the representation. To train ProteinDT, we construct a large dataset, SwissProtCLAP, with 441K text and protein pairs. We quantitatively verify the effectiveness of ProteinDT on three challenging tasks: (1) over 90\% accuracy for text-guided protein generation; (2) best hit ratio on 10 zero-shot text-guided protein editing tasks; (3) superior performance on four out of six protein property prediction benchmarks.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

活动

『应助活动周』正在进行中 🔥 (2025-3-20)

更新

『中科院2025期刊分区』已更新 (2025-3-23)

更新

『即时热点』模块已上线 (2025-2-28)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: nnnd77发布了新的文献求助30

1秒前; CodeCraft的应助被不爱科研的笨猪采纳，获得10

1秒前; 朴素的紫安发布了新的文献求助10

1秒前; 科研通AI5上传了应助文件

2秒前; 量子星尘发布了新的文献求助10

2秒前; 请叫我风吹麦浪上传了应助文件

3秒前; 科研通AI5上传了应助文件

3秒前; wanci上传了应助文件

3秒前; 爆米花的应助被胖一达采纳，获得10

3秒前; 请叫我风吹麦浪上传了应助文件

4秒前; Akim的应助被夏青荷采纳，获得10

5秒前; Distance关闭了Distance的文献求助

6秒前; 请叫我风吹麦浪上传了应助文件

6秒前; zz发布了新的文献求助10

6秒前; 请叫我风吹麦浪上传了应助文件

6秒前; 小蘑菇上传了应助文件

6秒前; 星辰大海的应助被日月山河永在采纳，获得10

7秒前; 科研通AI5上传了应助文件

8秒前; 欣慰未来关闭了欣慰未来的文献求助

8秒前; 舒伯特完成签到，获得积分10

9秒前; 科研通AI5上传了应助文件

9秒前; 量子星尘发布了新的文献求助10

9秒前; Lucas的应助被清爽匪采纳，获得30

10秒前; 西西弗斯发布了新的文献求助10

10秒前; 姜姜不姜就发布了新的文献求助10

10秒前; 科研通AI5的应助被搞怪的雁易采纳，获得10

11秒前; 爆米花上传了应助文件

12秒前; 无糖发布了新的文献求助10

12秒前; 上官若男的应助被伶俜采纳，获得10

12秒前; 小西瓜发布了新的文献求助10

12秒前; 科研通AI5的应助被yiseeya采纳，获得10

13秒前; wyx971027的应助被zz采纳，获得10

14秒前; 嘟嘟嘟嘟发布了新的文献求助10

14秒前; 星辰大海上传了应助文件

15秒前; tczw667完成签到，获得积分10

16秒前; 从容芮上传了应助文件

16秒前; shhoing的应助被祎雅采纳，获得30

16秒前; 胖一达发布了新的文献求助10

16秒前; 正直的夏真完成签到，获得积分10

17秒前; 小蘑菇上传了应助文件

17秒前

高分求助中: Production Logging: Theoretical and Interpretive Elements 2700; Neuromuscular and Electrodiagnostic Medicine Board Review 1000; Statistical Methods for the Social Sciences, Global Edition, 6th edition 600; こんなに痛いのにどうして「なんでもない」と医者にいわれてしまうのでしょうか 510; Walter Gilbert: Selected Works 500; An Annotated Checklist of Dinosaur Species by Continent 500; 岡本唐貴自伝的回想画集 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3660657; 求助须知：如何正确求助？哪些是违规求助？ 3221940; 关于积分的说明 9742294; 捐赠科研通 2931235; 什么是DOI，文献DOI怎么找？ 1604908; 邀请新用户注册赠送积分活动 757618; 科研通“疑难数据库（出版商）”最低求助积分说明 734461

今日热心研友

请叫我风吹麦浪

昏睡的蟠桃

科研小民工

有何丿不可

默默地读文献

热情的天蓝

今天只做一件事

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通