关系(数据库)
关系抽取
人工智能
计算机科学
判决
注释
特征(语言学)
信息抽取
机器学习
自然语言处理
情报检索
比例(比率)
数据科学
过程(计算)
数据挖掘
语言学
哲学
物理
量子力学
操作系统
作者
Lixiang Hong,Jinjian Lin,Shuya Li,Fangping Wan,Hui Yang,Tao Jiang,Dan Zhao,Jianyang Zeng
标识
DOI:10.1038/s42256-020-0189-y
摘要
Knowledge about the relations between biomedical entities (such as drugs and targets) is widely distributed in more than 30 million research articles and consistently plays an important role in the development of biomedical science. In this work, we propose a novel machine learning framework, named BERE, for automatically extracting biomedical relations from large-scale literature repositories. BERE uses a hybrid encoding network to better represent each sentence from both semantic and syntactic aspects, and employs a feature aggregation network to make predictions after considering all relevant statements. More importantly, BERE can also be trained without any human annotation via a distant supervision technique. Through extensive tests, BERE has demonstrated promising performance in extracting biomedical relations, and can also find meaningful relations that were not reported in existing databases, thus providing useful hints to guide wet-lab experiments and advance the biological knowledge discovery process. A lot of scientific literature is unstructured, which makes extracting information for biomedical databases difficult. Hong and colleagues show that a distant supervision approach, using latent tree learning and recurrent units, can extract drug–target interactions from literature that were previously unknown.
科研通智能强力驱动
Strongly Powered by AbleSci AI