Hum-mPLoc: An ensemble classifier for large-scale human protein subcellular location prediction by incorporating samples with multiple sites

分类器(UML) 亚细胞定位 计算机科学 数据挖掘 人工智能 计算生物学 模式识别(心理学) 生物 生物化学 细胞质 艺术史 艺术 表演艺术
作者
Hong‐Bin Shen,Kuo‐Chen Chou
出处
期刊:Biochemical and Biophysical Research Communications [Elsevier]
卷期号:355 (4): 1006-1011 被引量:196
标识
DOI:10.1016/j.bbrc.2007.02.071
摘要

Proteins may simultaneously exist at, or move between, two or more different subcellular locations. Proteins with multiple locations or dynamic feature of this kind are particularly interesting because they may have some very special biological functions intriguing to investigators in both basic research and drug discovery. For instance, among the 6408 human protein entries that have experimentally observed subcellular location annotations in the Swiss-Prot database (version 50.7, released 19-Sept-2006), 973 (≈15%) have multiple location sites. The number of total human protein entries (except those annotated with "fragment" or those with less than 50 amino acids) in the same database is 14,370, meaning a gap of (14,370 − 6408) = 7962 entries for which no knowledge is available about their subcellular locations. Although one can use the computational approach to predict the desired information for the gap, so far all the existing methods for predicting human protein subcellular localization are limited in the case of single location site only. To overcome such a barrier, a new ensemble classifier, named Hum-mPLoc, was developed that can be used to deal with the case of multiple location sites as well. Hum-mPLoc is freely accessible to the public as a web server at http://202.120.37.186/bioinf/hum-multi. Meanwhile, for the convenience of people working in the relevant areas, Hum-mPLoc has been used to identify all human protein entries in the Swiss-Prot database that do not have subcellular location annotations or are annotated as being uncertain. The large-scale results thus obtained have been deposited in a downloadable file prepared with Microsoft Excel and named "Tab_Hum-mPLoc.xls". This file is available at the same website and will be updated twice a year to include new entries of human proteins and reflect the continuous development of Hum-mPLoc.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
HOOW完成签到,获得积分10
刚刚
刚刚
爱吃地锅鱼应助xzn1123采纳,获得10
1秒前
liuhaha给liuhaha的求助进行了留言
1秒前
ycc发布了新的文献求助10
1秒前
不懂科研的小白鼠完成签到,获得积分10
2秒前
2秒前
3秒前
3秒前
细腻的仙人掌完成签到,获得积分10
3秒前
4秒前
4秒前
栗栗栗子完成签到,获得积分10
4秒前
4秒前
星辰大海应助lan采纳,获得10
5秒前
5秒前
曾经的世倌完成签到,获得积分10
7秒前
7秒前
可靠代丝完成签到,获得积分10
8秒前
8秒前
琉璃发布了新的文献求助20
8秒前
鲍里斯瓦格完成签到,获得积分10
8秒前
wgg发布了新的文献求助10
8秒前
清脆大门发布了新的文献求助10
9秒前
10秒前
10秒前
10秒前
10秒前
setsail0816发布了新的文献求助10
11秒前
11秒前
11秒前
Graceluxx完成签到,获得积分10
12秒前
时有发布了新的文献求助10
12秒前
12秒前
12秒前
领导范儿应助SY采纳,获得10
12秒前
风味土豆片完成签到,获得积分10
12秒前
玄易发布了新的文献求助10
13秒前
十九完成签到,获得积分20
13秒前
13秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
List of 1,091 Public Pension Profiles by Region 1001
On the application of advanced modeling tools to the SLB analysis in NuScale. Part I: TRACE/PARCS, TRACE/PANTHER and ATHLET/DYN3D 500
L-Arginine Encapsulated Mesoporous MCM-41 Nanoparticles: A Study on In Vitro Release as Well as Kinetics 500
Haematolymphoid Tumours (Part A and Part B, WHO Classification of Tumours, 5th Edition, Volume 11) 400
Virus-like particles empower RNAi for effective control of a Coleopteran pest 400
Unraveling the Causalities of Genetic Variations - Recent Advances in Cytogenetics 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 纳米技术 计算机科学 内科学 化学工程 复合材料 物理化学 基因 遗传学 催化作用 冶金 量子力学 光电子学
热门帖子
关注 科研通微信公众号,转发送积分 5465271
求助须知:如何正确求助?哪些是违规求助? 4569649
关于积分的说明 14320326
捐赠科研通 4496051
什么是DOI,文献DOI怎么找? 2463064
邀请新用户注册赠送积分活动 1452084
关于科研通互助平台的介绍 1427253