One step forward for nanopore protein sequencing

纳米孔 纳米孔测序 计算机科学 计算生物学 DNA测序 纳米技术 遗传学 生物 材料科学 基因
作者
Ziyi Li,Yakun Yi,Lei Liu,Hai‐Chen Wu
出处
期刊:Clinical and translational medicine [Springer Science+Business Media]
卷期号:14 (3) 被引量:3
标识
DOI:10.1002/ctm2.1615
摘要

With the successful completion of the Human Genome Project, decoding entire human genomes has become a reality. According to the central dogma of molecular biology, genetic information flows from nucleic acid sequences to proteins, allowing for the decoding of the amino acid composition sequence from gene sequences. However, additional information remains concealed in subsequent processes such as post-translational modifications, protein splicing, and degradation. This hidden information necessitates exploration through high-throughput and high-sensitivity proteomics research. Since the inception of the first protein sequencing method, Edman Degradation,1 in 1949, researchers have devoted considerable efforts to decode proteins. While mass spectrometry remains the gold standard in protein sequencing, it faces limitations in dynamic range, sequencing length, and accuracy.2 Recent years have witnessed the emergence of new methods, including fluorosequencing,3 single-molecule peptide fingerprinting,4 tunnelling current5 and nanopore-based protein sequencing.6-11 While the success of nanopore DNA sequencing has driven the exploration of nanopore technology for protein sequencing, the latter presents unique challenges. Protein sequencing introduces additional complexity to nanopore systems when compared to DNA sequencing. This complexity stems not only from the existence of 20 distinctive amino acids, in contrast to the four nucleic acids in DNA, resulting in an exponential growth in data volume. Moreover, peptides in proteins exhibit heterogeneous charges, unlike the uniform negative charges of DNA molecules, thereby presenting challenges for directional translocation in nanopore systems. Inspired by DNA sequencing, nanopore protein sequencing has evolved, broadly categorized into two distinct methodologies: full-length chain sequencing and enzyme-assisted sequencing (Figure 1). The full-length chain sequencing strategy involves directing the unfolded protein chain through the nanopore. Three groups independently employed a DNA motor to ratchet a DNA-peptide conjugate through a nanopore, allowing for the reading of the peptide sequence.6, 10, 11 Other attempts include the use of electroosmotic force achieved by adding guanidinium chloride to the buffer solution8 or by mutating protein pores.12 On the other hand, enzyme-assisted sequencing entails enzymatic cleavage of proteins into individual amino acids, subsequently identifying them sequentially. In a recent study, Huang et al. engineered the MspA nanopore with nitrilotriacetic acid, later combining it with Ni2+, which demonstrated the ability to distinguish all 20 proteinogenic amino acids.9 While these advancements have pushed the boundaries of nanopore protein sequencing, none of them have been able to reveal sequence information. Recently, we proposed an alternative strategy for peptide sequencing based on a combination of enzymatic cleavage and host–guest interaction-assisted nanopore sensing, aiming to achieve comprehensive peptide sequencing.7 In our previous research, we found that the host-guest interaction between phenylalanine (F) and cucurbit[7]uril (CB[7]) significantly improves nanopore recognition, resulting in consistent and prolonged current events.13 Through meticulous experimentation, the FGXD8 model peptide demonstrated optimal performance for discriminating X, where 'X' represents any of the 20 proteinogenic amino acids (Figure 2A). The negative-charged polyaspartic acid chain drove translocation in an electric field, while strong interactions between FG and CB[7] stabilized the peptide⊂CB[7] complex at the wildtype α-hemolysin (WT αHL) constriction, providing the best resolution. Using this probe with the αHL and a specific mutant (M113F)7 allowed for the discrimination of all 20 proteinogenic amino acids in the FGXD8 sequence (Figure 2B). Motivated by these outcomes, we adapted this methodology for detecting free amino acids. Conjugating the amino group of free amino acids with the sulfhydryl group on FGGCD8 created the FGGC(X)D8 probe, efficiently attaching nearly all proteinogenic amino acids (Figure 2C). With nanopore mutants and altered experimental conditions, we achieved distinct and reliable differentiation of the 20 proteinogenic amino acids. Identifying free amino acids is a preliminary step for protein sequencing. The challenge lies in obtaining sequential amino acid information. Expecting sequential peptide digestion by protease, we aimed to identify digested amino acids using the FGGC(X)D8 probe. When employing carboxypeptidase A and B (CPA and CPB) to cleave peptides from the C-terminus, we faced a significant challenge: excessively rapid enzymatic digestion hindered isolating single amino acids. Fortunately, experiments revealed a correlation between amino acid abundance and position within the peptide chain (Figure 2C). Despite rapid cleavage, position-dependent abundance variability suggested a pathway for deducing peptide sequences based on relative amino acid quantities. The inclusion of D8 in our probe, known for strong negative charges, plays a vital role in averaging charges across the 20 natural amino acids and enhancing sequencing efficiency. While effective in accurately sequencing short peptides, our strategy faced dephasing beyond 8–10 digested amino acids. To address this, we devised a stepwise enzyme digestion approach (Figure 2D). Exploiting distinct properties of CPA and CPB, we introduced CPA to determine the sequence preceding arginine (R), followed by CPB to specifically cut the R residue. Repeating these cycles significantly extended our capability to accurately sequence much longer peptides. In conclusion, our innovative peptide sequencing strategies mark a promising frontier in proteomics. Future optimization includes an integrated nanopore chip for high throughput, artificially modified enzymes to control the digestion speed, and improved probe design for reduced sequencing time. Integration of advanced data analysis algorithms and machine learning is crucial for handling the exponential growth in data volume from complex protein sequencing. Beyond basic research, in the clinical realm, nanopore protein sequencing has the potential to revolutionize diagnostics, enabling rapid and precise identification of disease biomarkers for earlier detection and personalized treatment plans, leading to improved patient outcomes. Despite the challenges, nanopore protein sequencing shows a bright future, promising to unlock the full narrative encoded within proteins and providing unprecedented insights into the molecular machinery of life, ushering in a new era of biomedical breakthroughs. Ziyi Li, Yakun Yi and Hai-Chen Wu conceived the manuscript and composed the figures. Ziyi Li, Lei Liu and Hai-Chen Wu wrote the manuscript and approved the final draft. This project was funded by the National Natural Science Foundation of China (no. 22025407) and the Institute of Chemistry, Chinese Academy of Sciences. Hai-Chen Wu, Yakun Yi and Ziyi Li have filed patents describing the strategy for the nanopore-based peptide sequencing. Lei Liu declares no conflict of interest. This article does not contain any research involving humans or animals.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
pcy应助可耐的毛衣采纳,获得10
1秒前
爱因斯宣发布了新的文献求助10
1秒前
ghost202关注了科研通微信公众号
1秒前
辛勤饼干发布了新的文献求助10
1秒前
pantio完成签到,获得积分10
1秒前
nzxnzx发布了新的文献求助10
1秒前
小蘑菇应助王小小采纳,获得10
2秒前
Akim应助顺利紫山采纳,获得10
2秒前
2秒前
歡禧发布了新的文献求助10
3秒前
深情安青应助九月采纳,获得10
3秒前
3秒前
Liens发布了新的文献求助10
3秒前
dudu完成签到,获得积分10
4秒前
hdh发布了新的文献求助10
4秒前
replica完成签到,获得积分10
4秒前
火星上问夏完成签到 ,获得积分20
5秒前
晚霞不晚完成签到,获得积分10
5秒前
5秒前
林士发布了新的文献求助10
6秒前
HenryXiao完成签到,获得积分20
6秒前
7秒前
7秒前
可爱芷容发布了新的文献求助10
7秒前
lwl完成签到,获得积分10
8秒前
时闲应助哈哈哈哈哈哈采纳,获得10
8秒前
段仁杰完成签到,获得积分10
8秒前
slayers应助淡然绝山采纳,获得10
8秒前
cs发布了新的文献求助10
8秒前
老迟到的友容完成签到,获得积分20
8秒前
8秒前
9秒前
9秒前
ZHOUCHENG完成签到,获得积分0
9秒前
9秒前
汉堡包应助歡禧采纳,获得10
10秒前
JoshuaChen发布了新的文献求助10
10秒前
10秒前
10秒前
isaac完成签到,获得积分10
12秒前
高分求助中
A new approach to the extrapolation of accelerated life test data 1000
Handbook of Marine Craft Hydrodynamics and Motion Control, 2nd Edition 500
‘Unruly’ Children: Historical Fieldnotes and Learning Morality in a Taiwan Village (New Departures in Anthropology) 400
Indomethacinのヒトにおける経皮吸収 400
Phylogenetic study of the order Polydesmida (Myriapoda: Diplopoda) 370
基于可调谐半导体激光吸收光谱技术泄漏气体检测系统的研究 350
Robot-supported joining of reinforcement textiles with one-sided sewing heads 320
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 冶金 细胞生物学 免疫学
热门帖子
关注 科研通微信公众号,转发送积分 3987078
求助须知:如何正确求助?哪些是违规求助? 3529488
关于积分的说明 11245360
捐赠科研通 3267987
什么是DOI,文献DOI怎么找? 1804013
邀请新用户注册赠送积分活动 881270
科研通“疑难数据库(出版商)”最低求助积分说明 808650