Oktoberfest: Open‐source spectral library generation and rescoring pipeline based on Prosit
Python(编程语言)
计算机科学
开源
管道(软件)
人工智能
机器学习
软件
程序设计语言
作者
Mario Picciani,Wassim Gabriel,Victor Giurcoiu,Omar Shouman,Firas Hamood,Ludwig Lautenbacher,Cecilia Bang Jensen,Julian Müller,Mostafa Kalhor,Armin Soleymaniniya,Bernhard Küster,Matthew The,Mathias Wilhelm
Abstract Machine learning (ML) and deep learning (DL) models for peptide property prediction such as Prosit have enabled the creation of high quality in silico reference libraries. These libraries are used in various applications, ranging from data‐independent acquisition (DIA) data analysis to data‐driven rescoring of search engine results. Here, we present Oktoberfest, an open source Python package of our spectral library generation and rescoring pipeline originally only available online via ProteomicsDB. Oktoberfest is largely search engine agnostic and provides access to online peptide property predictions, promoting the adoption of state‐of‐the‐art ML/DL models in proteomics analysis pipelines. We demonstrate its ability to reproduce and even improve our results from previously published rescoring analyses on two distinct use cases. Oktoberfest is freely available on GitHub ( https://github.com/wilhelm‐lab/oktoberfest ) and can easily be installed locally through the cross‐platform PyPI Python package.