作者
Abzer K. Pakkir Shah,Axel Walter,Filip Ottosson,Francesco Russo,Marcelo Navarro-Díaz,Judith Boldt,Jarmo-Charles Kalinski,Eftychia E. Kontou,James Elofson,Alexandros Polyzois,Carolina González-Marín,Stephanie Farrell,Marie Rønne Aggerbeck,Thapanee Pruksatrakul,Ngai Hang Chan,Yunshu Wang,Magdalena Pöchhacker,Corinna Brungs,Beatríz Cámara,Andrés Mauricio Caraballo‐Rodríguez,Andrés Cumsille,Fernanda de Oliveira,Kai Dührkop,Yasin El Abiead,Christian Geibel,Lana G Graves,Martin Hansen,Steffen Heuckeroth,Simon Knoblauch,Anastasiia Kostenko,Mirte C. M. Kuijpers,Kevin Mildau,Stilianos Papadopoulos Lambidis,Paulo Wender Portal Gomes,T Schramm,Karoline Steuer-Lodd,Paolo Stincone,Sibgha Tayyab,Giovanni Andrea Vitale,Berenike Wagner,Shipei Xing,Marquis T. Yazzie,Simone Zuffa,Martinus de Kruijff,Christine Beemelmanns,Hannes Link,Christoph Mayer,Justin J. J. van der Hooft,Tito Damiani,Tomáš Pluskal,Pieter C. Dorrestein,Jan Stanstrup,Robin Schmid,Mingxun Wang,Allegra T. Aron,Madeleine Ernst,Daniel Petras
摘要
Feature-Based Molecular Networking (FBMN) is a popular analysis approach for LC-MS/MS-based non-targeted metabolomics data. While processing LC-MS/MS data through FBMN is fairly streamlined, downstream data handling and statistical interrogation is often a key bottleneck. Especially, users new to statistical analysis struggle to effectively handle and analyze complex data matrices. In this protocol, we provide a comprehensive guide for the statistical analysis of FBMN results. We explain the data structure and principles of data clean-up and normalization, as well as uni- and multivariate statistical analysis of FBMN results. We provide explanations and code in two scripting languages (R and Python) as well as the QIIME2 framework for all protocol steps, from data clean-up to statistical analysis. Additionally, the protocol is accompanied by a web application with a graphical user interface (https://fbmn-statsguide.gnps2.org/), to lower the barrier of entry for new users. Together, the protocol, code, and web app provide a complete guide and toolbox for FBMN data integration, clean-up, and advanced statistical analysis, enabling new users to uncover molecular insights from their non-targeted metabolomics data. Our protocol is tailored for the seamless analysis of FBMN results from Global Natural Products Social Molecular Networking (GNPS and GNPS2) and can be adapted to other MS feature detection, annotation, and networking tools.