DEIMoS: an open-source tool for processing high-dimensional mass spectrometry data
Authors:
Sean M. Colby,
Christine H. Chang,
Jessica L. Bade,
Jamie R. Nunez,
Madison R. Blumer,
Daniel J. Orton,
Kent J. Bloodsworth,
Ernesto S. Nakayasu,
Richard D. Smith,
Yehia M. Ibrahim,
Ryan S. Renslow,
Thomas O. Metz
Abstract:
We present DEIMoS: Data Extraction for Integrated Multidimensional Spectrometry, a Python application programming interface (API) and command-line tool for high-dimensional mass spectrometry data analysis workflows that offers ease of development and access to efficient algorithmic implementations. Functionality includes feature detection, feature alignment, collision cross section (CCS) calibrati…
▽ More
We present DEIMoS: Data Extraction for Integrated Multidimensional Spectrometry, a Python application programming interface (API) and command-line tool for high-dimensional mass spectrometry data analysis workflows that offers ease of development and access to efficient algorithmic implementations. Functionality includes feature detection, feature alignment, collision cross section (CCS) calibration, isotope detection, and MS/MS spectral deconvolution, with the output comprising detected features aligned across study samples and characterized by mass, CCS, tandem mass spectra, and isotopic signature. Notably, DEIMoS operates on N-dimensional data, largely agnostic to acquisition instrumentation; algorithm implementations simultaneously utilize all dimensions to (i) offer greater separation between features, thus improving detection sensitivity, (ii) increase alignment/feature matching confidence among datasets, and (iii) mitigate convolution artifacts in tandem mass spectra. We demonstrate DEIMoS with LC-IMS-MS/MS data to illustrate the advantages of a multidimensional approach in each data processing step.
△ Less
Submitted 6 December, 2021;
originally announced December 2021.
A Validated Method for Predicting Small Molecule Ionization Sites using Gibb's Free Energies
Authors:
Jessica L. Bade,
Sean M. Colby,
Ryan S. Renslow,
Thomas O. Metz
Abstract:
Accurate molecular identification of metabolites can unlock new areas of the molecular universe and allow greater insight into complex biological and environmental systems than currently possible. Analytical approaches for measuring the metabolome, such as NMR spectroscopy, and separation techniques coupled with mass spectrometry, such as LC-IMS-MS, have risen to this challenge by yielding rich ex…
▽ More
Accurate molecular identification of metabolites can unlock new areas of the molecular universe and allow greater insight into complex biological and environmental systems than currently possible. Analytical approaches for measuring the metabolome, such as NMR spectroscopy, and separation techniques coupled with mass spectrometry, such as LC-IMS-MS, have risen to this challenge by yielding rich experimental data that can be queried by cross-reference with similar information for known standards in reference libraries. Confident identification of molecules in metabolomics studies, though, is often limited by the diversity of available data across chemical space, the unavailability of authentic reference standards, and the corresponding lack of comprehensiveness of standard reference libraries. The In Silico Chemical Library Engine (ISiCLE) addresses theses hindrances by providing a first-principles, cheminformatics pipeline that yields collisional cross section (CCS) values for any given molecule and without the need for training data. In this program, chemical identifiers undergo MD simulations, quantum chemical transformations, and ion mobility calculations for the generation of predicted CCS values. Here, we present a new module for ISiCLE that addresses the sensitivity of CCS predictions to ionization site location. An update to adduct creation methods is proposed concerning a transition from pKa and pKb led predictions to a Gibb's free energy (GFE) based determinacy of true ionization site location. A validation set of experimentally confirmed molecular protonation sites was assembled from literature and cross-referenced with the respective pKb predicted locations and GFE values for all potential ionization site placements. Upon evaluation of the two methods, the lowest GFE value was found to predict the true ionization site location with 100% accuracy while pKb had less accuracy.
△ Less
Submitted 4 November, 2021;
originally announced November 2021.