LibGen: Generating High Quality Spectral Libraries of Natural Products for EAD-, UVPD-, and HCD-High Resolution Mass SpectrometersClick to copy article linkArticle link copied!
- Fanzhou KongFanzhou KongChemistry Department, One Shields Avenue, University of California−Davis, Davis, California 95616, United StatesWest Coast Metabolomics Center, University of California−Davis, Davis, California 95616, United StatesMore by Fanzhou Kong
- Uri KeshetUri KeshetWest Coast Metabolomics Center, University of California−Davis, Davis, California 95616, United StatesMore by Uri Keshet
- Tong ShenTong ShenWest Coast Metabolomics Center, University of California−Davis, Davis, California 95616, United StatesMore by Tong Shen
- Elys RodriguezElys RodriguezChemistry Department, One Shields Avenue, University of California−Davis, Davis, California 95616, United StatesWest Coast Metabolomics Center, University of California−Davis, Davis, California 95616, United StatesMore by Elys Rodriguez
- Oliver Fiehn*Oliver Fiehn*Phone: +1-530-754-8258. Fax: +1-530754-9658. E-mail: [email protected]West Coast Metabolomics Center, University of California−Davis, Davis, California 95616, United StatesMore by Oliver Fiehn
Abstract
Compound annotation using spectral-matching algorithms is vital for (MS/MS)-based metabolomics research, but is hindered by the lack of high-quality reference MS/MS library spectra. Finding and removing errors from libraries, including noise ions, is mostly done manually. This process is both error-prone and time-consuming. To address these challenges, we have developed an automated library curation pipeline, LibGen, to universally build novel spectral libraries. This pipeline corrects mass errors, denoises spectra by subformula assignments, and performs quality control of the reference spectra by calculating explained intensity and spectral entropy. We employed LibGen to generate three high-quality libraries with chemical standards of 2241 natural products. To this end, we used an IQ-X orbital ion trap mass spectrometer to generate 1947 classic high-energy collision dissociation spectra (HCD) as well as 1093 ultraviolet-photodissociation (UVPD) mass spectra. The third library was generated by an electron-activated collision dissociation (EAD) 7600 ZenoTOF mass spectrometer yielding 3244 MS/MS spectra. The natural compounds covered 140 chemical classes from prenol lipids to benzypyrans with >97% of the compounds showing <0.2 Tanimoto-similarity, demonstrating a very high structural variance. Mass spectra showed much higher information content for both UVPD- and EAD-mass spectra compared to classic HCD spectra when using spectral entropy calculations. We validated the denoising algorithm by acquiring MS/MS spectra at high concentration and at 13-fold diluted chemical standards. At low concentrations, a higher proportion of spectra showed apparent fragment ions that could not be explained by subformula losses of the parent molecule. When more than 10% of the total intensity of MS/MS fragments was regarded as noise ions, spectra were considered as low quality and were not included in the libraries. As the overall process is fully automated, LibGen can be utilized by all researchers who create or curate mass spectral libraries. The libraries we created here are publicly available at MassBank.us.
Cited By
This article is cited by 5 publications.
- Michael W. Christopher, Aiden C. Ericson, Alexander C. Klug, Rhoel R. Dinglasan, Boone M. Prentice, Timothy J. Garrett. Divergent Metabolic Fates of Aromatic Amino Acid-Derived Isomers: Insights from Ex Vivo Metabolomics and HDX-HRMS/MS-Based Resolution of Tautomers. Analytical Chemistry 2024, 96
(42)
, 16917-16925. https://doi.org/10.1021/acs.analchem.4c03862
- Axel Dablanc, Solweig Hennechart, Amélie Perez, Guillaume Cabanac, Yann Guitton, Nils Paulhe, Bernard Lyan, Emilien L. Jamin, Franck Giacomoni, Guillaume Marti. FragHub: A Mass Spectral Library Data Integration Workflow. Analytical Chemistry 2024, 96
(30)
, 12489-12496. https://doi.org/10.1021/acs.analchem.4c02219
- Yunjia Lai, Jeremy P. Koelmel, Douglas I. Walker, Elliott J. Price, Stefano Papazian, Katherine E. Manz, Delia Castilla-Fernández, John A. Bowden, Vladimir Nikiforov, Arthur David, Vincent Bessonneau, Bashar Amer, Suresh Seethapathy, Xin Hu, Elizabeth Z. Lin, Akrem Jbebli, Brooklynn R. McNeil, Dinesh Barupal, Marina Cerasa, Hongyu Xie, Vrinda Kalia, Renu Nandakumar, Randolph Singh, Zhenyu Tian, Peng Gao, Yujia Zhao, Jean Froment, Pawel Rostkowski, Saurabh Dubey, Kateřina Coufalíková, Hana Seličová, Helge Hecht, Sheng Liu, Hanisha H. Udhani, Sophie Restituito, Kam-Meng Tchou-Wong, Kun Lu, Jonathan W. Martin, Benedikt Warth, Krystal J. Godri Pollitt, Jana Klánová, Oliver Fiehn, Thomas O. Metz, Kurt D. Pennell, Dean P. Jones, Gary W. Miller. High-Resolution Mass Spectrometry for Human Exposomics: Expanding Chemical Space Coverage. Environmental Science & Technology 2024, 58
(29)
, 12784-12822. https://doi.org/10.1021/acs.est.4c01156
- Wenyi Yu, Xunhao Zheng, Xiaonong Li, Jinfeng Zhu, Hongjiang Liu, Qing Xu, Aijin Shen, Yanfang Liu, Xinmiao Liang. An algorithm-driven intelligent mining and identification strategy for natural product mass spectrometry. Journal of Chromatography A 2024, 1734 , 465288. https://doi.org/10.1016/j.chroma.2024.465288
- Tomas Cajka. Liquid chromatography–mass spectrometry–based metabolomics approaches for foodomics research. Current Opinion in Food Science 2024, 58 , 101201. https://doi.org/10.1016/j.cofs.2024.101201
Article Views are the COUNTER-compliant sum of full text article downloads since November 2008 (both PDF and HTML) across all institutions and individuals. These metrics are regularly updated to reflect usage leading up to the last few days.
Citations are the number of other articles citing this article, calculated by Crossref and updated daily. Find more information about Crossref citation counts.
The Altmetric Attention Score is a quantitative measure of the attention that a research article has received online. Clicking on the donut icon will load a page at altmetric.com with additional details about the score and the social media presence for the given article. Find more information on the Altmetric Attention Score and how the score is calculated.