ACS Publications. Most Trusted. Most Cited. Most Read
My Activity
CONTENT TYPES

Correlation-Based Deconvolution (CorrDec) To Generate High-Quality MS2 Spectra from Data-Independent Acquisition in Multisample Studies

  • Ipputa Tada
    Ipputa Tada
    Department of Genetics, The Graduate University for Advanced Studies, SOKENDAI, 1111 Yata, Mishima, Shizuoka 411-8540, Japan
    More by Ipputa Tada
  • Romanas Chaleckis
    Romanas Chaleckis
    Gunma University Initiative for Advanced Research (GIAR), Gunma University, 3-39-22, Showa-machi, Maebashi, Gunma 371-8511, Japan
    Division of Physiological Chemistry 2, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Stockholm, 171-77, Sweden
  • Hiroshi Tsugawa
    Hiroshi Tsugawa
    RIKEN Center for Sustainable Resource Science, 1-7-22, Suehiro, Tsurumi, Yokohama, Kanagawa 240-0045, Japan
    RIKEN Center for Integrative Medical Sciences, 1-7-22, Suehiro, Tsurumi, Yokohama, Kanagawa 240-0045, Japan
  • Isabel Meister
    Isabel Meister
    Gunma University Initiative for Advanced Research (GIAR), Gunma University, 3-39-22, Showa-machi, Maebashi, Gunma 371-8511, Japan
    Division of Physiological Chemistry 2, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Stockholm, 171-77, Sweden
  • Pei Zhang
    Pei Zhang
    Gunma University Initiative for Advanced Research (GIAR), Gunma University, 3-39-22, Showa-machi, Maebashi, Gunma 371-8511, Japan
    Division of Physiological Chemistry 2, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Stockholm, 171-77, Sweden
    More by Pei Zhang
  • Nikolaos Lazarinis
    Nikolaos Lazarinis
    Division of Respiratory Medicine and Allergy, Department of Medicine, Karolinska University Hospital Huddinge, Stockholm, 141-86, Sweden
  • Barbro Dahlén
    Barbro Dahlén
    Division of Respiratory Medicine and Allergy, Department of Medicine, Karolinska University Hospital Huddinge, Stockholm, 141-86, Sweden
  • Craig E. Wheelock*
    Craig E. Wheelock
    Gunma University Initiative for Advanced Research (GIAR), Gunma University, 3-39-22, Showa-machi, Maebashi, Gunma 371-8511, Japan
    Division of Physiological Chemistry 2, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Stockholm, 171-77, Sweden
    *E-mail: [email protected]
  • , and 
  • Masanori Arita*
    Masanori Arita
    RIKEN Center for Sustainable Resource Science, 1-7-22, Suehiro, Tsurumi, Yokohama, Kanagawa 240-0045, Japan
    National Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, Japan
    *E-mail: [email protected]. Phone: +81-55-981-9449.
Cite this: Anal. Chem. 2020, 92, 16, 11310–11317
Publication Date (Web):July 10, 2020
https://doi.org/10.1021/acs.analchem.0c01980

Copyright © 2020 American Chemical Society. This publication is licensed under CC-BY.

  • Open Access

Article Views

3654

Altmetric

-

Citations

LEARN ABOUT THESE METRICS
PDF (5 MB)
Supporting Info (2)»

Abstract

Data-independent acquisition mass spectrometry (DIA-MS) is essential for information-rich spectral annotations in untargeted metabolomics. However, the acquired MS2 spectra are highly complex, posing significant annotation challenges. We have developed a correlation-based deconvolution (CorrDec) method that uses ion abundance correlations in multisample studies using DIA-MS as an update of our MS-DIAL software. CorrDec is based on the assumption that peak intensities of precursor and fragment ions correlate across samples and exploits this quantitative information to deconvolute complex DIA spectra. CorrDec clearly improved deconvolution of the original MS-DIAL deconvolution method (MS2Dec) in a dilution series of chemical standards and a 224-sample urinary metabolomics study. The primary advantage of CorrDec over MS2Dec is the ability to discriminate coeluting low-abundance compounds. CorrDec requires the measurement of multiple samples to successfully deconvolute DIA spectra; however, our randomized assessment demonstrated that CorrDec can contribute to studies with as few as 10 unique samples. The presented methodology improves compound annotation and identification in multisample studies and will be useful for applications in large cohort studies.

For compound identification, high-resolution tandem mass spectra (MS2) with public spectral library and associated computational tools are indispensable. A number of resources are available including MassBank, (1) GNPS, (2) CSI:FingerID, (3) and MS-FINDER. (4) In the classical data-dependent acquisition mass spectrometry (DDA-MS), ions are isolated in a narrow m/z window to obtain clean spectra (typically 1 Da, sometimes up to 9 Da). (5,6) In contrast, for data-independent acquisition mass spectrometry (DIA-MS), wider m/z windows of 10–1000 Da are used to obtain complex spectra from coeluting precursors, thereby requiring computational approaches to interpret. (7)
To overcome the trade-off between cleanness and comprehensiveness of DIA spectra, various deconvolution tools have been proposed, such as OpenSWATH, (8) Specter, (9) MetDIA, (10) MS-DIAL, (11) and decoMetDIA. (12) The first three tools were designed for targeted analyses that utilize predefined spectral libraries to deconvolute spectra. The latter two can deconvolute MS2 spectra de novo by fitting MS2 chromatograms to their precursor chromatogram in a single sample (i.e., using retention time). These powerful methods are suitable for the SWATH (Sequential Window Acquisition of all THeoretical fragment ion spectra) type of DIA data. (7) However, MS2 spectra become highly complex when precursor ions of all m/z are fragmented together: e.g., all ion fragmentation (AIF), MSALL, or MSE. (7) In particular, busy chromatographic regions with multiple coeluting compounds pose a significant challenge. In the case of the original MS-DIAL, at least two data-point differences between the chromatographic peak tops is required for deconvolution, which is a challenging condition for AIF data. Previous tools are therefore not suitable for untangling complex MS2 spectra from the AIF acquisition and its equivalent.
We present a new MS2 deconvolution method based on the correlation of ion abundances between precursor and product ions among biological samples, named CorrDec (Correlation-based Deconvolution). This method, implemented in MS-DIAL version 3.22 and later, is designed to deconvolute MS2 spectra from untargeted, multisample AIF metabolomics without requiring a predefined spectral library. The method is based on three assumptions: (1) metabolite concentrations differ across study samples in multisample studies; (2) the MS2 fragmentation pattern is identical under identical experimental conditions; (3) intensities of fragment ions correlate with those of their precursors.
Correlation has been widely used in mass spectrometry-based metabolomics. (13,14) For example, the Pearson correlation is used in CAMERA to estimate the similarity of different mass chromatograms to extract compound spectra and to annotate adduct ions and isotopic peaks. (15) For DIA, data correlation-based approaches such as RAMClust assigns precursor-product relationships based on detected features in MS1 and MS2. (16,17) In contrast to the previous approaches, CorrDec is not designed to retrieve as many characteristic product ions as possible from the DIA-MS2 spectra. Rather, it excludes noise peaks effectively by integrating multisample profiles. We demonstrate the concept and utility of CorrDec in a dilution series of chemical standards in urine and a case study from a urinary metabolomics cohort.

Experimental Section

ARTICLE SECTIONS
Jump To

Correlation-Based Deconvolution

CorrDec starts with the aligned peak list from multiple samples. The peak list consists of “aligned features”, which include the averages of retention time, m/z, peak height, and width obtained from the detected peaks in the samples, their ion abundances, and corresponding MS2 spectra. The peak height is used for the quantification of MS1 and MS2 peaks. The MS2 deconvolution is performed as follows (Figure 1).

Figure 1

Figure 1. Flowchart of the CorrDec method for a target feature Ft1. A. For each feature, the Pearson correlations are calculated for all pairs of precursor (MS1 vector) and product ions (MS2 matrix). B. All correlation values of all features are merged into a single matrix. C. Product ions satisfying the three criteria (see the main text for details) are selected to produce the deconvoluted MS2 spectrum of Ft1.

Step 1: For each aligned feature Ft1, Pearson correlations are calculated between all product ions and their precursors. The MS2 spectra of Ft1 for all samples are retrieved to create a “MS2Mat” data matrix, consisting of the ion abundances of each product ion (P) binned by an m/z threshold in all samples (0.01 in this study). The precursor ion abundances of all samples are retrieved to create a “MS1Vec” data vector, and Pearson correlations are calculated for all pairs of the features in MS1Vec and product ions in MS2Mat (Figure 1A). For each product ion, its existence ratio within the samples (the number of samples having the product ion above the threshold (1000 in this study) divided by the number of all samples) is also recorded.
Step 2: All correlation values in all features are integrated into a matrix based on the m/z of the product ion using the same m/z threshold (0.01 in this study) as MS2Mat (Figure 1B).
Step 3: Each product ion is assessed using the correlation value CorrMS1vsMS2 for its inclusion to the deconvoluted spectrum of Ft1. Three criteria are applied (Figure 1C):
  • (Criterion 1) CorrMS1vsMS2 > minimum threshold,

  • (Criterion 2) CorrMS1vsMS2> MaxCorrFt-margin1, and

  • (Criterion 3) CorrMS1vsMS2> MaxCorrP-margin2.

Criterion 1 is an overall cutoff to suppress noise signals. Correlations between the ion abundances of a MS1 precursor ion and the ion abundances of a MS2 product ion must be higher than a predefined minimum correlation threshold for all peaks. The recommended threshold is between 0.3 and 0.7, and we used 0.7 in this study. A lower threshold indicates a higher possibility to introduce noise peaks into spectra.
Criterion 2 is a threshold to filter product ions for fragments from each MS1 feature Ft1. MaxCorrFt is the maximum of all correlations for Ft1, and relatively low-correlating peaks from ionization enhancement and/or biochemical proximation are removed. The recommended margin1 is between 0.1 and 0.3, and we used 0.2 in this study. A larger margin indicates a higher possibility to introduce noise peaks into spectra. For example, in Figure 1B, the MS2 peak P1 (0.73) is removed because MaxCorrFt for the feature is 0.98.
Criterion 3 is used to avoid false-positive assignments by Criterion 2 when the same product ion shows high correlation values for multiple precursor ions. For each product ion Px, a maximum correlation MaxCorrP with its neighboring features (eluting within ±0.5 × peak width of Ft1) is determined. When the correlation value between the Ft1 and Px is less than MaxCorrP-margin2, Px is excluded from the deconvoluted spectrum of Ft1. The recommended range is between 0.1 and 0.3, and we used 0.1 in this study. A larger margin indicates a higher possibility to introduce noise peaks into spectra. For example, the product ion P2 is excluded from the Ft1 deconvoluted spectrum because the value of 0.81 is less than MaxCorrP (0.93) – 0.1 (Figure 1B).
These threshold values require tuning when applied to different data sets. The m/z value and the intensity in a deconvoluted spectrum are represented by their respective median value of m/z and intensities in biological samples, where the intensities are normalized by the abundance of the precursor ion in each sample.

Sample Information and Data Acquisition

Information on samples and experiments are detailed in Supporting Information. Liquid chromatography (LC)-MS measurements in AIF mode were performed as described previously. (18,19) Metabolites were separated on a 15 min gradient using HILIC chromatography with acidified water and acetonitrile. Data were acquired in positive ionization mode on an Agilent 6550 Q-TOF-MS system with a mass range of 40–1200 m/z in AIF mode with three alternating collision energies (0, 10, and 30 eV). The data acquisition rate was 2 scans/s for each segment.
A dilution series of eight chemical standards (proline betaine, trigonelline, dimethylglycine, trimethylamine N-oxide, tyrosine, glycine betaine, proline, 3-hydroxy-kynurenine; Table S1) was prepared using 10-fold diluted urine as a matrix. The starting spiked-in concentration of 4 μM in urine was diluted 1.5-fold with an equal amount of urine 10 times, resulting in an 11-point series to the final concentration of 69 nM (Figure S1). In addition, we also acquired data with a smaller dilution step (1.07-fold, 3.27–4.00 μM) for tyrosine. Two dilution series of trimethoprim (1.07-fold starting at 0.06 μM and 1.5-fold starting at 0.3 μM and) were acquired in urine samples with differing matrix composition.
Urine samples (n = 224) were used as the proof of concept for assessing the CorrDec performance. A detailed description of the full study is given in the original publication. (20) Samples were measured in four analytical batches, with pooled quality control (QC) sample injections every five samples and a water blank at the end of the batch sequence. The data sets have been deposited in the EMBL-EBI MetaboLights repository (21) with the identifiers MTBLS787 (chemical standards) and MTBLS816 (urine metabolomics).

Chemical Standard Library

An in-house MS2 spectral library combined from various open and closed sources containing 128,039 experimental MS2 spectra (high-resolution, mostly DDA) for 13 597 compounds was used for identification. The retention times (RT) for 280 compounds were obtained from purchased chemical standards. (18,22)

Data Processing and Analysis

The CorrDec method was implemented into MS-DIAL. (11) Data were processed in MS-DIAL version 4.12 (peak detection, alignment, and deconvolution). Important parameters were as follows: minimum peak height MS1:3000, noise level of MS2:1000, total identified score cutoff: 80%, detected in at least 20% of all samples, not in blank (maximum sample intensity/average blank intensity >5). As our library contained records of both DDA and DIA spectra, we used the deconvoluted spectra with and without the ions heavier than the precursor during the identification process; the higher matching score was kept. Detailed data processing settings of MS-DIAL can be found in the Tables S2 and S3. The MS2 spectra were deconvoluted independently using the MS2Dec (11) and the CorrDec methods (after the alignment of features).
For the urine data, we manually confirmed and curated the alignment results to correct missed or doubtful peak picking, feature alignment, and compound identification. We also annotated all features using three criteria: (i) accurate mass (AM) match (tolerance: 0.01 Da), (ii) RT match (tolerance: 1 min), and (iii) MS2 spectrum match (similarity >80%). The MS2 similarity was scored by the simple dot product without any weighting, (23) for clearer understanding of our method:
where Am and Ar are the arrays of m/z intensities in a measured and reference mass spectrum, respectively. To avoid erroneous high similarity matches resulting from only a few peaks, we adopted the following additional criteria for MS2 spectrum match: (1) if AMRT, a match of at least two MS2 peaks with the reference spectra, and (2) if AM only, a match of at least three MS2 peaks with the reference spectra. The MS2 similarities with reference spectra were compared between the CorrDec and the MS2Dec using three different collision energies (0, 10, and 30 eV).

Random Sampling Analysis

We evaluated the performance of CorrDec for different sample sizes by randomized resampling analysis of the urine metabolomics data set. After chromatographic alignment was performed using all samples, we reselected the study and QC samples for deconvolution by the CorrDec. The number of samples varied from four to the number of detected samples (depending upon the chosen compound) with 100 iterations. For each iteration, we calculated the MS2 similarity between the deconvoluted spectrum from the resampling and the reference spectrum. The MS2 similarity of resampling was the average of 100 iterations.

Results and Discussion

ARTICLE SECTIONS
Jump To

CorrDec Demonstration Using Compound Dilution Series in Urine

Using a dilution series of chemical standards, we verified high correlation of the intensities of MS2 fragments with those of their precursors. We measured the 11-point dilution series (0.069–4 μM) of eight chemical standards in AIF mode with diluted urine as the matrix. In such a setup, only the concentrations of the spiked compound vary (partially masked by the endogenous compounds present in the matrix) while concentrations of other compounds in the urine matrix remain stable. In the case of the tyrosine dilution series (Figure 2), the MS2 spectra of tyrosine contained 193 and 280 peaks for 10 and 30 eV, respectively. The similarity scores (simple dot product) of all raw MS2 spectra with the reference spectra were less than 30%. When processed by CorrDec, 12 peaks in 10 eV and 13 peaks in 30 eV showed >0.9 correlations with their precursors, clearly deviating from the normal distribution formed by the correlation values of the other peaks (Figure 2B bottom). These highly correlated peaks exhibited intensities proportional to the dilution (Figure 2B top in the log scale), and the MS2 similarity scores with the reference spectra were 90.5% and 86.5% for 10 and 30 eV, respectively. Similar results were obtained for the other 7 compounds: MS2 spectra were successfully generated with high MS2 matches (1 compound >80%, 6 compounds >90% for at least one collision energy) by CorrDec (Table S4 and Figure S2).

Figure 2

Figure 2. Demonstration of the CorrDec method using tyrosine dilution series spiked into diluted urine as background matrix. A. Raw MS2 spectra of tyrosine [M + H]+ (m/z: 182.082) at the lowest (69 nM) and the highest (4 μM) spiked concentrations in dilution series. Raw MS2 spectra contain over one hundred peaks masking the ions derived from tyrosine, especially at low spiked-in concentrations. B. Linked scatter plots visualizing the intensity correlations between the MS1 m/z 182.082 and MS2 peaks in 11 dilution series samples. Only 12 out of 193 (10 eV) and 13 out of 280 peaks (30 eV) correlated >0.9 (highlighted lines). C. Deconvoluted MS2 spectra (above, in black) matched well with the library reference spectra (below, in red). The MS2 similarities of deconvoluted spectra were 90.5% (10 eV) and 86.5% (30 eV), while the MS2 similarities of raw spectra at 0, 10, and 30 eV were less than 30% in the all samples.

In addition to the MS2 spectra at 10 and 30 eV, deconvoluted spectra were also obtained for 0 eV, justifying the use of in-source fragmentation for metabolite identification. (16,17,24,25) The degree of in-source fragmentation depends on the ionization source setting. In this study, in-source fragmentation is facilitated by a high fragmentor voltage (380 V), and the MS2 spectra at 0 eV for all eight chemical standards provided >80% matches in the library (Table S4), corroborating the usability of the in-source fragmentation.
To further confirm the usability of CorrDec for the cases where the concentration of the compounds varies only little across samples, we measured a 1.07-fold dilution series of tyrosine and trimethoprim. For tyrosine, the same QC background was used because the variation of endogenous tyrosine overwhelmed the 1.07-fold variation; we could obtain clean CorrDec spectra in such cases. With the constant QC background, CorrDec could generate MS2 spectra, showing >80% MS2 match at 10 eV using four samples (spiked-in concentration range 3.27–4.00 μM; Figure S3). To ensure the performance of CorrDec with the minimum concentration variations on different urine backgrounds, we measured an exogenous compound, trimethoprim, under different background matrixes. CorrDec could again deconvolute spectra using four samples (Figure S3). We could confirm that small concentration changes between the samples (<25%) suffice for the correlation-based method, (16) when heavy coelution is avoided.

Urine Metabolomics Data Set

To verify the practical performance of CorrDec, we analyzed a LC-MS (HILIC chromatography) metabolomics data set consisting of 224 unique urine samples, 58 pooled QCs, and 4 blanks acquired in positive ionization AIF mode. Data were processed by MS-DIAL version 4.12. In the CorrDec deconvolution process, we discarded product ions that appeared in <50% of all samples for computational efficiency. This threshold of 50% is arbitrary and should be set for each study considering the sample number and the desired level of reliability. The remaining 4159 features were aligned, among which the alignment of 64 features was manually corrected to separate fortuitously merged, coeluting compounds. By matching AM, RT, and MS2 spectra to the reference library, 105 compounds were confidently identified at the MSI level 1. (26)
For all of the 105 compounds, both MS2Dec and CorrDec could generate MS2 spectra. The number of spectra achieving >80% match, however, were 34 and 85 for MS2Dec and CorrDec, respectively (Figure 3A). Furthermore, the distribution of MS2 similarity scores reveals that MS2Dec spectra showed <60% match for 50 compounds. Median similarity values were 59.1% and 91.3% for MS2Dec and CorrDec, respectively (Figure 3B). The reason for the disparity is that CorrDec is especially effective in obtaining cleaner spectra for compounds of low abundance or smaller peak intensity (Figure 3C). (27)

Figure 3

Figure 3. CorrDec MS2 spectra provide increased confidence in compound identification than those obtained by MS2Dec in the urinary metabolomics DIA data set. A. Number of compounds in each identification category identified using MS2Dec and CorrDec. B. Distribution of the MS2 similarity scores for the MSI level-1 compounds spectra deconvoluted by the CorrDec and MS2Dec. C. MS2 similarity scores from CorrDec were higher than MS2Dec, especially for low-intensity peaks.

In addition to the 105 compounds identified at the AMRT and MS2 match level, we could also identify six metabolites as high match (>80%) to the applied MS2 library using CorrDec spectra but not with MS2Dec spectra. These compounds have been previously reported in human urine: imidazole acetic acid, (28) homocitrulline, (29) aminohippuric acid, (30) isobutyryl (C4) carnitine, (30) liquiritigenin, (31) and AICA-riboside. (32) Among the 111 identified compounds, over half (61) were amino acids and their metabolites: standard amino acids (13), methylated (9), acetylated (6), other amino acid metabolites (22), and conjugates (11). The other major compound groups include products of nucleic acid metabolism (13), and food/drug metabolites (8) (Table S5 and Figure S4).
In addition to the MS2 library matching, CorrDec can provide a more reliable MS2 spectra for structure-prediction tools such as MS-FINDER. (4) For example, we could annotate two features based on their CorrDec spectra as acetaminophen sulfate and valerylcarnitine (Figure S5), two compounds not present in our MS2 spectral library but likely present in urine. (30,33) The method is particularly suitable for compounds with variable levels in the samples, such as drugs and dietary components. Indeed, among the confirmed 85 AMRT+MS2 compounds (Figure 3A), 25 were first annotated by MS2 match only and were later purchased for confirmation.

MS2 Spectra Deconvolution of Coeluting Compounds

Nontargeted LC methods often contain regions with multiple coeluting compounds. In our analytical method, the distribution of the 4159 features ranged from a few to over 250 peaks per 20 s (approximate average peak width at base) across the 0.8–15 min of gradient elution (Figure S6A,B). Such coeluting peaks pose a challenge to deconvolution methods relying on mass chromatograms. With CorrDec, even completely coeluting compounds could be deconvoluted, such as abundant glutamine and sparse N-acetylcarnosine (Figure 4A,B). The relative peak intensities of the two compounds fit well with the reported average concentrations in the literature: 18–72 and 1–2 μM/mmol creatinine for glutamine (30) and N-acetylcarnosine (see Supporting Information), respectively. Using MS2Dec, the deconvoluted spectrum of N-acetylcarnosine contained all fragment peaks of glutamine, reducing the MS2 match to only 29.3%. The MS2Dec deconvoluted spectrum of glutamine showed the MS2 match of 86.5% (Figure 4C). With the same data set, CorrDec could deconvolute the MS2 spectrum of N-acetylcarnosine with 95.1% match and provided an equivalent high match for glutamine (84.1%) as well (Figure 4D). Low abundance metabolites such as N-acetylcarnosine arguably constitute the larger part of most metabolomics data sets. (27) The high-quality MS2 spectra deconvoluted by the CorrDec enabled us to untangle the complex AIF data set, by improving the identifications and annotations of smaller peaks in chromatographically dense sections.

Figure 4

Figure 4. CorrDec can successfully deconvolute the MS2 spectra of completely coeluting compounds, glutamine and N-acetylcarnosine. A. The raw MS2 spectrum and extracted ion chromatograms in MS1 (0 eV) of completely coeluting glutamine and N-acetylcarnosine as well as B. their fragments in MS2 (10 eV) from the urine data (QC1 sample in batch 1). C. MS2 spectra of glutamine and N-acetylcarnosine deconvoluted by the MS2Dec. D. MS2 spectra of glutamine and N-acetylcarnosine deconvoluted by the CorrDec.

The benefits of CorrDec are summarized as (1) cleaner MS2 spectra, and (2) statistical annotations (frequency and correlation) for MS2 peaks. CorrDec can generate clean spectra without noise signals from the matrix, mobile phase, or mass spectrometer artifacts, enabling better match to spectral databases or libraries. In the deconvolution process, each MS2 peak is assigned with a correlation value and frequency among samples. Using these statistical annotations, advanced users can manually interpret deconvoluted MS2 spectra of unknown or marginally matching metabolites.
On the other hand, CorrDec has two disadvantages: (1) the requirement of multiple samples with varying compound concentrations, and (2) the possibility of removing shared fragments among coeluting compounds. First, in principle, CorrDec cannot be performed on a single sample and at least three samples are required to calculate the correlation. While we observed that four spiked samples were sufficient to obtain >80% similarity match (Figure S3), we investigated further to estimate the required sample size using random resampling of the urine metabolomics data set in the next section. Second, if coeluting compounds produce the same m/z product ions, their intensity correlations become small enough to be removed from deconvoluted spectra depending on the CorrDec parameters. MS2Dec spectra are useful to complement such missing peaks, and advanced users can recover them through careful interpretation of statistical annotations and MS2 chromatograms. One such example for a group of betaines is provided in Figure S7.

Verification by Random Resampling

Estimating the number of samples required for CorrDec is difficult; it depends on multiple factors (study design, sample matrix, metabolite, etc.). Here for a rough estimation, we used 85 compounds confidently annotated (AMRT and MS2 match) in the urine study to perform random resampling analysis. For each of the 85 compounds, we created a scatter plot between the number of samples and MS2 similarity with the preselected library reference spectrum (Figure S8). On the basis of the median MS2 similarity from 100 iterations for each resampling, we plotted the number of compounds (total 85) of high MS2 similarity scores for each sampling size (Figure 5). Already with 10 samples, 47% (40 of 85) of the compounds showed >80% MS2 similarity; when using 30 samples, the number rose to 85% (72 of 85). Therefore, small studies with tens of samples can benefit from the CorrDec method. Note that urine is more variable compared to homeostatic fluids such as blood. A larger number of samples might be required for successful application of CorrDec in studies with less metabolite variations. On the other hand, the quality of MS2 spectra are largely dependent on compound classes and study designs. Defining the best parameters or the minimum sample number required for all studies is therefore difficult.

Figure 5

Figure 5. Summary of the randomized resampling analysis for the 85 CorrDec AMRT+MS2 compounds (Figure 3) to assess the relationship between the number of samples (urinary metabolomics data set) used for the CorrDec and quality of the deconvoluted MS2 spectra compared library MS2 spectrum.

In MS-DIAL, CorrDec is not intended to replace MS2Dec. Both deconvolution methods are based on different concepts and have different usage scenarios. The CorrDec method provides a reasonably clean deconvoluted MS2 spectrum per feature and sample set, and it is suitable for annotating and identifying a feature at the level of the whole sample set. MS2Dec can deconvolute MS2 spectra for each feature in a single sample; therefore, while noisier, the MS2Dec can be utilized to evaluate the feature identification for each sample in the data set. In DIA metabolomics, MS2 spectra are obtained from only a small number of MS scans. For such complex and noisy data, traditional deconvolution methods such as multivariate curve resolution (MCR) are difficult to apply because the multivariate method requires proper constraints to deconvolute spectra. When the number of coeluting compounds and peak shapes are much interfered by noise, error-minimization is not a good algorithmic choice. For such a data set, MS2Dec and CorrDec methods can function in complement to clean MS2 spectra. Lastly, regardless of how clean the MS2 spectra or how good the MS2 library similarity matches are, it is still necessary to further confirm compound annotations with chemical standards.

Conclusions

ARTICLE SECTIONS
Jump To

We have developed CorrDec, a new MS2 spectra deconvolution method for DIA data based on the correlations of the peak intensities across samples. CorrDec has been implemented in MS-DIAL and is available in version 3.22 or later (version 4 also covers ion-mobility data processing (34)). The improved quality of the MS2 spectra and the ability to deconvolute completely coeluting compounds are the main advantages over retention-time based deconvolution methods. Therefore, CorrDec enables more reliable compound annotations and identifications in multisample studies.

Supporting Information

ARTICLE SECTIONS
Jump To

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.analchem.0c01980.

  • Figures S1–S8, legends of Tables S1–S8 (PDF)

  • Tables S1–S8 (XLSX)

Terms & Conditions

Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system: http://pubs.acs.org/page/copyright/permissions.html.

Author Information

ARTICLE SECTIONS
Jump To

  • Corresponding Authors
    • Craig E. Wheelock - Gunma University Initiative for Advanced Research (GIAR), Gunma University, 3-39-22, Showa-machi, Maebashi, Gunma 371-8511, JapanDivision of Physiological Chemistry 2, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Stockholm, 171-77, SwedenOrcidhttp://orcid.org/0000-0002-8113-0653 Email: [email protected]
    • Masanori Arita - RIKEN Center for Sustainable Resource Science, 1-7-22, Suehiro, Tsurumi, Yokohama, Kanagawa 240-0045, JapanNational Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, JapanOrcidhttp://orcid.org/0000-0001-6706-0487 Email: [email protected]
  • Authors
    • Ipputa Tada - Department of Genetics, The Graduate University for Advanced Studies, SOKENDAI, 1111 Yata, Mishima, Shizuoka 411-8540, JapanOrcidhttp://orcid.org/0000-0003-4149-7191
    • Romanas Chaleckis - Gunma University Initiative for Advanced Research (GIAR), Gunma University, 3-39-22, Showa-machi, Maebashi, Gunma 371-8511, JapanDivision of Physiological Chemistry 2, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Stockholm, 171-77, SwedenOrcidhttp://orcid.org/0000-0001-8042-1005
    • Hiroshi Tsugawa - RIKEN Center for Sustainable Resource Science, 1-7-22, Suehiro, Tsurumi, Yokohama, Kanagawa 240-0045, JapanRIKEN Center for Integrative Medical Sciences, 1-7-22, Suehiro, Tsurumi, Yokohama, Kanagawa 240-0045, JapanOrcidhttp://orcid.org/0000-0002-2015-3958
    • Isabel Meister - Gunma University Initiative for Advanced Research (GIAR), Gunma University, 3-39-22, Showa-machi, Maebashi, Gunma 371-8511, JapanDivision of Physiological Chemistry 2, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Stockholm, 171-77, SwedenOrcidhttp://orcid.org/0000-0001-9063-0492
    • Pei Zhang - Gunma University Initiative for Advanced Research (GIAR), Gunma University, 3-39-22, Showa-machi, Maebashi, Gunma 371-8511, JapanDivision of Physiological Chemistry 2, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Stockholm, 171-77, SwedenOrcidhttp://orcid.org/0000-0003-2054-928X
    • Nikolaos Lazarinis - Division of Respiratory Medicine and Allergy, Department of Medicine, Karolinska University Hospital Huddinge, Stockholm, 141-86, Sweden
    • Barbro Dahlén - Division of Respiratory Medicine and Allergy, Department of Medicine, Karolinska University Hospital Huddinge, Stockholm, 141-86, Sweden
  • Notes
    The authors declare no competing financial interest.

Acknowledgments

ARTICLE SECTIONS
Jump To

We thank Hideru Obinata, Stacey Reinke, and David Broadhurst for discussions and suggestions. This work was supported by JSPS KAKENHI grant numbers JP18J23133, 17H03621, 15H05898, 15K21738, 18H02432, 18K19155, and 19K17662. We acknowledge support from the Gunma University Initiative for Advanced Research (GIAR), the STINT Foundation, the Swedish Heart Lung Foundation (HLF 20170734, HLF 20180290), the Swedish Research Council (2016-02798), the Stockholm County Council Research Funds (ALF), the Asthma and Allergy Research Foundation, the Centre for Allergy Research and Karolinska Institutet, NBDC, and AMED (JP17gm1010006, JP18gm0910005). This work was supported in part by The Environment Research and Technology Development Fund (ERTDF) (grant no. 5-1752). I.M. was supported by Japan Society for the Promotion of Science (JSPS) postdoctoral fellowship (P17774).

References

ARTICLE SECTIONS
Jump To

This article references 34 other publications.

  1. 1
    Horai, H. MassBank: a public repository for sharing mass spectral data for life sciences. J. Mass Spectrom. 2010, 45, 703714,  DOI: 10.1002/jms.1777
  2. 2
    Wang, M. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat. Biotechnol. 2016, 34, 828837,  DOI: 10.1038/nbt.3597
  3. 3
    Dührkop, K.; Shen, H.; Meusel, M.; Rousu, J.; Böcker, S. Searching molecular structure databases with tandem mass spectra using CSI:FingerID. Proc. Natl. Acad. Sci. U. S. A. 2015, 112, 1258012585,  DOI: 10.1073/pnas.1509788112
  4. 4
    Tsugawa, H.; Nakabayashi, R.; Mori, T.; Yamada, Y.; Takahashi, M.; Rai, A.; Sugiyama, R.; Yamamoto, H.; Nakaya, T.; Yamazaki, M.; Kooke, R.; Bac-Molenaar, J. A.; Oztolan-Erol, N.; Keurentjes, J. J. B.; Arita, M.; Saito, K. A cheminformatics approach to characterize metabolomes in stable-isotope-labeled organisms. Nat. Methods 2019, 16, 295298,  DOI: 10.1038/s41592-019-0358-2
  5. 5
    Nikolskiy, I.; Mahieu, N. G.; Chen, Y. J.; Tautenhahn, R.; Patti, G. J. An untargeted metabolomic workflow to improve structural characterization of metabolites. Anal. Chem. 2013, 85, 77137719,  DOI: 10.1021/ac400751j
  6. 6
    Lawson, T. N.; Weber, R. J. M.; Jones, M. R.; Chetwynd, A. J.; Rodríguez-Blanco, G.; Di Guida, R.; Viant, M. R.; Dunn, W. B. msPurity: Automated evaluation of precursor ion purity for mass spectrometry-based fragmentation in metabolomics. Anal. Chem. 2017, 89, 24322439,  DOI: 10.1021/acs.analchem.6b04358
  7. 7
    Zhu, X.; Chen, Y.; Subramanian, R. Comparison of information-dependent acquisition, SWATH, and MS(All) techniques in metabolite identification study employing ultrahigh-performance liquid chromatography-quadrupole time-of-flight mass spectrometry. Anal. Chem. 2014, 86, 12021209,  DOI: 10.1021/ac403385y
  8. 8
    Röst, H. L.; Rosenberger, G.; Navarro, P.; Gillet, L.; Miladinović, S. M.; Schubert, O. T.; Wolski, W.; Collins, B. C.; Malmström, J.; Malmström, L.; Aebersold, R. OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data. Nat. Biotechnol. 2014, 32, 219223,  DOI: 10.1038/nbt.2841
  9. 9
    Peckner, R.; Myers, S. A.; Jacome, A. S. V.; Egertson, J. D.; Abelin, J. G.; MacCoss, M. J.; Carr, S. A.; Jaffe, J. D. Specter: linear deconvolution for targeted analysis of data-independent acquisition mass spectrometry proteomics. Nat. Methods 2018, 15, 371378,  DOI: 10.1038/nmeth.4643
  10. 10
    Li, H.; Cai, Y.; Guo, Y.; Chen, F.; Zhu, Z. J. MetDIA: Targeted metabolite extraction of multiplexed MS/MS spectra generated by data-independent acquisition. Anal. Chem. 2016, 88, 87578764,  DOI: 10.1021/acs.analchem.6b02122
  11. 11
    Tsugawa, H.; Cajka, T.; Kind, T.; Ma, Y.; Higgins, B.; Ikeda, K.; Kanazawa, M.; VanderGheynst, J.; Fiehn, O.; Arita, M. MS-DIAL: data-independent MS/MS deconvolution for comprehensive metabolome analysis. Nat. Methods 2015, 12, 523526,  DOI: 10.1038/nmeth.3393
  12. 12
    Yin, Y.; Wang, R.; Cai, Y.; Wang, Z.; Zhu, Z.-J. DecoMetDIA: Deconvolution of multiplexed MS/MS spectra for metabolite identification in SWATH-MS-based untargeted metabolomics. Anal. Chem. 2019, 91, 1189711904,  DOI: 10.1021/acs.analchem.9b02655
  13. 13
    Brown, M.; Wedge, D. C.; Goodacre, R.; Kell, D. B.; Baker, P. N.; Kenny, L. C.; Mamas, M. A.; Neyses, L.; Dunn, W. B. Automated workflows for accurate mass-based putative metabolite identification in LC/MS-derived metabolomic datasets. Bioinformatics 2011, 27, 11081112,  DOI: 10.1093/bioinformatics/btr079
  14. 14
    Alonso, A.; Marsal, S.; Julià, A. Analytical methods in untargeted metabolomics: state of the art in 2015. Front. Bioeng. Biotechnol. 2015, 3, 23,  DOI: 10.3389/fbioe.2015.00023
  15. 15
    Kuhl, C.; Tautenhahn, R.; Böttcher, C.; Larson, T. R.; Neumann, S. CAMERA: an integrated strategy for compound spectra extraction and annotation of liquid chromatography/mass spectrometry data sets. Anal. Chem. 2012, 84, 283289,  DOI: 10.1021/ac202450g
  16. 16
    Broeckling, C. D.; Heuberger, A. L.; Prince, J. A.; Ingelsson, E.; Prenni, J. E. Assigning precursor–product ion relationships in indis. criminant MS/MS data from non-targeted metabolite profiling studies. Metabolomics 2013, 9, 3343,  DOI: 10.1007/s11306-012-0426-4
  17. 17
    Broeckling, C. D.; Afsar, F. A.; Neumann, S.; Ben-Hur, A.; Prenni, J. E. RAMClust: A novel feature clustering method enables spectral-matching-based annotation for metabolomics data. Anal. Chem. 2014, 86, 68126817,  DOI: 10.1021/ac501530d
  18. 18
    Naz, S.; Gallart-Ayala, H.; Reinke, S. N.; Mathon, C.; Blankley, R.; Chaleckis, R.; Wheelock, C. E. Development of a liquid chromatography-high resolution mass spectrometry metabolomics method with high specificity for metabolite identification using all ion fragmentation acquisition. Anal. Chem. 2017, 89, 79337942,  DOI: 10.1021/acs.analchem.7b00925
  19. 19
    Chaleckis, R.; Naz, S.; Meister, I.; Wheelock, C. E. LC-MS-based metabolomics of biofluids using All-Ion Fragmentation (AIF) acquisition. Methods Mol. Biol. 2018, 1730, 4558,  DOI: 10.1007/978-1-4939-7592-1_3
  20. 20
    Lazarinis, N.; Bood, J.; Gomez, C.; Kolmert, J.; Lantz, A. S.; Gyllfors, P.; Davis, A.; Wheelock, C. E.; Dahlén, S. E.; Dahlén, B. Leukotriene E4 induces airflow obstruction and mast cell activation through the cysteinyl leukotriene type 1 receptor. J. Allergy Clin. Immunol. 2018, 142, 10801089,  DOI: 10.1016/j.jaci.2018.02.024
  21. 21
    Haug, K.; Salek, R. M.; Conesa, P.; Hastings, J.; de Matos, P.; Rijnbeek, M.; Mahendraker, T.; Williams, M.; Neumann, S.; Rocca-Serra, P.; Maguire, E.; González-Beltrán, A.; Sansone, S. A.; Griffin, J. L.; Steinbeck, C. MetaboLights--an open-access general-purpose repository for metabolomics studies and associated meta-data. Nucleic Acids Res. 2013, 41, D781786,  DOI: 10.1093/nar/gks1004
  22. 22
    Tada, I.; Tsugawa, H.; Meister, I.; Zhang, P.; Shu, R.; Katsumi, R.; Wheelock, C. E.; Arita, M.; Chaleckis, R. Creating a reliable mass spectral–retention time library for all ion fragmentation-based metabolomics. Metabolites 2019, 9, 251,  DOI: 10.3390/metabo9110251
  23. 23
    Moorthy, A. S.; Wallace, W. E.; Kearsley, A. J.; Tchekhovskoi, D. V.; Stein, S. E. Combining fragment-ion and neutral-loss matching during mass spectral library searching: A new general purpose algorithm applicable to illicit drug identification. Anal. Chem. 2017, 89, 1326113268,  DOI: 10.1021/acs.analchem.7b03320
  24. 24
    Domingo-Almenara, X.; Montenegro-Burke, J. R.; Benton, H. P.; Siuzdak, G. Annotation: A computational solution for streamlining metabolomics analysis. Anal. Chem. 2018, 90, 480489,  DOI: 10.1021/acs.analchem.7b03929
  25. 25
    Domingo-Almenara, X.; Montenegro-Burke, J. R.; Guijas, C.; Majumder, E. L.; Benton, H. P.; Siuzdak, G. Autonomous METLIN-guided in-source fragment annotation for untargeted metabolomics. Anal. Chem. 2019, 91, 32463253,  DOI: 10.1021/acs.analchem.8b03126
  26. 26
    Sumner, L. W. Proposed minimum reporting standards for chemical analysis Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI). Metabolomics 2007, 3, 211221,  DOI: 10.1007/s11306-007-0082-2
  27. 27
    Chaleckis, R.; Meister, I.; Zhang, P.; Wheelock, C. E. Challenges, progress and promises of metabolite annotation for LC-MS-based metabolomics. Curr. Opin. Biotechnol. 2019, 55, 4450,  DOI: 10.1016/j.copbio.2018.07.010
  28. 28
    Tsuruta, Y.; Tomida, H.; Kohashi, K.; Ohkura, Y. Simultaneous determination of imidazoleacetic acid and N tau- and N pi-methylimidazoleacetic acids in human urine by high-performance liquid chromatography with fluorescence detection. J. Chromatogr., Biomed. Appl. 1987, 416, 6369,  DOI: 10.1016/0378-4347(87)80485-1
  29. 29
    Pohjanpelto, P.; Niemi, K.; Sarmela, T. Anterior chamber haemorrhage in the newborn after spontaneous delivery. A case report. Acta Ophthalmol. 1979, 57, 443446,  DOI: 10.1111/j.1755-3768.1979.tb01827.x
  30. 30
    Bouatra, S.; Aziat, F.; Mandal, R.; Guo, A. C.; Wilson, M. R.; Knox, C.; Bjorndahl, T. C.; Krishnamurthy, R.; Saleem, F.; Liu, P.; Dame, Z. T.; Poelzer, J.; Huynh, J.; Yallou, F. S.; Psychogios, N.; Dong, E.; Bogumil, R.; Roehring, C.; Wishart, D. S. The human urine metabolome. PLoS One 2013, 8, e73076  DOI: 10.1371/journal.pone.0073076
  31. 31
    Li, C.; Homma, M.; Oka, K. Characteristics of delayed excretion of flavonoids in human urine after administration of Shosaiko-to, a herbal medicine. Biol. Pharm. Bull. 1998, 21, 12511257,  DOI: 10.1248/bpb.21.1251
  32. 32
    Hornik, P.; Vyskocilová, P.; Friedecký, D.; Adam, T. Diagnosing AICA-ribosiduria by capillary electrophoresis. J. Chromatogr. B: Anal. Technol. Biomed. Life Sci. 2006, 843, 1519,  DOI: 10.1016/j.jchromb.2006.05.020
  33. 33
    Bales, J. R.; Sadler, P. J.; Nicholson, J. K.; Timbrell, J. A. Urinary excretion of acetaminophen and its metabolites as studied by proton NMR spectroscopy. Clin. Chem. 1984, 30, 16311636,  DOI: 10.1093/clinchem/30.10.1631
  34. 34
    Tsugawa, H., Ikeda, K., Takahashi, M., Satoh, A., Mori, Y., Uchino, H., Okahashi, N., Yamada, Y., Tada, I., Bonini, P., Higashi, Y., Okazaki, Y., Zhou, Z., Zhu, Z. J., Koelmel, J., Cajka, T., Fiehn, O., Saito, K., Arita, M., Arita, M. A lipidome atlas in MS-DIAL 4. Nat. Biotechnol. 2020,  DOI: 10.1038/s41587-020-0531-2 .

Cited By

ARTICLE SECTIONS
Jump To

This article is cited by 31 publications.

  1. Laihui Li, Rongjun Gao, Xuebing Wang, Yiyan Deng, Hong Sun, Huijing Sun, Beibei Zhang, Nanyang Yu, Cheng Gu, Bingcai Pan, Hongxia Yu, Si Wei. SWATH-F: A Novel Nontarget Strategy Based on the SWATH-MS Deconvolution Method Assisting in Annotating PFAS Homologues in Multisample Studies. Analytical Chemistry 2023, 95 (39) , 14551-14557. https://doi.org/10.1021/acs.analchem.3c01680
  2. Tingting Zhao, Shipei Xing, Huaxu Yu, Tao Huan. De Novo Cleaning of Chimeric MS/MS Spectra for LC-MS/MS-Based Metabolomics. Analytical Chemistry 2023, 95 (35) , 13018-13028. https://doi.org/10.1021/acs.analchem.3c00736
  3. Denice van Herwerden, Jake W. O’Brien, Sascha Lege, Bob W. J. Pirok, Kevin V. Thomas, Saer Samanipour. Cumulative Neutral Loss Model for Fragment Deconvolution in Electrospray Ionization High-Resolution Mass Spectrometry Data. Analytical Chemistry 2023, 95 (33) , 12247-12255. https://doi.org/10.1021/acs.analchem.3c00896
  4. Sadjad Fakouri Baygi, Yashwant Kumar, Dinesh Kumar Barupal. IDSL.CSA: Composite Spectra Analysis for Chemical Annotation of Untargeted Metabolomics Datasets. Analytical Chemistry 2023, 95 (25) , 9480-9487. https://doi.org/10.1021/acs.analchem.3c00376
  5. Robert M. Samples, Sara P. Puckett, Marcy J. Balunas. Metabolomics Peak Analysis Computational Tool (MPACT): An Advanced Informatics Tool for Metabolomics and Data Visualization of Molecules from Complex Biological Samples. Analytical Chemistry 2023, 95 (23) , 8770-8779. https://doi.org/10.1021/acs.analchem.2c04632
  6. Carlos Pérez-López, Bernat Oró-Nolla, Silvia Lacorte, Romà Tauler. Regions of Interest Multivariate Curve Resolution Liquid Chromatography with Data-Independent Acquisition Tandem Mass Spectrometry. Analytical Chemistry 2023, 95 (19) , 7519-7527. https://doi.org/10.1021/acs.analchem.2c05704
  7. Chih-Wei Chang, Jen-Yi Hsu, Ping-Zu Hsiao, Yuan-Chih Chen, Pao-Chi Liao. Identifying Hair Biomarker Candidates for Alzheimer’s Disease Using Three High Resolution Mass Spectrometry-Based Untargeted Metabolomics Strategies. Journal of the American Society for Mass Spectrometry 2023, 34 (4) , 550-561. https://doi.org/10.1021/jasms.2c00294
  8. Sean M. Colby, Christine H. Chang, Jessica L. Bade, Jamie R. Nunez, Madison R. Blumer, Daniel J. Orton, Kent J. Bloodsworth, Ernesto S. Nakayasu, Richard D. Smith, Yehia M. Ibrahim, Ryan S. Renslow, Thomas O. Metz. DEIMoS: An Open-Source Tool for Processing High-Dimensional Mass Spectrometry Data. Analytical Chemistry 2022, 94 (16) , 6130-6138. https://doi.org/10.1021/acs.analchem.1c05017
  9. Yuan-Chih Chen, Hsin-Yi Wu, Chih-Wei Chang, Pao-Chi Liao. Post-Deconvolution MS/MS Spectra Extraction with Data-Independent Acquisition for Comprehensive Profiling of Urinary Glucuronide-Conjugated Metabolome. Analytical Chemistry 2022, 94 (6) , 2740-2748. https://doi.org/10.1021/acs.analchem.1c03557
  10. Pei Zhang, Christopher Carlsten, Romanas Chaleckis, Kati Hanhineva, Mengna Huang, Tomohiko Isobe, Ville M. Koistinen, Isabel Meister, Stefano Papazian, Kalliroi Sdougkou, Hongyu Xie, Jonathan W. Martin, Stephen M. Rappaport, Hiroshi Tsugawa, Douglas I. Walker, Tracey J. Woodruff, Robert O. Wright, Craig E. Wheelock. Defining the Scope of Exposome Studies and Research Needs from a Multidisciplinary Perspective. Environmental Science & Technology Letters 2021, 8 (10) , 839-852. https://doi.org/10.1021/acs.estlett.1c00648
  11. Lichao Wang, Wangjie Lv, Xiaoshan Sun, Fujian Zheng, Tianrun Xu, Xinyu Liu, Hang Li, Xin Lu, Xiaojun Peng, Chunxiu Hu, Guowang Xu. Strategy for Nontargeted Metabolomic Annotation and Quantitation Using a High-Resolution Spectral-Stitching Nanoelectrospray Direct-Infusion Mass Spectrometry with Data-Independent Acquisition. Analytical Chemistry 2021, 93 (30) , 10528-10537. https://doi.org/10.1021/acs.analchem.1c01480
  12. Isabel Meister, Pei Zhang, Anirban Sinha, C. Magnus Sköld, Åsa M. Wheelock, Takashi Izumi, Romanas Chaleckis, Craig E. Wheelock. High-Precision Automated Workflow for Urinary Untargeted Metabolomic Epidemiology. Analytical Chemistry 2021, 93 (12) , 5248-5258. https://doi.org/10.1021/acs.analchem.1c00203
  13. Jonathan Zweigle, Boris Bugsel, Joel Fabregat-Palau, Christian Zwiener. PFΔScreen — an open-source tool for automated PFAS feature prioritization in non-target HRMS data. Analytical and Bioanalytical Chemistry 2023, 45 https://doi.org/10.1007/s00216-023-05070-2
  14. Vera Plekhova, Kimberly De Windt, Margot De Spiegeleer, Marilyn De Graeve, Lynn Vanhaecke. Recent advances in high-throughput biofluid metabotyping by direct infusion and ambient ionization mass spectrometry. TrAC Trends in Analytical Chemistry 2023, 168 , 117287. https://doi.org/10.1016/j.trac.2023.117287
  15. Ross McBride, Joe Wandy, Stefan Weidt, Simon Rogers, Vinny Davies, Rónán Daly, Kevin Bryson, . TopNEXt: automatic DDA exclusion framework for multi-sample mass spectrometry experiments. Bioinformatics 2023, 39 (7) https://doi.org/10.1093/bioinformatics/btad406
  16. Shipei Xing, Sam Shen, Banghua Xu, Xiaoxiao Li, Tao Huan. BUDDY: molecular formula discovery via bottom-up MS/MS interrogation. Nature Methods 2023, 20 (6) , 881-890. https://doi.org/10.1038/s41592-023-01850-x
  17. Xiang Zhang, Ruitao Wu, Zhijian Qu. A Cosine-Similarity-Based Deconvolution Method for Analyzing Data-Independent Acquisition Mass Spectrometry Data. Applied Sciences 2023, 13 (10) , 5969. https://doi.org/10.3390/app13105969
  18. Joe Wandy, Ross McBride, Simon Rogers, Nikolaos Terzis, Stefan Weidt, Justin J. J. van der Hooft, Kevin Bryson, Rónán Daly, Vinny Davies. Simulated-to-real benchmarking of acquisition methods in untargeted metabolomics. Frontiers in Molecular Biosciences 2023, 10 https://doi.org/10.3389/fmolb.2023.1130781
  19. Marie Valmori, Vincent Marie, François Fenaille, Benoit Colsch, David Touboul. Recent methodological developments in data-dependent analysis and data-independent analysis workflows for exhaustive lipidome coverage. Frontiers in Analytical Science 2023, 3 https://doi.org/10.3389/frans.2023.1118742
  20. Stanislava Rakusanova, Oliver Fiehn, Tomas Cajka. Toward building mass spectrometry-based metabolomics and lipidomics atlases for biological and clinical research. TrAC Trends in Analytical Chemistry 2023, 158 , 116825. https://doi.org/10.1016/j.trac.2022.116825
  21. Ajay Kumar, Nikita Mittal. Plant Cell Factory for Production of Biomolecules. 2023, 253-272. https://doi.org/10.1007/978-981-19-7911-8_12
  22. Jonathan He, Olivia Liu, Xuan Guo. Deep Learning Based MS2 Feature Detection for Data-Independent Shotgun Proteomics. 2022, 2342-2348. https://doi.org/10.1109/BIBM55620.2022.9995258
  23. Wei Taicheng, Chen Hao, Zhu Haoran, . Fuzzy Application in the Design of QR Code Data Binding Based on the GD Packaging Machine. Mobile Information Systems 2022, 2022 , 1-6. https://doi.org/10.1155/2022/6447578
  24. Chenxi Wang, Xu Pang, Tongtong Zhu, Shuhua Ma, Yunfei Liang, Yi Zhang, Xing Lan, Tao Wang, Lifeng Han. Rapid discovery of potential ADR compounds from injection of total saponins from Panax notoginseng using data-independent acquisition untargeted metabolomics. Analytical and Bioanalytical Chemistry 2022, 414 (2) , 1081-1093. https://doi.org/10.1007/s00216-021-03734-5
  25. Jarrod Moore, Andrew Emili. Mass-Spectrometry-Based Functional Proteomic and Phosphoproteomic Technologies and Their Application for Analyzing Ex Vivo and In Vitro Models of Hypertrophic Cardiomyopathy. International Journal of Molecular Sciences 2021, 22 (24) , 13644. https://doi.org/10.3390/ijms222413644
  26. Rina Agustina, Yusuke Masuo, Yasuto Kido, Kyosuke Shinoda, Takahiro Ishimoto, Yukio Kato. Identification of Food-Derived Isoflavone Sulfates as Inhibition Markers for Intestinal Breast Cancer Resistance Proteins. Drug Metabolism and Disposition 2021, 49 (11) , 972-984. https://doi.org/10.1124/dmd.121.000534
  27. Armando Alcázar Magaña, Naofumi Kamimura, Amala Soumyanath, Jan F. Stevens, Claudia S. Maier. Caffeoylquinic acids: chemistry, biosynthesis, occurrence, analytical challenges, and bioactivity. The Plant Journal 2021, 107 (5) , 1299-1319. https://doi.org/10.1111/tpj.15390
  28. Rodi Abdalkader, Romanas Chaleckis, Craig E. Wheelock, Ken-ichiro Kamei. Spatiotemporal determination of metabolite activities in the corneal epithelium on a chip. Experimental Eye Research 2021, 209 , 108646. https://doi.org/10.1016/j.exer.2021.108646
  29. Rodi Abdalkader, Romanas Chaleckis, Isabel Meister, Pei Zhang, Craig E. Wheelock, Ken-ichiro Kamei. Untargeted LC-MS Metabolomics for the Analysis of Micro-scaled Extracellular Metabolites from Hepatocytes. Analytical Sciences 2021, 37 (7) , 1049-1052. https://doi.org/10.2116/analsci.20N032
  30. Stephanie L. Collins, Imhoi Koo, Jeffrey M. Peters, Philip B. Smith, Andrew D. Patterson. Current Challenges and Recent Developments in Mass Spectrometry–Based Metabolomics. Annual Review of Analytical Chemistry 2021, 14 (1) , 467-487. https://doi.org/10.1146/annurev-anchem-091620-015205
  31. Jiaqian Qiu, Tongzhou Li, Zheng-Jiang Zhu. Multi-dimensional characterization and identification of sterols in untargeted LC-MS analysis using all ion fragmentation technology. Analytica Chimica Acta 2021, 1142 , 108-117. https://doi.org/10.1016/j.aca.2020.10.058
  • Abstract

    Figure 1

    Figure 1. Flowchart of the CorrDec method for a target feature Ft1. A. For each feature, the Pearson correlations are calculated for all pairs of precursor (MS1 vector) and product ions (MS2 matrix). B. All correlation values of all features are merged into a single matrix. C. Product ions satisfying the three criteria (see the main text for details) are selected to produce the deconvoluted MS2 spectrum of Ft1.

    Figure 2

    Figure 2. Demonstration of the CorrDec method using tyrosine dilution series spiked into diluted urine as background matrix. A. Raw MS2 spectra of tyrosine [M + H]+ (m/z: 182.082) at the lowest (69 nM) and the highest (4 μM) spiked concentrations in dilution series. Raw MS2 spectra contain over one hundred peaks masking the ions derived from tyrosine, especially at low spiked-in concentrations. B. Linked scatter plots visualizing the intensity correlations between the MS1 m/z 182.082 and MS2 peaks in 11 dilution series samples. Only 12 out of 193 (10 eV) and 13 out of 280 peaks (30 eV) correlated >0.9 (highlighted lines). C. Deconvoluted MS2 spectra (above, in black) matched well with the library reference spectra (below, in red). The MS2 similarities of deconvoluted spectra were 90.5% (10 eV) and 86.5% (30 eV), while the MS2 similarities of raw spectra at 0, 10, and 30 eV were less than 30% in the all samples.

    Figure 3

    Figure 3. CorrDec MS2 spectra provide increased confidence in compound identification than those obtained by MS2Dec in the urinary metabolomics DIA data set. A. Number of compounds in each identification category identified using MS2Dec and CorrDec. B. Distribution of the MS2 similarity scores for the MSI level-1 compounds spectra deconvoluted by the CorrDec and MS2Dec. C. MS2 similarity scores from CorrDec were higher than MS2Dec, especially for low-intensity peaks.

    Figure 4

    Figure 4. CorrDec can successfully deconvolute the MS2 spectra of completely coeluting compounds, glutamine and N-acetylcarnosine. A. The raw MS2 spectrum and extracted ion chromatograms in MS1 (0 eV) of completely coeluting glutamine and N-acetylcarnosine as well as B. their fragments in MS2 (10 eV) from the urine data (QC1 sample in batch 1). C. MS2 spectra of glutamine and N-acetylcarnosine deconvoluted by the MS2Dec. D. MS2 spectra of glutamine and N-acetylcarnosine deconvoluted by the CorrDec.

    Figure 5

    Figure 5. Summary of the randomized resampling analysis for the 85 CorrDec AMRT+MS2 compounds (Figure 3) to assess the relationship between the number of samples (urinary metabolomics data set) used for the CorrDec and quality of the deconvoluted MS2 spectra compared library MS2 spectrum.

  • References

    ARTICLE SECTIONS
    Jump To

    This article references 34 other publications.

    1. 1
      Horai, H. MassBank: a public repository for sharing mass spectral data for life sciences. J. Mass Spectrom. 2010, 45, 703714,  DOI: 10.1002/jms.1777
    2. 2
      Wang, M. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat. Biotechnol. 2016, 34, 828837,  DOI: 10.1038/nbt.3597
    3. 3
      Dührkop, K.; Shen, H.; Meusel, M.; Rousu, J.; Böcker, S. Searching molecular structure databases with tandem mass spectra using CSI:FingerID. Proc. Natl. Acad. Sci. U. S. A. 2015, 112, 1258012585,  DOI: 10.1073/pnas.1509788112
    4. 4
      Tsugawa, H.; Nakabayashi, R.; Mori, T.; Yamada, Y.; Takahashi, M.; Rai, A.; Sugiyama, R.; Yamamoto, H.; Nakaya, T.; Yamazaki, M.; Kooke, R.; Bac-Molenaar, J. A.; Oztolan-Erol, N.; Keurentjes, J. J. B.; Arita, M.; Saito, K. A cheminformatics approach to characterize metabolomes in stable-isotope-labeled organisms. Nat. Methods 2019, 16, 295298,  DOI: 10.1038/s41592-019-0358-2
    5. 5
      Nikolskiy, I.; Mahieu, N. G.; Chen, Y. J.; Tautenhahn, R.; Patti, G. J. An untargeted metabolomic workflow to improve structural characterization of metabolites. Anal. Chem. 2013, 85, 77137719,  DOI: 10.1021/ac400751j
    6. 6
      Lawson, T. N.; Weber, R. J. M.; Jones, M. R.; Chetwynd, A. J.; Rodríguez-Blanco, G.; Di Guida, R.; Viant, M. R.; Dunn, W. B. msPurity: Automated evaluation of precursor ion purity for mass spectrometry-based fragmentation in metabolomics. Anal. Chem. 2017, 89, 24322439,  DOI: 10.1021/acs.analchem.6b04358
    7. 7
      Zhu, X.; Chen, Y.; Subramanian, R. Comparison of information-dependent acquisition, SWATH, and MS(All) techniques in metabolite identification study employing ultrahigh-performance liquid chromatography-quadrupole time-of-flight mass spectrometry. Anal. Chem. 2014, 86, 12021209,  DOI: 10.1021/ac403385y
    8. 8
      Röst, H. L.; Rosenberger, G.; Navarro, P.; Gillet, L.; Miladinović, S. M.; Schubert, O. T.; Wolski, W.; Collins, B. C.; Malmström, J.; Malmström, L.; Aebersold, R. OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data. Nat. Biotechnol. 2014, 32, 219223,  DOI: 10.1038/nbt.2841
    9. 9
      Peckner, R.; Myers, S. A.; Jacome, A. S. V.; Egertson, J. D.; Abelin, J. G.; MacCoss, M. J.; Carr, S. A.; Jaffe, J. D. Specter: linear deconvolution for targeted analysis of data-independent acquisition mass spectrometry proteomics. Nat. Methods 2018, 15, 371378,  DOI: 10.1038/nmeth.4643
    10. 10
      Li, H.; Cai, Y.; Guo, Y.; Chen, F.; Zhu, Z. J. MetDIA: Targeted metabolite extraction of multiplexed MS/MS spectra generated by data-independent acquisition. Anal. Chem. 2016, 88, 87578764,  DOI: 10.1021/acs.analchem.6b02122
    11. 11
      Tsugawa, H.; Cajka, T.; Kind, T.; Ma, Y.; Higgins, B.; Ikeda, K.; Kanazawa, M.; VanderGheynst, J.; Fiehn, O.; Arita, M. MS-DIAL: data-independent MS/MS deconvolution for comprehensive metabolome analysis. Nat. Methods 2015, 12, 523526,  DOI: 10.1038/nmeth.3393
    12. 12
      Yin, Y.; Wang, R.; Cai, Y.; Wang, Z.; Zhu, Z.-J. DecoMetDIA: Deconvolution of multiplexed MS/MS spectra for metabolite identification in SWATH-MS-based untargeted metabolomics. Anal. Chem. 2019, 91, 1189711904,  DOI: 10.1021/acs.analchem.9b02655
    13. 13
      Brown, M.; Wedge, D. C.; Goodacre, R.; Kell, D. B.; Baker, P. N.; Kenny, L. C.; Mamas, M. A.; Neyses, L.; Dunn, W. B. Automated workflows for accurate mass-based putative metabolite identification in LC/MS-derived metabolomic datasets. Bioinformatics 2011, 27, 11081112,  DOI: 10.1093/bioinformatics/btr079
    14. 14
      Alonso, A.; Marsal, S.; Julià, A. Analytical methods in untargeted metabolomics: state of the art in 2015. Front. Bioeng. Biotechnol. 2015, 3, 23,  DOI: 10.3389/fbioe.2015.00023
    15. 15
      Kuhl, C.; Tautenhahn, R.; Böttcher, C.; Larson, T. R.; Neumann, S. CAMERA: an integrated strategy for compound spectra extraction and annotation of liquid chromatography/mass spectrometry data sets. Anal. Chem. 2012, 84, 283289,  DOI: 10.1021/ac202450g
    16. 16
      Broeckling, C. D.; Heuberger, A. L.; Prince, J. A.; Ingelsson, E.; Prenni, J. E. Assigning precursor–product ion relationships in indis. criminant MS/MS data from non-targeted metabolite profiling studies. Metabolomics 2013, 9, 3343,  DOI: 10.1007/s11306-012-0426-4
    17. 17
      Broeckling, C. D.; Afsar, F. A.; Neumann, S.; Ben-Hur, A.; Prenni, J. E. RAMClust: A novel feature clustering method enables spectral-matching-based annotation for metabolomics data. Anal. Chem. 2014, 86, 68126817,  DOI: 10.1021/ac501530d
    18. 18
      Naz, S.; Gallart-Ayala, H.; Reinke, S. N.; Mathon, C.; Blankley, R.; Chaleckis, R.; Wheelock, C. E. Development of a liquid chromatography-high resolution mass spectrometry metabolomics method with high specificity for metabolite identification using all ion fragmentation acquisition. Anal. Chem. 2017, 89, 79337942,  DOI: 10.1021/acs.analchem.7b00925
    19. 19
      Chaleckis, R.; Naz, S.; Meister, I.; Wheelock, C. E. LC-MS-based metabolomics of biofluids using All-Ion Fragmentation (AIF) acquisition. Methods Mol. Biol. 2018, 1730, 4558,  DOI: 10.1007/978-1-4939-7592-1_3
    20. 20
      Lazarinis, N.; Bood, J.; Gomez, C.; Kolmert, J.; Lantz, A. S.; Gyllfors, P.; Davis, A.; Wheelock, C. E.; Dahlén, S. E.; Dahlén, B. Leukotriene E4 induces airflow obstruction and mast cell activation through the cysteinyl leukotriene type 1 receptor. J. Allergy Clin. Immunol. 2018, 142, 10801089,  DOI: 10.1016/j.jaci.2018.02.024
    21. 21
      Haug, K.; Salek, R. M.; Conesa, P.; Hastings, J.; de Matos, P.; Rijnbeek, M.; Mahendraker, T.; Williams, M.; Neumann, S.; Rocca-Serra, P.; Maguire, E.; González-Beltrán, A.; Sansone, S. A.; Griffin, J. L.; Steinbeck, C. MetaboLights--an open-access general-purpose repository for metabolomics studies and associated meta-data. Nucleic Acids Res. 2013, 41, D781786,  DOI: 10.1093/nar/gks1004
    22. 22
      Tada, I.; Tsugawa, H.; Meister, I.; Zhang, P.; Shu, R.; Katsumi, R.; Wheelock, C. E.; Arita, M.; Chaleckis, R. Creating a reliable mass spectral–retention time library for all ion fragmentation-based metabolomics. Metabolites 2019, 9, 251,  DOI: 10.3390/metabo9110251
    23. 23
      Moorthy, A. S.; Wallace, W. E.; Kearsley, A. J.; Tchekhovskoi, D. V.; Stein, S. E. Combining fragment-ion and neutral-loss matching during mass spectral library searching: A new general purpose algorithm applicable to illicit drug identification. Anal. Chem. 2017, 89, 1326113268,  DOI: 10.1021/acs.analchem.7b03320
    24. 24
      Domingo-Almenara, X.; Montenegro-Burke, J. R.; Benton, H. P.; Siuzdak, G. Annotation: A computational solution for streamlining metabolomics analysis. Anal. Chem. 2018, 90, 480489,  DOI: 10.1021/acs.analchem.7b03929
    25. 25
      Domingo-Almenara, X.; Montenegro-Burke, J. R.; Guijas, C.; Majumder, E. L.; Benton, H. P.; Siuzdak, G. Autonomous METLIN-guided in-source fragment annotation for untargeted metabolomics. Anal. Chem. 2019, 91, 32463253,  DOI: 10.1021/acs.analchem.8b03126
    26. 26
      Sumner, L. W. Proposed minimum reporting standards for chemical analysis Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI). Metabolomics 2007, 3, 211221,  DOI: 10.1007/s11306-007-0082-2
    27. 27
      Chaleckis, R.; Meister, I.; Zhang, P.; Wheelock, C. E. Challenges, progress and promises of metabolite annotation for LC-MS-based metabolomics. Curr. Opin. Biotechnol. 2019, 55, 4450,  DOI: 10.1016/j.copbio.2018.07.010
    28. 28
      Tsuruta, Y.; Tomida, H.; Kohashi, K.; Ohkura, Y. Simultaneous determination of imidazoleacetic acid and N tau- and N pi-methylimidazoleacetic acids in human urine by high-performance liquid chromatography with fluorescence detection. J. Chromatogr., Biomed. Appl. 1987, 416, 6369,  DOI: 10.1016/0378-4347(87)80485-1
    29. 29
      Pohjanpelto, P.; Niemi, K.; Sarmela, T. Anterior chamber haemorrhage in the newborn after spontaneous delivery. A case report. Acta Ophthalmol. 1979, 57, 443446,  DOI: 10.1111/j.1755-3768.1979.tb01827.x
    30. 30
      Bouatra, S.; Aziat, F.; Mandal, R.; Guo, A. C.; Wilson, M. R.; Knox, C.; Bjorndahl, T. C.; Krishnamurthy, R.; Saleem, F.; Liu, P.; Dame, Z. T.; Poelzer, J.; Huynh, J.; Yallou, F. S.; Psychogios, N.; Dong, E.; Bogumil, R.; Roehring, C.; Wishart, D. S. The human urine metabolome. PLoS One 2013, 8, e73076  DOI: 10.1371/journal.pone.0073076
    31. 31
      Li, C.; Homma, M.; Oka, K. Characteristics of delayed excretion of flavonoids in human urine after administration of Shosaiko-to, a herbal medicine. Biol. Pharm. Bull. 1998, 21, 12511257,  DOI: 10.1248/bpb.21.1251
    32. 32
      Hornik, P.; Vyskocilová, P.; Friedecký, D.; Adam, T. Diagnosing AICA-ribosiduria by capillary electrophoresis. J. Chromatogr. B: Anal. Technol. Biomed. Life Sci. 2006, 843, 1519,  DOI: 10.1016/j.jchromb.2006.05.020
    33. 33
      Bales, J. R.; Sadler, P. J.; Nicholson, J. K.; Timbrell, J. A. Urinary excretion of acetaminophen and its metabolites as studied by proton NMR spectroscopy. Clin. Chem. 1984, 30, 16311636,  DOI: 10.1093/clinchem/30.10.1631
    34. 34
      Tsugawa, H., Ikeda, K., Takahashi, M., Satoh, A., Mori, Y., Uchino, H., Okahashi, N., Yamada, Y., Tada, I., Bonini, P., Higashi, Y., Okazaki, Y., Zhou, Z., Zhu, Z. J., Koelmel, J., Cajka, T., Fiehn, O., Saito, K., Arita, M., Arita, M. A lipidome atlas in MS-DIAL 4. Nat. Biotechnol. 2020,  DOI: 10.1038/s41587-020-0531-2 .
  • Supporting Information

    Supporting Information

    ARTICLE SECTIONS
    Jump To

    The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.analchem.0c01980.

    • Figures S1–S8, legends of Tables S1–S8 (PDF)

    • Tables S1–S8 (XLSX)


    Terms & Conditions

    Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system: http://pubs.acs.org/page/copyright/permissions.html.

Pair your accounts.

Export articles to Mendeley

Get article recommendations from ACS based on references in your Mendeley library.

Pair your accounts.

Export articles to Mendeley

Get article recommendations from ACS based on references in your Mendeley library.

You’ve supercharged your research process with ACS and Mendeley!

STEP 1:
Click to create an ACS ID

Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

MENDELEY PAIRING EXPIRED
Your Mendeley pairing has expired. Please reconnect