Early Diagnosis: End-to-End CNN–LSTM Models for Mass Spectrometry Data ClassificationClick to copy article linkArticle link copied!
- Khawla SeddikiKhawla SeddikiCentre de Recherche du CHU de Québec-Université Laval, Québec City, Québec G1V 4G2, CanadaUniv. Lille, Inserm, CHU Lille, U1192-Protéomique Réponse Inflammatoire Spectrométrie de Masse-PRISM, Lille F-59000, FranceMore by Khawla Seddiki
- Fŕed́eric PreciosoFŕed́eric PreciosoUniversité Ĉote d’Azur, CNRS, INRIA, I3S, Sophia Antipolis 06900, FranceMore by Fŕed́eric Precioso
- Melissa SanabriaMelissa SanabriaUniversité Ĉote d’Azur, CNRS, INRIA, I3S, Sophia Antipolis 06900, FranceMore by Melissa Sanabria
- Michel SalzetMichel SalzetUniv. Lille, Inserm, CHU Lille, U1192-Protéomique Réponse Inflammatoire Spectrométrie de Masse-PRISM, Lille F-59000, FranceMore by Michel Salzet
- Isabelle FournierIsabelle FournierUniv. Lille, Inserm, CHU Lille, U1192-Protéomique Réponse Inflammatoire Spectrométrie de Masse-PRISM, Lille F-59000, FranceMore by Isabelle Fournier
- Arnaud Droit*Arnaud Droit*Email: [email protected]Centre de Recherche du CHU de Québec-Université Laval, Québec City, Québec G1V 4G2, CanadaMore by Arnaud Droit
Abstract
Liquid chromatography–mass spectrometry (LC–MS) is a powerful method for cell profiling. The use of LC–MS technology is a tool of choice for cancer research since it provides molecular fingerprints of analyzed tissues. However, the ubiquitous presence of noise, the peaks shift between acquisitions, and the huge amount of information owing to the high dimensionality of the data make rapid and accurate cancer diagnosis a challenging task. Deep learning (DL) models are not only effective classifiers but are also well suited to jointly learn feature representation and classification tasks. This is particularly relevant when applied to raw LC–MS data and hence avoid the need for costly preprocessing and complicated feature selection. In this study, we propose a new end-to-end DL methodology that addresses all of the above challenges at once, while preserving the high potential of LC–MS data. Our DL model is designed to early discriminate between tumoral and normal tissues. It is a combination of a convolutional neural network (CNN) and a long short-term memory (LSTM) Network. The CNN network allows for significantly reducing the high dimensionality of the data while learning spatially relevant features. The LSTM network enables our model to capture temporal patterns. We show that our model outperforms not only benchmark models but also state-of-the-art models developed on the same data. Our framework is a promising strategy for improving early cancer detection during a diagnostic process.
This publication is licensed under
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
Introduction
Materials and Methods
Datasets
Experimental Design
Our Two-Stage CNN–LSTM Models
Comparison of Our Models with LSTM Models
Comparison of Our Models with Hybrid CNN–LSTM Models
Comparison of Our Models with State-of-the-Art Models
Early Classification with Our Two-Stage CNN–LSTM Models
Results
Our Two-Stage CNN–LSTM Models Performances
Figure 1
Figure 1. Accuracies of LSTM models on embedded (a) hepatic, (b) positive renal, and (c) negative renal datasets at three RT binning time (10, 5, and 3 or 2 s). (30) means one LSTM layer of 30 neurons and (30,30) means two LSTM layers of 30 neurons each.
Comparison of Our Models with LSTM Models Performances
Figure 2
Figure 2. Accuracies of LSTM models on raw (a) hepatic, (b) positive renal, and (c) negative renal datasets at three RT binning time (10, 5, and 3 or 2 s). (30) means one LSTM layer of 30 neurons and (30,30) means two LSTM layers of 30 neurons each.
Comparison of Our Models with Hybrid CNN–LSTM Models Performances
Figure 3
Figure 3. Accuracies of hybrid CNN–LSTM models on raw (a) hepatic, (b) positive renal, and (c) negative renal datasets at three RT binning time (10, 5, and 3 or 2 s). For hepatic and positive renal data, M1: Conv(32,21)(16,11)-LSTM(30,30), M2: Conv(32,21)-LSTM(30,30) and M3: Conv(32,11)-LSTM(30,30). For negative renal data, M1: Conv(32,21)(16,11)-LSTM(60)(10 s)/LSTM(60,60)(5 s)/LSTM(120)(2 s), M2: Conv(32,21)-LSTM(60)(10 s)/LSTM(60,60)(5 s)/LSTM(120)(2 s) and M3: Conv(32,11)-LSTM(60)(10 s)/LSTM(60,60)(5 s)/LSTM(120)(2 s). Conv(32,21) means 32 kernels in the convolutional layer with size 21. LSTM(30,30) means two LSTM layers of 30 neurons each.
Comparison of Our Models with State-of-the-Art Models
Early Classification Performances with Our Two-Stage CNN–LSTM Models
Figure 4
Figure 4. Early classification accuracies of our two-stage CNN–LSTM model on (a) hepatic, (b) positive renal, and (c) negative renal datasets at three RT binning time (10, 5, and 3 or 2 s).
Discussion
Conclusions
Supporting Information
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.analchem.3c00613.
Additional results and figures including matrix construction, our two-stage CNN–LSTM classification model, hybrid CNN–LSTM models architectures, 1D-CNN embedding models, classification performances and early classification performances with our two-stage CNN-LSTM model (PDF)
Terms & Conditions
Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system: http://pubs.acs.org/page/copyright/permissions.html.
Acknowledgments
This research is supported by funding from the Fonds de recherche du Québec-Santé (FRQS), Ministère de l’Enseignement Supérieur, de la Recherche et de l’Innovation (MESRI), Institut National de la Santé et de la Recherche Médicale (Inserm), Agence Nationale de la Recherche (ANR), SATT Nord, Institut Nationale du Cancer (INCA), and Université de Lille, with the support of ”Service de coopération et d’action culturelle du Consulat général de France à Québec”
References
This article references 30 other publications.
- 1Feider, C. L.; Krieger, A.; DeHoog, R. J.; Eberlin, L. S. Ambient Ionization Mass Spectrometry: Recent Developments and Applications. Anal. Chem. 2019, 91, 4266– 4290, DOI: 10.1021/acs.analchem.9b00807Google Scholar1Ambient Ionization Mass Spectrometry: Recent Developments and ApplicationsFeider, Clara L.; Krieger, Anna; DeHoog, Rachel J.; Eberlin, Livia S.Analytical Chemistry (Washington, DC, United States) (2019), 91 (7), 4266-4290CODEN: ANCHAM; ISSN:0003-2700. (American Chemical Society)A review. The early 2000s brought a new age in the field of mass spectrometry (MS) with the introduction of ambient ionization MS techniques. As defined by 1 of its early visionaries, Prof. R. Graham Cooks, ambient ionization refers to the ionization of unprocessed or minimally modified samples in their native environment, and it typically refers to the ionization of condensed phase samples in air. Since its inception in 2004, many researchers and labs. have contributed with approaches for sampling and ionization at atm. conditions, greatly decreasing exptl. complexity and time required for mass spectrometry analyses. In the last 15 years, innovations in the field of ambient ionization MS have grown expansively, pushing these technologies far past their point of conception and integrating them into the broader scientific community in creative and stimulating ways.
- 2Berrar, D.; Dubitzky, W. Deep learning in bioinformatics and biomedicine. Briefings Bioinf. 2021, 22, 1513– 1514, DOI: 10.1093/bib/bbab087Google Scholar2Deep learning in bioinformatics and biomedicineBerrar Daniel; Dubitzky WernerBriefings in bioinformatics (2021), 22 (2), 1513-1514 ISSN:.There is no expanded citation for this reference.
- 3LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. nature 2015, 521, 436– 444, DOI: 10.1038/nature14539Google Scholar3Deep learningLeCun, Yann; Bengio, Yoshua; Hinton, GeoffreyNature (London, United Kingdom) (2015), 521 (7553), 436-444CODEN: NATUAS; ISSN:0028-0836. (Nature Publishing Group)Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.
- 4Chollet, F. Deep Learning with Python; Simon & Schuster, 2021.Google ScholarThere is no corresponding record for this reference.
- 5Wen, B.; Zeng, W.-F.; Liao, Y.; Shi, Z.; Savage, S. R.; Jiang, W.; Zhang, B. Deep Learning in Proteomics. Proteomics 2020, 20, 1900335, DOI: 10.1002/pmic.201900335Google Scholar5Deep Learning in ProteomicsWen, Bo; Zeng, Wen-Feng; Liao, Yuxing; Shi, Zhiao; Savage, Sara R.; Jiang, Wen; Zhang, BingProteomics (2020), 20 (21-22), 1900335CODEN: PROTC7; ISSN:1615-9853. (Wiley-VCH Verlag GmbH & Co. KGaA)Proteomics, the study of all the proteins in biol. systems, is becoming a data-rich science. Protein sequences and structures are comprehensively catalogued in online databases. With recent advancements in tandem mass spectrometry (MS) technol., protein expression and post-translational modifications (PTMs) can be studied in a variety of biol. systems at the global scale. Sophisticated computational algorithms are needed to translate the vast amt. of data into novel biol. insights. Deep learning automatically exts. data representations at high levels of abstraction from data, and it thrives in data-rich scientific research domains. Here, a comprehensive overview of deep learning applications in proteomics, including retention time prediction, MS/MS spectrum prediction, de novo peptide sequencing, PTM prediction, major histocompatibility complex-peptide binding prediction, and protein structure prediction, is provided. Limitations and the future directions of deep learning in proteomics are also discussed. This review will provide readers an overview of deep learning and how it can be used to analyze proteomics data.
- 6Cortes, C.; Vapnik, V. Support-vector Networks, 3rd ed.; Springer, 1995; Vol. 20, pp 273– 297.Google ScholarThere is no corresponding record for this reference.
- 7Breiman, L. Mach. Learn. 2001, 45, 5– 32, DOI: 10.1023/a:1010933404324Google ScholarThere is no corresponding record for this reference.
- 8Wu, B.; Abbott, T.; Fishman, D.; McMurray, W.; Mor, G.; Stone, K.; Ward, D.; Williams, K.; Zhao, H. Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data. Bioinformatics 2003, 19, 1636– 1643, DOI: 10.1093/bioinformatics/btg210Google Scholar8Comparison of statistical methods for classification of ovarian cancer using mass spectrometry dataWu, Baolin; Abbott, Tom; Fishman, David; McMurray, Walter; Mor, Gil; Stone, Kathryn; Ward, David; Williams, Kenneth; Zhao, HongyuBioinformatics (2003), 19 (13), 1636-1643CODEN: BOINFP; ISSN:1367-4803. (Oxford University Press)Motivation: Novel methods, both mol. and statistical, are urgently needed to take advantage of recent advances in biotechnol. and the human genome project for disease diagnosis and prognosis. Mass spectrometry (MS) holds great promise for biomarker identification and genome-wide protein profiling. It has been demonstrated in the literature that biomarkers can be identified to distinguish normal individuals from cancer patients using MS data. Such progress is esp. exciting for the detection of early-stage ovarian cancer patients. Although various statistical methods have been utilized to identify biomarkers from MS data, there has been no systematic comparison among these approaches in their relative ability to analyze MS data. Results: We compare the performance of several classes of statistical methods for the classification of cancer based on MS spectra. These methods include: linear discriminant anal., quadratic discriminant anal., k-nearest neighbor classifier, bagging and boosting classification trees, support vector machine, and random forest (RF). The methods are applied to ovarian cancer and control serum samples from the National Ovarian Cancer Early Detection Program clinic at Northwestern University Hospital. We found that RF outperforms other methods in the anal. of MS data.
- 9Gredell, D. A.; Schroeder, A. R.; Belk, K. E.; Broeckling, C. D.; Heuberger, A. L.; Kim, S.-Y.; King, D. A.; Shackelford, S. D.; Sharp, J. L.; Wheeler, T. L. Comparison of Machine Learning Algorithms for Predictive Modeling of Beef Attributes Using Rapid Evaporative Ionization Mass Spectrometry (REIMS) Data. Sci. Rep. 2019, 9, 5721, DOI: 10.1038/s41598-019-40927-6Google Scholar9Comparison of Machine Learning Algorithms for Predictive Modeling of Beef Attributes Using Rapid Evaporative Ionization Mass Spectrometry (REIMS) DataGredell Devin A; Belk Keith E; Woerner Dale R; Schroeder Amelia R; Broeckling Corey D; Heuberger Adam L; Prenni Jessica E; Kim Soo-Young; Sharp Julia L; King D Andy; Shackelford Steven D; Wheeler Tommy LScientific reports (2019), 9 (1), 5721 ISSN:.Ambient mass spectrometry is an analytical approach that enables ionization of molecules under open-air conditions with no sample preparation and very fast sampling times. Rapid evaporative ionization mass spectrometry (REIMS) is a relatively new type of ambient mass spectrometry that has demonstrated applications in both human health and food science. Here, we present an evaluation of REIMS as a tool to generate molecular scale information as an objective measure for the assessment of beef quality attributes. Eight different machine learning algorithms were compared to generate predictive models using REIMS data to classify beef quality attributes based on the United States Department of Agriculture (USDA) quality grade, production background, breed type and muscle tenderness. The results revealed that the optimal machine learning algorithm, as assessed by predictive accuracy, was different depending on the classification problem, suggesting that a "one size fits all" approach to developing predictive models from REIMS data is not appropriate. The highest performing models for each classification achieved prediction accuracies between 81.5-99%, indicating the potential of the approach to complement current methods for classifying quality attributes in beef.
- 10Datta, S.; DePadilla, L. M. Feature selection and machine learning with mass spectrometry data for distinguishing cancer and non-cancer samples. Stat. Methodol. 2006, 3, 79– 92, DOI: 10.1016/j.stamet.2005.09.006Google ScholarThere is no corresponding record for this reference.
- 11Cannataro, M.; Guzzi, P. H.; Mazza, T.; Veltri, P. ACM SIGIR 2005, 112, 17– 26Google ScholarThere is no corresponding record for this reference.
- 12Engel, J.; Gerretzen, J.; Szymańska, E.; Jansen, J. J.; Downey, G.; Blanchet, L.; Buydens, L. M. TrAC, Trends Anal. Chem. 2013, 50, 96– 106, DOI: 10.1016/j.trac.2013.04.015Google Scholar12Breaking with trends in pre-processing?Engel, Jasper; Gerretzen, Jan; Szymanska, Ewa; Jansen, Jeroen J.; Downey, Gerard; Blanchet, Lionel; Buydens, Lutgarde M. C.TrAC, Trends in Analytical Chemistry (2013), 50 (), 96-106CODEN: TTAEDJ; ISSN:0165-9936. (Elsevier B. V.)A review. Data pre-processing is an essential part of chemometric data anal., which aims to remove unwanted variation (such as instrumental artifacts) and thereby focusing on the variation of interest. The choice of an optimal pre-processing method or combination of methods may strongly influence the anal. results, but is far from straightforward, since it depends on the characteristics of the data set and the goal of data anal. This 1st crit. review is devoted to the selection procedure for appropriate pre-processing strategies. Breaking with current trends in pre-processing is essential, as all selection approaches have serious drawbacks and cannot be properly used.
- 13Seddiki, K.; Saudemont, P.; Precioso, F.; Ogrinc, N.; Wisztorski, M.; Salzet, M.; Fournier, I.; Droit, A. Nat. Commun. 2020, 11, 5595, DOI: 10.1038/s41467-020-19354-zGoogle Scholar13Cumulative learning enables convolutional neural network representations for small mass spectrometry data classificationSeddiki, Khawla; Saudemont, Philippe; Precioso, Frederic; Ogrinc, Nina; Wisztorski, Maxence; Salzet, Michel; Fournier, Isabelle; Droit, ArnaudNature Communications (2020), 11 (1), 5595CODEN: NCAOBW; ISSN:2041-1723. (Nature Research)Rapid and accurate clin. diagnosis remains challenging. A component of diagnosis tool development is the design of effective classification models with Mass spectrometry (MS) data. Some Machine Learning approaches have been investigated but these models require time-consuming preprocessing steps to remove artifacts, making them unsuitable for rapid anal. Convolutional Neural Networks (CNNs) have been found to perform well under such circumstances since they can learn representations from raw data. However, their effectiveness decreases when the no. of available training samples is small, which is a common situation in medicine. In this work, we investigate transfer learning on 1D-CNNs, then we develop a cumulative learning method when transfer learning is not powerful enough. We propose to train the same model through several classification tasks over various small datasets to accumulate knowledge in the resulting representation. By using rat brain as the initial training dataset, a cumulative learning approach can have a classification accuracy exceeding 98% for 1D clin. MS-data. We show the use of cumulative learning using datasets generated in different biol. contexts, on different organisms, and acquired by different instruments. Here we show a promising strategy for improving MS data classification accuracy when only small nos. of samples are available.
- 14Gupta, A.; Gupta, H. P.; Biswas, B.; Dutta, T. IEEE Transactions on Artificial Intelligence; IEEE, 2020; Vol. 1, pp 47– 61.Google ScholarThere is no corresponding record for this reference.
- 15Mori, U.; Mendiburu, A.; Dasgupta, S.; Lozano, J. A. Early Classification of Time Series by Simultaneously Optimizing the Accuracy and Earliness. IEEE Transact. Neural Networks Learn. Syst. 2018, 29, 4569– 4578, DOI: 10.1109/tnnls.2017.2764939Google Scholar15Early Classification of Time Series by Simultaneously Optimizing the Accuracy and EarlinessMori Usue; Mendiburu Alexander; Dasgupta Sanjoy; Lozano Jose AIEEE transactions on neural networks and learning systems (2018), 29 (10), 4569-4578 ISSN:.The problem of early classification of time series appears naturally in contexts where the data, of temporal nature, are collected over time, and early class predictions are interesting or even required. The objective is to classify the incoming sequence as soon as possible, while maintaining suitable levels of accuracy in the predictions. Thus, we can say that the problem of early classification consists of optimizing two objectives simultaneously: accuracy and earliness. In this context, we present a method for early classification based on combining a set of probabilistic classifiers together with a stopping rule (SR). This SR will act as a trigger and will tell us when to output a prediction or when to wait for more data, and its main novelty lies in the fact that it is built by explicitly optimizing a cost function based on accuracy and earliness. We have selected a large set of benchmark data sets and four other state-of-the-art early classification methods, and we have evaluated and compared our framework obtaining superior results in terms of both earliness and accuracy.
- 16Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Proceedings of the IEEE Conference on Computer Vision and Pattern Recogni-tion; IEEE, 2015, pp 1– 9.Google ScholarThere is no corresponding record for this reference.
- 17Papagiannopoulou, C.; Parchen, R.; Rubbens, P.; Waegeman, W. Fast Pathogen Identification Using Single-Cell Matrix-Assisted Laser Desorption/Ionization-Aerosol Time-of-Flight Mass Spectrometry Data and Deep Learning Methods. Anal. Chem. 2020, 92, 7523– 7531, DOI: 10.1021/acs.analchem.9b05806Google Scholar17Fast Pathogen Identification Using Single-Cell Matrix-Assisted Laser Desorption/Ionization-Aerosol Time-of-Flight Mass Spectrometry Data and Deep Learning MethodsPapagiannopoulou, Christina; Parchen, Rene; Rubbens, Peter; Waegeman, WillemAnalytical Chemistry (Washington, DC, United States) (2020), 92 (11), 7523-7531CODEN: ANCHAM; ISSN:0003-2700. (American Chemical Society)In diagnostics of infectious diseases, matrix-assisted laser desorption/ionization-time-of-flight mass spectrometry (MALDI-TOF MS) can be applied for the identification of pathogenic microorganisms. However, to achieve a trustworthy identification from MALDI-TOF MS data, a significant amt. of biomass should be considered. The bacterial load that potentially occurs in a sample is therefore routinely amplified by culturing, which is a time-consuming procedure. In this paper, the authors show that culturing can be avoided by conducting MALDI-TOF MS on individual bacterial cells. This results in a more rapid identification of species with an acceptable accuracy. The authors propose a deep learning architecture to analyze the data and compare its performance with traditional supervised machine learning algorithms. The authors illustrate the workflow on a large data set that contains bacterial species related to urinary tract infections. Overall the authors obtain accuracies up to 85% in discriminating five different species.
- 18Skarysz, A.; Salman, D.; Eddleston, M.; Sykora, M.; Hunsicker, E.; Nailon, W. H.; Darnley, K.; McLaren, D. B.; Thomas, C.; Soltoggio, A. 2020, arXiv preprint arXiv:2006.01772.Google ScholarThere is no corresponding record for this reference.
- 19Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735– 1780, DOI: 10.1162/neco.1997.9.8.1735Google Scholar19Long short-term memoryHochreiter S; Schmidhuber JNeural computation (1997), 9 (8), 1735-80 ISSN:0899-7667.Learning to store information over extended time intervals by recurrent backpropagation takes a very long time, mostly because of insufficient, decaying error backflow. We briefly review Hochreiter's (1991) analysis of this problem, then address it by introducing a novel, efficient, gradient-based method called long short-term memory (LSTM). Truncating the gradient where this does not do harm, LSTM can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units. Multiplicative gate units learn to open and close access to the constant error flow. LSTM is local in space and time; its computational complexity per time step and weight is O(1). Our experiments with artificial data involve local, distributed, real-valued, and noisy pattern representations. In comparisons with real-time recurrent learning, back propagation through time, recurrent cascade correlation, Elman nets, and neural sequence chunking, LSTM leads to many more successful runs, and learns much faster. LSTM also solves complex, artificial long-time-lag tasks that have never been solved by previous recurrent network algorithms.
- 20Van Houdt, G.; Mosquera, C.; Nápoles, G. A review on the long short-term memory model. Artif. Intell. Rev. 2020, 53, 5929– 5955, DOI: 10.1007/s10462-020-09838-1Google ScholarThere is no corresponding record for this reference.
- 21Shickel, B.; Tighe, P. J.; Bihorac, A.; Rashidi, P. Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) Analysis. IEEE J. Biomed. Health Inf. 2018, 22, 1589– 1604, DOI: 10.1109/jbhi.2017.2767063Google Scholar21Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) AnalysisShickel Benjamin; Tighe Patrick James; Bihorac Azra; Rashidi ParisaIEEE journal of biomedical and health informatics (2018), 22 (5), 1589-1604 ISSN:.The past decade has seen an explosion in the amount of digital information stored in electronic health records (EHRs). While primarily designed for archiving patient information and performing administrative healthcare tasks like billing, many researchers have found secondary use of these records for various clinical informatics applications. Over the same period, the machine learning community has seen widespread advances in the field of deep learning. In this review, we survey the current research on applying deep learning to clinical tasks based on EHR data, where we find a variety of deep learning techniques and frameworks being applied to several types of clinical applications including information extraction, representation learning, outcome prediction, phenotyping, and deidentification. We identify several limitations of current research involving topics such as model interpretability, data heterogeneity, and lack of universal benchmarks. We conclude by summarizing the state of the field and identifying avenues of future deep EHR research.
- 22Nguyen, P.; Tran, T.; Wickramasinghe, N.; Venkatesh, S. Deepr: A Convolutional Net for Med-ical Records; arXiv. org, 2016.Google ScholarThere is no corresponding record for this reference.
- 23Zanjani, F. G.; Panteli, A.; Zinger, S.; van der Sommen, F.; Tan, T.; Balluff, B.; Vos, D. N.; Ellis, S. R.; Heeren, R. M.; Lucas, M.; IEEE 16th International Symposium on Biomedical Imaging; ISBI 2019, 2019, pp 674– 678.Google ScholarThere is no corresponding record for this reference.
- 24Liu, J.; Zhang, J.; Luo, Y.; Yang, S.; Wang, J.; Fu, Q. Mass Spectral Substance Detections Using Long Short-Term Memory Networks. IEEE Access 2019, 7, 10734– 10744, DOI: 10.1109/access.2019.2891548Google ScholarThere is no corresponding record for this reference.
- 25Zhang, J.; Liu, J.; Luo, Y.; Fu, Q.; Bi, J.; Qiu, S.; Cao, Y.; Ding, X. IEEE 17th International Conference on Communication Technology; ICCT, 2017, pp 1994– 1997.Google ScholarThere is no corresponding record for this reference.
- 26Jiang, Y.; Sun, A.; Zhao, Y.; Ying, W.; Sun, H.; Yang, X.; Xing, B.; Sun, W.; Ren, L.; Hu, B. Proteomics identifies new therapeutic targets of early-stage hepatocellular carcinoma. Nature 2019, 567, 257– 261, DOI: 10.1038/s41586-019-0987-8Google Scholar26Proteomics identifies new therapeutic targets of early-stage hepatocellular carcinomaJiang, Ying; Sun, Aihua; Zhao, Yang; Ying, Wantao; Sun, Huichuan; Yang, Xinrong; Xing, Baocai; Sun, Wei; Ren, Liangliang; Hu, Bo; Li, Chaoying; Zhang, Li; Qin, Guangrong; Zhang, Menghuan; Chen, Ning; Zhang, Manli; Huang, Yin; Zhou, Jinan; Zhao, Yan; Liu, Mingwei; Zhu, Xiaodong; Qiu, Yang; Sun, Yanjun; Huang, Cheng; Yan, Meng; Wang, Mingchao; Liu, Wei; Tian, Fang; Xu, Huali; Zhou, Jian; Wu, Zhenyu; Shi, Tieliu; Zhu, Weimin; Qin, Jun; Xie, Lu; Fan, Jia; Qian, Xiaohong; He, Fuchu; Chinese Human Proteome Project ConsortiumNature (London, United Kingdom) (2019), 567 (7747), 257-261CODEN: NATUAS; ISSN:0028-0836. (Nature Research)Hepatocellular carcinoma is the third leading cause of deaths from cancer worldwide. Infection with the hepatitis B virus is one of the leading risk factors for developing hepatocellular carcinoma, particularly in East Asia. Although surgical treatment may be effective in the early stages, the five-year overall rate of survival after developing this cancer is only 50-70%. Here, using proteomic and phospho-proteomic profiling, we characterize 110 paired tumor and non-tumor tissues of clin. early-stage hepatocellular carcinoma related to hepatitis B virus infection. Our quant. proteomic data highlight heterogeneity in early-stage hepatocellular carcinoma: we used this to stratify the cohort into the subtypes S-I, S-II and S-III, each of which has a different clin. outcome. S-III, which is characterized by disrupted cholesterol homeostasis, is assocd. with the lowest overall rate of survival and the greatest risk of a poor prognosis after first-line surgery. The knockdown of sterol O-acyltransferase 1 (SOAT1) - high expression of which is a signature specific to the S-III subtype - alters the distribution of cellular cholesterol, and effectively suppresses the proliferation and migration of hepatocellular carcinoma. Finally, on the basis of a patient-derived tumor xenograft mouse model of hepatocellular carcinoma, we found that treatment with avasimibe, an inhibitor of SOAT1, markedly reduced the size of tumors that had high levels of SOAT1 expression. The proteomic stratification of early-stage hepatocellular carcinoma presented in this study provides insight into the tumor biol. of this cancer, and suggests opportunities for personalized therapies that target it.
- 27Bifarin, O. O.; Gaul, D. A.; Sah, S.; Arnold, R. S.; Ogan, K.; Master, V. A.; Roberts, D. L.; Bergquist, S. H.; Petros, J. A.; Fernandez, F. M. Machine Learning-Enabled Renal Cell Carcinoma Status Prediction Using Multiplatform Urine-Based Metabolomics. J. Proteome Res. 2021, 20, 3629– 3641, DOI: 10.1021/acs.jproteome.1c00213Google Scholar27Machine Learning-Enabled Renal Cell Carcinoma Status Prediction Using Multiplatform Urine-Based MetabolomicsBifarin, Olatomiwa O.; Gaul, David A.; Sah, Samyukta; Arnold, Rebecca S.; Ogan, Kenneth; Master, Viraj A.; Roberts, David L.; Bergquist, Sharon H.; Petros, John A.; Fernandez, Facundo M.; Edison, Arthur S.Journal of Proteome Research (2021), 20 (7), 3629-3641CODEN: JPROBS; ISSN:1535-3893. (American Chemical Society)Renal cell carcinoma (RCC) is diagnosed through expensive cross-sectional imaging, frequently followed by renal mass biopsy, which is not only invasive but also prone to sampling errors. Hence, there is a crit. need for a noninvasive diagnostic assay. RCC exhibits altered cellular metab. combined with the close proximity of the tumor(s) to the urine in the kidney, suggesting that urine metabolomic profiling is an excellent choice for assay development. Here, we acquired liq. chromatog.-mass spectrometry (LC-MS) and NMR (NMR) data followed by the use of machine learning (ML) to discover candidate metabolomic panels for RCC. The study cohort consisted of 105 RCC patients and 179 controls sepd. into two subcohorts: the model cohort and the test cohort. Univariate, wrapper, and embedded methods were used to select discriminatory features using the model cohort. Three ML techniques, each with different induction biases, were used for training and hyperparameter tuning. Assessment of RCC status prediction was evaluated using the test cohort with the selected biomarkers and the optimally tuned ML algorithms. A seven-metabolite panel predicted RCC in the test cohort with 88% accuracy, 94% sensitivity, 85% specificity, and 0.98 AUC. Metabolomics Workbench Study IDs are ST001705 and ST001706.
- 28Su, Z.; Xie, H.; Han, L. Multi-Factor RFG-LSTM Algorithm for Stock Sequence Predicting. Comput. Econ. 2021, 57, 1041– 1058, DOI: 10.1007/s10614-020-10008-2Google ScholarThere is no corresponding record for this reference.
- 29Ma, F.; Zhang, J.; Chen, W.; Liang, W.; Yang, W. Discrete Dynamics in Nature and Society; Hindawi, 2020.Google ScholarThere is no corresponding record for this reference.
- 30Dong, H.; Liu, Y.; Zeng, W.-F.; Shu, K.; Zhu, Y.; Chang, C. A Deep Learning-Based Tumor Classifier Directly Using MS Raw Data. Proteomics 2020, 20, 1900344, DOI: 10.1002/pmic.201900344Google Scholar30A Deep Learning-Based Tumor Classifier Directly Using MS Raw DataDong, Hao; Liu, Yi; Zeng, Wen-Feng; Shu, Kunxian; Zhu, Yunping; Chang, ChengProteomics (2020), 20 (21-22), 1900344CODEN: PROTC7; ISSN:1615-9853. (Wiley-VCH Verlag GmbH & Co. KGaA)Since the launch of Chinese Human Proteome Project (CNHPP) and Clin. Proteomic Tumor Anal. Consortium (CPTAC), large-scale mass spectrometry (MS) based proteomic profiling of different kinds of human tumor samples have provided huge amt. of valuable data for both basic and clin. researchers. Accurate prediction for tumor and non-tumor samples, as well as the tumor types has become a key step for biol. and medical research, such as biomarker discovery, diagnosis, and monitoring of diseases. The traditional MS-based classification strategy mainly depends on the identification and quantification results of MS data, which has some inherent limitations, such as the low identification rate of MS data. Here, a deep learning-based tumor classifier directly using MS raw data is proposed, which is independent of the identification and quantification results of MS data. The potential precursors with intensities and retention times from MS data as input is first detected and extd. Then, a deep learning-based classifier is trained, which can accurately distinguish between the tumor and non-tumor samples. Finally, it is demonstrated the deep learning-based classifier has a good performance compared with other machine learning methods and may help researchers find the potential biomarkers which are likely to be missed by the traditional strategy.
Cited By
Smart citations by scite.ai include citation statements extracted from the full text of the citing article. The number of the statements may be higher than the number of citations provided by ACS Publications if one paper cites another multiple times or lower if scite has not yet processed some of the citing articles.
This article is cited by 12 publications.
- Chenyu Yang, He Chen, Yun Wu, Xiangguo Shen, Hongchun Liu, Taotao Liu, Xizhong Shen, Ruyi Xue, Nianrong Sun, Chunhui Deng. Deep Learning-Enabled Rapid Metabolic Decoding of Small Extracellular Vesicles via Dual-Use Mass Spectroscopy Chip Array. Analytical Chemistry 2025, 97
(1)
, 271-280. https://doi.org/10.1021/acs.analchem.4c04106
- Xin-Yu Lu, Hao-Ping Wu, Hao Ma, Hui Li, Jia Li, Yan-Ti Liu, Zheng-Yan Pan, Yi Xie, Lei Wang, Bin Ren, Guo-Kun Liu. Deep Learning-Assisted Spectrum–Structure Correlation: State-of-the-Art and Perspectives. Analytical Chemistry 2024, 96
(20)
, 7959-7975. https://doi.org/10.1021/acs.analchem.4c01639
- Andre Nicolle, Sili Deng, Matthias Ihme, Nursulu Kuzhagaliyeva, Emad Al Ibrahim, Aamir Farooq. Mixtures Recomposition by Neural Nets: A Multidisciplinary Overview. Journal of Chemical Information and Modeling 2024, 64
(3)
, 597-620. https://doi.org/10.1021/acs.jcim.3c01633
- Yuxia Zheng, Limei Yin, Heera Jayan, Shuiquan Jiang, Hesham R. El-Seedi, Xiaobo Zou, Zhiming Guo. In situ self-cleaning PAN/Cu2O@Ag/Au@Ag flexible SERS sensor coupled with chemometrics for quantitative detection of thiram residues on apples. Food Chemistry 2025, 473 , 143032. https://doi.org/10.1016/j.foodchem.2025.143032
- Yijiang Liu, Feifan Zhang, Yifei Ge, Qiao Liu, Siyu He, Xiaotao Shen. Application of LLMs/Transformer-Based Models for Metabolite Annotation in Metabolomics. Health and Metabolism 2025, , 7. https://doi.org/10.53941/hm.2025.100014
- Blessina Preethi R, Berin Shalu S, Saranya Nair M, Vergin Raja Sarobin M. EE-SAMS: An adaptive, SNN based energy-efficient data aggregation framework for agrovoltaic monitoring systems. Results in Engineering 2025, 25 , 104053. https://doi.org/10.1016/j.rineng.2025.104053
- Tara Menon Pattilachan, Maria Christodoulou, Sharona Ross. Diagnosis to dissection: AI’s role in early detection and surgical intervention for gastric cancer. Journal of Robotic Surgery 2024, 18
(1)
https://doi.org/10.1007/s11701-024-02005-6
- Olga Kapustina, Polina Burmakina, Nina Gubina, Nikita Serov, Vladimir Vinogradov. User-friendly and industry-integrated AI for medicinal chemists and pharmaceuticals. Artificial Intelligence Chemistry 2024, 2
(2)
, 100072. https://doi.org/10.1016/j.aichem.2024.100072
- Huanhuan Li, Wei Sheng, Selorm Yao-Say Solomon Adade, Xorlali Nunekpeku, Quansheng Chen. Investigation of heat-induced pork batter quality detection and change mechanisms using Raman spectroscopy coupled with deep learning algorithms. Food Chemistry 2024, 461 , 140798. https://doi.org/10.1016/j.foodchem.2024.140798
- Muhammad Shalahuddin Al Ja’farawy, Vo Thi Nhat Linh, Jun-Yeong Yang, Chaewon Mun, Seunghun Lee, Sung-Gyu Park, In Woong Han, Samjin Choi, Min-Young Lee, Dong-Ho Kim, Ho Sang Jung. Whole urine-based multiple cancer diagnosis and metabolite profiling using 3D evolutionary gold nanoarchitecture combined with machine learning-assisted SERS. Sensors and Actuators B: Chemical 2024, 412 , 135828. https://doi.org/10.1016/j.snb.2024.135828
- Ushas A K, Dhanya M B, Jerrin Thomas Panachakel, Deepak S, Viji H, Smitha V Thampi, R Satheesh Thampi. Peak Pattern Classification in Mass Spectrometer Data using Deep Learning Techniques. 2024, 1-5. https://doi.org/10.1109/ICECCC61767.2024.10593905
- Ming Wei. Performance comparison of multiple neural networks for brain tumor classification. 2024, 12-20. https://doi.org/10.1109/ICBEA62825.2024.00012
Article Views are the COUNTER-compliant sum of full text article downloads since November 2008 (both PDF and HTML) across all institutions and individuals. These metrics are regularly updated to reflect usage leading up to the last few days.
Citations are the number of other articles citing this article, calculated by Crossref and updated daily. Find more information about Crossref citation counts.
The Altmetric Attention Score is a quantitative measure of the attention that a research article has received online. Clicking on the donut icon will load a page at altmetric.com with additional details about the score and the social media presence for the given article. Find more information on the Altmetric Attention Score and how the score is calculated.
Recommended Articles
Abstract
Figure 1
Figure 1. Accuracies of LSTM models on embedded (a) hepatic, (b) positive renal, and (c) negative renal datasets at three RT binning time (10, 5, and 3 or 2 s). (30) means one LSTM layer of 30 neurons and (30,30) means two LSTM layers of 30 neurons each.
Figure 2
Figure 2. Accuracies of LSTM models on raw (a) hepatic, (b) positive renal, and (c) negative renal datasets at three RT binning time (10, 5, and 3 or 2 s). (30) means one LSTM layer of 30 neurons and (30,30) means two LSTM layers of 30 neurons each.
Figure 3
Figure 3. Accuracies of hybrid CNN–LSTM models on raw (a) hepatic, (b) positive renal, and (c) negative renal datasets at three RT binning time (10, 5, and 3 or 2 s). For hepatic and positive renal data, M1: Conv(32,21)(16,11)-LSTM(30,30), M2: Conv(32,21)-LSTM(30,30) and M3: Conv(32,11)-LSTM(30,30). For negative renal data, M1: Conv(32,21)(16,11)-LSTM(60)(10 s)/LSTM(60,60)(5 s)/LSTM(120)(2 s), M2: Conv(32,21)-LSTM(60)(10 s)/LSTM(60,60)(5 s)/LSTM(120)(2 s) and M3: Conv(32,11)-LSTM(60)(10 s)/LSTM(60,60)(5 s)/LSTM(120)(2 s). Conv(32,21) means 32 kernels in the convolutional layer with size 21. LSTM(30,30) means two LSTM layers of 30 neurons each.
Figure 4
Figure 4. Early classification accuracies of our two-stage CNN–LSTM model on (a) hepatic, (b) positive renal, and (c) negative renal datasets at three RT binning time (10, 5, and 3 or 2 s).
References
This article references 30 other publications.
- 1Feider, C. L.; Krieger, A.; DeHoog, R. J.; Eberlin, L. S. Ambient Ionization Mass Spectrometry: Recent Developments and Applications. Anal. Chem. 2019, 91, 4266– 4290, DOI: 10.1021/acs.analchem.9b008071Ambient Ionization Mass Spectrometry: Recent Developments and ApplicationsFeider, Clara L.; Krieger, Anna; DeHoog, Rachel J.; Eberlin, Livia S.Analytical Chemistry (Washington, DC, United States) (2019), 91 (7), 4266-4290CODEN: ANCHAM; ISSN:0003-2700. (American Chemical Society)A review. The early 2000s brought a new age in the field of mass spectrometry (MS) with the introduction of ambient ionization MS techniques. As defined by 1 of its early visionaries, Prof. R. Graham Cooks, ambient ionization refers to the ionization of unprocessed or minimally modified samples in their native environment, and it typically refers to the ionization of condensed phase samples in air. Since its inception in 2004, many researchers and labs. have contributed with approaches for sampling and ionization at atm. conditions, greatly decreasing exptl. complexity and time required for mass spectrometry analyses. In the last 15 years, innovations in the field of ambient ionization MS have grown expansively, pushing these technologies far past their point of conception and integrating them into the broader scientific community in creative and stimulating ways.
- 2Berrar, D.; Dubitzky, W. Deep learning in bioinformatics and biomedicine. Briefings Bioinf. 2021, 22, 1513– 1514, DOI: 10.1093/bib/bbab0872Deep learning in bioinformatics and biomedicineBerrar Daniel; Dubitzky WernerBriefings in bioinformatics (2021), 22 (2), 1513-1514 ISSN:.There is no expanded citation for this reference.
- 3LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. nature 2015, 521, 436– 444, DOI: 10.1038/nature145393Deep learningLeCun, Yann; Bengio, Yoshua; Hinton, GeoffreyNature (London, United Kingdom) (2015), 521 (7553), 436-444CODEN: NATUAS; ISSN:0028-0836. (Nature Publishing Group)Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.
- 4Chollet, F. Deep Learning with Python; Simon & Schuster, 2021.There is no corresponding record for this reference.
- 5Wen, B.; Zeng, W.-F.; Liao, Y.; Shi, Z.; Savage, S. R.; Jiang, W.; Zhang, B. Deep Learning in Proteomics. Proteomics 2020, 20, 1900335, DOI: 10.1002/pmic.2019003355Deep Learning in ProteomicsWen, Bo; Zeng, Wen-Feng; Liao, Yuxing; Shi, Zhiao; Savage, Sara R.; Jiang, Wen; Zhang, BingProteomics (2020), 20 (21-22), 1900335CODEN: PROTC7; ISSN:1615-9853. (Wiley-VCH Verlag GmbH & Co. KGaA)Proteomics, the study of all the proteins in biol. systems, is becoming a data-rich science. Protein sequences and structures are comprehensively catalogued in online databases. With recent advancements in tandem mass spectrometry (MS) technol., protein expression and post-translational modifications (PTMs) can be studied in a variety of biol. systems at the global scale. Sophisticated computational algorithms are needed to translate the vast amt. of data into novel biol. insights. Deep learning automatically exts. data representations at high levels of abstraction from data, and it thrives in data-rich scientific research domains. Here, a comprehensive overview of deep learning applications in proteomics, including retention time prediction, MS/MS spectrum prediction, de novo peptide sequencing, PTM prediction, major histocompatibility complex-peptide binding prediction, and protein structure prediction, is provided. Limitations and the future directions of deep learning in proteomics are also discussed. This review will provide readers an overview of deep learning and how it can be used to analyze proteomics data.
- 6Cortes, C.; Vapnik, V. Support-vector Networks, 3rd ed.; Springer, 1995; Vol. 20, pp 273– 297.There is no corresponding record for this reference.
- 7Breiman, L. Mach. Learn. 2001, 45, 5– 32, DOI: 10.1023/a:1010933404324There is no corresponding record for this reference.
- 8Wu, B.; Abbott, T.; Fishman, D.; McMurray, W.; Mor, G.; Stone, K.; Ward, D.; Williams, K.; Zhao, H. Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data. Bioinformatics 2003, 19, 1636– 1643, DOI: 10.1093/bioinformatics/btg2108Comparison of statistical methods for classification of ovarian cancer using mass spectrometry dataWu, Baolin; Abbott, Tom; Fishman, David; McMurray, Walter; Mor, Gil; Stone, Kathryn; Ward, David; Williams, Kenneth; Zhao, HongyuBioinformatics (2003), 19 (13), 1636-1643CODEN: BOINFP; ISSN:1367-4803. (Oxford University Press)Motivation: Novel methods, both mol. and statistical, are urgently needed to take advantage of recent advances in biotechnol. and the human genome project for disease diagnosis and prognosis. Mass spectrometry (MS) holds great promise for biomarker identification and genome-wide protein profiling. It has been demonstrated in the literature that biomarkers can be identified to distinguish normal individuals from cancer patients using MS data. Such progress is esp. exciting for the detection of early-stage ovarian cancer patients. Although various statistical methods have been utilized to identify biomarkers from MS data, there has been no systematic comparison among these approaches in their relative ability to analyze MS data. Results: We compare the performance of several classes of statistical methods for the classification of cancer based on MS spectra. These methods include: linear discriminant anal., quadratic discriminant anal., k-nearest neighbor classifier, bagging and boosting classification trees, support vector machine, and random forest (RF). The methods are applied to ovarian cancer and control serum samples from the National Ovarian Cancer Early Detection Program clinic at Northwestern University Hospital. We found that RF outperforms other methods in the anal. of MS data.
- 9Gredell, D. A.; Schroeder, A. R.; Belk, K. E.; Broeckling, C. D.; Heuberger, A. L.; Kim, S.-Y.; King, D. A.; Shackelford, S. D.; Sharp, J. L.; Wheeler, T. L. Comparison of Machine Learning Algorithms for Predictive Modeling of Beef Attributes Using Rapid Evaporative Ionization Mass Spectrometry (REIMS) Data. Sci. Rep. 2019, 9, 5721, DOI: 10.1038/s41598-019-40927-69Comparison of Machine Learning Algorithms for Predictive Modeling of Beef Attributes Using Rapid Evaporative Ionization Mass Spectrometry (REIMS) DataGredell Devin A; Belk Keith E; Woerner Dale R; Schroeder Amelia R; Broeckling Corey D; Heuberger Adam L; Prenni Jessica E; Kim Soo-Young; Sharp Julia L; King D Andy; Shackelford Steven D; Wheeler Tommy LScientific reports (2019), 9 (1), 5721 ISSN:.Ambient mass spectrometry is an analytical approach that enables ionization of molecules under open-air conditions with no sample preparation and very fast sampling times. Rapid evaporative ionization mass spectrometry (REIMS) is a relatively new type of ambient mass spectrometry that has demonstrated applications in both human health and food science. Here, we present an evaluation of REIMS as a tool to generate molecular scale information as an objective measure for the assessment of beef quality attributes. Eight different machine learning algorithms were compared to generate predictive models using REIMS data to classify beef quality attributes based on the United States Department of Agriculture (USDA) quality grade, production background, breed type and muscle tenderness. The results revealed that the optimal machine learning algorithm, as assessed by predictive accuracy, was different depending on the classification problem, suggesting that a "one size fits all" approach to developing predictive models from REIMS data is not appropriate. The highest performing models for each classification achieved prediction accuracies between 81.5-99%, indicating the potential of the approach to complement current methods for classifying quality attributes in beef.
- 10Datta, S.; DePadilla, L. M. Feature selection and machine learning with mass spectrometry data for distinguishing cancer and non-cancer samples. Stat. Methodol. 2006, 3, 79– 92, DOI: 10.1016/j.stamet.2005.09.006There is no corresponding record for this reference.
- 11Cannataro, M.; Guzzi, P. H.; Mazza, T.; Veltri, P. ACM SIGIR 2005, 112, 17– 26There is no corresponding record for this reference.
- 12Engel, J.; Gerretzen, J.; Szymańska, E.; Jansen, J. J.; Downey, G.; Blanchet, L.; Buydens, L. M. TrAC, Trends Anal. Chem. 2013, 50, 96– 106, DOI: 10.1016/j.trac.2013.04.01512Breaking with trends in pre-processing?Engel, Jasper; Gerretzen, Jan; Szymanska, Ewa; Jansen, Jeroen J.; Downey, Gerard; Blanchet, Lionel; Buydens, Lutgarde M. C.TrAC, Trends in Analytical Chemistry (2013), 50 (), 96-106CODEN: TTAEDJ; ISSN:0165-9936. (Elsevier B. V.)A review. Data pre-processing is an essential part of chemometric data anal., which aims to remove unwanted variation (such as instrumental artifacts) and thereby focusing on the variation of interest. The choice of an optimal pre-processing method or combination of methods may strongly influence the anal. results, but is far from straightforward, since it depends on the characteristics of the data set and the goal of data anal. This 1st crit. review is devoted to the selection procedure for appropriate pre-processing strategies. Breaking with current trends in pre-processing is essential, as all selection approaches have serious drawbacks and cannot be properly used.
- 13Seddiki, K.; Saudemont, P.; Precioso, F.; Ogrinc, N.; Wisztorski, M.; Salzet, M.; Fournier, I.; Droit, A. Nat. Commun. 2020, 11, 5595, DOI: 10.1038/s41467-020-19354-z13Cumulative learning enables convolutional neural network representations for small mass spectrometry data classificationSeddiki, Khawla; Saudemont, Philippe; Precioso, Frederic; Ogrinc, Nina; Wisztorski, Maxence; Salzet, Michel; Fournier, Isabelle; Droit, ArnaudNature Communications (2020), 11 (1), 5595CODEN: NCAOBW; ISSN:2041-1723. (Nature Research)Rapid and accurate clin. diagnosis remains challenging. A component of diagnosis tool development is the design of effective classification models with Mass spectrometry (MS) data. Some Machine Learning approaches have been investigated but these models require time-consuming preprocessing steps to remove artifacts, making them unsuitable for rapid anal. Convolutional Neural Networks (CNNs) have been found to perform well under such circumstances since they can learn representations from raw data. However, their effectiveness decreases when the no. of available training samples is small, which is a common situation in medicine. In this work, we investigate transfer learning on 1D-CNNs, then we develop a cumulative learning method when transfer learning is not powerful enough. We propose to train the same model through several classification tasks over various small datasets to accumulate knowledge in the resulting representation. By using rat brain as the initial training dataset, a cumulative learning approach can have a classification accuracy exceeding 98% for 1D clin. MS-data. We show the use of cumulative learning using datasets generated in different biol. contexts, on different organisms, and acquired by different instruments. Here we show a promising strategy for improving MS data classification accuracy when only small nos. of samples are available.
- 14Gupta, A.; Gupta, H. P.; Biswas, B.; Dutta, T. IEEE Transactions on Artificial Intelligence; IEEE, 2020; Vol. 1, pp 47– 61.There is no corresponding record for this reference.
- 15Mori, U.; Mendiburu, A.; Dasgupta, S.; Lozano, J. A. Early Classification of Time Series by Simultaneously Optimizing the Accuracy and Earliness. IEEE Transact. Neural Networks Learn. Syst. 2018, 29, 4569– 4578, DOI: 10.1109/tnnls.2017.276493915Early Classification of Time Series by Simultaneously Optimizing the Accuracy and EarlinessMori Usue; Mendiburu Alexander; Dasgupta Sanjoy; Lozano Jose AIEEE transactions on neural networks and learning systems (2018), 29 (10), 4569-4578 ISSN:.The problem of early classification of time series appears naturally in contexts where the data, of temporal nature, are collected over time, and early class predictions are interesting or even required. The objective is to classify the incoming sequence as soon as possible, while maintaining suitable levels of accuracy in the predictions. Thus, we can say that the problem of early classification consists of optimizing two objectives simultaneously: accuracy and earliness. In this context, we present a method for early classification based on combining a set of probabilistic classifiers together with a stopping rule (SR). This SR will act as a trigger and will tell us when to output a prediction or when to wait for more data, and its main novelty lies in the fact that it is built by explicitly optimizing a cost function based on accuracy and earliness. We have selected a large set of benchmark data sets and four other state-of-the-art early classification methods, and we have evaluated and compared our framework obtaining superior results in terms of both earliness and accuracy.
- 16Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Proceedings of the IEEE Conference on Computer Vision and Pattern Recogni-tion; IEEE, 2015, pp 1– 9.There is no corresponding record for this reference.
- 17Papagiannopoulou, C.; Parchen, R.; Rubbens, P.; Waegeman, W. Fast Pathogen Identification Using Single-Cell Matrix-Assisted Laser Desorption/Ionization-Aerosol Time-of-Flight Mass Spectrometry Data and Deep Learning Methods. Anal. Chem. 2020, 92, 7523– 7531, DOI: 10.1021/acs.analchem.9b0580617Fast Pathogen Identification Using Single-Cell Matrix-Assisted Laser Desorption/Ionization-Aerosol Time-of-Flight Mass Spectrometry Data and Deep Learning MethodsPapagiannopoulou, Christina; Parchen, Rene; Rubbens, Peter; Waegeman, WillemAnalytical Chemistry (Washington, DC, United States) (2020), 92 (11), 7523-7531CODEN: ANCHAM; ISSN:0003-2700. (American Chemical Society)In diagnostics of infectious diseases, matrix-assisted laser desorption/ionization-time-of-flight mass spectrometry (MALDI-TOF MS) can be applied for the identification of pathogenic microorganisms. However, to achieve a trustworthy identification from MALDI-TOF MS data, a significant amt. of biomass should be considered. The bacterial load that potentially occurs in a sample is therefore routinely amplified by culturing, which is a time-consuming procedure. In this paper, the authors show that culturing can be avoided by conducting MALDI-TOF MS on individual bacterial cells. This results in a more rapid identification of species with an acceptable accuracy. The authors propose a deep learning architecture to analyze the data and compare its performance with traditional supervised machine learning algorithms. The authors illustrate the workflow on a large data set that contains bacterial species related to urinary tract infections. Overall the authors obtain accuracies up to 85% in discriminating five different species.
- 18Skarysz, A.; Salman, D.; Eddleston, M.; Sykora, M.; Hunsicker, E.; Nailon, W. H.; Darnley, K.; McLaren, D. B.; Thomas, C.; Soltoggio, A. 2020, arXiv preprint arXiv:2006.01772.There is no corresponding record for this reference.
- 19Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735– 1780, DOI: 10.1162/neco.1997.9.8.173519Long short-term memoryHochreiter S; Schmidhuber JNeural computation (1997), 9 (8), 1735-80 ISSN:0899-7667.Learning to store information over extended time intervals by recurrent backpropagation takes a very long time, mostly because of insufficient, decaying error backflow. We briefly review Hochreiter's (1991) analysis of this problem, then address it by introducing a novel, efficient, gradient-based method called long short-term memory (LSTM). Truncating the gradient where this does not do harm, LSTM can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units. Multiplicative gate units learn to open and close access to the constant error flow. LSTM is local in space and time; its computational complexity per time step and weight is O(1). Our experiments with artificial data involve local, distributed, real-valued, and noisy pattern representations. In comparisons with real-time recurrent learning, back propagation through time, recurrent cascade correlation, Elman nets, and neural sequence chunking, LSTM leads to many more successful runs, and learns much faster. LSTM also solves complex, artificial long-time-lag tasks that have never been solved by previous recurrent network algorithms.
- 20Van Houdt, G.; Mosquera, C.; Nápoles, G. A review on the long short-term memory model. Artif. Intell. Rev. 2020, 53, 5929– 5955, DOI: 10.1007/s10462-020-09838-1There is no corresponding record for this reference.
- 21Shickel, B.; Tighe, P. J.; Bihorac, A.; Rashidi, P. Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) Analysis. IEEE J. Biomed. Health Inf. 2018, 22, 1589– 1604, DOI: 10.1109/jbhi.2017.276706321Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) AnalysisShickel Benjamin; Tighe Patrick James; Bihorac Azra; Rashidi ParisaIEEE journal of biomedical and health informatics (2018), 22 (5), 1589-1604 ISSN:.The past decade has seen an explosion in the amount of digital information stored in electronic health records (EHRs). While primarily designed for archiving patient information and performing administrative healthcare tasks like billing, many researchers have found secondary use of these records for various clinical informatics applications. Over the same period, the machine learning community has seen widespread advances in the field of deep learning. In this review, we survey the current research on applying deep learning to clinical tasks based on EHR data, where we find a variety of deep learning techniques and frameworks being applied to several types of clinical applications including information extraction, representation learning, outcome prediction, phenotyping, and deidentification. We identify several limitations of current research involving topics such as model interpretability, data heterogeneity, and lack of universal benchmarks. We conclude by summarizing the state of the field and identifying avenues of future deep EHR research.
- 22Nguyen, P.; Tran, T.; Wickramasinghe, N.; Venkatesh, S. Deepr: A Convolutional Net for Med-ical Records; arXiv. org, 2016.There is no corresponding record for this reference.
- 23Zanjani, F. G.; Panteli, A.; Zinger, S.; van der Sommen, F.; Tan, T.; Balluff, B.; Vos, D. N.; Ellis, S. R.; Heeren, R. M.; Lucas, M.; IEEE 16th International Symposium on Biomedical Imaging; ISBI 2019, 2019, pp 674– 678.There is no corresponding record for this reference.
- 24Liu, J.; Zhang, J.; Luo, Y.; Yang, S.; Wang, J.; Fu, Q. Mass Spectral Substance Detections Using Long Short-Term Memory Networks. IEEE Access 2019, 7, 10734– 10744, DOI: 10.1109/access.2019.2891548There is no corresponding record for this reference.
- 25Zhang, J.; Liu, J.; Luo, Y.; Fu, Q.; Bi, J.; Qiu, S.; Cao, Y.; Ding, X. IEEE 17th International Conference on Communication Technology; ICCT, 2017, pp 1994– 1997.There is no corresponding record for this reference.
- 26Jiang, Y.; Sun, A.; Zhao, Y.; Ying, W.; Sun, H.; Yang, X.; Xing, B.; Sun, W.; Ren, L.; Hu, B. Proteomics identifies new therapeutic targets of early-stage hepatocellular carcinoma. Nature 2019, 567, 257– 261, DOI: 10.1038/s41586-019-0987-826Proteomics identifies new therapeutic targets of early-stage hepatocellular carcinomaJiang, Ying; Sun, Aihua; Zhao, Yang; Ying, Wantao; Sun, Huichuan; Yang, Xinrong; Xing, Baocai; Sun, Wei; Ren, Liangliang; Hu, Bo; Li, Chaoying; Zhang, Li; Qin, Guangrong; Zhang, Menghuan; Chen, Ning; Zhang, Manli; Huang, Yin; Zhou, Jinan; Zhao, Yan; Liu, Mingwei; Zhu, Xiaodong; Qiu, Yang; Sun, Yanjun; Huang, Cheng; Yan, Meng; Wang, Mingchao; Liu, Wei; Tian, Fang; Xu, Huali; Zhou, Jian; Wu, Zhenyu; Shi, Tieliu; Zhu, Weimin; Qin, Jun; Xie, Lu; Fan, Jia; Qian, Xiaohong; He, Fuchu; Chinese Human Proteome Project ConsortiumNature (London, United Kingdom) (2019), 567 (7747), 257-261CODEN: NATUAS; ISSN:0028-0836. (Nature Research)Hepatocellular carcinoma is the third leading cause of deaths from cancer worldwide. Infection with the hepatitis B virus is one of the leading risk factors for developing hepatocellular carcinoma, particularly in East Asia. Although surgical treatment may be effective in the early stages, the five-year overall rate of survival after developing this cancer is only 50-70%. Here, using proteomic and phospho-proteomic profiling, we characterize 110 paired tumor and non-tumor tissues of clin. early-stage hepatocellular carcinoma related to hepatitis B virus infection. Our quant. proteomic data highlight heterogeneity in early-stage hepatocellular carcinoma: we used this to stratify the cohort into the subtypes S-I, S-II and S-III, each of which has a different clin. outcome. S-III, which is characterized by disrupted cholesterol homeostasis, is assocd. with the lowest overall rate of survival and the greatest risk of a poor prognosis after first-line surgery. The knockdown of sterol O-acyltransferase 1 (SOAT1) - high expression of which is a signature specific to the S-III subtype - alters the distribution of cellular cholesterol, and effectively suppresses the proliferation and migration of hepatocellular carcinoma. Finally, on the basis of a patient-derived tumor xenograft mouse model of hepatocellular carcinoma, we found that treatment with avasimibe, an inhibitor of SOAT1, markedly reduced the size of tumors that had high levels of SOAT1 expression. The proteomic stratification of early-stage hepatocellular carcinoma presented in this study provides insight into the tumor biol. of this cancer, and suggests opportunities for personalized therapies that target it.
- 27Bifarin, O. O.; Gaul, D. A.; Sah, S.; Arnold, R. S.; Ogan, K.; Master, V. A.; Roberts, D. L.; Bergquist, S. H.; Petros, J. A.; Fernandez, F. M. Machine Learning-Enabled Renal Cell Carcinoma Status Prediction Using Multiplatform Urine-Based Metabolomics. J. Proteome Res. 2021, 20, 3629– 3641, DOI: 10.1021/acs.jproteome.1c0021327Machine Learning-Enabled Renal Cell Carcinoma Status Prediction Using Multiplatform Urine-Based MetabolomicsBifarin, Olatomiwa O.; Gaul, David A.; Sah, Samyukta; Arnold, Rebecca S.; Ogan, Kenneth; Master, Viraj A.; Roberts, David L.; Bergquist, Sharon H.; Petros, John A.; Fernandez, Facundo M.; Edison, Arthur S.Journal of Proteome Research (2021), 20 (7), 3629-3641CODEN: JPROBS; ISSN:1535-3893. (American Chemical Society)Renal cell carcinoma (RCC) is diagnosed through expensive cross-sectional imaging, frequently followed by renal mass biopsy, which is not only invasive but also prone to sampling errors. Hence, there is a crit. need for a noninvasive diagnostic assay. RCC exhibits altered cellular metab. combined with the close proximity of the tumor(s) to the urine in the kidney, suggesting that urine metabolomic profiling is an excellent choice for assay development. Here, we acquired liq. chromatog.-mass spectrometry (LC-MS) and NMR (NMR) data followed by the use of machine learning (ML) to discover candidate metabolomic panels for RCC. The study cohort consisted of 105 RCC patients and 179 controls sepd. into two subcohorts: the model cohort and the test cohort. Univariate, wrapper, and embedded methods were used to select discriminatory features using the model cohort. Three ML techniques, each with different induction biases, were used for training and hyperparameter tuning. Assessment of RCC status prediction was evaluated using the test cohort with the selected biomarkers and the optimally tuned ML algorithms. A seven-metabolite panel predicted RCC in the test cohort with 88% accuracy, 94% sensitivity, 85% specificity, and 0.98 AUC. Metabolomics Workbench Study IDs are ST001705 and ST001706.
- 28Su, Z.; Xie, H.; Han, L. Multi-Factor RFG-LSTM Algorithm for Stock Sequence Predicting. Comput. Econ. 2021, 57, 1041– 1058, DOI: 10.1007/s10614-020-10008-2There is no corresponding record for this reference.
- 29Ma, F.; Zhang, J.; Chen, W.; Liang, W.; Yang, W. Discrete Dynamics in Nature and Society; Hindawi, 2020.There is no corresponding record for this reference.
- 30Dong, H.; Liu, Y.; Zeng, W.-F.; Shu, K.; Zhu, Y.; Chang, C. A Deep Learning-Based Tumor Classifier Directly Using MS Raw Data. Proteomics 2020, 20, 1900344, DOI: 10.1002/pmic.20190034430A Deep Learning-Based Tumor Classifier Directly Using MS Raw DataDong, Hao; Liu, Yi; Zeng, Wen-Feng; Shu, Kunxian; Zhu, Yunping; Chang, ChengProteomics (2020), 20 (21-22), 1900344CODEN: PROTC7; ISSN:1615-9853. (Wiley-VCH Verlag GmbH & Co. KGaA)Since the launch of Chinese Human Proteome Project (CNHPP) and Clin. Proteomic Tumor Anal. Consortium (CPTAC), large-scale mass spectrometry (MS) based proteomic profiling of different kinds of human tumor samples have provided huge amt. of valuable data for both basic and clin. researchers. Accurate prediction for tumor and non-tumor samples, as well as the tumor types has become a key step for biol. and medical research, such as biomarker discovery, diagnosis, and monitoring of diseases. The traditional MS-based classification strategy mainly depends on the identification and quantification results of MS data, which has some inherent limitations, such as the low identification rate of MS data. Here, a deep learning-based tumor classifier directly using MS raw data is proposed, which is independent of the identification and quantification results of MS data. The potential precursors with intensities and retention times from MS data as input is first detected and extd. Then, a deep learning-based classifier is trained, which can accurately distinguish between the tumor and non-tumor samples. Finally, it is demonstrated the deep learning-based classifier has a good performance compared with other machine learning methods and may help researchers find the potential biomarkers which are likely to be missed by the traditional strategy.
Supporting Information
Supporting Information
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.analchem.3c00613.
Additional results and figures including matrix construction, our two-stage CNN–LSTM classification model, hybrid CNN–LSTM models architectures, 1D-CNN embedding models, classification performances and early classification performances with our two-stage CNN-LSTM model (PDF)
Terms & Conditions
Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system: http://pubs.acs.org/page/copyright/permissions.html.