Autonomous Chemical Experiments: Challenges and Perspectives on Establishing a Self-Driving LabClick to copy article linkArticle link copied!
- Martin SeifridMartin SeifridDepartment of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, CanadaMore by Martin Seifrid
- Robert PolliceRobert PolliceDepartment of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, CanadaMore by Robert Pollice
- Andrés Aguilar-GrandaAndrés Aguilar-GrandaDepartment of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, CanadaMore by Andrés Aguilar-Granda
- Zamyla Morgan ChanZamyla Morgan ChanDepartment of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, CanadaAcceleration Consortium, University of Toronto, Toronto, Ontario M5S 3H6, CanadaMore by Zamyla Morgan Chan
- Kazuhiro HottaKazuhiro HottaDepartment of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, CanadaScience & Innovation Center, Mitsubishi Chemical Corporation, 1000 Kamoshidacho, Aoba, Yokohama, Kanagawa 227-8502, JapanMore by Kazuhiro Hotta
- Cher Tian SerCher Tian SerDepartment of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, CanadaMore by Cher Tian Ser
- Jenya VestfridJenya VestfridDepartment of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, CanadaMore by Jenya Vestfrid
- Tony C. WuTony C. WuDepartment of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, CanadaMore by Tony C. Wu
- Alán Aspuru-Guzik*Alán Aspuru-Guzik*Email: [email protected]Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, CanadaDepartment of Computer Science, University of Toronto, Toronto, Ontario M5S 3H6, CanadaDepartment of Chemical Engineering & Applied Chemistry, University of Toronto, Toronto, Ontario M5S 3E5, CanadaDepartment of Materials Science, University of Toronto, Toronto, Ontario M5S 3E4, CanadaVector Institute for Artificial Intelligence, Toronto, Ontario M5S 1M1, CanadaLebovic Fellow, Canadian Institute for Advanced Research, Toronto, Ontario M5S 1M1, CanadaMore by Alán Aspuru-Guzik
Abstract
Conspectus
We must accelerate the pace at which we make technological advancements to address climate change and disease risks worldwide. This swifter pace of discovery requires faster research and development cycles enabled by better integration between hypothesis generation, design, experimentation, and data analysis. Typical research cycles take months to years. However, data-driven automated laboratories, or self-driving laboratories, can significantly accelerate molecular and materials discovery. Recently, substantial advancements have been made in the areas of machine learning and optimization algorithms that have allowed researchers to extract valuable knowledge from multidimensional data sets. Machine learning models can be trained on large data sets from the literature or databases, but their performance can often be hampered by a lack of negative results or metadata. In contrast, data generated by self-driving laboratories can be information-rich, containing precise details of the experimental conditions and metadata. Consequently, much larger amounts of high-quality data are gathered in self-driving laboratories. When placed in open repositories, this data can be used by the research community to reproduce experiments, for more in-depth analysis, or as the basis for further investigation. Accordingly, high-quality open data sets will increase the accessibility and reproducibility of science, which is sorely needed.
In this Account, we describe our efforts to build a self-driving lab for the development of a new class of materials: organic semiconductor lasers (OSLs). Since they have only recently been demonstrated, little is known about the molecular and material design rules for thin-film, electrically-pumped OSL devices as compared to other technologies such as organic light-emitting diodes or organic photovoltaics. To realize high-performing OSL materials, we are developing a flexible system for automated synthesis via iterative Suzuki–Miyaura cross-coupling reactions. This automated synthesis platform is directly coupled to the analysis and purification capabilities. Subsequently, the molecules of interest can be transferred to an optical characterization setup. We are currently limited to optical measurements of the OSL molecules in solution. However, material properties are ultimately most important in the solid state (e.g., as a thin-film device). To that end and for a different scientific goal, we are developing a self-driving lab for inorganic thin-film materials focused on the oxygen evolution reaction.
While the future of self-driving laboratories is very promising, numerous challenges still need to be overcome. These challenges can be split into cognition and motor function. Generally, the cognitive challenges are related to optimization with constraints or unexpected outcomes for which general algorithmic solutions have yet to be developed. A more practical challenge that could be resolved in the near future is that of software control and integration because few instrument manufacturers design their products with self-driving laboratories in mind. Challenges in motor function are largely related to handling heterogeneous systems, such as dispensing solids or performing extractions. As a result, it is critical to understand that adapting experimental procedures that were designed for human experimenters is not as simple as transferring those same actions to an automated system, and there may be more efficient ways to achieve the same goal in an automated fashion. Accordingly, for self-driving laboratories, we need to carefully rethink the translation of manual experimental protocols.
This publication is licensed under
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
License Summary*
You are free to share(copy and redistribute) this article in any medium or format within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
Non-Commercial (NC): Only non-commercial uses of the work are permitted.
No Derivatives (ND): Derivative works may be created for non-commercial purposes, but sharing is prohibited.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
Key references
Roch, L. M.; Häse, F.; Kreisbeck, C.; Tamayo-Mendoza, T.; Yunker, L. P. E.; Hein, J. E.; Aspuru-Guzik, A. ChemOS: An Orchestration Software to Democratize Autonomous Discovery. PLoS One 2020, 15 (4), e0229862. (1) The “ChemOS” orchestration software for autonomous laboratories, featuring machine learning algorithms, online analysis, and interaction with researchers, automated instrumentation, and databases, is introduced and used to optimize color, robotic HPLC sampling and a cocktail recipe.
Langner, S.; Häse, F.; Perea, J. D.; Stubhan, T.; Hauch, J.; Roch, L. M.; Heumueller, T.; Aspuru-Guzik, A.; Brabec, C. J. Beyond Ternary OPV: High-Throughput Experimentation and Self-Driving Laboratories Optimize Multicomponent Systems. Adv. Mater. 2020, 32 (14, 1907801. (2) The “Phoenics” algorithm guides an automated thin-film fabrication and characterization platform to optimize the photostability of a multicomponent blend of organic photovoltaic materials.
Christensen, M.; Yunker, L. P. E.; Adedeji, F.; Häse, F.; Roch, L. M.; Gensch, T.; dos Passos Gomes, G.; Zepel, T.; Sigman, M. S.; Aspuru-Guzik, A.; Hein, J. E. Data-Science Driven Autonomous Process Optimization. Commun. Chem. 2021, 4 (1), 112. (3) Bayesian optimization of discrete and continuous variables enables automated synthesis and characterization to find optimal ligands and conditions for a stereoselective Suzuki–Miyaura coupling.
Seifrid, M.; Hickman, R. J.; Aguilar-Granda, A.; Lavigne, C.; Vestfrid, J.; Wu, T. C.; Gaudin, T.; Hopkins, E. J.; Aspuru-Guzik, A. Routescore: Punching the Ticket to More Efficient Materials Development. ACS Cent. Sci. 2022, 8 (1), 122–131. (4) Quantifying the costs of combined manual and automated synthetic routes enables the inverse design of materials that are cheaper to synthesize without sacrificing important properties.
1. Introduction
Figure 1
Figure 1. Diagram of the design–make–test–analyze cycle in our self-driving laboratory, showing the process for the development of new organic semiconductor laser materials.
2. Why Build a Self-Driving Lab?
3. Our Approach
3.1. Experiment Planning and Digital Infrastructure
Figure 2
Figure 2. Integration of ChemOS and its most important algorithms into the process or material optimization workflow.
3.2. Synthesis
Figure 3
Figure 3. Top: Photo of the Chemspeed deck. The inset shows the top of the ISYNTH with one of the drawers (vertical row of wells) highlighted. Bottom: (Right) Icons for liquid dispensing, solid dispensing, and solid-phase extraction actions. (Left) Diagram of the iSMcc process along with icons indicating where different capabilities are used. Cross-coupling (C): X-Ar-BMIDA (1 equiv), Ar–B(OH)2 (3 equiv), Pd-XPhos G2 (5 mol %), K3PO4 (2 equiv), THF, 16 h, 65 °C. Purification (P): precipitation from hexanes/THF 3:1. Deprotection (D): aqueous NaOH (1 M), 20 min, room temperature.
3.3. Analysis and Purification
Figure 4
Figure 4. Schematic diagram of our analysis, purification, and optical characterization setup. The gray box is a schematic diagram of how a specific HPLC fraction is selected for further evaluation and how its properties are measured. Absorption measurements are carried out in the “absorption” flow cell. Photoluminescence (PL), PL quantum yield (PLQY), and photodegradation rate measurements are carried out in the “emission” flow cell. PL lifetime is measured in the “PL lifetime” flow cell. Gray polygons represent valves with the number of ports corresponding to the number of sides, and arrows represent the directions of sample transport.
3.4. Characterization
3.5. Solid-State Materials
Figure 5
Figure 5. Autonomous robotic workflows can accelerate the discovery of solid-state inorganic materials using proxy experiments. These can then be used in conjunction with more accurate full experiments to perform multifidelity optimization of the inorganic materials.
4. Challenges and Lessons Learned
4.1. Replacing Cognitive Processes
4.2. Replacing or Replicating Motor Function
5. Future Perspectives
Biographies
Martin Seifrid
Martin Seifrid received a Ph.D. in Chemistry from the University of California, Santa Barbara, where he studied the relationship between molecular structure, processing, and solid-state structure in organic semiconducting materials under the supervision of Professor Guillermo Bazan. He is currently a postdoctoral fellow in the group of Professor Alán Aspuru-Guzik at the University of Toronto, where he is building a self-driving lab for the design of organic semiconductor laser materials.
Robert Pollice
Robert Pollice received his Ph.D. in Chemistry in 2019 under the supervision of Professor Peter Chen at ETH Zurich, where he investigated London dispersion in molecular systems. Subsequently, he joined the group of Professor Alán Aspuru-Guzik as an SNSF postdoctoral fellow at the University of Toronto to work on the inverse design of organic electronic materials and molecular catalysts.
Andrés Aguilar-Granda
Andrés Aguilar-Granda studied Industrial Chemistry at the University of Veracruz and received his Ph.D. from the Institute of Chemistry, Universidad Nacional Autónoma de México (UNAM). Afterward, he completed a postdoctoral fellowship at the University of Toronto (in the group of Prof. Alán Aspuru-Guzik). In April 2021, he began his independent career as an associate professor in the Department of Organic Chemistry at the School of Chemistry at UNAM. His research interests are in the areas of automated organic synthesis for functional materials and the digitization of organic chemistry.
Zamyla Morgan Chan
Zamyla Morgan Chan is the Associate Director of the Acceleration Consortium at the University of Toronto. She received a Ph.D. in Chemistry from Harvard University and joined the Vector Institute for Artificial Intelligence as a Postdoctoral Research Fellow. Her research seeks to facilitate accelerated discovery of scalable and stable materials by leveraging robotics, machine learning, and fundamental science for inverse design.
Kazuhiro Hotta
Kazuhiro Hotta received a Ph.D. in Chemistry from Tohoku University, where he studied optical biosensors based on nanoporous materials as a sensing platform. He then joined Mitsubishi Chemical Corporation where he is currently a senior scientist working on laboratory automation and the development of a self-driving laboratory.
Cher Tian Ser
Cher Tian Ser is a Ph.D. student at the University of Toronto supervised by Prof. Alán Aspuru-Guzik. His research interests involve the application of machine learning methods for the discovery of catalytic and energy materials.
Jenya Vestfrid
Jenya Vestfrid received her Ph.D. in Chemistry from Technion – Israel Institute of Technology. Afterward, she completed postdoctoral fellowships in the departments of Chemical Engineering & Applied Chemistry and Chemistry at the University of Toronto. Currently, she is an experienced researcher at StoreDot, a motor vehicle parts manufacturing company, developing extreme fast charging batteries for electric vehicles.
Tony C. Wu
Tony C. Wu is a postdoctoral fellow at the University of Toronto and the Vector Institute. With a cross-disciplinary background, his current research passion is in the development of autonomous chemistry and machine learning models for accelerated materials development. He received his B.Sc. in both Electrical Engineering and Physics from the National Taiwan University in 2011 and his M.A. and Ph.D. in Electrical Engineering from the Massachusetts Institute of Technology in 2018. During his Ph.D., his research focused on optoelectronics and excitonics engineering with applications in OLEDs and organic solar cells.
Alán Aspuru-Guzik
Alán Aspuru-Guzik is a Professor of Chemistry and Computer Science at the University of Toronto, a Canada 150 Research Chair in Theoretical Chemistry, a Canada CIFAR AI Chair at the Vector Institute, a CIFAR Lebovic Fellow in the Biologically Inspired Solar Energy program, and a Google Industrial Research Chair in Quantum Computing. He received a Ph.D. in Chemistry from the University of California, Berkeley, where he was also a postdoctoral fellow. He began his independent career at Harvard University and became a Full Professor before moving to the University of Toronto. His research interests span chemistry, automation, machine learning, and quantum information.
Acknowledgments
We thank all our co-workers and collaborators who contributed to the projects highlighted in this Account. R.P. acknowledges funding through a Postdoc.Mobility fellowship by the Swiss National Science Foundation (SNSF; Project No. 191127). T.C.W. acknowledges funding through University of Toronto Arts & Science Postdoctoral Fellowship. We acknowledge the Defense Advanced Research Projects Agency (DARPA) under the Accelerated Molecular Discovery Program under Cooperative Agreement No. HR00111920027 dated August 1, 2019. The content of the information presented in this work does not necessarily reflect the position or the policy of the Government. We also acknowledge funding from the National Research Council Canada (MCF-106) as part of the Materials for Clean Fuels Challenge program. A. Aspuru-Guzik thanks Anders G. Frøseth for his generous support. A. Aspuru-Guzik also acknowledges the generous support of Natural Resources Canada and the Canada 150 Research Chairs program. We also acknowledge the Department of Navy awards (N00014-19-1-2134 and N00014-21-1-2137) issued by the Office of Naval Research. The United States Government has a royalty-free license throughout the world in all copyrightable material contained herein. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the Office of Naval Research.
References
This article references 80 other publications.
- 1Roch, L. M.; Häse, F.; Kreisbeck, C.; Tamayo-Mendoza, T.; Yunker, L. P. E.; Hein, J. E.; Aspuru-Guzik, A. ChemOS: An Orchestration Software to Democratize Autonomous Discovery. PLoS One 2020, 15, e0229862, DOI: 10.1371/journal.pone.0229862Google Scholar1ChemOS: An orchestration software to democratize autonomous discoveryRoch, Loic M.; Hase, Florian; Kreisbeck, Christoph; Tamayo-Mendoza, Teresa; Yunker, Lars P. E.; Hein, Jason E.; Aspuru-Guzik, AlanPLoS One (2020), 15 (4), e0229862CODEN: POLNCL; ISSN:1932-6203. (Public Library of Science)The current Edisonian approach to discovery requires up to two decades of fundamental and applied research for materials technologies to reach the market. Such a slow and capital-intensive turnaround calls for disruptive strategies to expedite innovation. Self-driving labs. have the potential to provide the means to revolutionize experimentation by empowering automation with artificial intelligence to enable autonomous discovery. However, the lack of adequate software solns. significantly impedes the development of self-driving labs. In this paper, we make progress towards addressing this challenge, and we propose and develop an implementation of ChemOS; a portable, modular and versatile software package which supplies the structured layers necessary for the deployment and operation of self-driving labs. ChemOS facilitates the integration of automated equipment, and it enables remote control of automated labs. ChemOS can operate at various degrees of autonomy; from fully unsupervised experimentation to actively including inputs and feedbacks from researchers into the experimentation loop. The flexibility of ChemOS provides a broad range of functionality as demonstrated on five applications, which were executed on different automated equipment, highlighting various aspects of the software package.
- 2Langner, S.; Häse, F.; Perea, J. D.; Stubhan, T.; Hauch, J.; Roch, L. M.; Heumueller, T.; Aspuru-Guzik, A.; Brabec, C. J. Beyond Ternary OPV: High-Throughput Experimentation and Self-Driving Laboratories Optimize Multicomponent Systems. Adv. Mater. 2020, 32, 1907801, DOI: 10.1002/adma.201907801Google Scholar2Beyond Ternary OPV: High-Throughput Experimentation and Self-Driving Laboratories Optimize Multicomponent SystemsLangner, Stefan; Haese, Florian; Perea, Jose Dario; Stubhan, Tobias; Hauch, Jens; Roch, Loic M.; Heumueller, Thomas; Aspuru-Guzik, Alan; Brabec, Christoph J.Advanced Materials (Weinheim, Germany) (2020), 32 (14), 1907801CODEN: ADVMEW; ISSN:0935-9648. (Wiley-VCH Verlag GmbH & Co. KGaA)Fundamental advances to increase the efficiency as well as stability of org. photovoltaics (OPVs) are achieved by designing ternary blends, which represents a clear trend toward multicomponent active layer blends. The development of high-throughput and autonomous experimentation methods is reported for the effective optimization of multicomponent polymer blends for OPVs. A method for automated film formation enabling the fabrication of ≤ 6048 films per day is introduced. Equipping this automated experimentation platform with a Bayesian optimization, a self-driving lab. is constructed that autonomously evaluates measurements to design and execute the next expts. To demonstrate the potential of these methods, a 4D parameter space of quaternary OPV blends is mapped and optimized for photostability. While with conventional approaches, roughly 100 mg of material would be necessary, the robot-based platform can screen 2000 combinations with < 10 mg, and machine-learning-enabled autonomous experimentation identifies stable compns. with < 1 mg.
- 3Christensen, M.; Yunker, L. P. E.; Adedeji, F.; Häse, F.; Roch, L. M.; Gensch, T.; dos Passos Gomes, G.; Zepel, T.; Sigman, M. S.; Aspuru-Guzik, A.; Hein, J. E. Data-Science Driven Autonomous Process Optimization. Commun. Chem. 2021, 4, 112, DOI: 10.1038/s42004-021-00550-xGoogle Scholar3Data-science driven autonomous process optimizationChristensen Melodie; Yunker Lars P E; Zepel Tara; Hein Jason E; Christensen Melodie; Adedeji Folarin; Hase Florian; Roch Loic M; Aspuru-Guzik Alan; Hase Florian; Roch Loic M; Dos Passos Gomes Gabriel; Aspuru-Guzik Alan; Hase Florian; Roch Loic M; Dos Passos Gomes Gabriel; Aspuru-Guzik Alan; Hase Florian; Dos Passos Gomes Gabriel; Aspuru-Guzik Alan; Hase Florian; Roch Loic M; Gensch Tobias; Sigman Matthew S; Aspuru-Guzik AlanCommunications chemistry (2021), 4 (1), 112 ISSN:.Autonomous process optimization involves the human intervention-free exploration of a range process parameters to improve responses such as product yield and selectivity. Utilizing off-the-shelf components, we develop a closed-loop system for carrying out parallel autonomous process optimization experiments in batch. Upon implementation of our system in the optimization of a stereoselective Suzuki-Miyaura coupling, we find that the definition of a set of meaningful, broad, and unbiased process parameters is the most critical aspect of successful optimization. Importantly, we discern that phosphine ligand, a categorical parameter, is vital to determination of the reaction outcome. To date, categorical parameter selection has relied on chemical intuition, potentially introducing bias into the experimental design. In seeking a systematic method for selecting a diverse set of phosphine ligands, we develop a strategy that leverages computed molecular feature clustering. The resulting optimization uncovers conditions to selectively access the desired product isomer in high yield.
- 4Seifrid, M.; Hickman, R. J.; Aguilar-Granda, A.; Lavigne, C.; Vestfrid, J.; Wu, T. C.; Gaudin, T.; Hopkins, E. J.; Aspuru-Guzik, A. Routescore: Punching the Ticket to More Efficient Materials Development. ACS Cent. Sci. 2022, 8, 122– 131, DOI: 10.1021/acscentsci.1c01002Google Scholar4Routescore: Punching the Ticket to More Efficient Materials DevelopmentSeifrid, Martin; Hickman, Riley J.; Aguilar-Granda, Andres; Lavigne, Cyrille; Vestfrid, Jenya; Wu, Tony C.; Gaudin, Theophile; Hopkins, Emily J.; Aspuru-Guzik, AlanACS Central Science (2022), 8 (1), 122-131CODEN: ACSCII; ISSN:2374-7951. (American Chemical Society)Self-driving labs., in the form of automated experimentation platforms guided by machine learning algorithms have emerged as a potential soln. to the need for accelerated science. While new tools for automated anal. and characterization are being developed at a steady rate, automated synthesis remains the bottleneck in the chem. space accessible to self-driving labs. Combining automated and manual synthesis efforts immediately significantly expands the explorable chem. space. To effectively direct the different capabilities of automated (higher throughput and less labor) and manual synthesis (greater chem. versatility), we describe a protocol, the RouteScore, that quantifies the cost of combined synthetic routes. In this work, the RouteScore is used to det. the most efficient synthetic route to a well-known pharmaceutical (structure-oriented optimization), and to simulate a self-driving lab that finds the most easily synthesizable org. laser mol. with specific photophys. properties from a space of ∼3500 possible mols. (property-oriented optimization). These two examples demonstrate the power and flexibility of our approach in mixed synthetic planning and optimization, and esp. in down-selecting promising candidates from a large chem. space via a priori estn. of the synthetic costs.
- 5Häse, F.; Roch, L. M.; Aspuru-Guzik, A. Next-Generation Experimentation with Self-Driving Laboratories. Trends Chem. 2019, 1, 282– 291, DOI: 10.1016/j.trechm.2019.02.007Google Scholar5Next-Generation Experimentation with Self-Driving LaboratoriesHase, Florian; Roch, Loic M.; Aspuru-Guzik, AlanTrends in Chemistry (2019), 1 (3), 282-291CODEN: TCRHBQ; ISSN:2589-5974. (Cell Press)A review. The ever-growing demand for advanced functional materials requires disruption of conventional approaches to experimentation and acceleration of the discovery process. State-of-the-art approaches to scientific discovery are inherently slow, capital intensive, and have arguably reached a plateau. Significant advances are possible when rethinking and redesigning the traditional experimentation process. Self-driving labs. promise to substantially accelerate the discovery process by augmenting automated experimentation platforms with artificial intelligence (AI). AI methods actively search for promising exptl. procedures by hypothesizing about their outcomes based on previous expts. This feedback loop is crucial to reduce the no. of expts. needed for discovery. Supplying automated platforms with AI enables self-driving labs. to fully embrace the vision of autonomous experimentation.
- 6MacLeod, B. P.; Parlane, F. G. L.; Rupnow, C. C.; Dettelbach, K. E.; Elliott, M. S.; Morrissey, T. D.; Haley, T. H.; Proskurin, O.; Rooney, M. B.; Taherimakhsousi, N.; Dvorak, D. J.; Chiu, H. N.; Waizenegger, C. E. B.; Ocean, K.; Mokhtari, M.; Berlinguette, C. P. A Self-Driving Laboratory Advances the Pareto Front for Material Properties. Nat. Commun. 2022, 13, 995, DOI: 10.1038/s41467-022-28580-6Google Scholar6A self-driving laboratory advances the Pareto front for material propertiesMacLeod, Benjamin P.; Parlane, Fraser G. L.; Rupnow, Connor C.; Dettelbach, Kevan E.; Elliott, Michael S.; Morrissey, Thomas D.; Haley, Ted H.; Proskurin, Oleksii; Rooney, Michael B.; Taherimakhsousi, Nina; Dvorak, David J.; Chiu, Hsi N.; Waizenegger, Christopher E. B.; Ocean, Karry; Mokhtari, Mehrdad; Berlinguette, Curtis P.Nature Communications (2022), 13 (1), 995CODEN: NCAOBW; ISSN:2041-1723. (Nature Portfolio)Useful materials must satisfy multiple objectives, where the optimization of one objective is often at the expense of another. The Pareto front reports the optimal trade-offs between these conflicting objectives. Here we use a self-driving lab., Ada, to define the Pareto front of conductivities and processing temps. for palladium films formed by combustion synthesis. Ada discovers new synthesis conditions that yield metallic films at lower processing temps. (below 200°C) relative to the prior art for this technique (250°C). This temp. difference makes possible the coating of different commodity plastic materials (e.g., Nafion, polyethersulfone). These combustion synthesis conditions enable us to to spray coat uniform palladium films with moderate cond. (1.1 x 105 S m-1) at 191°C. Spray coating at 226°C yields films with conductivities (2.0 x 106 S m-1) comparable to those of sputtered films (2.0 to 5.8 x 106 S m-1). This work shows how a self-driving laboratoy can discover materials that provide optimal trade-offs between conflicting objectives.
- 7Rooney, M. B.; MacLeod, B. P.; Oldford, R.; Thompson, Z. J.; White, K. L.; Tungjunyatham, J.; Stankiewicz, B. J.; Berlinguette, C. P. A Self-Driving Laboratory Designed to Accelerate the Discovery of Adhesive Materials. Digit. Discovery 2022, DOI: 10.1039/D2DD00029FGoogle ScholarThere is no corresponding record for this reference.
- 8Tao, H.; Wu, T.; Kheiri, S.; Aldeghi, M.; Aspuru-Guzik, A.; Kumacheva, E. Self-Driving Platform for Metal Nanoparticle Synthesis: Combining Microfluidics and Machine Learning. Adv. Funct. Mater. 2021, 31, 2106725, DOI: 10.1002/adfm.202106725Google Scholar8Self-Driving Platform for Metal Nanoparticle Synthesis: Combining Microfluidics and Machine LearningTao, Huachen; Wu, Tianyi; Kheiri, Sina; Aldeghi, Matteo; Aspuru-Guzik, Alan; Kumacheva, EugeniaAdvanced Functional Materials (2021), 31 (51), 2106725CODEN: AFMDC6; ISSN:1616-301X. (Wiley-VCH Verlag GmbH & Co. KGaA)Many applications of inorg. nanoparticles (NPs), including photocatalysis, photovoltaics, chem. and biochem. sensing, and theranostics, are governed by NP optical properties. Exploration and identification of reaction conditions for the synthesis of NPs with targeted spectroscopic characteristics is a time-, labor-, and resource-intensive task, as it involves the optimization of multiple interdependent reaction conditions. Integration of machine learning (ML) and microfluidics (MF) offers accelerated identification and optimization of reaction conditions for NP synthesis. Here, an autonomous ML-driven, oscillatory MF platform for the synthesis of NPs is reported. The platform utilized multiple recipes and reaction times for the synthesis of NPs with different dimensions, conducted spectroscopic NP characterization, and employed ML approaches to analyze multiple yet prioritized spectroscopic NP characteristics, and identified reaction conditions for the synthesis of NPs with targeted optical properties. The platform is also used to develop an understanding of the relationship between reaction conditions and NP properties. This study shows the strong potential of ML-driven oscillatory MF platforms in materials science and paves the way for automated NP development.
- 9Gao, W.; Raghavan, P.; Coley, C. W. Autonomous Platforms for Data-Driven Organic Synthesis. Nat. Commun. 2022, 13, 1075, DOI: 10.1038/s41467-022-28736-4Google Scholar9Autonomous platforms for data-driven organic synthesisGao, Wenhao; Raghavan, Priyanka; Coley, Connor W.Nature Communications (2022), 13 (1), 1075CODEN: NCAOBW; ISSN:2041-1723. (Nature Portfolio)Achieving autonomous multi-step synthesis of novel mol. structures in chem. discovery processes is a goal shared by many researchers. In this Comment, we discuss key considerations of what an ideal platform may look like and the apparent state of the art. While most hardware challenges can be overcome with clever engineering, other challenges will require advances in both algorithms and data curation.
- 10Gupta, A.; Ong, Y.; Feng, L. Insights on Transfer Optimization: Because Experience Is the Best Teacher. IEEE Trans. Emerg. Top. Comput. Intell. 2018, 2, 51– 64, DOI: 10.1109/TETCI.2017.2769104Google ScholarThere is no corresponding record for this reference.
- 11Shi, Y.; Prieto, P. L.; Zepel, T.; Grunert, S.; Hein, J. E. Automated Experimentation Powers Data Science in Chemistry. Acc. Chem. Res. 2021, 54, 546– 555, DOI: 10.1021/acs.accounts.0c00736Google Scholar11Automated Experimentation Powers Data Science in ChemistryShi, Yao; Prieto, Paloma L.; Zepel, Tara; Grunert, Shad; Hein, Jason E.Accounts of Chemical Research (2021), 54 (3), 546-555CODEN: ACHRE4; ISSN:0001-4842. (American Chemical Society)A review. Data science has revolutionized chem. research and continues to break down barriers with new interdisciplinary studies. The introduction of computational models and machine learning (ML) algorithms in combination with automation and traditional exptl. techniques has enabled scientific advancement across nearly every discipline of chem., from materials discovery, to process optimization, to synthesis planning. However, predictive tools powered by data science are only as good as their data sets and, currently, many of the data sets used to train models suffer from several limitations, including being sparse, limited in scope and requiring human curation. Likewise, computational data faces limitations in terms of accurate modeling of nonideal systems and can suffer from low translation fidelity from simulation to real conditions. The lack of diverse data and the need to be able to test it exptl. reduces both the accuracy and scope of the predictive models derived from data science. This Account contextualizes the need for more complex and diverse exptl. data and highlights how the seamless integration of robotics, machine learning, and data-rich monitoring techniques can be used to access it with minimal human labor. We propose three broad categories of data in chem.: data on fundamental properties, data on reaction outcomes, and data on reaction mechanics. We highlight flexible, automated platforms that can be deployed to acquire and leverage these data. The first platform combines solid- and liq.-dosing modules with computer vision to automate soly. screening, thereby gathering fundamental data that are necessary for almost every exptl. design. Using computer vision offers the addnl. benefit of creating a visual record, which can be referenced and used to further interrogate and gain insight on the data collected. The second platform iteratively tests reaction variables proposed by a ML algorithm in a closed-loop fashion. Exptl. data related to reaction outcomes are fed back into the algorithm to drive the discovery and optimization of new materials and chem. processes. The third platform uses automated process anal. technol. to gather real-time data related to reaction kinetics. This system allows the researcher to directly interrogate the reaction mechanisms in granular detail to det. exactly how and why a reaction proceeds, thereby enabling reaction optimization and deployment.
- 12Strieth-Kalthoff, F.; Sandfort, F.; Kühnemund, M.; Schäfer, F. R.; Kuchen, H.; Glorius, F. Machine Learning for Chemical Reactivity: The Importance of Failed Experiments. Angew. Chem., Int. Ed. 2022, 61, e202204647, DOI: 10.1002/anie.202204647Google Scholar12Machine Learning for Chemical Reactivity: The Importance of Failed ExperimentsStrieth-Kalthoff, Felix; Sandfort, Frederik; Kuehnemund, Marius; Schafer, Felix R.; Kuchen, Herbert; Glorius, FrankAngewandte Chemie, International Edition (2022), 61 (29), e202204647CODEN: ACIEF5; ISSN:1433-7851. (Wiley-VCH Verlag GmbH & Co. KGaA)A review. Assessing the outcomes of chem. reactions in a quant. fashion has been a cornerstone across all synthetic disciplines. Classically approached through empirical optimization, data-driven modeling bears an enormous potential to streamline this process. However, such predictive models require significant quantities of high-quality data, the availability of which is limited: Main reasons for this include exptl. errors and, importantly, human biases regarding expt. selection and result reporting. In a series of case studies, we investigate the impact of these biases for drawing general conclusions from chem. reaction data, revealing the utmost importance of "neg." examples. Eventually, case studies into data expansion approaches showcase directions to circumvent these limitations-and demonstrate perspectives towards a long-term data quality enhancement in chem.
- 13Beker, W.; Roszak, R.; Wołos, A.; Angello, N. H.; Rathore, V.; Burke, M. D.; Grzybowski, B. A. Machine Learning May Sometimes Simply Capture Literature Popularity Trends: A Case Study of Heterocyclic Suzuki–Miyaura Coupling. J. Am. Chem. Soc. 2022, 144, 4819– 4827, DOI: 10.1021/jacs.1c12005Google Scholar13Machine Learning May Sometimes Simply Capture Literature Popularity Trends: A Case Study of Heterocyclic Suzuki-Miyaura CouplingBeker, Wiktor; Roszak, Rafal; Wolos, Agnieszka; Angello, Nicholas H.; Rathore, Vandana; Burke, Martin D.; Grzybowski, Bartosz A.Journal of the American Chemical Society (2022), 144 (11), 4819-4827CODEN: JACSAT; ISSN:0002-7863. (American Chemical Society)A review. Applications of machine learning (ML) to synthetic chem. rely on the assumption that large nos. of literature-reported examples should enable construction of accurate and predictive models of chem. reactivity. This paper demonstrates that abundance of carefully curated literature data may be insufficient for this purpose. Using an example of Suzuki-Miyaura coupling with heterocyclic building blocks-and a carefully selected database of >10,000 literature examples-this article shows that ML models cannot offer any meaningful predictions of optimum reaction conditions, even if the search space is restricted to only solvents and bases. This result holds irresp. of the ML model applied (from simple feed-forward to state-of-the-art graph convolution neural networks) or the representation to describe the reaction partners (various fingerprints, chem. descriptors, latent representations, etc.). In all cases, the ML methods fail to perform significantly better than naive assignments based on the sheer frequency of certain reaction conditions reported in the literature. These unsatisfactory results likely reflect subjective preferences of various chemists to use certain protocols and, other biasing factors as mundane as availability of certain solvents/reagents, and/or a lack of neg. data. These findings highlight the likely importance of systematically generating reliable and standardized data sets for algorithm training.
- 14Wilbraham, L.; Mehr, S. H. M.; Cronin, L. Digitizing Chemistry Using the Chemical Processing Unit: From Synthesis to Discovery. Acc. Chem. Res. 2021, 54, 253– 262, DOI: 10.1021/acs.accounts.0c00674Google Scholar14Digitizing Chemistry Using the Chemical Processing Unit: From Synthesis to DiscoveryWilbraham, Liam; Mehr, S. Hessam M.; Cronin, LeroyAccounts of Chemical Research (2021), 54 (2), 253-262CODEN: ACHRE4; ISSN:0001-4842. (American Chemical Society)The digitization of chem. is not simply about using machine learning or artificial intelligence systems to process chem. data, or about the development of ever more capable automation hardware; instead, it is the creation of a hard link between an abstracted process ontol. of chem. and bespoke hardware for performing reactions or exploring reactivity. Chem. digitization is therefore about the unambiguous development of an architecture, a chem. state machine, that uses this ontol. to connect precise instruction sets to hardware that performs chem. transformations. This approach enables a universal std. for describing chem. procedures via a chem. programming language and facilitates unambiguous dissemination of these procedures. We predict that this std. will revolutionize the ability of chemists to collaborate, increase reproducibility and safety, as we all as optimize for cost and efficiency. Most importantly, the digitization of chem. will dramatically reduce the labor needed to make new compds. and broaden accessible chem. space. In recent years, the developments of automation in chem. have gone beyond flow chem. alone, with many bespoke workflows being developed not only for automating chem. synthesis but also for materials, nanomaterials, and formulation prodn. Indeed, the leap from fixed-configuration synthesis machines like peptide, nucleic acid, or dedicated cross-coupling engines is important for developing a truly universal approach to "dial-a-mol.". In this case, a key conceptual leap is the use of a batch system that can encode the chem. reagents, solvent, and products as packets which can be moved around the system, and a graph-based approach for the description of hardware modules that allows the compilation of chem. code that runs on, in principle, any hardware. Further, the integration of sensor systems for monitoring and controlling the state of the chem. synthesis machine, as well as high resoln. spectroscopic tools, is vital if these systems are to facilitate closed-loop autonomous expts. Systems that not only make mols. and materials, but also optimize their function, and use algorithms to assist with the development of new synthetic pathways and process optimization are also possible. Here, we discuss how the digitization of chem. is happening, building on the plethora of technol. developments in hardware and software. Importantly, digital-chem. robot systems need to integrate feedback from simple sensors, e.g., cond. or temp., as well as online analytics in order to navigate process space autonomously. This will open the door to accessing known mols. (synthesis), exploring whether known compds./reactions are possible under new conditions (optimization), and searching chem. space for unknown and unexpected new mols., reactions, and modes of reactivity (discovery). We will also discuss the role of chem. knowledge and how this can be used to challenge bias, as well as define and expand synthetically accessible chem. space using programmable robotic chem. state machines.
- 15Rohrbach, S.; Šiaučiulis, M.; Chisholm, G.; Pirvan, P.-A.; Saleeb, M.; Mehr, S. H. M.; Trushina, E.; Leonov, A. I.; Keenan, G.; Khan, A.; Hammer, A.; Cronin, L. Digitization and Validation of a Chemical Synthesis Literature Database in the ChemPU. Science 2022, 377, 172– 180, DOI: 10.1126/science.abo0058Google Scholar15Digitization and validation of a chemical synthesis literature database in the ChemPURohrbach, Simon; Siauciulis, Mindaugas; Chisholm, Greig; Pirvan, Petrisor-Alin; Saleeb, Michael; Mehr, S. Hessam M.; Trushina, Ekaterina; Leonov, Artem I.; Keenan, Graham; Khan, Aamir; Hammer, Alexander; Cronin, LeroyScience (Washington, DC, United States) (2022), 377 (6602), 172-180CODEN: SCIEAS; ISSN:1095-9203. (American Association for the Advancement of Science)Despite huge potential, automation of synthetic chem. has only made incremental progress over the past few decades. We present an automatically executable chem. reaction database of 100 mols. representative of the range of reactions found in contemporary org. synthesis. These reactions include transition metal-catalyzed coupling reactions, heterocycle formations, functional group interconversions, and multicomponent reactions. The chem. reaction codes or χDLs for the reactions have been stored in a database for version control, validation, collaboration, and data mining. Of these syntheses, more than 50 entries from the database have been downloaded and robotically run in seven modular ChemPU's with yields and purities comparable to those achieved by an expert chemist. We also demonstrate the automatic purifn. of a range of compds. using a chromatog. module seamlessly coupled to the platform and programmed with the same language.
- 16Bubliauskas, A.; Blair, D. J.; Powell-Davies, H.; Kitson, P. J.; Burke, M. D.; Cronin, L. Digitizing Chemical Synthesis in 3D Printed Reactionware. Angew. Chem., Int. Ed. 2022, 61, e202116108, DOI: 10.1002/anie.202116108Google Scholar16Digitizing Chemical Synthesis in 3D Printed ReactionwareBubliauskas, Andrius; Blair, Daniel J.; Powell-Davies, Henry; Kitson, Philip J.; Burke, Martin D.; Cronin, Leroy; AcknowAngewandte Chemie, International Edition (2022), 61 (24), e202116108CODEN: ACIEF5; ISSN:1433-7851. (Wiley-VCH Verlag GmbH & Co. KGaA)Chem. digitization requires an unambiguous link between expts. and the code used to generate the exptl. conditions and outcomes, yet this process is not standardized, limiting the portability of any chem. code. What is needed is a universal approach to aid this process using a well-defined std. that is composed of syntheses that are employed in modular hardware. Herein a new approach is presented to the digitization of org. synthesis that combines process chem. principles with 3D printed reactionware. This approach outlines the process for transforming unit operations into digitized hardware and well-defined instructions that ensure effective synthesis. To demonstrate this, the process is outlined for digitizing 3 MIDA boronate building blocks, an ester hydrolysis, a Wittig olefination, a Suzuki-Miyaura coupling reaction, and synthesis of the drug sulfanilamide.
- 17Nikolaev, P.; Hooper, D.; Webber, F.; Rao, R.; Decker, K.; Krein, M.; Poleski, J.; Barto, R.; Maruyama, B. Autonomy in Materials Research: A Case Study in Carbon Nanotube Growth. Npj Comput. Mater. 2016, 2, 16031, DOI: 10.1038/npjcompumats.2016.31Google ScholarThere is no corresponding record for this reference.
- 18Roch, L. M.; Häse, F.; Kreisbeck, C.; Tamayo-Mendoza, T.; Yunker, L. P. E.; Hein, J. E.; Aspuru-Guzik, A. ChemOS: Orchestrating Autonomous Experimentation. Sci. Robot. 2018, 3 (19), 1, DOI: 10.1126/scirobotics.aat5559Google ScholarThere is no corresponding record for this reference.
- 19Rahmanian, F.; Flowers, J.; Guevarra, D.; Richter, M.; Fichtner, M.; Donnely, P.; Gregoire, J. M.; Stein, H. S. Enabling Modular Autonomous Feedback-Loops in Materials Science through Hierarchical Experimental Laboratory Automation and Orchestration. Adv. Mater. Interfaces 2022, 9, 2101987, DOI: 10.1002/admi.202101987Google ScholarThere is no corresponding record for this reference.
- 20Gaudin, T.; Benlolo, I.; Cui, Z. Y.; Hickmann, R.; Tamblyn, I.; Aspuru-Guzik, A. Molar. In Zenodo ; 2022; https://zenodo.org/record/6809290.Google ScholarThere is no corresponding record for this reference.
- 21McKinney, W. Data Structures for Statistical Computing in Python. In Proc. of the 9th Python in Science Conference , Austin, Texas, 2010; pp 56– 61.Google ScholarThere is no corresponding record for this reference.
- 22Reback, J.; McKinney, W.; Jbrockmendel; ; Van den Bossche, J.; Roeschke, M.; Augspurger, T.; Hawkins, S.; Cloud, P.; Gfyoung; ; Sinhrks; ; Hoefler, P.; Klein, A.; Petersen, T.; Tratner, J.; She, C.; Ayd, W.; Naveh, S.; Darbyshire, J. H. M.; Shadrach, R.; Garcia, M.; Schendel, J.; Hayden, A.; Saxton, D.; Gorelli, M. E.; Li, F.; Wörtwein, T.; Zeitlin, M.; Jancauskas, V.; McMaster, A.; Li, T. Pandas-Dev/Pandas: Pandas 1.4.3. In Zenodo , 2022; https://zenodo.org/record/3509134.Google ScholarThere is no corresponding record for this reference.
- 23Häse, F.; Roch, L. M.; Kreisbeck, C.; Aspuru-Guzik, A. Phoenics: A Bayesian Optimizer for Chemistry. ACS Cent. Sci. 2018, 4, 1134– 1145, DOI: 10.1021/acscentsci.8b00307Google Scholar23Phoenics: A Bayesian Optimizer for ChemistryHase, Florian; Roch, Loic M.; Kreisbeck, Christoph; Aspuru-Guzik, AlanACS Central Science (2018), 4 (9), 1134-1145CODEN: ACSCII; ISSN:2374-7951. (American Chemical Society)We report Phoenics, a probabilistic global optimization algorithm identifying the set of conditions of an exptl. or computational procedure which satisfies desired targets. Phoenics combines ideas from Bayesian optimization with concepts from Bayesian kernel d. estn. As such, Phoenics allows to tackle typical optimization problems in chem. for which objective evaluations are limited, due to either budgeted resources or time-consuming evaluations of the conditions, including experimentation or enduring computations. Phoenics proposes new conditions based on all previous observations, avoiding, thus, redundant evaluations to locate the optimal conditions. It enables an efficient parallel search based on intuitive sampling strategies implicitly biasing toward exploration or exploitation of the search space. Our benchmarks indicate that Phoenics is less sensitive to the response surface than already established optimization algorithms. We showcase the applicability of Phoenics on the Oregonator, a complex case-study describing a nonlinear chem. reaction network. Despite the large search space, Phoenics quickly identifies the conditions which yield the desired target dynamic behavior. Overall, we recommend Phoenics for rapid optimization of unknown expensive-to-evaluate objective functions, such as experimentation or long-lasting computations.
- 24Häse, F.; Roch, L. M.; Aspuru-Guzik, A. Chimera: Enabling Hierarchy Based Multi-Objective Optimization for Self-Driving Laboratories. Chem. Sci. 2018, 9, 7642– 7655, DOI: 10.1039/C8SC02239AGoogle Scholar24Chimera: enabling hierarchy based multi-objective optimization for self-driving laboratoriesHase, Florian; Roch, Loic M.; Aspuru-Guzik, AlanChemical Science (2018), 9 (39), 7642-7655CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)Finding the ideal conditions satisfying multiple pre-defined targets simultaneously is a challenging decision-making process, which impacts science, engineering, and economics. Addnl. complexity arises for tasks involving experimentation or expensive computations, as the no. of evaluated conditions must be kept low. We propose Chimera as a general purpose achievement scalarizing function for multi-target optimization where evaluations are the limiting factor. Chimera combines concepts of a priori scalarizing with lexicog. approaches and is applicable to any set of n unknown objectives. Importantly, it does not require detailed prior knowledge about individual objectives. The performance of Chimera is demonstrated on several well-established analytic multi-objective benchmark sets using different single-objective optimization algorithms. We further illustrate the applicability and performance of Chimera with two practical examples: (i) the auto-calibration of a virtual robotic sampling sequence for direct-injection, and (ii) the inverse-design of a four-pigment excitonic system for an efficient energy transport. The results indicate that Chimera enables a wide class of optimization algorithms to rapidly find ideal conditions. Addnl., the presented applications highlight the interpretability of Chimera to corroborate design choices for tailoring system parameters.
- 25Häse, F.; Aldeghi, M.; Hickman, R. J.; Roch, L. M.; Aspuru-Guzik, A. Gryffin: An Algorithm for Bayesian Optimization of Categorical Variables Informed by Expert Knowledge. Appl. Phys. Rev. 2021, 8, 031406, DOI: 10.1063/5.0048164Google Scholar25GRYFFIN: An algorithm for Bayesian optimization of categorical variables informed by expert knowledgeHase, Florian; Aldeghi, Matteo; Hickman, Riley J.; Roch, Loic M.; Aspuru-Guzik, AlanApplied Physics Reviews (2021), 8 (3), 031406CODEN: APRPG5; ISSN:1931-9401. (American Institute of Physics)A review. Designing functional mols. and advanced materials requires complex design choices: tuning continuous process parameters such as temps. or flow rates, while simultaneously selecting catalysts or solvents. To date, the development of data-driven expt. planning strategies for autonomous experimentation has largely focused on continuous process parameters, despite the urge to devise efficient strategies for the selection of categorical variables. Here, we introduce GRYFFIN, a general-purpose optimization framework for the autonomous selection of categorical variables driven by expert knowledge. GRYFFIN augments Bayesian optimization based on kernel d. estn. with smooth approxns. to categorical distributions. Leveraging domain knowledge in the form of physicochem. descriptors, GRYFFIN can significantly accelerate the search for promising mols. and materials. GRYFFIN can further highlight relevant correlations between the provided descriptors to inspire phys. insights and foster scientific intuition. In addn. to comprehensive benchmarks, we demonstrate the capabilities and performance of GRYFFIN on three examples in materials science and chem.: (i) the discovery of non-fullerene acceptors for org. solar cells, (ii) the design of hybrid org.-inorg. perovskites for light-harvesting, and (iii) the identification of ligands and process parameters for Suzuki-Miyaura reactions. Our results suggest that GRYFFIN, in its simplest form, is competitive with state-of-the-art categorical optimization algorithms. However, when leveraging domain knowledge provided via descriptors, GRYFFIN outperforms other approaches while simultaneously refining this domain knowledge to promote scientific understanding. (c) 2021 American Institute of Physics.
- 26Hickman, R. J.; Häse, F.; Roch, L. M.; Aspuru-Guzik, A. Gemini: Dynamic Bias Correction for Autonomous Experimentation and Molecular Simulation. arXiv , 2021, 2103.03391.Google ScholarThere is no corresponding record for this reference.
- 27Aldeghi, M.; Häse, F.; Hickman, R. J.; Tamblyn, I.; Aspuru-Guzik, A. Golem: An Algorithm for Robust Experiment and Process Optimization. arXiv , 2021, 2103.03716.Google ScholarThere is no corresponding record for this reference.
- 28MacLeod, B. P.; Parlane, F. G. L.; Morrissey, T. D.; Häse, F.; Roch, L. M.; Dettelbach, K. E.; Moreira, R.; Yunker, L. P. E.; Rooney, M. B.; Deeth, J. R.; Lai, V.; Ng, G. J.; Situ, H.; Zhang, R. H.; Elliott, M. S.; Haley, T. H.; Dvorak, D. J.; Aspuru-Guzik, A.; Hein, J. E.; Berlinguette, C. P. Self-Driving Laboratory for Accelerated Discovery of Thin-Film Materials. Sci. Adv. 2020, 6, eaaz8867, DOI: 10.1126/sciadv.aaz8867Google ScholarThere is no corresponding record for this reference.
- 29Glasnov, T. N.; Kappe, C. O. The Microwave-to-Flow Paradigm: Translating High-Temperature Batch Microwave Chemistry to Scalable Continuous-Flow Processes. Chem. – Eur. J. 2011, 17, 11956– 11968, DOI: 10.1002/chem.201102065Google Scholar29The Microwave-to-Flow Paradigm: Translating High-Temperature Batch Microwave Chemistry to Scalable Continuous-Flow ProcessesGlasnov, Toma N.; Kappe, C. OliverChemistry - A European Journal (2011), 17 (43), 11956-11968CODEN: CEUJED; ISSN:0947-6539. (Wiley-VCH Verlag GmbH & Co. KGaA)The popularity of dedicated microwave reactors in many academic and industrial labs. has produced a plethora of synthetic protocols that are based on this enabling technol. In the majority of examples, transformations that require several hours when performed using conventional heating under reflux conditions reach completion in a few minutes or even seconds in sealed-vessel, autoclave-type, microwave reactors. However, one severe drawback of microwave chem. is the difficulty in scaling this technol. to a prodn.-scale level. This concept article demonstrates that this limitation can be overcome by translating batch microwave chem. to scalable continuous-flow processes. For this purpose, conventionally heated micro- or mesofluidic flow devices fitted with a back-pressure regulator are employed, in which the high temps. and pressures attainable in a sealed-vessel microwave chem. batch expt. can be mimicked.
- 30Plutschack, M. B.; Pieber, B.; Gilmore, K.; Seeberger, P. H. The Hitchhiker’s Guide to Flow Chemistry. Chem. Rev. 2017, 117, 11796– 11893, DOI: 10.1021/acs.chemrev.7b00183Google Scholar30The Hitchhiker's Guide to Flow ChemistryPlutschack, Matthew B.; Pieber, Bartholomaeus; Gilmore, Kerry; Seeberger, Peter H.Chemical Reviews (Washington, DC, United States) (2017), 117 (18), 11796-11893CODEN: CHREAY; ISSN:0009-2665. (American Chemical Society)Flow chem. involves the use of channels or tubing to conduct a reaction in a continuous stream rather than in a flask. Flow equipment provides chemists with unique control over reaction parameters, enhancing reactivity or in some cases enabling new reactions. This relatively young technol. has received a remarkable amt. of attention in the past decade with many reports on what can be done in flow. Until recently, however, the question, "Should we do this in flow" has merely been an afterthought. This review introduces readers to the basic principles and fundamentals of flow chem. and critically discusses recent flow chem. accounts.
- 31Bianchi, P.; Williams, J. D.; Kappe, C. O. Oscillatory Flow Reactors for Synthetic Chemistry Applications. J. Flow Chem. 2020, 10, 475– 490, DOI: 10.1007/s41981-020-00105-6Google Scholar31Oscillatory flow reactors for synthetic chemistry applicationsBianchi, Pauline; Williams, Jason D.; Kappe, C. OliverJournal of Flow Chemistry (2020), 10 (3), 475-490CODEN: JFCOBJ; ISSN:2063-0212. (Akademiai Kiado)Abstr.: Oscillatory flow reactors (OFRs) superimpose an oscillatory flow to the net movement through a flow reactor. OFRs have been engineered to enable improved mixing, excellent heat- and mass transfer and good plug flow character under a broad range of operating conditions. Such features render these reactors appealing, since they are suitable for reactions that require long residence times, improved mass transfer (such as in biphasic liq.-liq. systems) or to homogeneously suspend solid particles. Various OFR configurations, offering specific features, have been developed over the past two decades, with significant progress still being made. This review outlines the principles and recent advances in OFR technol. and overviews the synthetic applications of OFRs for liq.-liq. and solid-liq. biphasic systems.
- 32Christensen, M.; Yunker, L. P. E.; Shiri, P.; Zepel, T.; Prieto, P. L.; Grunert, S.; Bork, F.; Hein, J. E. Automation Isn’t Automatic. Chem. Sci. 2021, 12, 15473, DOI: 10.1039/D1SC04588AGoogle Scholar32Automation isn't automaticChristensen, Melodie; Yunker, Lars P. E.; Shiri, Parisa; Zepel, Tara; Prieto, Paloma L.; Grunert, Shad; Bork, Finn; Hein, Jason E.Chemical Science (2021), 12 (47), 15473-15490CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)Automation has become an increasingly popular tool for synthetic chemists over the past decade. Recent advances in robotics and computer science have led to the emergence of automated systems that execute common lab. procedures including parallel synthesis, reaction discovery, reaction optimization, time course studies, and crystn. development. While such systems offer many potential benefits, their implementation is rarely automatic due to the highly specialized nature of synthetic procedures. Each reaction category requires careful execution of a particular sequence of steps, the specifics of which change with different conditions and chem. systems. Careful assessment of these crit. procedural requirements and identification of the tools suitable for effective exptl. execution are key to developing effective automation workflows. Even then, it is often difficult to get all the components of an automated system integrated and operational. Data flows and specialized equipment present yet another level of challenge. Unfortunately, the pain points and process of implementing automated systems are often not shared or remain buried deep in the SI. This perspective provides an overview of the current state of automation of synthetic chem. at the benchtop scale with a particular emphasis on core considerations and the ensuing challenges of deploying a system. Importantly, we aim to reframe automation as decidedly not automatic but rather an iterative process that involves a series of careful decisions (both human and computational) and const. adjustment.
- 33Gillis, E. P.; Burke, M. D. Multistep Synthesis of Complex Boronic Acids from Simple MIDA Boronates. J. Am. Chem. Soc. 2008, 130, 14084– 14085, DOI: 10.1021/ja8063759Google Scholar33Multistep Synthesis of Complex Boronic Acids from Simple MIDA BoronatesGillis, Eric P.; Burke, Martin D.Journal of the American Chemical Society (2008), 130 (43), 14084-14085CODEN: JACSAT; ISSN:0002-7863. (American Chemical Society)Due to its sensitivity to most synthetic reagents, it is typically necessary to introduce the boronic acid functional group just prior to its use. Overcoming this important limitation, the authors herein report that air- and chromatog. stable N-methyliminodiacetic acid (MIDA) boronates are compatible with a wide range of common reagents which enables the multistep synthesis of complex boronic acid building blocks from simple B-contg. starting materials. X-ray and variable temp. NMR studies link the unique stability of MIDA boronates to a kinetic inaccessibility of the potentially reactive B p-orbital and/or N lone pair. These findings were collectively harnessed to achieve a short and modular total synthesis of (+)-crocacin C via the iterative cross-coupling of a structurally complex, MIDA-protected haloboronic acid building block.
- 34Li, J.; Ballmer, S. G.; Gillis, E. P.; Fujii, S.; Schmidt, M. J.; Palazzolo, A. M. E.; Lehmann, J. W.; Morehouse, G. F.; Burke, M. D. Synthesis of Many Different Types of Organic Small Molecules Using One Automated Process. Science 2015, 347, 1221– 1226, DOI: 10.1126/science.aaa5414Google Scholar34Synthesis of many different types of organic small molecules using one automated processLi, Junqi; Ballmer, Steven G.; Gillis, Eric P.; Fujii, Seiko; Schmidt, Michael J.; Palazzolo, Andrea M. E.; Lehmann, Jonathan W.; Morehouse, Greg F.; Burke, Martin D.Science (Washington, DC, United States) (2015), 347 (6227), 1221-1226CODEN: SCIEAS; ISSN:0036-8075. (American Association for the Advancement of Science)A wide variety of human-prepd. and natural product small mols. such as all-trans-retinal I, the phosphodiesterase inhibitor II, oblongolide III, and the secodaphnane core compd. IV were prepd. using an automated process. Alkyl, alkenyl, and aryl N-methyliminodiacetate-protected boronic acid esters (MIDA-boronates) were sepd. on silica gel from other mols. lacking MIDA-boronate moieties; the MIDA-boronates were retained on silica gel when methanol/diethyl ether was used as the eluent and were eluted when THF was used as the eluent. MIDA-boronates thus acted as phase tags which can be activated selectively to undergo Suzuki coupling reactions. Iterative coupling reactions of MIDA-boronates using solid-phase catch-and-release chromatog. purifn. in combination with stereoselective cyclization reactions allowed a variety of small mols. contg. polyene, biaryl, macrocyclic, and fused polycyclic structures to be prepd. using a multipurpose flow reactor (synthesizer).
- 35Anthony, J. E.; Heeney, M.; Ong, B. S. Synthetic Aspects of Organic Semiconductors. MRS Bull. 2008, 33, 698– 705, DOI: 10.1557/mrs2008.142Google Scholar35Synthetic aspects of organic semiconductorsAnthony, John E.; Heeney, Martin; Ong, Beng S.MRS Bulletin (2008), 33 (7), 698-705CODEN: MRSBEA; ISSN:0883-7694. (Materials Research Society)A review. This article discusses the importance of the choice of synthetic methodol. in the purity, and therefore performance, of both small-mol. and polymeric org. semiconductors. We discuss common methodologies used in the prepn. of org. semiconductors, paying particular attention to the impurities and byproducts that can arise during these synthetic approaches and how they can have an impact on semiconductor performance.
- 36Kuehne, A. J. C.; Gather, M. C. Organic Lasers: Recent Developments on Materials, Device Geometries, and Fabrication Techniques. Chem. Rev. 2016, 116, 12823– 12864, DOI: 10.1021/acs.chemrev.6b00172Google Scholar36Organic Lasers: Recent Developments on Materials, Device Geometries, and Fabrication TechniquesKuehne, Alexander J. C.; Gather, Malte C.Chemical Reviews (Washington, DC, United States) (2016), 116 (21), 12823-12864CODEN: CHREAY; ISSN:0009-2665. (American Chemical Society)Org. dyes have been used as gain medium for lasers since the 1960s, long before the advent of today's org. electronic devices. Org. gain materials are highly attractive for lasing due to their chem. tunability and large stimulated emission cross section. While the traditional dye laser has been largely replaced by solid-state lasers, a no. of new and miniaturized org. lasers have emerged that hold great potential for lab-on-chip applications, biointegration, low-cost sensing and related areas, which benefit from the unique properties of org. gain materials. On the fundamental level, these include high exciton binding energy, low refractive index (compared to inorg. semiconductors), and ease of spectral and chem. tuning. On a technol. level, mech. flexibility and compatibility with simple processing techniques such as printing, roll-to-roll, self-assembly, and soft-lithog. are most relevant. Here, the authors provide a comprehensive review of the developments in the field over the past decade, discussing recent advances in org. gain materials, which are today often based on solid-state org. semiconductors, as well as optical feedback structures, and device fabrication. Recent efforts toward continuous wave operation and elec. pumping of solid-state org. lasers are reviewed, and new device concepts and emerging applications are summarized.
- 37Wu, T. C.; Granda, A. A.; Hotta, K.; Yazdani, S. A.; Pollice, R.; Vestfrid, J.; Hao, H.; Lavigne, C.; Seifrid, M.; Angello, N.; Bencheikh, F.; Hein, J. E.; Burke, M.; Adachi, C.; Aspuru-Guzik, A. A Materials Acceleration Platform for Organic Laser Discovery. ChemRxiv , 2022; DOI: 10.26434/chemrxiv-2022-9zm65 .Google ScholarThere is no corresponding record for this reference.
- 38Yunker, L. P. E.; Donnecke, S.; Ting, M.; Yeung, D.; McIndoe, J. S. PythoMS: A Python Framework To Simplify and Assist in the Processing and Interpretation of Mass Spectrometric Data. J. Chem. Inf. Model. 2019, 59, 1295– 1300, DOI: 10.1021/acs.jcim.9b00055Google Scholar38PythoMS: A Python Framework To Simplify and Assist in the Processing and Interpretation of Mass Spectrometric DataYunker, Lars P. E.; Donnecke, Sofia; Ting, Michelle; Yeung, Darien; McIndoe, J. ScottJournal of Chemical Information and Modeling (2019), 59 (4), 1295-1300CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Mass spectrometric data are copious and generate a processing burden that is best dealt with programmatically. PythoMS is a collection of tools based on the Python programming language that assist researchers in creating figures and video output that is informative, clear, and visually compelling. The PythoMS framework introduces a library of classes and a variety of scripts that quickly perform time-consuming tasks: making proprietary output readable; binning intensity vs. time data to simulate longer scan times (and hence reduce noise); calcg. theor. isotope patterns and overlaying them in histogram form on exptl. data (an approach that works even for overlapping signals); rendering videos that enable zooming into the baseline of intensity vs time plots (useful to make sense of data collected over a large dynamic range) or that depict the evolution of different species in a time-lapse format; calcg. aggregates; and providing a quick first-pass at identifying fragments in MS/MS spectra. PythoMS is a living project that will continue to evolve as addnl. scripts are developed and deployed.
- 39Burger, B.; Maffettone, P. M.; Gusev, V. V.; Aitchison, C. M.; Bai, Y.; Wang, X.; Li, X.; Alston, B. M.; Li, B.; Clowes, R.; Rankin, N.; Harris, B.; Sprick, R. S.; Cooper, A. I. A Mobile Robotic Chemist. Nature 2020, 583, 237– 241, DOI: 10.1038/s41586-020-2442-2Google Scholar39A mobile robotic chemistBurger, Benjamin; Maffettone, Phillip M.; Gusev, Vladimir V.; Aitchison, Catherine M.; Bai, Yang; Wang, Xiaoyan; Li, Xiaobo; Alston, Ben M.; Li, Buyi; Clowes, Rob; Rankin, Nicola; Harris, Brandon; Sprick, Reiner Sebastian; Cooper, Andrew I.Nature (London, United Kingdom) (2020), 583 (7815), 237-241CODEN: NATUAS; ISSN:0028-0836. (Nature Research)Abstr.: Technologies such as batteries, biomaterials and heterogeneous catalysts have functions that are defined by mixts. of mol. and mesoscale components. As yet, this multi-length-scale complexity cannot be fully captured by atomistic simulations, and the design of such materials from first principles is still rare1-5. Likewise, exptl. complexity scales exponentially with the no. of variables, restricting most searches to narrow areas of materials space. Robots can assist in exptl. searches6-14 but their widespread adoption in materials research is challenging because of the diversity of sample types, operations, instruments and measurements required. Here we use a mobile robot to search for improved photocatalysts for hydrogen prodn. from water15. The robot operated autonomously over eight days, performing 688 expts. within a ten-variable exptl. space, driven by a batched Bayesian search algorithm16-18. This autonomous search identified photocatalyst mixts. that were six times more active than the initial formulations, selecting beneficial components and deselecting neg. ones. Our strategy uses a dexterous19,20 free-roaming robot21-24, automating the researcher rather than the instruments. This modular approach could be deployed in conventional labs. for a range of research problems beyond photocatalysis.
- 40Crosby, G. A.; Demas, J. N. Measurement of Photoluminescence Quantum Yields. Review. J. Phys. Chem. 1971, 75, 991– 1024, DOI: 10.1021/j100678a001Google Scholar40Measurement of photoluminescence quantum yields. ReviewCrosby, Glenn A.; Demas, James N.Journal of Physical Chemistry (1971), 75 (8), 991-1024CODEN: JPCHAX; ISSN:0022-3654.A review is given with 147 refs. Methods and apparatus for measuring photoluminescence quantum yields, standards, apparatus calibration, and data corrections are described.
- 41O’Connor, D. V. O.; Phillips, D. Time-Correlated Single Photon Counting; Academic Press: London, 1984.Google ScholarThere is no corresponding record for this reference.
- 42Cava, R. J.; DiSalvo, F. J.; Brus, L. E.; Dunbar, K. R.; Gorman, C. B.; Haile, S. M.; Interrante, L. V.; Musfeldt, J. L.; Navrotsky, A.; Nuzzo, R. G.; Pickett, W. E.; Wilkinson, A. P.; Ahn, C.; Allen, J. W.; Burns, P. C.; Ceder, G.; Chidsey, C. E. D.; Clegg, W.; Coronado, E.; Dai, H.; Deem, M. W.; Dunn, B. S.; Galli, G.; Jacobson, A. J.; Kanatzidis, M.; Lin, W.; Manthiram, A.; Mrksich, M.; Norris, D.; Nozik, A. J.; Peng, X.; Rawn, C.; Rolison, D.; Singh, D. J.; Toby, B. H.; Tolbert, S.; Wiesner, U. B.; Woodward, P. M.; Yang, P. Future Directions in Solid State Chemistry: Report of the NSF-Sponsored Workshop. Prog. Solid State Chem. 2002, 30, 1– 101, DOI: 10.1016/S0079-6786(02)00010-9Google Scholar42Future directions in solid state chemistry: report of the NSF-sponsored workshopCava, Robert J.; DiSalvo, Francis J.; Brus, Louis E.; Dunbar, Kim R.; Gorman, Christopher B.; Haile, Sossina M.; Interrante, Leonard V.; Musfeldt, Janice L.; Navrotsky, Alexandra; Nuzzo, Ralph G.; Pickett, Warren E.; Wilkinson, Angus P.; Ahn, Channing; Allen, James W.; Burns, Peter C.; Ceder, Gerdrand; Chidsey, Christopher E. D.; Clegg, William; Coronado, Eugenio; Dai, Hongjie; Deem, Michael W.; Dunn, Bruce S.; Galli, Giulia; Jacobson, Allan J.; Kanatzidis, Mercouri; Lin, Wenbin; Manthiram, Arumugam; Mrksich, Milan; Norris, David; Nozik, Arthur J.; Peng, Xiaogang; Rawn, Claudia; Rolison, Debra; Singh, David J.; Toby, Brian H.; Tolbert, Sarah; Wiesner, Ulrich B.; Woodward, Patrick M.; Yang, PeidongProgress in Solid State Chemistry (2002), 30 (1-2), 1-101CODEN: PSSTAW; ISSN:0079-6786. (Elsevier Science Ltd.)A review. A long-established area of scientific excellence in Europe, solid state chem. has emerged in the US in the past two decades as a field experiencing rapid growth and development. At its core, it is an interdisciplinary melding of chem., physics, engineering, and materials science, as it focuses on the design, synthesis and structural characterization of new chem. compds. and characterization of their phys. properties. As a consequence of this inherently interdisciplinary character, the solid state chem. community is highly open to the influx of new ideas and directions. The inclusionary character of the field's culture has been a significant factor in its continuing growth and vitality. This report presents an elaboration of discussions held during an NSF-sponsored workshop on Future Directions in Solid State Chem., held on the UC Davis Campus in Oct. 2001. That workshop was the second of a series of workshops planned in this topical area. The first, held at NSF headquarters in Arlington, Virginia, in Jan. of 1998, was designed to address the core of the field, describing how it has developed in the US and worldwide in the past decade, and how the members of the community saw the central thrusts of research and education in solid state chem. proceeding in the next several years. A report was published on that workshop (J.M. Honig, chair, "Proceedings of the Workshop on the Present Status and Future Developments of Solid State Chem. and Materials", Arlington, VA, Jan. 15-16, 1998) describing the state of the field and recommendations for future development of the core discipline. In the spirit of continuing to expand the scope of the solid state chem. community into new areas of scientific inquiry, the workshop elaborated in this document was designed to address the interfaces between our field and fields where we thought there would be significant opportunity for the development of new scientific advancements through increased interaction. The 7 topic areas, described in detail in this report, ranged from those with established ties to solid state chem. such as Earth and planetary sciences, and energy storage and conversion, to those such as condensed matter physics, where the connections are in their infancy, to biol., where the opportunities for connections are largely unexplored. Exciting ties to materials chem. were explored in discussions on mol. materials and nanoscale science, and a session on the importance of improving the ties between solid state chemists and experts in characterization at national exptl. facilities was included. The full report elaborates these ideas extensively.
- 43Aspuru-Guzik, A.; Persson, K. Materials Acceleration Platform: Accelerating Advanced Energy Materials Discovery by Integrating High-Throughput Methods and Artificial Intelligence; Mission Innovation: Innovation Challenge 6; Canadian Institute for Advanced Research, 2018.Google ScholarThere is no corresponding record for this reference.
- 44Fabbri, E.; Schmidt, T. J. Oxygen Evolution Reaction─The Enigma in Water Electrolysis. ACS Catal. 2018, 8, 9765– 9774, DOI: 10.1021/acscatal.8b02712Google Scholar44Oxygen Evolution Reaction-The Enigma in Water ElectrolysisFabbri, Emiliana; Schmidt, Thomas J.ACS Catalysis (2018), 8 (10), 9765-9774CODEN: ACCACS; ISSN:2155-5435. (American Chemical Society)A review is given. We aim at increasing the awareness among the scientific community devoted to progresses in water electrolyzers of the very recent development made in the fundamental understanding of the OER, particularly focusing on the increasing consciousness that several processes actually underpin the evolution of O from a metal oxide catalyst. Traditionally, the OER mechanism on metal oxides has been derived from that on metal catalysts, where the main parameter governing the reaction overpotential is the binding strength of O (or oxygenated species/intermediates) on the catalyst surface following the Sabatier principle: the best catalyst in terms of displaying the min. overpotential binds O on its surface neither too strongly nor too weakly.
- 45Coley, C. W.; Eyke, N. S.; Jensen, K. F. Autonomous Discovery in the Chemical Sciences Part II: Outlook. Angew. Chem., Int. Ed. 2020, 59 (52), 23414– 23436, DOI: 10.1002/anie.201909989Google Scholar45Autonomous Discovery in the Chemical Sciences Part II: OutlookColey, Connor W.; Eyke, Natalie S.; Jensen, Klavs F.Angewandte Chemie, International Edition (2020), 59 (52), 23414-23436CODEN: ACIEF5; ISSN:1433-7851. (Wiley-VCH Verlag GmbH & Co. KGaA)This two-part Review examines how automation has contributed to different aspects of discovery in the chem. sciences. In this second part, we reflect on a selection of exemplary studies. It is increasingly important to articulate what the role of automation and computation has been in the scientific process and how that has or has not accelerated discovery. One can argue that even the best automated systems have yet to "discover" despite being incredibly useful as lab. assistants. We must carefully consider how they have been and can be applied to future problems of chem. discovery in order to effectively design and interact with future autonomous platforms. The majority of this Review defines a large set of open research directions, including improving our ability to work with complex data, build empirical models, automate both phys. and computational expts. for validation, select expts., and evaluate whether we are making progress towards the ultimate goal of autonomous discovery. Addressing these practical and methodol. challenges will greatly advance the extent to which autonomous systems can make meaningful discoveries.
- 46Molga, K.; Szymkuć, S.; Grzybowski, B. A. Chemist Ex Machina: Advanced Synthesis Planning by Computers. Acc. Chem. Res. 2021, 54, 1094– 1106, DOI: 10.1021/acs.accounts.0c00714Google Scholar46Chemist Ex Machina: Advanced Synthesis Planning by ComputersMolga, Karol; Szymkuc, Sara; Grzybowski, Bartosz A.Accounts of Chemical Research (2021), 54 (5), 1094-1106CODEN: ACHRE4; ISSN:0001-4842. (American Chemical Society)Teaching computers to plan multistep syntheses of arbitrary target mols.-including natural products-has been one of the oldest challenges in chem., dating back to the 1960s. This Account recapitulates two decades of our group's work on the software platform called Chematica, which very recently achieved this long-sought objective and has been shown capable of planning synthetic routes to complex natural products, several of which were validated in the lab. For the machine to plan syntheses at an expert level, it must know the rules describing chem. reactions and use these rules to expand and search the networks of synthetic options. The rules must be of high quality: They must delineate accurately the scope of admissible substituents, capture all relevant stereochem. information, detect potential reactivity conflicts, and protection requirements. They should yield only those synthons that are chem. stable and energetically allowed (e.g., not too strained) and should be able to extrapolate beyond examples already published in the literature. In parallel, the network-search algorithms must be able to assign meaningful scores to the sets of synthons they encounter, make judicious choices which of the network's branches to expand, and when to withdraw from unpromising ones. They must be able to strategize over multiple steps to resolve intermittent reactivity conflicts, exchange functional groups, or overcome local maxima of mol. complexity. Meeting all these requirements makes the problem of computer-driven retrosynthesis very multifaceted, combining expert and AI approaches further supplemented by quantum-mech. and mol.-mechanics calcns. Development of Chematica has been a very long and gradual process because all these components are needed. Any shortcuts-for example, reliance on only expert or only data-based approaches-yield chem. na.ovrddot.ive and often erroneous syntheses, esp. for complex targets. On the bright side, once all the requisite algorithms are implemented-as they now are-they not only streamline conventional synthetic planning but also enable completely new modalities that would challenge any human chemist, for example, synthesis with multiple constraints imposed simultaneously or library-wide syntheses in which the machine constructs "global plans" leading to multiple targets and benefiting from the use of common intermediates. These types of analyses will have profound impact on the practice of chem. industry, designing more economical, more green, and less hazardous pathways.
- 47Shim, E.; Kammeraad, J. A.; Xu, Z.; Tewari, A.; Cernak, T.; Zimmerman, P. M. Predicting Reaction Conditions from Limited Data through Active Transfer Learning. Chem. Sci. 2022, 13, 6655– 6668, DOI: 10.1039/D1SC06932BGoogle Scholar47Predicting reaction conditions from limited data through active transfer learningShim, Eunjae; Kammeraad, Joshua A.; Xu, Ziping; Tewari, Ambuj; Cernak, Tim; Zimmerman, Paul M.Chemical Science (2022), 13 (22), 6655-6668CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)Transfer and active learning have the potential to accelerate the development of new chem. reactions, using prior data and new expts. to inform models that adapt to the target area of interest. This article shows how specifically tuned machine learning models, based on random forest classifiers, can expand the applicability of Pd-catalyzed cross-coupling reactions to types of nucleophiles unknown to the model. First, model transfer is shown to be effective when reaction mechanisms and substrates are closely related, even when models are trained on relatively small nos. of data points. Then, a model simplification scheme is tested and found to provide comparative predictivity on reactions of new nucleophiles that include unseen reagent combinations. Lastly, for a challenging target where model transfer only provides a modest benefit over random selection, an active transfer learning strategy is introduced to improve model predictions. Simple models, composed of a small no. of decision trees with limited depths, are crucial for securing generalizability, interpretability, and performance of active transfer learning.
- 48Grzybowski, B. A.; Badowski, T.; Molga, K.; Szymkuć, S. Network Search Algorithms and Scoring Functions for Advanced-Level Computerized Synthesis Planning. WIREs Comput. Mol. Sci., e1630; DOI: 10.1002/wcms.1630 .Google ScholarThere is no corresponding record for this reference.
- 49Shahriari, B.; Swersky, K.; Wang, Z.; Adams, R. P.; de Freitas, N. Taking the Human Out of the Loop: A Review of Bayesian Optimization. Proc. IEEE 2016, 104, 148– 175, DOI: 10.1109/JPROC.2015.2494218Google ScholarThere is no corresponding record for this reference.
- 50Hickman, R. J.; Aldeghi, M.; Häse, F.; Aspuru-Guzik, A. Bayesian Optimization with Known Experimental and Design Constraints for Chemistry Applications. arXiv , 2022, 2203.17241.Google ScholarThere is no corresponding record for this reference.
- 51Blaženović, I.; Kind, T.; Torbašinović, H.; Obrenović, S.; Mehta, S. S.; Tsugawa, H.; Wermuth, T.; Schauer, N.; Jahn, M.; Biedendieck, R.; Jahn, D.; Fiehn, O. Comprehensive Comparison of in Silico MS/MS Fragmentation Tools of the CASMI Contest: Database Boosting Is Needed to Achieve 93% Accuracy. J. Cheminformatics 2017, 9, 32, DOI: 10.1186/s13321-017-0219-xGoogle Scholar51Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracyBlazenovic Ivana; Jahn Martina; Biedendieck Rebekka; Jahn Dieter; Blazenovic Ivana; Schauer Nicolas; Blazenovic Ivana; Kind Tobias; Mehta Sajjan S; Wermuth Tobias; Fiehn Oliver; Torbasinovic Hrvoje; Obrenovic Slobodan; Tsugawa Hiroshi; Fiehn OliverJournal of cheminformatics (2017), 9 (1), 32 ISSN:1758-2946.In mass spectrometry-based untargeted metabolomics, rarely more than 30% of the compounds are identified. Without the true identity of these molecules it is impossible to draw conclusions about the biological mechanisms, pathway relationships and provenance of compounds. The only way at present to address this discrepancy is to use in silico fragmentation software to identify unknown compounds by comparing and ranking theoretical MS/MS fragmentations from target structures to experimental tandem mass spectra (MS/MS). We compared the performance of four publicly available in silico fragmentation algorithms (MetFragCL, CFM-ID, MAGMa+ and MS-FINDER) that participated in the 2016 CASMI challenge. We found that optimizing the use of metadata, weighting factors and the manner of combining different tools eventually defined the ultimate outcomes of each method. We comprehensively analysed how outcomes of different tools could be combined and reached a final success rate of 93% for the training data, and 87% for the challenge data, using a combination of MAGMa+, CFM-ID and compound importance information along with MS/MS matching. Matching MS/MS spectra against the MS/MS libraries without using any in silico tool yielded 60% correct hits, showing that the use of in silico methods is still important.
- 52De Vijlder, T.; Valkenborg, D.; Lemière, F.; Romijn, E. P.; Laukens, K.; Cuyckens, F. A Tutorial in Small Molecule Identification via Electrospray Ionization-mass Spectrometry: The Practical Art of Structural Elucidation. Mass Spectrom. Rev. 2018, 37, 607– 629, DOI: 10.1002/mas.21551Google Scholar52A tutorial in small molecule identification via electrospray ionization-mass spectrometry: The practical art of structural elucidationDe Vijlder, Thomas; Valkenborg, Dirk; Lemiere, Filip; Romijn, Edwin P.; Laukens, Kris; Cuyckens, FilipMass Spectrometry Reviews (2018), 37 (5), 607-629CODEN: MSRVD3; ISSN:0277-7037. (John Wiley & Sons, Inc.)A review. The identification of unknown mols. has been one of the cornerstone applications of mass spectrometry for decades. Most of what is discussed in this tutorial also applies to other atm. pressure ionization methods like atm. pressure chem./photoionization. We focus primarily on the fundamental steps of MS-based structural elucidation of individual unknown compds., rather than describing strategies for large-scale identification in complex samples. We critically discuss topics like the detection of protonated and deprotonated ions ([M + H]+ and [M-H]-) as well as other adduct ions, the detn. of the mol. formula, and provide some basic rules on the interpretation of product ion spectra. Our tutorial focuses primarily on the fundamental steps of MS-based structural elucidation of individual unknown compds. (eg, contaminants in chem. prodn., pharmacol. alteration of drugs), rather than describing strategies for large-scale identification in complex samples. This tutorial also discusses strategies to obtain useful orthogonal information (UV/Vis, H/D exchange, chem. derivatization, etc) and offers an overview of the different informatics tools and approaches that can be used for structural elucidation of small mols. It is primarily intended for beginning mass spectrometrists and researchers from other mass spectrometry sub-disciplines that want to get acquainted with structural elucidation are interested in some practical tips and tricks.
- 53Cook, A.; Johnson, A. P.; Law, J.; Mirzazadeh, M.; Ravitz, O.; Simon, A. Computer-Aided Synthesis Design: 40 Years On. WIREs Comput. Mol. Sci. 2012, 2, 79– 107, DOI: 10.1002/wcms.61Google Scholar53Computer-aided synthesis design. 40 years onCook, Anthony; Johnson, A. Peter; Law, James; Mirzazadeh, Mahdi; Ravitz, Orr; Simon, AnikoWiley Interdisciplinary Reviews: Computational Molecular Science (2012), 2 (1), 79-107CODEN: WIRCAH; ISSN:1759-0884. (Wiley-Blackwell)A review. The discipline of retrosynthetic anal. is now just over 40 years old. From the earliest day, attempts were made to incorporate this approach into computer programs to test the extent in which chem. perception and synthetic thinking could be formalized. Despite pioneering research efforts, computer-aided synthetic anal. failed to achieve widespread routine use by chemists, which can be attributed in part to the difficulty of building the required high-quality retrosynthetic transform databases required for credible analyses. However, with the advent over the past 25 years of large comprehensive reaction databases, work on successfully automating the construction of reliable and comprehensive reaction rule databases is promising to revitalize research in this field. This review compares and contrasts the diverse approaches taken by selected programs in both the design and implementation of mol. feature perception and reaction rule representation, and the concepts of synthetic strategy selection, representation, and execution were reviewed. In particular, the current work on automating the construction of reliable and comprehensive synthetic rule sets from available reaction databases in newer programs such as ARChem were discussed. The authors argued that the progress achieved in this aspect paves the way to a deeper exploration of computer approaches to applying strategy and control in the synthesis problem.
- 54Rappoport, D.; Aspuru-Guzik, A. Predicting Feasible Organic Reaction Pathways Using Heuristically Aided Quantum Chemistry. J. Chem. Theory Comput. 2019, 15, 4099– 4112, DOI: 10.1021/acs.jctc.9b00126Google Scholar54Predicting Feasible Organic Reaction Pathways Using Heuristically Aided Quantum ChemistryRappoport, Dmitrij; Aspuru-Guzik, AlanJournal of Chemical Theory and Computation (2019), 15 (7), 4099-4112CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)Studying org. reaction mechanisms using quantum chem. methods requires from the researcher an extensive knowledge of both org. chem. and first-principles computation. The need for empirical knowledge arises because any reasonably complete exploration of the potential energy surfaces (PES) of org. reactions is computationally prohibitive. The authors have previously introduced the heuristically-aided quantum chem. (HAQC) approach to modeling complex chem. reactions, which abstrs. the empirical knowledge in terms of chem. heuristics-simple rules guiding the PES exploration-and combines them with structure optimizations using quantum chem. methods. The HAQC approach makes use of heuristic kinetic criteria for selecting reaction paths that are not only plausible, i.e., consistent with the empirical rules of org. reactivity, but also feasible under the reaction conditions. The authors develop heuristic kinetic feasibility criteria, which correctly predict feasible reaction pathways for a wide range of simple polar (substitutions, addns., and eliminations) and pericyclic org. reactions (cyclizations, sigmatropic shifts, and cycloaddns.). In contrast to knowledge-based reaction mechanism prediction methods, the same kinetic heuristics are successful in classifying reaction pathways as feasible or infeasible across this diverse set of reaction mechanisms. The energy profiles of HAQC and their potential applications in machine learning of chem. reactivity are discussed.
- 55Rappoport, D. Reaction Networks and the Metric Structure of Chemical Space(s). J. Phys. Chem. A 2019, 123, 2610– 2620, DOI: 10.1021/acs.jpca.9b00519Google Scholar55Reaction Networks and the Metric Structure of Chemical Space(s)Rappoport, DmitrijJournal of Physical Chemistry A (2019), 123 (13), 2610-2620CODEN: JPCAFH; ISSN:1089-5639. (American Chemical Society)In this paper, we develop a formal definition of chem. space as a discrete metric space of mols. and analyze its properties. To this end, we utilize the shortest path metric on reaction networks to define a distance function between mols. of the same stoichiometry (no. of atoms). The distance between mols. with different stoichiometries is formalized by making use of the partial ordering of stoichiometries with respect to inclusion. Calcns. of fractal dimension on metric spaces for individual stoichiometries show that they have low intrinsic dimensionality, about an order of magnitude less than the dimension of the underlying reactive potential energy surface. Our findings suggest that efficient search strategies on chem. space can be designed that take advantage of its metric structure.
- 56Wołos, A.; Roszak, R.; Żądło-Dobrowolska, A.; Beker, W.; Mikulak-Klucznik, B.; Spólnik, G.; Dygas, M.; Szymkuć, S.; Grzybowski, B. A. Synthetic Connectivity, Emergence, and Self-Regeneration in the Network of Prebiotic Chemistry. Science 2020, 369, eaaw1955, DOI: 10.1126/science.aaw1955Google ScholarThere is no corresponding record for this reference.
- 57Arya, A.; Ray, J.; Sharma, S.; Simbron, R. C.; Lozano, A.; Smith, H. B.; Andersen, J. L.; Chen, H.; Meringer, M.; Cleaves, H. J. An Open Source Computational Workflow for the Discovery of Autocatalytic Networks in Abiotic Reactions. Chem. Sci. 2022, 13, 4838– 4853, DOI: 10.1039/D2SC00256FGoogle Scholar57An open source computational workflow for the discovery of autocatalytic networks in abiotic reactionsArya, Aayush; Ray, Jessica; Sharma, Siddhant; Cruz Simbron, Romulo; Lozano, Alejandro; Smith, Harrison B.; Andersen, Jakob Lykke; Chen, Huan; Meringer, Markus; Cleaves II, Henderson JamesChemical Science (2022), 13 (17), 4838-4853CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)A central question in origins of life research is how non-entailed chem. processes, which simply dissipate chem. energy because they can do so due to immediate reaction kinetics and thermodn, enabled the origin of highly-entailed ones, in which concatenated kinetically and thermodynamically favorable processes enhanced some processes over others. Some degree of mol. complexity likely had to be supplied by environmental processes to produce entailed self-replicating processes. The origin of entailment, therefore, must connect to fundamental chem. that builds mol. complexity. We present here an open-source chemoinformatic workflow to model abiol. chem. to discover such entailment. This pipeline automates generation of chem. reaction networks and their anal. to discover novel compds. and autocatalytic processes. We demonstrate this pipelines capabilities against a well-studied model system by vetting it against expremental data. This workflow can enable rapid identification of products of complex chemistries and their underlying synthetic relationships to help identify autocatalysis, and potentially self-organization, in such systems. The algorithms used in this study are open-source and reconfigurable by other user-developed workflows.
- 58Allen, F.; Pon, A.; Wilson, M.; Greiner, R.; Wishart, D. CFM-ID: A Web Server for Annotation, Spectrum Prediction and Metabolite Identification from Tandem Mass Spectra. Nucleic Acids Res. 2014, 42, W94– W99, DOI: 10.1093/nar/gku436Google Scholar58CFM-ID: a web server for annotation, spectrum prediction and metabolite identification from tandem mass spectraAllen, Felicity; Pon, Allison; Wilson, Michael; Greiner, Russ; Wishart, DavidNucleic Acids Research (2014), 42 (W1), W94-W99CODEN: NARHAD; ISSN:0305-1048. (Oxford University Press)CFM-ID is a web server supporting three tasks assocd. with the interpretation of tandem mass spectra (MS/MS) for the purpose of automated metabolite identification: annotation of the peaks in a spectrum for a known chem. structure; prediction of spectra for a given chem. structure and putative metabolite identification-a predicted ranking of possible candidate structures for a target spectrum. The algorithms used for these tasks are based on Competitive Fragmentation Modeling (CFM), a recently introduced probabilistic generative model for the MS/MS fragmentation process that uses machine learning techniques to learn its parameters from data. These algorithms have been extensively tested on multiple datasets and have been shown to out-perform existing methods such as MetFrag and FingerId. This web server provides a simple interface for using these algorithms and a graphical display of the resulting annotations, spectra and structures. CFM-ID is made freely available at http://cfmid.wishartlab.com.
- 59Djoumbou-Feunang, Y.; Pon, A.; Karu, N.; Zheng, J.; Li, C.; Arndt, D.; Gautam, M.; Allen, F.; Wishart, D. S. CFM-ID 3.0: Significantly Improved ESI-MS/MS Prediction and Compound Identification. Metabolites 2019, 9, 72, DOI: 10.3390/metabo9040072Google Scholar59CFM-ID 3.0: significantly improved ESI-MS/MS prediction and compound identificationDjoumbou-Feunang, Yannick; Pon, Allison; Karu, Naama; Zheng, Jiamin; Li, Carin; Arndt, Dav; Gautam, Maheswor; Allen, Felicity; Wishart, Dav S.Metabolites (2019), 9 (4), 72CODEN: METALU; ISSN:2218-1989. (MDPI AG)Metabolite identification for untargeted metabolomics is often hampered by the lack of exptl. collected ref. spectra from tandem mass spectrometry (MS/MS). To circumvent this problem, Competitive Fragmentation Modeling-ID (CFM-ID) was developed to accurately predict electrospray ionization-MS/MS (ESI-MS/MS) spectra from chem. structures and to aid in compd. identification via MS/MS spectral matching. While earlier versions of CFM-ID performed very well, CFM-ID's performance for predicting the MS/MS spectra of certain classes of compds., including many lipids, was quite poor. Furthermore, CFM-ID's compd. identification capabilities were limited because it did not use exptl. available MS/MS spectra nor did it exploit metadata in its spectral matching algorithm. Here, we describe significant improvements to CFM-ID's performance and speed. These include (1) the implementation of a rule-based fragmentation approach for lipid MS/MS spectral prediction, which greatly improves the speed and accuracy of CFM-ID; (2) the inclusion of exptl. MS/MS spectra and other metadata to enhance CFM-ID's compd. identification abilities; (3) the development of new scoring functions that improves CFM-ID's accuracy by 21.1%; and (4) the implementation of a chem. classification algorithm that correctly classifies unknown chems. (based on their MS/MS spectra) in >80% of the cases. This improved version called CFM-ID 3.0 is freely available as a web server.
- 60Ji, H.; Deng, H.; Lu, H.; Zhang, Z. Predicting a Molecular Fingerprint from an Electron Ionization Mass Spectrum with Deep Neural Networks. Anal. Chem. 2020, 92, 8649– 8653, DOI: 10.1021/acs.analchem.0c01450Google Scholar60Predicting a Molecular Fingerprint from an Electron Ionization Mass Spectrum with Deep Neural NetworksJi, Hongchao; Deng, Hanzi; Lu, Hongmei; Zhang, ZhiminAnalytical Chemistry (Washington, DC, United States) (2020), 92 (13), 8649-8653CODEN: ANCHAM; ISSN:0003-2700. (American Chemical Society)Electron ionization-mass spectrometry (EI-MS) hyphenated to gas chromatog. (GC) is the workhorse for analyzing volatile compds. in complex samples. The spectral matching method can only identify compds. within the spectral database. In response, we present a deep-learning-based approach (DeepEI) for structure elucidation of an unknown compd. with its EI-MS spectrum. DeepEI employs deep neural networks to predict mol. fingerprints from an EI-MS spectrum and searches the mol. structure database with the predicted fingerprints. We evaluated DeepEI with MassBank spectra, and the results indicate DeepEI is an effective identification method. In addn., DeepEI can work cooperatively with database spectral matching and NEIMS (fingerprint to spectrum method) to improve identification accuracy.
- 61Xue, J.; Guijas, C.; Benton, H. P.; Warth, B.; Siuzdak, G. METLIN MS 2 Molecular Standards Database: A Broad Chemical and Biological Resource. Nat. Methods 2020, 17, 953– 954, DOI: 10.1038/s41592-020-0942-5Google Scholar61METLIN MS2 molecular standards database: a broad chemical and biological resourceXue, Jingchuan; Guijas, Carlos; Benton, H. Paul; Warth, Benedikt; Siuzdak, GaryNature Methods (2020), 17 (10), 953-954CODEN: NMAEA3; ISSN:1548-7091. (Nature Research)There is no expanded citation for this reference.
- 62Horai, H.; Arita, M.; Kanaya, S.; Nihei, Y.; Ikeda, T.; Suwa, K.; Ojima, Y.; Tanaka, K.; Tanaka, S.; Aoshima, K.; Oda, Y.; Kakazu, Y.; Kusano, M.; Tohge, T.; Matsuda, F.; Sawada, Y.; Hirai, M. Y.; Nakanishi, H.; Ikeda, K.; Akimoto, N.; Maoka, T.; Takahashi, H.; Ara, T.; Sakurai, N.; Suzuki, H.; Shibata, D.; Neumann, S.; Iida, T.; Tanaka, K.; Funatsu, K.; Matsuura, F.; Soga, T.; Taguchi, R.; Saito, K.; Nishioka, T. MassBank: A Public Repository for Sharing Mass Spectral Data for Life Sciences. J. Mass Spectrom. 2010, 45, 703– 714, DOI: 10.1002/jms.1777Google Scholar62MassBank: a public repository for sharing mass spectral data for life sciencesHorai, Hisayuki; Arita, Masanori; Kanaya, Shigehiko; Nihei, Yoshito; Ikeda, Tasuku; Suwa, Kazuhiro; Ojima, Yuya; Tanaka, Kenichi; Tanaka, Satoshi; Aoshima, Ken; Oda, Yoshiya; Kakazu, Yuji; Kusano, Miyako; Tohge, Takayuki; Matsuda, Fumio; Sawada, Yuji; Hirai, Masami Yokota; Nakanishi, Hiroki; Ikeda, Kazutaka; Akimoto, Naoshige; Maoka, Takashi; Takahashi, Hiroki; Ara, Takeshi; Sakurai, Nozomu; Suzuki, Hideyuki; Shibata, Daisuke; Neumann, Steffen; Iida, Takashi; Tanaka, Ken; Funatsu, Kimito; Matsuura, Fumito; Soga, Tomoyoshi; Taguchi, Ryo; Saito, Kazuki; Nishioka, TakaakiJournal of Mass Spectrometry (2010), 45 (7), 703-714CODEN: JMSPFJ; ISSN:1076-5174. (John Wiley & Sons Ltd.)MassBank is the first public repository of mass spectra of small chem. compds. for life sciences (<3000 Da). The database contains 605 electron-ionization mass spectrometry(EI-MS), 137 fast atom bombardment MS and 9276 electrospray ionization (ESI)-MSn data of 2337 authentic compds. of metabolites, 11 545 EI-MS and 834 other-MS data of 10 286 volatile natural and synthetic compds., and 3045 ESI-MS2 data of 679 synthetic drugs contributed by 16 research groups (Jan. 2010). ESI-MS2 data were analyzed under nonstandardized, independent exptl. conditions. MassBank is a distributed database. Each research group provides data from its own MassBank data servers distributed on the Internet. MassBank users can access either all of the MassBank data or a subset of the data by specifying one or more exptl. conditions. In a spectral search to retrieve mass spectra similar to a query mass spectrum, the similarity score is calcd. by a weighted cosine correlation in which weighting exponents on peak intensity and the mass-to-charge ratio are optimized to the ESI-MS2 data. MassBank also provides a merged spectrum for each compd. prepd. by merging the analyzed ESI-MS2 data on an identical compd. under different collision-induced dissocn. conditions. Data merging has significantly improved the precision of the identification of a chem. compd. by 21-23% at a similarity score of 0.6. Thus, MassBank is useful for the identification of chem. compds. and the publication of exptl. data.
- 63NIST. NIST 20 Tandem Mass Spectral Libraries; https://chemdata.nist.gov/dokuwiki/doku.php?id=chemdata:msms (accessed 2022-07-19).Google ScholarThere is no corresponding record for this reference.
- 64Sans, V.; Porwol, L.; Dragone, V.; Cronin, L. A Self Optimizing Synthetic Organic Reactor System Using Real-Time in-Line NMR Spectroscopy. Chem. Sci. 2015, 6, 1258– 1264, DOI: 10.1039/C4SC03075CGoogle Scholar64A self optimizing synthetic organic reactor system using real-time in-line NMR spectroscopySans, Victor; Porwol, Luzian; Dragone, Vincenza; Cronin, LeroyChemical Science (2015), 6 (2), 1258-1264CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)A configurable platform for synthetic chem. incorporating an in-line bench-top NMR capable of monitoring and controlling org. reactions in real-time is discussed. The platform is controlled by a modular LabView software control system for hardware, NMR, data anal., and feedback optimization. Using this platform, real-time advanced structural characterization of reaction mixts., including 19F, 13C, DEPT, 2-dimensional NMR spectroscopy (COSY, HSQC, 19F-COSY), are reported for the first time. The potential of this technique was demonstrated by optimizing a catalytic org. reaction in real-time, showing its applicability to self-optimizing systems using criteria such as stereo-selectivity, multi-nuclear measurements, or 2-dimensional correlations.
- 65Granda, J. M.; Donina, L.; Dragone, V.; Long, D.-L.; Cronin, L. Controlling an Organic Synthesis Robot with Machine Learning to Search for New Reactivity. Nature 2018, 559, 377– 381, DOI: 10.1038/s41586-018-0307-8Google Scholar65Controlling an organic synthesis robot with machine learning to search for new reactivityGranda, Jaroslaw M.; Donina, Liva; Dragone, Vincenza; Long, De-Liang; Cronin, LeroyNature (London, United Kingdom) (2018), 559 (7714), 377-381CODEN: NATUAS; ISSN:0028-0836. (Nature Research)The discovery of chem. reactions is an inherently unpredictable and time-consuming process. An attractive alternative is to predict reactivity, although relevant approaches, such as computer-aided reaction design, are still in their infancy. Reaction prediction based on high-level quantum chem. methods is complex, even for simple mols. Although machine learning is powerful for data anal., its applications in chem. are still being developed. Inspired by strategies based on chemists' intuition, we propose that a reaction system controlled by a machine learning algorithm may be able to explore the space of chem. reactions quickly, esp. if trained by an expert. Here we present an org. synthesis robot that can perform chem. reactions and anal. faster than they can be performed manually, as well as predict the reactivity of possible reagent combinations after conducting a small no. of expts., thus effectively navigating chem. reaction space. By using machine learning for decision making, enabled by binary encoding of the chem. inputs, the reactions can be assessed in real time using NMR and IR spectroscopy. The machine learning system was able to predict the reactivity of about 1,000 reaction combinations with accuracy greater than 80 per cent after considering the outcomes of slightly over 10 per cent of the dataset. This approach was also used to calc. the reactivity of published datasets. Further, by using real-time data from our robot, these predictions were followed up manually by a chemist, leading to the discovery of four reactions.
- 66Maschmeyer, T.; Prieto, P. L.; Grunert, S.; Hein, J. E. Exploration of Continuous-Flow Benchtop NMR Acquisition Parameters and Considerations for Reaction Monitoring. Magn. Reson. Chem. 2020, 58, 1234– 1248, DOI: 10.1002/mrc.5094Google Scholar66Exploration of continuous-flow benchtop NMR acquisition parameters and considerations for reaction monitoringMaschmeyer, Tristan; Prieto, Paloma L.; Grunert, Shad; Hein, Jason E.Magnetic Resonance in Chemistry (2020), 58 (12), 1234-1248CODEN: MRCHEG; ISSN:0749-1581. (John Wiley & Sons Ltd.)This study focused on fundamental data acquisition parameter selection for a benchtop NMR (NMR) system with continuous flow, applicable for reaction monitoring. The effect of flow rate on the mixing behaviors within a flow cell was obsd., along with an exponential decay relationship between flow rate and the apparent spin-lattice relaxation time (T1*) of benzaldehyde. We also monitored sensitivity (as detd. by signal-to-noise ratios; SNRs) under various flow rates, analyte concns., and temps. of the analyte flask. Results suggest that a max. SNR can be achieved with low to medium flow rates and higher analyte concns. This was consistent with data collected with parameters that promote either slow or fast data acquisition. We further consider the effect of these conditions on the analyte's residence time, T1*, and magnetic field inhomogeneity that is a product of continuous flow. Altogether, our results demonstrate how fundamental acquisition parameters can be manipulated to achieve optimal data acquisition in continuous-flow NMR systems.
- 67Chatterjee, S.; Guidi, M.; Seeberger, P. H.; Gilmore, K. Automated Radial Synthesis of Organic Molecules. Nature 2020, 579, 379– 384, DOI: 10.1038/s41586-020-2083-5Google Scholar67Automated radial synthesis of organic moleculesChatterjee, Sourav; Guidi, Mara; Seeberger, Peter H.; Gilmore, KerryNature (London, United Kingdom) (2020), 579 (7799), 379-384CODEN: NATUAS; ISSN:0028-0836. (Nature Research)Abstr.: Automated synthesis platforms accelerate and simplify the prepn. of mols. by removing the phys. barriers to org. synthesis. This provides unrestricted access to biopolymers and small mols. via reproducible and directly comparable chem. processes. Current automated multistep syntheses rely on either iterative1-4 or linear processes5-9, and require compromises in terms of versatility and the use of equipment. Here we report an approach towards the automated synthesis of small mols., based on a series of continuous flow modules that are radially arranged around a central switching station. Using this approach, concise vols. can be exposed to any reaction conditions required for a desired transformation. Sequential, non-simultaneous reactions can be combined to perform multistep processes, enabling the use of variable flow rates, reuse of reactors under different conditions, and the storage of intermediates. This fully automated instrument is capable of both linear and convergent syntheses and does not require manual reconfiguration between different processes. The capabilities of this approach are demonstrated by performing optimizations and multistep syntheses of targets, varying concns. via inline dilns., exploring several strategies for the multistep synthesis of the anticonvulsant drug rufinamide10, synthesizing eighteen compds. of two deriv. libraries that are prepd. using different reaction pathways and chemistries, and using the same reagents to perform metallaphotoredox carbon-nitrogen cross-couplings11 in a photochem. module-all without instrument reconfiguration.
- 68Bahr, M. N.; Damon, D. B.; Yates, S. D.; Chin, A. S.; Christopher, J. D.; Cromer, S.; Perrotto, N.; Quiroz, J.; Rosso, V. Collaborative Evaluation of Commercially Available Automated Powder Dispensing Platforms for High-Throughput Experimentation in Pharmaceutical Applications. Org. Process Res. Dev. 2018, 22, 1500– 1508, DOI: 10.1021/acs.oprd.8b00259Google Scholar68Collaborative Evaluation of Commercially Available Automated Powder Dispensing Platforms for High-Throughput Experimentation in Pharmaceutical ApplicationsBahr, Matthew N.; Damon, David B.; Yates, Simon D.; Chin, Alexander S.; Christopher, J. David; Cromer, Samuel; Perrotto, Nicholas; Quiroz, Jorge; Rosso, VictorOrganic Process Research & Development (2018), 22 (11), 1500-1508CODEN: OPRDFK; ISSN:1083-6160. (American Chemical Society)Many workflows in Pharmaceutical R&D involve the manipulation of defined amts. of powders. Automated powder dispensing platforms are currently available; however, these existing technologies do not meet the requirements for every high-throughput experimentation powder dispensing application. A Working Group (WG) composed of pharmaceutical researchers within the Enabling Technologies Consortium (ETC) evaluated automated platforms com. available from three manufacturers using an objective, systematic testing protocol. This paper describes the selection of powders and testing conditions used in this evaluation, and it assesses the impact that the powders, testing conditions, equipment environment, and other factors had on the performance of the selected platforms.
- 69Bahr, M. N.; Morris, M. A.; Tu, N. P.; Nandkeolyar, A. Recent Advances in High-Throughput Automated Powder Dispensing Platforms for Pharmaceutical Applications. Org. Process Res. Dev. 2020, 24, 2752, DOI: 10.1021/acs.oprd.0c00411Google Scholar69Recent Advances in High-Throughput Automated Powder Dispensing Platforms for Pharmaceutical ApplicationsBahr, Matthew N.; Morris, Mark A.; Tu, Noah P.; Nandkeolyar, AakankschitOrganic Process Research & Development (2020), 24 (11), 2752-2761CODEN: OPRDFK; ISSN:1083-6160. (American Chemical Society)A wide array of pharmaceutical research studies involve dispensing a variety of powders such as active ingredients, intermediates, catalysts, and formulation excipients. Automated powder dispensing platforms are increasingly relied upon to perform the mundane task of filling vials in multi-well plates for high-throughput experimentation workflows. A small group of pharmaceutical scientists collaborated to evaluate recent advances in com. available automation platforms from two instrument manufacturers using previously reported objective and systematic testing protocols. This manuscript details the testing conditions used for the evaluation and the results obtained and assesses the impact that the powder characteristics had on the performance of the selected platforms through statistical anal.
- 70Tu, N. P.; Dombrowski, A. W.; Goshu, G. M.; Vasudevan, A.; Djuric, S. W.; Wang, Y. High-Throughput Reaction Screening with Nanomoles of Solid Reagents Coated on Glass Beads. Angew. Chem., Int. Ed. 2019, 58, 7987– 7991, DOI: 10.1002/anie.201900536Google Scholar70High-Throughput Reaction Screening with Nanomoles of Solid Reagents Coated on Glass BeadsTu, Noah P.; Dombrowski, Amanda W.; Goshu, Gashaw M.; Vasudevan, Anil; Djuric, Stevan W.; Wang, YingAngewandte Chemie, International Edition (2019), 58 (24), 7987-7991CODEN: ACIEF5; ISSN:1433-7851. (Wiley-VCH Verlag GmbH & Co. KGaA)Technologies that enable rapid screening of diverse reaction conditions are of crit. importance to methodol. development and reaction optimization, esp. when mols. of high complexity and scarcity are involved. The lack of a general solid dispensing method for chem. reagents on micro- and nanomole scale prevents the full use of reaction screening technologies. The authors herein report the development of a technol. in which glass beads coated with solid chem. reagents (ChemBeads) enable the delivery of nanomole quantities of solid chem. reagents efficiently. By exploring the concept of preferred screening sets, the flexibility and generality of this technol. for high-throughput reaction screening was validated.
- 71Shiri, P.; Lai, V.; Zepel, T.; Griffin, D.; Reifman, J.; Clark, S.; Grunert, S.; Yunker, L. P. E.; Steiner, S.; Situ, H.; Yang, F.; Prieto, P. L.; Hein, J. E. Automated Solubility Screening Platform Using Computer Vision. iScience 2021, 24, 102176, DOI: 10.1016/j.isci.2021.102176Google Scholar71Automated solubility screening platform using computer visionShiri, Parisa; Lai, Veronica; Zepel, Tara; Griffin, Daniel; Reifman, Jonathan; Clark, Sean; Grunert, Shad; Yunker, Lars P. E.; Steiner, Sebastian; Situ, Henry; Yang, Fan; Prieto, Paloma L.; Hein, Jason E.iScience (2021), 24 (3), 102176CODEN: ISCICE; ISSN:2589-0042. (Elsevier B.V.)Soly. screening is an essential, routine process that is often labor intensive. Robotic platforms have been developed to automate some aspects of the manual labor involved. However, many of the existing systems rely on traditional analytic techniques such as high-performance liq. chromatog., which require pre-calibration for each compd. and can be resource consuming. In addn., automation is not typically end-to-end, requiring user intervention to move vials, establish anal. methods for each compd. and interpret the raw data. We developed a closed-loop, flexible robotic system with integrated solid and liq. dosing capabilities that relies on computer vision and iterative feedback to successfully measure caffeine soly. in multiple solvents. After initial researcher input (<2 min), the system ran autonomously, screening five different solvent systems (20-80 min each). The resulting soly. values matched those obtained using traditional manual techniques.
- 72J3016C: Taxonomy and Definitions for Terms Related to Driving Automation Systems for On-Road Motor Vehicles ; 2021; https://www.sae.org/standards/content/j3016_202104.Google ScholarThere is no corresponding record for this reference.
- 73Ruiz-Castillo, P.; Buchwald, S. L. Applications of Palladium-Catalyzed C–N Cross-Coupling Reactions. Chem. Rev. 2016, 116, 12564– 12649, DOI: 10.1021/acs.chemrev.6b00512Google Scholar73Applications of Palladium-Catalyzed C-N Cross-Coupling ReactionsRuiz-Castillo, Paula; Buchwald, Stephen L.Chemical Reviews (Washington, DC, United States) (2016), 116 (19), 12564-12649CODEN: CHREAY; ISSN:0009-2665. (American Chemical Society)A review. Pd-catalyzed cross-coupling reactions that form C-N bonds have become useful methods to synthesize anilines and aniline derivs., an important class of compds. throughout chem. research. A key factor in the widespread adoption of these methods has been the continued development of reliable and versatile catalysts that function under operationally simple, user-friendly conditions. This review provides an overview of Pd-catalyzed N-arylation reactions found in both basic and applied chem. research from 2008 to the present. Selected examples of C-N cross-coupling reactions between nine classes of nitrogen-based coupling partners and (pseudo)aryl halides are described for the synthesis of heterocycles, medicinally relevant compds., natural products, org. materials, and catalysts.
- 74Chinchilla, R.; Nájera, C. The Sonogashira Reaction: A Booming Methodology in Synthetic Organic Chemistry. Chem. Rev. 2007, 107, 874– 922, DOI: 10.1021/cr050992xGoogle Scholar74The Sonogashira reaction: a booming methodology in synthetic organic chemistryChinchilla, Rafael; Najera, CarmenChemical Reviews (Washington, DC, United States) (2007), 107 (3), 874-922CODEN: CHREAY; ISSN:0009-2665. (American Chemical Society)A review. The palladium-catalyzed sp2-sp coupling reaction between aryl or alkenyl halides or triflates and terminal alkynes, with or without the presence of copper(I) cocatalyst, was reviewed. The mechanism of Sonogashira reaction was discussed, and the catalysts and reaction conditions were explored as well. In addn., the application of Sonogashira cross-coupling reaction was also introduced.
- 75Breugst, M.; Reissig, H.-U. The Huisgen Reaction: Milestones of the 1,3-Dipolar Cycloaddition. Angew. Chem., Int. Ed. 2020, 59, 12293– 12307, DOI: 10.1002/anie.202003115Google Scholar75The Huisgen Reaction: Milestones of the 1,3-Dipolar CycloadditionBreugst, Martin; Reissig, Hans-UlrichAngewandte Chemie, International Edition (2020), 59 (30), 12293-12307CODEN: ACIEF5; ISSN:1433-7851. (Wiley-VCH Verlag GmbH & Co. KGaA)A review. The concept of 1,3-dipolar cycloaddns. was presented by Rolf Huisgen 60 years ago. Previously unknown reactive intermediates, for example azomethine ylides, were introduced to org. chem. and the (3+2) cycloaddns. of 1,3-dipoles to multiple-bond systems (Huisgen reaction) developed into one of the most versatile synthetic methods in heterocyclic chem. In this Review, the authors present the history of this research area, highlight important older reports, and describe the evolution and further development of the concept. The most important mechanistic and synthetic results are discussed. Quantum-mech. calcns. support the concerted mechanism always favored by R. Huisgen; however, in extreme cases intermediates may be involved. The impact of 1,3-dipolar cycloaddns. on the click chem. concept of K. B. Sharpless will also be discussed.
- 76Molga, K.; Szymkuć, S.; Gołębiowska, P.; Popik, O.; Dittwald, P.; Moskal, M.; Roszak, R.; Mlynarski, J.; Grzybowski, B. A. A Computer Algorithm to Discover Iterative Sequences of Organic Reactions. Nat. Synth. 2022, 1, 49– 58, DOI: 10.1038/s44160-021-00010-3Google ScholarThere is no corresponding record for this reference.
- 77Eppel, S.; Xu, H.; Bismuth, M.; Aspuru-Guzik, A. Computer Vision for Recognition of Materials and Vessels in Chemistry Lab Settings and the Vector-LabPics Data Set. ACS Cent. Sci. 2020, 6, 1743– 1752, DOI: 10.1021/acscentsci.0c00460Google Scholar77Computer Vision for Recognition of Materials and Vessels in Chemistry Lab Settings and the Vector-LabPics Data SetEppel, Sagi; Xu, Haoping; Bismuth, Mor; Aspuru-Guzik, AlanACS Central Science (2020), 6 (10), 1743-1752CODEN: ACSCII; ISSN:2374-7951. (American Chemical Society)This work presents a machine learning approach for the computer vision-based recognition of materials inside vessels in the chem. lab and other settings. In addn., we release a data set assocd. with the training of the model for further model development. The task to learn is finding the region, boundaries, and category for each material phase and vessel in an image. Handling materials inside mostly transparent containers is the main activity performed by human and robotic chemists in the lab. Visual recognition of vessels and their contents is essential for performing this task. Modern machine-vision methods learn recognition tasks by using data sets contg. a large no. of annotated images. This work presents the Vector-LabPics data set, which consists of 2187 images of materials within mostly transparent vessels in a chem. lab and other general settings. The images are annotated for both the vessels and the individual material phases inside them, and each instance is assigned one or more classes (liq., solid, foam, suspension, powder, ...). The fill level, labels, corks, and parts of the vessel are also annotated. Several convolutional nets for semantic and instance segmentation were trained on this data set. The trained neural networks achieved good accuracy in detecting and segmenting vessels and material phases, and in classifying liqs. and solids, but relatively low accuracy in segmenting multiphase systems such as phase-sepg. liqs. A computer vision system for recognition materials and vessels in the chem. lab. The system is based on the new LabPics image data set and convolutional neural nets for image segmentation.
- 78Steiner, S.; Wolf, J.; Glatzel, S.; Andreou, A.; Granda, J. M.; Keenan, G.; Hinkley, T.; Aragon-Camarasa, G.; Kitson, P. J.; Angelone, D.; Cronin, L. Organic Synthesis in a Modular Robotic System Driven by a Chemical Programming Language. Science 2019, 363, 1, DOI: 10.1126/science.aav2211Google ScholarThere is no corresponding record for this reference.
- 79Mehr, S. H. M.; Craven, M.; Leonov, A. I.; Keenan, G.; Cronin, L. A Universal System for Digitization and Automatic Execution of the Chemical Synthesis Literature. Science 2020, 370, 101– 108, DOI: 10.1126/science.abc2986Google Scholar79A universal system for digitization and automatic execution of the chemical synthesis literatureMehr, S. Hessam M.; Craven, Matthew; Leonov, Artem I.; Keenan, Graham; Cronin, LeroyScience (Washington, DC, United States) (2020), 370 (6512), 101-108CODEN: SCIEAS; ISSN:1095-9203. (American Association for the Advancement of Science)Robotic systems for chem. synthesis are growing in popularity but can be difficult to run and maintain because of the lack of a std. operating system or capacity for direct access to the literature through natural language processing. Here we show an extendable chem. execution architecture that can be populated by automatically reading the literature, leading to a universal autonomous workflow. The robotic synthesis code can be cor. in natural language without any programming knowledge and, because of the std., is hardware independent. This chem. code can then be combined with a graph describing the hardware modules and compiled into platform-specific, low-level robotic instructions for execution. We showcase automated syntheses of 12 compds. from the literature, including the analgesic lidocaine, the Dess-Martin periodinane oxidn. reagent, and the fluorinating agent AlkylFluor.
- 80Acceleration Consortium; https://acceleration.utoronto.ca (accessed 2021-06-10).Google ScholarThere is no corresponding record for this reference.
Cited By
Smart citations by scite.ai include citation statements extracted from the full text of the citing article. The number of the statements may be higher than the number of citations provided by ACS Publications if one paper cites another multiple times or lower if scite has not yet processed some of the citing articles.
This article is cited by 134 publications.
- Haisen Zhou, Guangqi Wu, Zuo Zhang, Zhen Zhu, Tanfeng Zhao, Qiankang Zhou, Haolin Chen, Zhenzhong Zhao, Yuanhao Dai, Xiaodong Jing, Wei Ji, Xuehai Yan, Lijiang Yang, Yiqin Gao, Huan Wang, Hua Lu. Data-Driven Design of Random Heteropolypeptides as Synthetic Polyclonal Antibodies. Journal of the American Chemical Society 2025, 147
(24)
, 21077-21088. https://doi.org/10.1021/jacs.5c06240
- Rebekah Duke, Chih-Hsuan Yang, Baskar Ganapathysubramanian, Chad Risko. Evaluating Molecular Similarity Measures: Do Similarity Measures Reflect Electronic Structure Properties?. Journal of Chemical Information and Modeling 2025, 65
(9)
, 4311-4319. https://doi.org/10.1021/acs.jcim.5c00175
- Justin P. Edaugal, Difan Zhang, Dupeng Liu, Vassiliki-Alexandra Glezakou, Ning Sun. Solvent Screening for Separation Processes Using Machine Learning and High-Throughput Technologies. Chem & Bio Engineering 2025, 2
(4)
, 210-228. https://doi.org/10.1021/cbe.4c00170
- Paulius Baronas, Justas Lekavičius, Maciej Majdecki, Jacob Lynge Elholm, Karolis Kazlauskas, Przemysław Gaweł, Kasper Moth-Poulsen. Automated Research Platform for Development of Triplet–Triplet Annihilation Photon Upconversion Systems. ACS Central Science 2025, 11
(3)
, 413-421. https://doi.org/10.1021/acscentsci.4c02059
- Yiming Zhao, Yongjia Zhao, Jian Wang, Zhuo Wang. Artificial Intelligence Meets Laboratory Automation in Discovery and Synthesis of Metal–Organic Frameworks: A Review. Industrial & Engineering Chemistry Research 2025, 64
(9)
, 4637-4668. https://doi.org/10.1021/acs.iecr.4c04636
- Elham Foadian, Sheryl Sanchez, Sergei V. Kalinin, Mahshid Ahmadi. From Sunlight to Solutions: Closing the Loop on Halide Perovskites. ACS Materials Au 2025, 5
(1)
, 11-23. https://doi.org/10.1021/acsmaterialsau.4c00096
- Junkil Park, Honghui Kim, Yeonghun Kang, Yunsung Lim, Jihan Kim. From Data to Discovery: Recent Trends of Machine Learning in Metal–Organic Frameworks. JACS Au 2024, 4
(10)
, 3727-3743. https://doi.org/10.1021/jacsau.4c00618
- Boris N. Slautin, Yongtao Liu, Hiroshi Funakubo, Rama K. Vasudevan, Maxim Ziatdinov, Sergei V. Kalinin. Bayesian Conavigation: Dynamic Designing of the Material Digital Twins via Active Learning. ACS Nano 2024, 18
(36)
, 24898-24908. https://doi.org/10.1021/acsnano.4c05368
- Gary Tom, Stefan P. Schmid, Sterling G. Baird, Yang Cao, Kourosh Darvish, Han Hao, Stanley Lo, Sergio Pablo-García, Ella M. Rajaonson, Marta Skreta, Naruki Yoshikawa, Samantha Corapi, Gun Deniz Akkoc, Felix Strieth-Kalthoff, Martin Seifrid, Alán Aspuru-Guzik. Self-Driving Laboratories for Chemistry and Materials Science. Chemical Reviews 2024, 124
(16)
, 9633-9732. https://doi.org/10.1021/acs.chemrev.4c00055
- Manuel Carrer, Henrique Musseli Cezar, Sigbjørn Løland Bore, Morten Ledum, Michele Cascella. Learning Force Field Parameters from Differentiable Particle-Field Molecular Dynamics. Journal of Chemical Information and Modeling 2024, 64
(14)
, 5510-5520. https://doi.org/10.1021/acs.jcim.4c00564
- Wenlong Yao, Xin-Shan Ye. Donor Preactivation-Based Glycan Assembly: from Manual to Automated Synthesis. Accounts of Chemical Research 2024, 57
(11)
, 1577-1594. https://doi.org/10.1021/acs.accounts.4c00072
- Aravind Senthil Vel, Kouakou E. Konan, Daniel Cortés-Borda, François-Xavier Felpin. Enhancing Optimization of Mixed Variables on a Robotic Flow Platform: Integrating Statistical Filtering with Nelder–Mead and Bayesian Methods. Organic Process Research & Development 2024, 28
(5)
, 1597-1606. https://doi.org/10.1021/acs.oprd.3c00238
- Difan Zhang, Brett Smith, Haiyi Wu, Manh-Thuong Nguyen, Roger Rousseau, Vassiliki-Alexandra Glezakou. Data Analytics for Catalysis Predictions: Are We Ready Yet?. ACS Catalysis 2024, 14
(10)
, 8073-8086. https://doi.org/10.1021/acscatal.3c05285
- Jiyun Zhang, Jens A. Hauch, Christoph J. Brabec. Toward Self-Driven Autonomous Material and Device Acceleration Platforms (AMADAP) for Emerging Photovoltaics Technologies. Accounts of Chemical Research 2024, 57
(9)
, 1434-1445. https://doi.org/10.1021/acs.accounts.4c00095
- Anish Rao, Marek Grzelczak. Revisiting El-Sayed Synthesis: Bayesian Optimization for Revealing New Insights during the Growth of Gold Nanorods. Chemistry of Materials 2024, 36
(5)
, 2577-2587. https://doi.org/10.1021/acs.chemmater.4c00271
- Alexey Kopyl, Yuki Yew, Jian Wern Ong, Thomas Hiscox, Calem Young, Murat Muradoglu, Tuck Wah Ng. Automated Liquid Handler from a 3D Printer. Journal of Chemical Education 2024, 101
(2)
, 640-646. https://doi.org/10.1021/acs.jchemed.3c00855
- Kenta Kawashima, Raúl A. Márquez, Lettie A. Smith, Rinish Reddy Vaidyula, Omar A. Carrasco-Jaim, Ziqing Wang, Yoon Jun Son, Chi L. Cao, C. Buddie Mullins. A Review of Transition Metal Boride, Carbide, Pnictide, and Chalcogenide Water Oxidation Electrocatalysts. Chemical Reviews 2023, 123
(23)
, 12795-13208. https://doi.org/10.1021/acs.chemrev.3c00005
- Laura Pascazio, Simon Rihm, Ali Naseri, Sebastian Mosbach, Jethro Akroyd, Markus Kraft. Chemical Species Ontology for Data Integration and Knowledge Discovery. Journal of Chemical Information and Modeling 2023, 63
(21)
, 6569-6586. https://doi.org/10.1021/acs.jcim.3c00820
- Joshua Schrier, Alexander J. Norquist, Tonio Buonassisi, Jakoah Brgoch. In Pursuit of the Exceptional: Research Directions for Machine Learning in Chemical and Materials Science. Journal of the American Chemical Society 2023, 145
(40)
, 21699-21716. https://doi.org/10.1021/jacs.3c04783
- Dylan M. Anstine, Olexandr Isayev. Generative Models as an Emerging Paradigm in the Chemical Sciences. Journal of the American Chemical Society 2023, 145
(16)
, 8736-8750. https://doi.org/10.1021/jacs.2c13467
- Aikaterini Vriza, Henry Chan, Jie Xu. Self-Driving Laboratory for Polymer Electronics. Chemistry of Materials 2023, 35
(8)
, 3046-3056. https://doi.org/10.1021/acs.chemmater.2c03593
- Baosen Zhang, Ariana Frkonja-Kuczin, Zhong-Hui Duan, Aliaksei Boika. Workshop on Computer Vision for Bioanalytical Chemists: Classification and Detection of Amoebae Using Optical Microscopy Image Analysis with Machine Learning. Journal of Chemical Education 2023, 100
(2)
, 539-545. https://doi.org/10.1021/acs.jchemed.2c00631
- Keiji Fushimi, Yusuke Nakai, Akiko Nishi, Ryo Suzuki, Masahiro Ikegami, Risa Nimura, Taichi Tomono, Ryota Hidese, Hisashi Yasueda, Yusuke Tagawa, Tomohisa Hasunuma. Development of the autonomous lab system to support biotechnology research. Scientific Reports 2025, 15
(1)
https://doi.org/10.1038/s41598-025-89069-y
- Richard B. Canty, Jeffrey A. Bennett, Keith A. Brown, Tonio Buonassisi, Sergei V. Kalinin, John R. Kitchin, Benji Maruyama, Robert G. Moore, Joshua Schrier, Martin Seifrid, Shijing Sun, Tejs Vegge, Milad Abolhasani. Science acceleration and accessibility with self-driving labs. Nature Communications 2025, 16
(1)
https://doi.org/10.1038/s41467-025-59231-1
- Yusei Ito, Yasuo Takeichi, Hideitsu Hino, Kanta Ono. Optimal spectroscopic measurement design: Bayesian framework for rational data acquisition. Machine Learning: Science and Technology 2025, 6
(2)
, 025037. https://doi.org/10.1088/2632-2153/add0f6
- Austin M. Mroz, Annabel R. Basford, Friedrich Hastedt, Isuru Shavindra Jayasekera, Irea Mosquera-Lois, Ruby Sedgwick, Pedro J. Ballester, Joshua D. Bocarsly, Ehecatl Antonio del Río Chanona, Matthew L. Evans, Jarvist M. Frost, Alex M. Ganose, Rebecca L. Greenaway, King Kuok (Mimi) Hii, Yingzhen Li, Ruth Misener, Aron Walsh, Dandan Zhang, Kim E. Jelfs. Cross-disciplinary perspectives on the potential for artificial intelligence across chemistry. Chemical Society Reviews 2025, 54
(11)
, 5433-5469. https://doi.org/10.1039/D5CS00146C
- Erik Hermann. Self-driving labs: The new frontier for GenAI-driven marketing research. Business Horizons 2025, 2 https://doi.org/10.1016/j.bushor.2025.06.001
- Michael A. Pence, Gavin Hazen, Joaquín Rodríguez-López. The emergence of automation in electrochemistry. Current Opinion in Electrochemistry 2025, 51 , 101679. https://doi.org/10.1016/j.coelec.2025.101679
- Jincheng Xu, Linjiang Chen, Jun Jiang. Data-intelligent-driven exploration of robotic chemist systems. SCIENTIA SINICA Chimica 2025, 55
(6)
, 1606-1622. https://doi.org/10.1360/SSC-2025-0093
- Leiyun Zhong, Yiming Xu, Xinghai Li, Peihao Cheng, Shengyang Tao. Production and evaluation of high-throughput reaction data from an automated chemical synthesis platform. Science China Chemistry 2025, 25 https://doi.org/10.1007/s11426-025-2676-x
- Muhammad Awais Maqbool, Shunya Okamoto, Takayuki Shibata, Tuhin Subhra Santra, Moeto Nagai. Self-regulating pen-needle-based micronozzle for printing array of nanoliter droplets under fluorinated liquid. Instrumentation Science & Technology 2025, 53
(3)
, 332-350. https://doi.org/10.1080/10739149.2024.2390907
- Lukas Nolte, Sven Tomforde. A Helping Hand: A Survey About AI-Driven Experimental Design for Accelerating Scientific Research. Applied Sciences 2025, 15
(9)
, 5208. https://doi.org/10.3390/app15095208
- Guo‐Jin Cao. Advancements in Machine Learning Predicting Activation and Gibbs Free Energies in Chemical Reactions. International Journal of Quantum Chemistry 2025, 125
(7)
https://doi.org/10.1002/qua.70036
- Dian-Zhao Lin, Kai-Jui Pan, Yuyin Li, Charles B. Musgrave III, Lingyu Zhang, Krish N. Jayarapu, Tianchen Li, Jasmine Vy Tran, William A. Goddard, Zhengtang Luo, Yayuan Liu. A high-throughput experimentation platform for data-driven discovery in electrochemistry. Science Advances 2025, 11
(14)
https://doi.org/10.1126/sciadv.adu4391
- Woojun Jung, Insung Hwang, Keuntae Cho. Research Trend Analysis in the Field of Self-Driving Labs Using Network Analysis and Topic Modeling. Systems 2025, 13
(4)
, 253. https://doi.org/10.3390/systems13040253
- Panna Zsoldos, Ádám Wolf, Károly Széll, Péter Galambos. Towards robotic Laboratory Automation Plug & Play: LAPP reference implementation with the TIAGo mobile manipulator. SLAS Technology 2025, 31 , 100251. https://doi.org/10.1016/j.slast.2025.100251
- . Autonomous Laboratory Empowered by
AI
and Robotics. 2025, 85-112. https://doi.org/10.1002/9783527848836.ch4
- Sergio Pablo-García, Ángel García, Gun Deniz Akkoc, Malcolm Sim, Yang Cao, Maxine Somers, Chance Hattrick, Naruki Yoshikawa, Dominik Dworschak, Han Hao, Alán Aspuru-Guzik. An affordable platform for automated synthesis and electrochemical characterization. Device 2025, 3
(2)
, 100567. https://doi.org/10.1016/j.device.2024.100567
- Christopher Price, Ichiro Takeuchi, James Rondinelli, Wei Chen, Christopher Hinkle. AI-Accelerated Electronic Materials Discovery and Development. Computer 2025, 58
(2)
, 98-104. https://doi.org/10.1109/MC.2024.3515392
- Junru Ren, Nis Fisker-Bødker, Ronja Güldenring, Jin Hyun Chang, Tejs Vegge, Ole Ravn, Lazaros Nalpantidis. Empowering Lab Education: Integrating a Vision-Based Monitoring System with Small-Scale Self-Driving Experiment Platforms. 2025, 1155-1162. https://doi.org/10.1109/SII59315.2025.10870899
- Austin H. Cheng, Cher Tian Ser, Marta Skreta, Andrés Guzmán-Cordero, Luca Thiede, Andreas Burger, Abdulrahman Aldossary, Shi Xuan Leong, Sergio Pablo-García, Felix Strieth-Kalthoff, Alán Aspuru-Guzik. Spiers Memorial Lecture: How to do impactful research in artificial intelligence for chemistry and materials science. Faraday Discussions 2025, 256 , 10-60. https://doi.org/10.1039/D4FD00153B
- Yu-Zhe Shi, Qiao Xu, Fanxu Meng, Lecheng Ruan, Qining Wang. Abstract Hardware Grounding Towards the Automated Design of Automation Systems. 2025, 117-132. https://doi.org/10.1007/978-981-96-0780-8_9
- Zhen Cao, Laura Falivene, Albert Poater, Bholanath Maity, Ziyung Zhang, Gentoku Takasao, Sadeed Bin Sayed, Andrea Petta, Giovanni Talarico, Romina Oliva, Luigi Cavallo. COBRA web application to benchmark linear regression models for catalyst optimization with few-entry datasets. Cell Reports Physical Science 2025, 6
(1)
, 102348. https://doi.org/10.1016/j.xcrp.2024.102348
- Pablo Quijano Velasco, Kedar Hippalgaonkar, Balamurugan Ramalingam. Emerging trends in the optimization of organic synthesis through high-throughput tools and machine learning. Beilstein Journal of Organic Chemistry 2025, 21 , 10-38. https://doi.org/10.3762/bjoc.21.3
- Sayan Doloi, Maloy Das, Yujia Li, Zen Han Cho, Xingchi Xiao, John V. Hanna, Matthew Osvaldo, Leonard Ng Wei Tat. Democratizing self-driving labs: advances in low-cost 3D printing for laboratory automation. Digital Discovery 2025, 3 https://doi.org/10.1039/D4DD00411F
- Juliana Cárdenas, Benjamin Katryniok, Marcia Araque-Marin, Wei-Hsin Hsu, Peter H. Seeberger, Jose Danglad-Flores, Alvaro Orjuela. Continuous epoxidation of used cooking oils using an automated slug-flow millireactor. Chemical Engineering Journal 2025, 506 , 159907. https://doi.org/10.1016/j.cej.2025.159907
- Jinpeng Li, Chuxuan Ding, Daobin Liu, Linjiang Chen, Jun Jiang. Autonomous laboratories in China: an embodied intelligence-driven platform to accelerate chemical discovery. Digital Discovery 2025, 27 https://doi.org/10.1039/D5DD00072F
- Ryosuke Sasaki, Mikito Fujinami, Hiromi Nakai. Application of object detection and action recognition toward automated recognition of chemical experiments. Digital Discovery 2024, 3
(12)
, 2458-2464. https://doi.org/10.1039/D4DD00015C
- Valentine P. Ananikov. Top 20 influential AI-based technologies in chemistry. Artificial Intelligence Chemistry 2024, 2
(2)
, 100075. https://doi.org/10.1016/j.aichem.2024.100075
- Jonathan Aubuchon Ouimet, Faraj Al-Badani, Xinhong Liu, Laurianne Lair, Zachary W. Muetzel, Alexander W. Dowling, William A. Phillip. Automated membrane characterization: In-situ monitoring of the permeate and retentate solutions using a 3D printed permeate probe device. Journal of Membrane Science Letters 2024, 4
(2)
, 100087. https://doi.org/10.1016/j.memlet.2024.100087
- Hongyuan Sheng, Jingwen Sun, Oliver Rodríguez, Benjamin B. Hoar, Weitong Zhang, Danlei Xiang, Tianhua Tang, Avijit Hazra, Daniel S. Min, Abigail G. Doyle, Matthew S. Sigman, Cyrille Costentin, Quanquan Gu, Joaquín Rodríguez-López, Chong Liu. Autonomous closed-loop mechanistic investigation of molecular electrochemistry via automation. Nature Communications 2024, 15
(1)
https://doi.org/10.1038/s41467-024-47210-x
- Hyuk Jun Yoo, Kwan-Young Lee, Donghun Kim, Sang Soo Han. OCTOPUS: operation control system for task optimization and job parallelization via a user-optimal scheduler. Nature Communications 2024, 15
(1)
https://doi.org/10.1038/s41467-024-54067-7
- Christopher Karpovich, Elton Pan, Elsa A. Olivetti. Deep reinforcement learning for inverse inorganic materials design. npj Computational Materials 2024, 10
(1)
https://doi.org/10.1038/s41524-024-01474-5
- Renan Gonçalves Leonel da Silva. The advancement of artificial intelligence in biomedical research and health innovation: challenges and opportunities in emerging economies. Globalization and Health 2024, 20
(1)
https://doi.org/10.1186/s12992-024-01049-5
- Tianwei Dai, Sriram Vijayakrishnan, Filip T. Szczypiński, Jean-François Ayme, Ehsan Simaei, Thomas Fellowes, Rob Clowes, Lyubomir Kotopanov, Caitlin E. Shields, Zhengxue Zhou, John W. Ward, Andrew I. Cooper. Autonomous mobile robots for exploratory synthetic chemistry. Nature 2024, 635
(8040)
, 890-897. https://doi.org/10.1038/s41586-024-08173-7
- Haochen Pan, Ryan Chard, Sicheng Zhou, Alok Kamatar, Rafael Vescovi, Valérie Hayot-Sasson, André Bauer, Maxime Gonthier, Kyle Chard, Ian Foster. Octopus: Experiences with a Hybrid Event-Driven Architecture for Distributed Scientific Computing. 2024, 496-507. https://doi.org/10.1109/SCW63240.2024.00071
- Alexandre A. Schoepfer, Jan Weinreich, Ruben Laplaza, Jerome Waser, Clemence Corminboeuf. Cost-informed Bayesian reaction optimization. Digital Discovery 2024, 3
(11)
, 2289-2297. https://doi.org/10.1039/D4DD00225C
- Nicholas Dallaire, Nicholas T. Boileau, Ian Myers, Samantha Brixi, May Ourabi, Ewenike Raluchukwu, Rosemary Cranston, Halynne R. Lamontagne, Benjamin King, Bahar Ronnasi, Owen A. Melville, Joseph G. Manion, Benoît H. Lessard. High Throughput Characterization of Organic Thin Film Transistors. Advanced Materials 2024, 36
(44)
https://doi.org/10.1002/adma.202406105
- Aniket S. Mule, Kevin Tran, Ashton M. Aleman, Yamile E. Cornejo‐Carrillo, Gaurav A. Kamat, Michaela Burke Stevens, Thomas F. Jaramillo. Advancing Insights into Electrochemical Pre‐Treatments of Supported Nanoparticle Electrocatalysts by Combining a Design of Experiments Strategy with In Situ Characterization. Advanced Energy Materials 2024, 14
(41)
https://doi.org/10.1002/aenm.202401939
- Ryosuke Sasaki, Mikito Fujinami, Hiromi Nakai. Application and integration of computer vision technologies for automated recognition and recording of chemical experiments. Bulletin of the Chemical Society of Japan 2024, 97
(11)
https://doi.org/10.1093/bulcsj/uoae110
- Jan Obořil, Christian P. Haas, Maximilian Lübbesmeyer, Rachel Nicholls, Thorsten Gressling, Klavs F. Jensen, Giulio Volpin, Julius Hillenbrand. Automated processing of chromatograms: a comprehensive python package with a GUI for intelligent peak identification and deconvolution in chemical reaction analysis. Digital Discovery 2024, 3
(10)
, 2041-2051. https://doi.org/10.1039/D4DD00214H
- Derek van Tilborg, Francesca Grisoni. Traversing chemical space with active deep learning for low-data drug discovery. Nature Computational Science 2024, 4
(10)
, 786-796. https://doi.org/10.1038/s43588-024-00697-2
- Nour El Sabbagh, Margherita Bazzoni, Yuliia Horbenko, Aurélie Bernard, Daniel Cortés-Borda, Patrick Giraudeau, François-Xavier Felpin, Jean-Nicolas Dumez. Autonomous reaction self-optimization using in-line high-field NMR spectroscopy. Reaction Chemistry & Engineering 2024, 9
(10)
, 2599-2609. https://doi.org/10.1039/D4RE00270A
- Anees Al-Najjar, Nageswara S. V. Rao, Craig A. Bridges, Sheng Dai, Alex Walters. Autonomous Electrochemistry Platform with Real-Time Normality Testing of Voltammetry Measurements Using ML. 2024, 1-10. https://doi.org/10.1109/e-Science62913.2024.10678672
- Yangguan Chen, Longhan Zhang, Zhehong Ai, Yifan Long, Ji Qi, Pengxiao Bao, Jing Jiang. Robot-assisted optimized array design for accurate multi-component gas quantification. Chemical Engineering Journal 2024, 496 , 154225. https://doi.org/10.1016/j.cej.2024.154225
- Adam J. Gormley. Machine learning in drug delivery. Journal of Controlled Release 2024, 373 , 23-30. https://doi.org/10.1016/j.jconrel.2024.06.045
- Malcolm Sim, Mohammad Ghazi Vakili, Felix Strieth-Kalthoff, Han Hao, Riley J. Hickman, Santiago Miret, Sergio Pablo-García, Alán Aspuru-Guzik. ChemOS 2.0: An orchestration architecture for chemical self-driving laboratories. Matter 2024, 7
(9)
, 2959-2977. https://doi.org/10.1016/j.matt.2024.04.022
- Tomomi Sato, Koji Masuda, Chikako Sano, Keiji Matsumoto, Hidetoshi Numata, Seiji Munetoh, Toshihiro Kasama, Ryo Miyake. Democratizing Microreactor Technology for Accelerated Discoveries in Chemistry and Materials Research. Micromachines 2024, 15
(9)
, 1064. https://doi.org/10.3390/mi15091064
- Gabriella Pizzuto, Hetong Wang, Hatem Fakhruldeen, Bei Peng, Kevin S. Luck, Andrew I. Cooper. Accelerating Laboratory Automation Through Robot Skill Learning For Sample Scraping*. 2024, 2103-2110. https://doi.org/10.1109/CASE59546.2024.10711291
- Boris N. Slautin, Utkarsh Pratiush, Ilia N. Ivanov, Yongtao Liu, Rohit Pant, Xiaohang Zhang, Ichiro Takeuchi, Maxim A. Ziatdinov, Sergei V. Kalinin. Co-orchestration of multiple instruments to uncover structure–property relationships in combinatorial libraries. Digital Discovery 2024, 3
(8)
, 1602-1611. https://doi.org/10.1039/D4DD00109E
- Xu‐Dan Yang, Bo Gong, Wei Chen, Jie‐Jie Chen, Chen Qian, Rui Lu, Yuan Min, Ting Jiang, Liang Li, Han‐Qing Yu. In Situ Quantitative Monitoring of Adsorption from Aqueous Phase by UV–vis Spectroscopy: Implication for Understanding of Heterogeneous Processes. Advanced Science 2024, 11
(32)
https://doi.org/10.1002/advs.202402732
- Jiyun Zhang, Jianchang Wu, Anastasia Barabash, Tian Du, Shudi Qiu, Vincent M. Le Corre, Yicheng Zhao, Kaicheng Zhang, Frederik Schmitt, Zijian Peng, Jingjing Tian, Chaohui Li, Chao Liu, Thomas Heumueller, Larry Lüer, Jens A. Hauch, Christoph J. Brabec. Precise control of process parameters for >23% efficiency perovskite solar cells in ambient air using an automated device acceleration platform. Energy & Environmental Science 2024, 17
(15)
, 5490-5499. https://doi.org/10.1039/D4EE01432D
- Maëva Vallet, Daniel Cortés-Borda, François-Xavier Felpin. Expanding analytical horizons: 3D HPLC calibration surfaces for micromole scale self-optimizing flow reactors. Reaction Chemistry & Engineering 2024, 9
(8)
, 2262-2269. https://doi.org/10.1039/D3RE00559C
- Shuanglei Gong. Transition from machine intelligence to knowledge intelligence: A multi-agent simulation approach to technology transfer. Journal of Intelligent Systems 2024, 33
(1)
https://doi.org/10.1515/jisys-2023-0320
- Alexander W. Dowling. Artificial Intelligence and Machine Learning for Sustainable Molecular-to-Systems Engineering. 2024, 22-31. https://doi.org/10.69997/sct.114705
- Elijah N. Kissman, Max B. Sosa, Douglas C. Millar, Edward J. Koleski, Kershanthen Thevasundaram, Michelle C. Y. Chang. Expanding chemistry through in vitro and in vivo biocatalysis. Nature 2024, 631
(8019)
, 37-48. https://doi.org/10.1038/s41586-024-07506-w
- Oliver Bayley, Elia Savino, Aidan Slattery, Timothy Noël. Autonomous chemistry: Navigating self-driving labs in chemical and material sciences. Matter 2024, 7
(7)
, 2382-2398. https://doi.org/10.1016/j.matt.2024.06.003
- Zainab Saeed Wattoo, Petal Vitis, Ruizhe Zhu, Noah Depner, Ivory Zhang, Jason Hein, Arpan Gujarati, Margo Seltzer. RABIT, a Robot Arm Bug Intervention Tool for Self-Driving Labs. 2024, 353-361. https://doi.org/10.1109/DSN58291.2024.00043
- Felix Strieth-Kalthoff, Han Hao, Vandana Rathore, Joshua Derasp, Théophile Gaudin, Nicholas H. Angello, Martin Seifrid, Ekaterina Trushina, Mason Guy, Junliang Liu, Xun Tang, Masashi Mamada, Wesley Wang, Tuul Tsagaantsooj, Cyrille Lavigne, Robert Pollice, Tony C. Wu, Kazuhiro Hotta, Leticia Bodo, Shangyu Li, Mohammad Haddadnia, Agnieszka Wołos, Rafał Roszak, Cher Tian Ser, Carlota Bozal-Ginesta, Riley J. Hickman, Jenya Vestfrid, Andrés Aguilar-Granda, Elena L. Klimareva, Ralph C. Sigerson, Wenduan Hou, Daniel Gahler, Slawomir Lach, Adrian Warzybok, Oleg Borodin, Simon Rohrbach, Benjamin Sanchez-Lengeling, Chihaya Adachi, Bartosz A. Grzybowski, Leroy Cronin, Jason E. Hein, Martin D. Burke, Alán Aspuru-Guzik. Delocalized, asynchronous, closed-loop discovery of organic laser emitters. Science 2024, 384
(6697)
https://doi.org/10.1126/science.adk9227
- Stanley Lo, Sterling G. Baird, Joshua Schrier, Ben Blaiszik, Nessa Carson, Ian Foster, Andrés Aguilar-Granda, Sergei V. Kalinin, Benji Maruyama, Maria Politi, Helen Tran, Taylor D. Sparks, Alán Aspuru-Guzik. Review of low-cost self-driving laboratories in chemistry and materials science: the “frugal twin” concept. Digital Discovery 2024, 3
(5)
, 842-868. https://doi.org/10.1039/D3DD00223C
- Fuzhan Rahmanian, Stefan Fuchs, Bojing Zhang, Maximilian Fichtner, Helge Sören Stein. Autonomous millimeter scale high throughput battery research system. Digital Discovery 2024, 3
(5)
, 883-895. https://doi.org/10.1039/D3DD00257H
- Xiang Huang, Shenghong Ju. Tutorial: AI-assisted exploration and active design of polymers with high intrinsic thermal conductivity. Journal of Applied Physics 2024, 135
(17)
https://doi.org/10.1063/5.0201522
- Frank Tristram, Nicole Jung, Patrick Hodapp, Rasmus R. Schröder, Christof Wöll, Stefan Bräse. The Impact of Digitalized Data Management on Materials Systems Workflows. Advanced Functional Materials 2024, 34
(20)
https://doi.org/10.1002/adfm.202303615
- Boris N. Slautin, Yongtao Liu, Hiroshi Funakubo, Sergei V. Kalinin. Unraveling the impact of initial choices and in-loop interventions on learning dynamics in autonomous scanning probe microscopy. Journal of Applied Physics 2024, 135
(15)
https://doi.org/10.1063/5.0198316
- Yongtao Liu, Kevin Roccapriore, Marti Checa, Sai Mani Valleti, Jan‐Chi Yang, Stephen Jesse, Rama K. Vasudevan. AEcroscopy: A Software–Hardware Framework Empowering Microscopy Toward Automated and Autonomous Experimentation. Small Methods 2024, 2 https://doi.org/10.1002/smtd.202301740
- Chris Beeler, Sriram Ganapathi Subramanian, Kyle Sprague, Mark Baula, Nouha Chatti, Amanuel Dawit, Xinkai Li, Nicholas Paquin, Mitchell Shahen, Zihan Yang, Colin Bellinger, Mark Crowley, Isaac Tamblyn. ChemGymRL: A customizable interactive framework for reinforcement learning for digital chemistry. Digital Discovery 2024, 3
(4)
, 742-758. https://doi.org/10.1039/D3DD00183K
- Holland Hysmith, Elham Foadian, Shakti P. Padhy, Sergei V. Kalinin, Rob G. Moore, Olga S. Ovchinnikova, Mahshid Ahmadi. The future of self-driving laboratories: from human in the loop interactive AI to gamification. Digital Discovery 2024, 3
(4)
, 621-636. https://doi.org/10.1039/D4DD00040D
- Daniele Padula. A Computational Perspective on the Reactivity of
π
‐spacers in Self‐Immolative Elimination Reactions. Chemistry – An Asian Journal 2024, 19
(7)
https://doi.org/10.1002/asia.202400010
- J. Elliott Fowler, Matthew A. Kottwitz, Nat Trask, Rémi Dingreville. Beyond Combinatorial Materials Science: The 100 Prisoners Problem. Integrating Materials and Manufacturing Innovation 2024, 13
(1)
, 83-91. https://doi.org/10.1007/s40192-023-00330-6
- Jae Bem You. Surface nanodroplets as platforms for small scale chemical engineering. Chemical Engineering Journal 2024, 483 , 149252. https://doi.org/10.1016/j.cej.2024.149252
- Simon D. Rihm, Jiaru Bai, Aleksandar Kondinski, Sebastian Mosbach, Jethro Akroyd, Markus Kraft. Transforming research laboratories with connected digital twins. Nexus 2024, 1
(1)
, 100004. https://doi.org/10.1016/j.ynexs.2024.100004
- Rebekah Duke, Siamak Mahmoudi, Aman Preet Kaur, Vinayak Bhat, Ian C. Dingle, Nathan C. Stumme, Scott K. Shaw, David Eaton, Asmund Vego, Chad Risko. ExpFlow: a graphical user interface for automated reproducible electrochemistry. Digital Discovery 2024, 3
(1)
, 163-172. https://doi.org/10.1039/D3DD00156C
- Seoin Back, Alán Aspuru-Guzik, Michele Ceriotti, Ganna Gryn'ova, Bartosz Grzybowski, Geun Ho Gu, Jason Hein, Kedar Hippalgaonkar, Rodrigo Hormázabal, Yousung Jung, Seonah Kim, Woo Youn Kim, Seyed Mohamad Moosavi, Juhwan Noh, Changyoung Park, Joshua Schrier, Philippe Schwaller, Koji Tsuda, Tejs Vegge, O. Anatole von Lilienfeld, Aron Walsh. Accelerated chemical science with AI. Digital Discovery 2024, 3
(1)
, 23-33. https://doi.org/10.1039/D3DD00213F
- Haoke Qiu, Lunyang Liu, Xuepeng Qiu, Xuemin Dai, Xiangling Ji, Zhao-Yan Sun. PolyNC: a natural and chemical language model for the prediction of unified polymer properties. Chemical Science 2024, 15
(2)
, 534-544. https://doi.org/10.1039/D3SC05079C
- Andrew Wang, Alán Aspuru-Guzik, Geoffrey Ozin. Challenges and opportunities for AI in synthetic solid-state inorganic chemistry. Matter 2024, 7
(1)
, 5-8. https://doi.org/10.1016/j.matt.2023.11.016
- Rama El-khawaldeh, Jason E. Hein. Balancing act: when to flex and when to stay fixed. Trends in Chemistry 2024, 6
(1)
, 1-4. https://doi.org/10.1016/j.trechm.2023.10.008
- Arto Klami, Theo Damoulas, Ola Engkvist, Patrick Rinke, Samuel Kaski. Virtual laboratories: transforming research with AI. Data-Centric Engineering 2024, 5 https://doi.org/10.1017/dce.2024.15
- Renan Gonçalves Leonel da Silva. The role of autonomous experimentation in biomedical sciences and health innovation: Challenges and opportunities in emerging economies. SSRN Electronic Journal 2024, 14 https://doi.org/10.2139/ssrn.4672439
- Lung-Yi Chen, Yi-Pei Li. Machine learning-guided strategies for reaction conditions design and optimization. Beilstein Journal of Organic Chemistry 2024, 20 , 2476-2492. https://doi.org/10.3762/bjoc.20.212
- Juan Liu, Rihana Burciaga, Selene Tang, Shichao Ding, Hongshun Ran, Wen Zhao, Guangtao Wang, Zhengkai Zhuang, Liangyiqun Xie, Zhaoyuan Lyu, Yuehe Lin, Annie Du, Aihua Yuan, Jiaju Fu, Bing Song, Jiawei Zhu, Zhenyu Sun, Xin Jin, Zheng-Yang Huo, Bo Shen, Meikun Shen, Yue Cao, Yang Zhou, Yujing Jiang, Dongdong Zhu, Meng Sun, Xuanhao Wu, Chu Qin, Zhifeng Jiang, Onder Metin, Charitha J. Thambiliyagodage, Jing-Jing Lv, Qing Li, Hao Wu, Zhiliang Wu, Jason Chun-Ho Lam, Guandao Gao, Chao Li, Mingchuan Luo, Yi Jiang, Xue Wang, Junrui Li, Min Liu, Richen Lin, Hongqiang Ren, Buxing Han, Yaxuan Jing, Wenlei Zhu. Heterogeneous catalysis for the environment. The Innovation Materials 2024, 2
(<![CDATA[3]]>)
, 100090. https://doi.org/10.59717/j.xinn-mater.2024.100090
Article Views are the COUNTER-compliant sum of full text article downloads since November 2008 (both PDF and HTML) across all institutions and individuals. These metrics are regularly updated to reflect usage leading up to the last few days.
Citations are the number of other articles citing this article, calculated by Crossref and updated daily. Find more information about Crossref citation counts.
The Altmetric Attention Score is a quantitative measure of the attention that a research article has received online. Clicking on the donut icon will load a page at altmetric.com with additional details about the score and the social media presence for the given article. Find more information on the Altmetric Attention Score and how the score is calculated.
Recommended Articles
Abstract
Figure 1
Figure 1. Diagram of the design–make–test–analyze cycle in our self-driving laboratory, showing the process for the development of new organic semiconductor laser materials.
Figure 2
Figure 2. Integration of ChemOS and its most important algorithms into the process or material optimization workflow.
Figure 3
Figure 3. Top: Photo of the Chemspeed deck. The inset shows the top of the ISYNTH with one of the drawers (vertical row of wells) highlighted. Bottom: (Right) Icons for liquid dispensing, solid dispensing, and solid-phase extraction actions. (Left) Diagram of the iSMcc process along with icons indicating where different capabilities are used. Cross-coupling (C): X-Ar-BMIDA (1 equiv), Ar–B(OH)2 (3 equiv), Pd-XPhos G2 (5 mol %), K3PO4 (2 equiv), THF, 16 h, 65 °C. Purification (P): precipitation from hexanes/THF 3:1. Deprotection (D): aqueous NaOH (1 M), 20 min, room temperature.
Figure 4
Figure 4. Schematic diagram of our analysis, purification, and optical characterization setup. The gray box is a schematic diagram of how a specific HPLC fraction is selected for further evaluation and how its properties are measured. Absorption measurements are carried out in the “absorption” flow cell. Photoluminescence (PL), PL quantum yield (PLQY), and photodegradation rate measurements are carried out in the “emission” flow cell. PL lifetime is measured in the “PL lifetime” flow cell. Gray polygons represent valves with the number of ports corresponding to the number of sides, and arrows represent the directions of sample transport.
Figure 5
Figure 5. Autonomous robotic workflows can accelerate the discovery of solid-state inorganic materials using proxy experiments. These can then be used in conjunction with more accurate full experiments to perform multifidelity optimization of the inorganic materials.
References
This article references 80 other publications.
- 1Roch, L. M.; Häse, F.; Kreisbeck, C.; Tamayo-Mendoza, T.; Yunker, L. P. E.; Hein, J. E.; Aspuru-Guzik, A. ChemOS: An Orchestration Software to Democratize Autonomous Discovery. PLoS One 2020, 15, e0229862, DOI: 10.1371/journal.pone.02298621ChemOS: An orchestration software to democratize autonomous discoveryRoch, Loic M.; Hase, Florian; Kreisbeck, Christoph; Tamayo-Mendoza, Teresa; Yunker, Lars P. E.; Hein, Jason E.; Aspuru-Guzik, AlanPLoS One (2020), 15 (4), e0229862CODEN: POLNCL; ISSN:1932-6203. (Public Library of Science)The current Edisonian approach to discovery requires up to two decades of fundamental and applied research for materials technologies to reach the market. Such a slow and capital-intensive turnaround calls for disruptive strategies to expedite innovation. Self-driving labs. have the potential to provide the means to revolutionize experimentation by empowering automation with artificial intelligence to enable autonomous discovery. However, the lack of adequate software solns. significantly impedes the development of self-driving labs. In this paper, we make progress towards addressing this challenge, and we propose and develop an implementation of ChemOS; a portable, modular and versatile software package which supplies the structured layers necessary for the deployment and operation of self-driving labs. ChemOS facilitates the integration of automated equipment, and it enables remote control of automated labs. ChemOS can operate at various degrees of autonomy; from fully unsupervised experimentation to actively including inputs and feedbacks from researchers into the experimentation loop. The flexibility of ChemOS provides a broad range of functionality as demonstrated on five applications, which were executed on different automated equipment, highlighting various aspects of the software package.
- 2Langner, S.; Häse, F.; Perea, J. D.; Stubhan, T.; Hauch, J.; Roch, L. M.; Heumueller, T.; Aspuru-Guzik, A.; Brabec, C. J. Beyond Ternary OPV: High-Throughput Experimentation and Self-Driving Laboratories Optimize Multicomponent Systems. Adv. Mater. 2020, 32, 1907801, DOI: 10.1002/adma.2019078012Beyond Ternary OPV: High-Throughput Experimentation and Self-Driving Laboratories Optimize Multicomponent SystemsLangner, Stefan; Haese, Florian; Perea, Jose Dario; Stubhan, Tobias; Hauch, Jens; Roch, Loic M.; Heumueller, Thomas; Aspuru-Guzik, Alan; Brabec, Christoph J.Advanced Materials (Weinheim, Germany) (2020), 32 (14), 1907801CODEN: ADVMEW; ISSN:0935-9648. (Wiley-VCH Verlag GmbH & Co. KGaA)Fundamental advances to increase the efficiency as well as stability of org. photovoltaics (OPVs) are achieved by designing ternary blends, which represents a clear trend toward multicomponent active layer blends. The development of high-throughput and autonomous experimentation methods is reported for the effective optimization of multicomponent polymer blends for OPVs. A method for automated film formation enabling the fabrication of ≤ 6048 films per day is introduced. Equipping this automated experimentation platform with a Bayesian optimization, a self-driving lab. is constructed that autonomously evaluates measurements to design and execute the next expts. To demonstrate the potential of these methods, a 4D parameter space of quaternary OPV blends is mapped and optimized for photostability. While with conventional approaches, roughly 100 mg of material would be necessary, the robot-based platform can screen 2000 combinations with < 10 mg, and machine-learning-enabled autonomous experimentation identifies stable compns. with < 1 mg.
- 3Christensen, M.; Yunker, L. P. E.; Adedeji, F.; Häse, F.; Roch, L. M.; Gensch, T.; dos Passos Gomes, G.; Zepel, T.; Sigman, M. S.; Aspuru-Guzik, A.; Hein, J. E. Data-Science Driven Autonomous Process Optimization. Commun. Chem. 2021, 4, 112, DOI: 10.1038/s42004-021-00550-x3Data-science driven autonomous process optimizationChristensen Melodie; Yunker Lars P E; Zepel Tara; Hein Jason E; Christensen Melodie; Adedeji Folarin; Hase Florian; Roch Loic M; Aspuru-Guzik Alan; Hase Florian; Roch Loic M; Dos Passos Gomes Gabriel; Aspuru-Guzik Alan; Hase Florian; Roch Loic M; Dos Passos Gomes Gabriel; Aspuru-Guzik Alan; Hase Florian; Dos Passos Gomes Gabriel; Aspuru-Guzik Alan; Hase Florian; Roch Loic M; Gensch Tobias; Sigman Matthew S; Aspuru-Guzik AlanCommunications chemistry (2021), 4 (1), 112 ISSN:.Autonomous process optimization involves the human intervention-free exploration of a range process parameters to improve responses such as product yield and selectivity. Utilizing off-the-shelf components, we develop a closed-loop system for carrying out parallel autonomous process optimization experiments in batch. Upon implementation of our system in the optimization of a stereoselective Suzuki-Miyaura coupling, we find that the definition of a set of meaningful, broad, and unbiased process parameters is the most critical aspect of successful optimization. Importantly, we discern that phosphine ligand, a categorical parameter, is vital to determination of the reaction outcome. To date, categorical parameter selection has relied on chemical intuition, potentially introducing bias into the experimental design. In seeking a systematic method for selecting a diverse set of phosphine ligands, we develop a strategy that leverages computed molecular feature clustering. The resulting optimization uncovers conditions to selectively access the desired product isomer in high yield.
- 4Seifrid, M.; Hickman, R. J.; Aguilar-Granda, A.; Lavigne, C.; Vestfrid, J.; Wu, T. C.; Gaudin, T.; Hopkins, E. J.; Aspuru-Guzik, A. Routescore: Punching the Ticket to More Efficient Materials Development. ACS Cent. Sci. 2022, 8, 122– 131, DOI: 10.1021/acscentsci.1c010024Routescore: Punching the Ticket to More Efficient Materials DevelopmentSeifrid, Martin; Hickman, Riley J.; Aguilar-Granda, Andres; Lavigne, Cyrille; Vestfrid, Jenya; Wu, Tony C.; Gaudin, Theophile; Hopkins, Emily J.; Aspuru-Guzik, AlanACS Central Science (2022), 8 (1), 122-131CODEN: ACSCII; ISSN:2374-7951. (American Chemical Society)Self-driving labs., in the form of automated experimentation platforms guided by machine learning algorithms have emerged as a potential soln. to the need for accelerated science. While new tools for automated anal. and characterization are being developed at a steady rate, automated synthesis remains the bottleneck in the chem. space accessible to self-driving labs. Combining automated and manual synthesis efforts immediately significantly expands the explorable chem. space. To effectively direct the different capabilities of automated (higher throughput and less labor) and manual synthesis (greater chem. versatility), we describe a protocol, the RouteScore, that quantifies the cost of combined synthetic routes. In this work, the RouteScore is used to det. the most efficient synthetic route to a well-known pharmaceutical (structure-oriented optimization), and to simulate a self-driving lab that finds the most easily synthesizable org. laser mol. with specific photophys. properties from a space of ∼3500 possible mols. (property-oriented optimization). These two examples demonstrate the power and flexibility of our approach in mixed synthetic planning and optimization, and esp. in down-selecting promising candidates from a large chem. space via a priori estn. of the synthetic costs.
- 5Häse, F.; Roch, L. M.; Aspuru-Guzik, A. Next-Generation Experimentation with Self-Driving Laboratories. Trends Chem. 2019, 1, 282– 291, DOI: 10.1016/j.trechm.2019.02.0075Next-Generation Experimentation with Self-Driving LaboratoriesHase, Florian; Roch, Loic M.; Aspuru-Guzik, AlanTrends in Chemistry (2019), 1 (3), 282-291CODEN: TCRHBQ; ISSN:2589-5974. (Cell Press)A review. The ever-growing demand for advanced functional materials requires disruption of conventional approaches to experimentation and acceleration of the discovery process. State-of-the-art approaches to scientific discovery are inherently slow, capital intensive, and have arguably reached a plateau. Significant advances are possible when rethinking and redesigning the traditional experimentation process. Self-driving labs. promise to substantially accelerate the discovery process by augmenting automated experimentation platforms with artificial intelligence (AI). AI methods actively search for promising exptl. procedures by hypothesizing about their outcomes based on previous expts. This feedback loop is crucial to reduce the no. of expts. needed for discovery. Supplying automated platforms with AI enables self-driving labs. to fully embrace the vision of autonomous experimentation.
- 6MacLeod, B. P.; Parlane, F. G. L.; Rupnow, C. C.; Dettelbach, K. E.; Elliott, M. S.; Morrissey, T. D.; Haley, T. H.; Proskurin, O.; Rooney, M. B.; Taherimakhsousi, N.; Dvorak, D. J.; Chiu, H. N.; Waizenegger, C. E. B.; Ocean, K.; Mokhtari, M.; Berlinguette, C. P. A Self-Driving Laboratory Advances the Pareto Front for Material Properties. Nat. Commun. 2022, 13, 995, DOI: 10.1038/s41467-022-28580-66A self-driving laboratory advances the Pareto front for material propertiesMacLeod, Benjamin P.; Parlane, Fraser G. L.; Rupnow, Connor C.; Dettelbach, Kevan E.; Elliott, Michael S.; Morrissey, Thomas D.; Haley, Ted H.; Proskurin, Oleksii; Rooney, Michael B.; Taherimakhsousi, Nina; Dvorak, David J.; Chiu, Hsi N.; Waizenegger, Christopher E. B.; Ocean, Karry; Mokhtari, Mehrdad; Berlinguette, Curtis P.Nature Communications (2022), 13 (1), 995CODEN: NCAOBW; ISSN:2041-1723. (Nature Portfolio)Useful materials must satisfy multiple objectives, where the optimization of one objective is often at the expense of another. The Pareto front reports the optimal trade-offs between these conflicting objectives. Here we use a self-driving lab., Ada, to define the Pareto front of conductivities and processing temps. for palladium films formed by combustion synthesis. Ada discovers new synthesis conditions that yield metallic films at lower processing temps. (below 200°C) relative to the prior art for this technique (250°C). This temp. difference makes possible the coating of different commodity plastic materials (e.g., Nafion, polyethersulfone). These combustion synthesis conditions enable us to to spray coat uniform palladium films with moderate cond. (1.1 x 105 S m-1) at 191°C. Spray coating at 226°C yields films with conductivities (2.0 x 106 S m-1) comparable to those of sputtered films (2.0 to 5.8 x 106 S m-1). This work shows how a self-driving laboratoy can discover materials that provide optimal trade-offs between conflicting objectives.
- 7Rooney, M. B.; MacLeod, B. P.; Oldford, R.; Thompson, Z. J.; White, K. L.; Tungjunyatham, J.; Stankiewicz, B. J.; Berlinguette, C. P. A Self-Driving Laboratory Designed to Accelerate the Discovery of Adhesive Materials. Digit. Discovery 2022, DOI: 10.1039/D2DD00029FThere is no corresponding record for this reference.
- 8Tao, H.; Wu, T.; Kheiri, S.; Aldeghi, M.; Aspuru-Guzik, A.; Kumacheva, E. Self-Driving Platform for Metal Nanoparticle Synthesis: Combining Microfluidics and Machine Learning. Adv. Funct. Mater. 2021, 31, 2106725, DOI: 10.1002/adfm.2021067258Self-Driving Platform for Metal Nanoparticle Synthesis: Combining Microfluidics and Machine LearningTao, Huachen; Wu, Tianyi; Kheiri, Sina; Aldeghi, Matteo; Aspuru-Guzik, Alan; Kumacheva, EugeniaAdvanced Functional Materials (2021), 31 (51), 2106725CODEN: AFMDC6; ISSN:1616-301X. (Wiley-VCH Verlag GmbH & Co. KGaA)Many applications of inorg. nanoparticles (NPs), including photocatalysis, photovoltaics, chem. and biochem. sensing, and theranostics, are governed by NP optical properties. Exploration and identification of reaction conditions for the synthesis of NPs with targeted spectroscopic characteristics is a time-, labor-, and resource-intensive task, as it involves the optimization of multiple interdependent reaction conditions. Integration of machine learning (ML) and microfluidics (MF) offers accelerated identification and optimization of reaction conditions for NP synthesis. Here, an autonomous ML-driven, oscillatory MF platform for the synthesis of NPs is reported. The platform utilized multiple recipes and reaction times for the synthesis of NPs with different dimensions, conducted spectroscopic NP characterization, and employed ML approaches to analyze multiple yet prioritized spectroscopic NP characteristics, and identified reaction conditions for the synthesis of NPs with targeted optical properties. The platform is also used to develop an understanding of the relationship between reaction conditions and NP properties. This study shows the strong potential of ML-driven oscillatory MF platforms in materials science and paves the way for automated NP development.
- 9Gao, W.; Raghavan, P.; Coley, C. W. Autonomous Platforms for Data-Driven Organic Synthesis. Nat. Commun. 2022, 13, 1075, DOI: 10.1038/s41467-022-28736-49Autonomous platforms for data-driven organic synthesisGao, Wenhao; Raghavan, Priyanka; Coley, Connor W.Nature Communications (2022), 13 (1), 1075CODEN: NCAOBW; ISSN:2041-1723. (Nature Portfolio)Achieving autonomous multi-step synthesis of novel mol. structures in chem. discovery processes is a goal shared by many researchers. In this Comment, we discuss key considerations of what an ideal platform may look like and the apparent state of the art. While most hardware challenges can be overcome with clever engineering, other challenges will require advances in both algorithms and data curation.
- 10Gupta, A.; Ong, Y.; Feng, L. Insights on Transfer Optimization: Because Experience Is the Best Teacher. IEEE Trans. Emerg. Top. Comput. Intell. 2018, 2, 51– 64, DOI: 10.1109/TETCI.2017.2769104There is no corresponding record for this reference.
- 11Shi, Y.; Prieto, P. L.; Zepel, T.; Grunert, S.; Hein, J. E. Automated Experimentation Powers Data Science in Chemistry. Acc. Chem. Res. 2021, 54, 546– 555, DOI: 10.1021/acs.accounts.0c0073611Automated Experimentation Powers Data Science in ChemistryShi, Yao; Prieto, Paloma L.; Zepel, Tara; Grunert, Shad; Hein, Jason E.Accounts of Chemical Research (2021), 54 (3), 546-555CODEN: ACHRE4; ISSN:0001-4842. (American Chemical Society)A review. Data science has revolutionized chem. research and continues to break down barriers with new interdisciplinary studies. The introduction of computational models and machine learning (ML) algorithms in combination with automation and traditional exptl. techniques has enabled scientific advancement across nearly every discipline of chem., from materials discovery, to process optimization, to synthesis planning. However, predictive tools powered by data science are only as good as their data sets and, currently, many of the data sets used to train models suffer from several limitations, including being sparse, limited in scope and requiring human curation. Likewise, computational data faces limitations in terms of accurate modeling of nonideal systems and can suffer from low translation fidelity from simulation to real conditions. The lack of diverse data and the need to be able to test it exptl. reduces both the accuracy and scope of the predictive models derived from data science. This Account contextualizes the need for more complex and diverse exptl. data and highlights how the seamless integration of robotics, machine learning, and data-rich monitoring techniques can be used to access it with minimal human labor. We propose three broad categories of data in chem.: data on fundamental properties, data on reaction outcomes, and data on reaction mechanics. We highlight flexible, automated platforms that can be deployed to acquire and leverage these data. The first platform combines solid- and liq.-dosing modules with computer vision to automate soly. screening, thereby gathering fundamental data that are necessary for almost every exptl. design. Using computer vision offers the addnl. benefit of creating a visual record, which can be referenced and used to further interrogate and gain insight on the data collected. The second platform iteratively tests reaction variables proposed by a ML algorithm in a closed-loop fashion. Exptl. data related to reaction outcomes are fed back into the algorithm to drive the discovery and optimization of new materials and chem. processes. The third platform uses automated process anal. technol. to gather real-time data related to reaction kinetics. This system allows the researcher to directly interrogate the reaction mechanisms in granular detail to det. exactly how and why a reaction proceeds, thereby enabling reaction optimization and deployment.
- 12Strieth-Kalthoff, F.; Sandfort, F.; Kühnemund, M.; Schäfer, F. R.; Kuchen, H.; Glorius, F. Machine Learning for Chemical Reactivity: The Importance of Failed Experiments. Angew. Chem., Int. Ed. 2022, 61, e202204647, DOI: 10.1002/anie.20220464712Machine Learning for Chemical Reactivity: The Importance of Failed ExperimentsStrieth-Kalthoff, Felix; Sandfort, Frederik; Kuehnemund, Marius; Schafer, Felix R.; Kuchen, Herbert; Glorius, FrankAngewandte Chemie, International Edition (2022), 61 (29), e202204647CODEN: ACIEF5; ISSN:1433-7851. (Wiley-VCH Verlag GmbH & Co. KGaA)A review. Assessing the outcomes of chem. reactions in a quant. fashion has been a cornerstone across all synthetic disciplines. Classically approached through empirical optimization, data-driven modeling bears an enormous potential to streamline this process. However, such predictive models require significant quantities of high-quality data, the availability of which is limited: Main reasons for this include exptl. errors and, importantly, human biases regarding expt. selection and result reporting. In a series of case studies, we investigate the impact of these biases for drawing general conclusions from chem. reaction data, revealing the utmost importance of "neg." examples. Eventually, case studies into data expansion approaches showcase directions to circumvent these limitations-and demonstrate perspectives towards a long-term data quality enhancement in chem.
- 13Beker, W.; Roszak, R.; Wołos, A.; Angello, N. H.; Rathore, V.; Burke, M. D.; Grzybowski, B. A. Machine Learning May Sometimes Simply Capture Literature Popularity Trends: A Case Study of Heterocyclic Suzuki–Miyaura Coupling. J. Am. Chem. Soc. 2022, 144, 4819– 4827, DOI: 10.1021/jacs.1c1200513Machine Learning May Sometimes Simply Capture Literature Popularity Trends: A Case Study of Heterocyclic Suzuki-Miyaura CouplingBeker, Wiktor; Roszak, Rafal; Wolos, Agnieszka; Angello, Nicholas H.; Rathore, Vandana; Burke, Martin D.; Grzybowski, Bartosz A.Journal of the American Chemical Society (2022), 144 (11), 4819-4827CODEN: JACSAT; ISSN:0002-7863. (American Chemical Society)A review. Applications of machine learning (ML) to synthetic chem. rely on the assumption that large nos. of literature-reported examples should enable construction of accurate and predictive models of chem. reactivity. This paper demonstrates that abundance of carefully curated literature data may be insufficient for this purpose. Using an example of Suzuki-Miyaura coupling with heterocyclic building blocks-and a carefully selected database of >10,000 literature examples-this article shows that ML models cannot offer any meaningful predictions of optimum reaction conditions, even if the search space is restricted to only solvents and bases. This result holds irresp. of the ML model applied (from simple feed-forward to state-of-the-art graph convolution neural networks) or the representation to describe the reaction partners (various fingerprints, chem. descriptors, latent representations, etc.). In all cases, the ML methods fail to perform significantly better than naive assignments based on the sheer frequency of certain reaction conditions reported in the literature. These unsatisfactory results likely reflect subjective preferences of various chemists to use certain protocols and, other biasing factors as mundane as availability of certain solvents/reagents, and/or a lack of neg. data. These findings highlight the likely importance of systematically generating reliable and standardized data sets for algorithm training.
- 14Wilbraham, L.; Mehr, S. H. M.; Cronin, L. Digitizing Chemistry Using the Chemical Processing Unit: From Synthesis to Discovery. Acc. Chem. Res. 2021, 54, 253– 262, DOI: 10.1021/acs.accounts.0c0067414Digitizing Chemistry Using the Chemical Processing Unit: From Synthesis to DiscoveryWilbraham, Liam; Mehr, S. Hessam M.; Cronin, LeroyAccounts of Chemical Research (2021), 54 (2), 253-262CODEN: ACHRE4; ISSN:0001-4842. (American Chemical Society)The digitization of chem. is not simply about using machine learning or artificial intelligence systems to process chem. data, or about the development of ever more capable automation hardware; instead, it is the creation of a hard link between an abstracted process ontol. of chem. and bespoke hardware for performing reactions or exploring reactivity. Chem. digitization is therefore about the unambiguous development of an architecture, a chem. state machine, that uses this ontol. to connect precise instruction sets to hardware that performs chem. transformations. This approach enables a universal std. for describing chem. procedures via a chem. programming language and facilitates unambiguous dissemination of these procedures. We predict that this std. will revolutionize the ability of chemists to collaborate, increase reproducibility and safety, as we all as optimize for cost and efficiency. Most importantly, the digitization of chem. will dramatically reduce the labor needed to make new compds. and broaden accessible chem. space. In recent years, the developments of automation in chem. have gone beyond flow chem. alone, with many bespoke workflows being developed not only for automating chem. synthesis but also for materials, nanomaterials, and formulation prodn. Indeed, the leap from fixed-configuration synthesis machines like peptide, nucleic acid, or dedicated cross-coupling engines is important for developing a truly universal approach to "dial-a-mol.". In this case, a key conceptual leap is the use of a batch system that can encode the chem. reagents, solvent, and products as packets which can be moved around the system, and a graph-based approach for the description of hardware modules that allows the compilation of chem. code that runs on, in principle, any hardware. Further, the integration of sensor systems for monitoring and controlling the state of the chem. synthesis machine, as well as high resoln. spectroscopic tools, is vital if these systems are to facilitate closed-loop autonomous expts. Systems that not only make mols. and materials, but also optimize their function, and use algorithms to assist with the development of new synthetic pathways and process optimization are also possible. Here, we discuss how the digitization of chem. is happening, building on the plethora of technol. developments in hardware and software. Importantly, digital-chem. robot systems need to integrate feedback from simple sensors, e.g., cond. or temp., as well as online analytics in order to navigate process space autonomously. This will open the door to accessing known mols. (synthesis), exploring whether known compds./reactions are possible under new conditions (optimization), and searching chem. space for unknown and unexpected new mols., reactions, and modes of reactivity (discovery). We will also discuss the role of chem. knowledge and how this can be used to challenge bias, as well as define and expand synthetically accessible chem. space using programmable robotic chem. state machines.
- 15Rohrbach, S.; Šiaučiulis, M.; Chisholm, G.; Pirvan, P.-A.; Saleeb, M.; Mehr, S. H. M.; Trushina, E.; Leonov, A. I.; Keenan, G.; Khan, A.; Hammer, A.; Cronin, L. Digitization and Validation of a Chemical Synthesis Literature Database in the ChemPU. Science 2022, 377, 172– 180, DOI: 10.1126/science.abo005815Digitization and validation of a chemical synthesis literature database in the ChemPURohrbach, Simon; Siauciulis, Mindaugas; Chisholm, Greig; Pirvan, Petrisor-Alin; Saleeb, Michael; Mehr, S. Hessam M.; Trushina, Ekaterina; Leonov, Artem I.; Keenan, Graham; Khan, Aamir; Hammer, Alexander; Cronin, LeroyScience (Washington, DC, United States) (2022), 377 (6602), 172-180CODEN: SCIEAS; ISSN:1095-9203. (American Association for the Advancement of Science)Despite huge potential, automation of synthetic chem. has only made incremental progress over the past few decades. We present an automatically executable chem. reaction database of 100 mols. representative of the range of reactions found in contemporary org. synthesis. These reactions include transition metal-catalyzed coupling reactions, heterocycle formations, functional group interconversions, and multicomponent reactions. The chem. reaction codes or χDLs for the reactions have been stored in a database for version control, validation, collaboration, and data mining. Of these syntheses, more than 50 entries from the database have been downloaded and robotically run in seven modular ChemPU's with yields and purities comparable to those achieved by an expert chemist. We also demonstrate the automatic purifn. of a range of compds. using a chromatog. module seamlessly coupled to the platform and programmed with the same language.
- 16Bubliauskas, A.; Blair, D. J.; Powell-Davies, H.; Kitson, P. J.; Burke, M. D.; Cronin, L. Digitizing Chemical Synthesis in 3D Printed Reactionware. Angew. Chem., Int. Ed. 2022, 61, e202116108, DOI: 10.1002/anie.20211610816Digitizing Chemical Synthesis in 3D Printed ReactionwareBubliauskas, Andrius; Blair, Daniel J.; Powell-Davies, Henry; Kitson, Philip J.; Burke, Martin D.; Cronin, Leroy; AcknowAngewandte Chemie, International Edition (2022), 61 (24), e202116108CODEN: ACIEF5; ISSN:1433-7851. (Wiley-VCH Verlag GmbH & Co. KGaA)Chem. digitization requires an unambiguous link between expts. and the code used to generate the exptl. conditions and outcomes, yet this process is not standardized, limiting the portability of any chem. code. What is needed is a universal approach to aid this process using a well-defined std. that is composed of syntheses that are employed in modular hardware. Herein a new approach is presented to the digitization of org. synthesis that combines process chem. principles with 3D printed reactionware. This approach outlines the process for transforming unit operations into digitized hardware and well-defined instructions that ensure effective synthesis. To demonstrate this, the process is outlined for digitizing 3 MIDA boronate building blocks, an ester hydrolysis, a Wittig olefination, a Suzuki-Miyaura coupling reaction, and synthesis of the drug sulfanilamide.
- 17Nikolaev, P.; Hooper, D.; Webber, F.; Rao, R.; Decker, K.; Krein, M.; Poleski, J.; Barto, R.; Maruyama, B. Autonomy in Materials Research: A Case Study in Carbon Nanotube Growth. Npj Comput. Mater. 2016, 2, 16031, DOI: 10.1038/npjcompumats.2016.31There is no corresponding record for this reference.
- 18Roch, L. M.; Häse, F.; Kreisbeck, C.; Tamayo-Mendoza, T.; Yunker, L. P. E.; Hein, J. E.; Aspuru-Guzik, A. ChemOS: Orchestrating Autonomous Experimentation. Sci. Robot. 2018, 3 (19), 1, DOI: 10.1126/scirobotics.aat5559There is no corresponding record for this reference.
- 19Rahmanian, F.; Flowers, J.; Guevarra, D.; Richter, M.; Fichtner, M.; Donnely, P.; Gregoire, J. M.; Stein, H. S. Enabling Modular Autonomous Feedback-Loops in Materials Science through Hierarchical Experimental Laboratory Automation and Orchestration. Adv. Mater. Interfaces 2022, 9, 2101987, DOI: 10.1002/admi.202101987There is no corresponding record for this reference.
- 20Gaudin, T.; Benlolo, I.; Cui, Z. Y.; Hickmann, R.; Tamblyn, I.; Aspuru-Guzik, A. Molar. In Zenodo ; 2022; https://zenodo.org/record/6809290.There is no corresponding record for this reference.
- 21McKinney, W. Data Structures for Statistical Computing in Python. In Proc. of the 9th Python in Science Conference , Austin, Texas, 2010; pp 56– 61.There is no corresponding record for this reference.
- 22Reback, J.; McKinney, W.; Jbrockmendel; ; Van den Bossche, J.; Roeschke, M.; Augspurger, T.; Hawkins, S.; Cloud, P.; Gfyoung; ; Sinhrks; ; Hoefler, P.; Klein, A.; Petersen, T.; Tratner, J.; She, C.; Ayd, W.; Naveh, S.; Darbyshire, J. H. M.; Shadrach, R.; Garcia, M.; Schendel, J.; Hayden, A.; Saxton, D.; Gorelli, M. E.; Li, F.; Wörtwein, T.; Zeitlin, M.; Jancauskas, V.; McMaster, A.; Li, T. Pandas-Dev/Pandas: Pandas 1.4.3. In Zenodo , 2022; https://zenodo.org/record/3509134.There is no corresponding record for this reference.
- 23Häse, F.; Roch, L. M.; Kreisbeck, C.; Aspuru-Guzik, A. Phoenics: A Bayesian Optimizer for Chemistry. ACS Cent. Sci. 2018, 4, 1134– 1145, DOI: 10.1021/acscentsci.8b0030723Phoenics: A Bayesian Optimizer for ChemistryHase, Florian; Roch, Loic M.; Kreisbeck, Christoph; Aspuru-Guzik, AlanACS Central Science (2018), 4 (9), 1134-1145CODEN: ACSCII; ISSN:2374-7951. (American Chemical Society)We report Phoenics, a probabilistic global optimization algorithm identifying the set of conditions of an exptl. or computational procedure which satisfies desired targets. Phoenics combines ideas from Bayesian optimization with concepts from Bayesian kernel d. estn. As such, Phoenics allows to tackle typical optimization problems in chem. for which objective evaluations are limited, due to either budgeted resources or time-consuming evaluations of the conditions, including experimentation or enduring computations. Phoenics proposes new conditions based on all previous observations, avoiding, thus, redundant evaluations to locate the optimal conditions. It enables an efficient parallel search based on intuitive sampling strategies implicitly biasing toward exploration or exploitation of the search space. Our benchmarks indicate that Phoenics is less sensitive to the response surface than already established optimization algorithms. We showcase the applicability of Phoenics on the Oregonator, a complex case-study describing a nonlinear chem. reaction network. Despite the large search space, Phoenics quickly identifies the conditions which yield the desired target dynamic behavior. Overall, we recommend Phoenics for rapid optimization of unknown expensive-to-evaluate objective functions, such as experimentation or long-lasting computations.
- 24Häse, F.; Roch, L. M.; Aspuru-Guzik, A. Chimera: Enabling Hierarchy Based Multi-Objective Optimization for Self-Driving Laboratories. Chem. Sci. 2018, 9, 7642– 7655, DOI: 10.1039/C8SC02239A24Chimera: enabling hierarchy based multi-objective optimization for self-driving laboratoriesHase, Florian; Roch, Loic M.; Aspuru-Guzik, AlanChemical Science (2018), 9 (39), 7642-7655CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)Finding the ideal conditions satisfying multiple pre-defined targets simultaneously is a challenging decision-making process, which impacts science, engineering, and economics. Addnl. complexity arises for tasks involving experimentation or expensive computations, as the no. of evaluated conditions must be kept low. We propose Chimera as a general purpose achievement scalarizing function for multi-target optimization where evaluations are the limiting factor. Chimera combines concepts of a priori scalarizing with lexicog. approaches and is applicable to any set of n unknown objectives. Importantly, it does not require detailed prior knowledge about individual objectives. The performance of Chimera is demonstrated on several well-established analytic multi-objective benchmark sets using different single-objective optimization algorithms. We further illustrate the applicability and performance of Chimera with two practical examples: (i) the auto-calibration of a virtual robotic sampling sequence for direct-injection, and (ii) the inverse-design of a four-pigment excitonic system for an efficient energy transport. The results indicate that Chimera enables a wide class of optimization algorithms to rapidly find ideal conditions. Addnl., the presented applications highlight the interpretability of Chimera to corroborate design choices for tailoring system parameters.
- 25Häse, F.; Aldeghi, M.; Hickman, R. J.; Roch, L. M.; Aspuru-Guzik, A. Gryffin: An Algorithm for Bayesian Optimization of Categorical Variables Informed by Expert Knowledge. Appl. Phys. Rev. 2021, 8, 031406, DOI: 10.1063/5.004816425GRYFFIN: An algorithm for Bayesian optimization of categorical variables informed by expert knowledgeHase, Florian; Aldeghi, Matteo; Hickman, Riley J.; Roch, Loic M.; Aspuru-Guzik, AlanApplied Physics Reviews (2021), 8 (3), 031406CODEN: APRPG5; ISSN:1931-9401. (American Institute of Physics)A review. Designing functional mols. and advanced materials requires complex design choices: tuning continuous process parameters such as temps. or flow rates, while simultaneously selecting catalysts or solvents. To date, the development of data-driven expt. planning strategies for autonomous experimentation has largely focused on continuous process parameters, despite the urge to devise efficient strategies for the selection of categorical variables. Here, we introduce GRYFFIN, a general-purpose optimization framework for the autonomous selection of categorical variables driven by expert knowledge. GRYFFIN augments Bayesian optimization based on kernel d. estn. with smooth approxns. to categorical distributions. Leveraging domain knowledge in the form of physicochem. descriptors, GRYFFIN can significantly accelerate the search for promising mols. and materials. GRYFFIN can further highlight relevant correlations between the provided descriptors to inspire phys. insights and foster scientific intuition. In addn. to comprehensive benchmarks, we demonstrate the capabilities and performance of GRYFFIN on three examples in materials science and chem.: (i) the discovery of non-fullerene acceptors for org. solar cells, (ii) the design of hybrid org.-inorg. perovskites for light-harvesting, and (iii) the identification of ligands and process parameters for Suzuki-Miyaura reactions. Our results suggest that GRYFFIN, in its simplest form, is competitive with state-of-the-art categorical optimization algorithms. However, when leveraging domain knowledge provided via descriptors, GRYFFIN outperforms other approaches while simultaneously refining this domain knowledge to promote scientific understanding. (c) 2021 American Institute of Physics.
- 26Hickman, R. J.; Häse, F.; Roch, L. M.; Aspuru-Guzik, A. Gemini: Dynamic Bias Correction for Autonomous Experimentation and Molecular Simulation. arXiv , 2021, 2103.03391.There is no corresponding record for this reference.
- 27Aldeghi, M.; Häse, F.; Hickman, R. J.; Tamblyn, I.; Aspuru-Guzik, A. Golem: An Algorithm for Robust Experiment and Process Optimization. arXiv , 2021, 2103.03716.There is no corresponding record for this reference.
- 28MacLeod, B. P.; Parlane, F. G. L.; Morrissey, T. D.; Häse, F.; Roch, L. M.; Dettelbach, K. E.; Moreira, R.; Yunker, L. P. E.; Rooney, M. B.; Deeth, J. R.; Lai, V.; Ng, G. J.; Situ, H.; Zhang, R. H.; Elliott, M. S.; Haley, T. H.; Dvorak, D. J.; Aspuru-Guzik, A.; Hein, J. E.; Berlinguette, C. P. Self-Driving Laboratory for Accelerated Discovery of Thin-Film Materials. Sci. Adv. 2020, 6, eaaz8867, DOI: 10.1126/sciadv.aaz8867There is no corresponding record for this reference.
- 29Glasnov, T. N.; Kappe, C. O. The Microwave-to-Flow Paradigm: Translating High-Temperature Batch Microwave Chemistry to Scalable Continuous-Flow Processes. Chem. – Eur. J. 2011, 17, 11956– 11968, DOI: 10.1002/chem.20110206529The Microwave-to-Flow Paradigm: Translating High-Temperature Batch Microwave Chemistry to Scalable Continuous-Flow ProcessesGlasnov, Toma N.; Kappe, C. OliverChemistry - A European Journal (2011), 17 (43), 11956-11968CODEN: CEUJED; ISSN:0947-6539. (Wiley-VCH Verlag GmbH & Co. KGaA)The popularity of dedicated microwave reactors in many academic and industrial labs. has produced a plethora of synthetic protocols that are based on this enabling technol. In the majority of examples, transformations that require several hours when performed using conventional heating under reflux conditions reach completion in a few minutes or even seconds in sealed-vessel, autoclave-type, microwave reactors. However, one severe drawback of microwave chem. is the difficulty in scaling this technol. to a prodn.-scale level. This concept article demonstrates that this limitation can be overcome by translating batch microwave chem. to scalable continuous-flow processes. For this purpose, conventionally heated micro- or mesofluidic flow devices fitted with a back-pressure regulator are employed, in which the high temps. and pressures attainable in a sealed-vessel microwave chem. batch expt. can be mimicked.
- 30Plutschack, M. B.; Pieber, B.; Gilmore, K.; Seeberger, P. H. The Hitchhiker’s Guide to Flow Chemistry. Chem. Rev. 2017, 117, 11796– 11893, DOI: 10.1021/acs.chemrev.7b0018330The Hitchhiker's Guide to Flow ChemistryPlutschack, Matthew B.; Pieber, Bartholomaeus; Gilmore, Kerry; Seeberger, Peter H.Chemical Reviews (Washington, DC, United States) (2017), 117 (18), 11796-11893CODEN: CHREAY; ISSN:0009-2665. (American Chemical Society)Flow chem. involves the use of channels or tubing to conduct a reaction in a continuous stream rather than in a flask. Flow equipment provides chemists with unique control over reaction parameters, enhancing reactivity or in some cases enabling new reactions. This relatively young technol. has received a remarkable amt. of attention in the past decade with many reports on what can be done in flow. Until recently, however, the question, "Should we do this in flow" has merely been an afterthought. This review introduces readers to the basic principles and fundamentals of flow chem. and critically discusses recent flow chem. accounts.
- 31Bianchi, P.; Williams, J. D.; Kappe, C. O. Oscillatory Flow Reactors for Synthetic Chemistry Applications. J. Flow Chem. 2020, 10, 475– 490, DOI: 10.1007/s41981-020-00105-631Oscillatory flow reactors for synthetic chemistry applicationsBianchi, Pauline; Williams, Jason D.; Kappe, C. OliverJournal of Flow Chemistry (2020), 10 (3), 475-490CODEN: JFCOBJ; ISSN:2063-0212. (Akademiai Kiado)Abstr.: Oscillatory flow reactors (OFRs) superimpose an oscillatory flow to the net movement through a flow reactor. OFRs have been engineered to enable improved mixing, excellent heat- and mass transfer and good plug flow character under a broad range of operating conditions. Such features render these reactors appealing, since they are suitable for reactions that require long residence times, improved mass transfer (such as in biphasic liq.-liq. systems) or to homogeneously suspend solid particles. Various OFR configurations, offering specific features, have been developed over the past two decades, with significant progress still being made. This review outlines the principles and recent advances in OFR technol. and overviews the synthetic applications of OFRs for liq.-liq. and solid-liq. biphasic systems.
- 32Christensen, M.; Yunker, L. P. E.; Shiri, P.; Zepel, T.; Prieto, P. L.; Grunert, S.; Bork, F.; Hein, J. E. Automation Isn’t Automatic. Chem. Sci. 2021, 12, 15473, DOI: 10.1039/D1SC04588A32Automation isn't automaticChristensen, Melodie; Yunker, Lars P. E.; Shiri, Parisa; Zepel, Tara; Prieto, Paloma L.; Grunert, Shad; Bork, Finn; Hein, Jason E.Chemical Science (2021), 12 (47), 15473-15490CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)Automation has become an increasingly popular tool for synthetic chemists over the past decade. Recent advances in robotics and computer science have led to the emergence of automated systems that execute common lab. procedures including parallel synthesis, reaction discovery, reaction optimization, time course studies, and crystn. development. While such systems offer many potential benefits, their implementation is rarely automatic due to the highly specialized nature of synthetic procedures. Each reaction category requires careful execution of a particular sequence of steps, the specifics of which change with different conditions and chem. systems. Careful assessment of these crit. procedural requirements and identification of the tools suitable for effective exptl. execution are key to developing effective automation workflows. Even then, it is often difficult to get all the components of an automated system integrated and operational. Data flows and specialized equipment present yet another level of challenge. Unfortunately, the pain points and process of implementing automated systems are often not shared or remain buried deep in the SI. This perspective provides an overview of the current state of automation of synthetic chem. at the benchtop scale with a particular emphasis on core considerations and the ensuing challenges of deploying a system. Importantly, we aim to reframe automation as decidedly not automatic but rather an iterative process that involves a series of careful decisions (both human and computational) and const. adjustment.
- 33Gillis, E. P.; Burke, M. D. Multistep Synthesis of Complex Boronic Acids from Simple MIDA Boronates. J. Am. Chem. Soc. 2008, 130, 14084– 14085, DOI: 10.1021/ja806375933Multistep Synthesis of Complex Boronic Acids from Simple MIDA BoronatesGillis, Eric P.; Burke, Martin D.Journal of the American Chemical Society (2008), 130 (43), 14084-14085CODEN: JACSAT; ISSN:0002-7863. (American Chemical Society)Due to its sensitivity to most synthetic reagents, it is typically necessary to introduce the boronic acid functional group just prior to its use. Overcoming this important limitation, the authors herein report that air- and chromatog. stable N-methyliminodiacetic acid (MIDA) boronates are compatible with a wide range of common reagents which enables the multistep synthesis of complex boronic acid building blocks from simple B-contg. starting materials. X-ray and variable temp. NMR studies link the unique stability of MIDA boronates to a kinetic inaccessibility of the potentially reactive B p-orbital and/or N lone pair. These findings were collectively harnessed to achieve a short and modular total synthesis of (+)-crocacin C via the iterative cross-coupling of a structurally complex, MIDA-protected haloboronic acid building block.
- 34Li, J.; Ballmer, S. G.; Gillis, E. P.; Fujii, S.; Schmidt, M. J.; Palazzolo, A. M. E.; Lehmann, J. W.; Morehouse, G. F.; Burke, M. D. Synthesis of Many Different Types of Organic Small Molecules Using One Automated Process. Science 2015, 347, 1221– 1226, DOI: 10.1126/science.aaa541434Synthesis of many different types of organic small molecules using one automated processLi, Junqi; Ballmer, Steven G.; Gillis, Eric P.; Fujii, Seiko; Schmidt, Michael J.; Palazzolo, Andrea M. E.; Lehmann, Jonathan W.; Morehouse, Greg F.; Burke, Martin D.Science (Washington, DC, United States) (2015), 347 (6227), 1221-1226CODEN: SCIEAS; ISSN:0036-8075. (American Association for the Advancement of Science)A wide variety of human-prepd. and natural product small mols. such as all-trans-retinal I, the phosphodiesterase inhibitor II, oblongolide III, and the secodaphnane core compd. IV were prepd. using an automated process. Alkyl, alkenyl, and aryl N-methyliminodiacetate-protected boronic acid esters (MIDA-boronates) were sepd. on silica gel from other mols. lacking MIDA-boronate moieties; the MIDA-boronates were retained on silica gel when methanol/diethyl ether was used as the eluent and were eluted when THF was used as the eluent. MIDA-boronates thus acted as phase tags which can be activated selectively to undergo Suzuki coupling reactions. Iterative coupling reactions of MIDA-boronates using solid-phase catch-and-release chromatog. purifn. in combination with stereoselective cyclization reactions allowed a variety of small mols. contg. polyene, biaryl, macrocyclic, and fused polycyclic structures to be prepd. using a multipurpose flow reactor (synthesizer).
- 35Anthony, J. E.; Heeney, M.; Ong, B. S. Synthetic Aspects of Organic Semiconductors. MRS Bull. 2008, 33, 698– 705, DOI: 10.1557/mrs2008.14235Synthetic aspects of organic semiconductorsAnthony, John E.; Heeney, Martin; Ong, Beng S.MRS Bulletin (2008), 33 (7), 698-705CODEN: MRSBEA; ISSN:0883-7694. (Materials Research Society)A review. This article discusses the importance of the choice of synthetic methodol. in the purity, and therefore performance, of both small-mol. and polymeric org. semiconductors. We discuss common methodologies used in the prepn. of org. semiconductors, paying particular attention to the impurities and byproducts that can arise during these synthetic approaches and how they can have an impact on semiconductor performance.
- 36Kuehne, A. J. C.; Gather, M. C. Organic Lasers: Recent Developments on Materials, Device Geometries, and Fabrication Techniques. Chem. Rev. 2016, 116, 12823– 12864, DOI: 10.1021/acs.chemrev.6b0017236Organic Lasers: Recent Developments on Materials, Device Geometries, and Fabrication TechniquesKuehne, Alexander J. C.; Gather, Malte C.Chemical Reviews (Washington, DC, United States) (2016), 116 (21), 12823-12864CODEN: CHREAY; ISSN:0009-2665. (American Chemical Society)Org. dyes have been used as gain medium for lasers since the 1960s, long before the advent of today's org. electronic devices. Org. gain materials are highly attractive for lasing due to their chem. tunability and large stimulated emission cross section. While the traditional dye laser has been largely replaced by solid-state lasers, a no. of new and miniaturized org. lasers have emerged that hold great potential for lab-on-chip applications, biointegration, low-cost sensing and related areas, which benefit from the unique properties of org. gain materials. On the fundamental level, these include high exciton binding energy, low refractive index (compared to inorg. semiconductors), and ease of spectral and chem. tuning. On a technol. level, mech. flexibility and compatibility with simple processing techniques such as printing, roll-to-roll, self-assembly, and soft-lithog. are most relevant. Here, the authors provide a comprehensive review of the developments in the field over the past decade, discussing recent advances in org. gain materials, which are today often based on solid-state org. semiconductors, as well as optical feedback structures, and device fabrication. Recent efforts toward continuous wave operation and elec. pumping of solid-state org. lasers are reviewed, and new device concepts and emerging applications are summarized.
- 37Wu, T. C.; Granda, A. A.; Hotta, K.; Yazdani, S. A.; Pollice, R.; Vestfrid, J.; Hao, H.; Lavigne, C.; Seifrid, M.; Angello, N.; Bencheikh, F.; Hein, J. E.; Burke, M.; Adachi, C.; Aspuru-Guzik, A. A Materials Acceleration Platform for Organic Laser Discovery. ChemRxiv , 2022; DOI: 10.26434/chemrxiv-2022-9zm65 .There is no corresponding record for this reference.
- 38Yunker, L. P. E.; Donnecke, S.; Ting, M.; Yeung, D.; McIndoe, J. S. PythoMS: A Python Framework To Simplify and Assist in the Processing and Interpretation of Mass Spectrometric Data. J. Chem. Inf. Model. 2019, 59, 1295– 1300, DOI: 10.1021/acs.jcim.9b0005538PythoMS: A Python Framework To Simplify and Assist in the Processing and Interpretation of Mass Spectrometric DataYunker, Lars P. E.; Donnecke, Sofia; Ting, Michelle; Yeung, Darien; McIndoe, J. ScottJournal of Chemical Information and Modeling (2019), 59 (4), 1295-1300CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Mass spectrometric data are copious and generate a processing burden that is best dealt with programmatically. PythoMS is a collection of tools based on the Python programming language that assist researchers in creating figures and video output that is informative, clear, and visually compelling. The PythoMS framework introduces a library of classes and a variety of scripts that quickly perform time-consuming tasks: making proprietary output readable; binning intensity vs. time data to simulate longer scan times (and hence reduce noise); calcg. theor. isotope patterns and overlaying them in histogram form on exptl. data (an approach that works even for overlapping signals); rendering videos that enable zooming into the baseline of intensity vs time plots (useful to make sense of data collected over a large dynamic range) or that depict the evolution of different species in a time-lapse format; calcg. aggregates; and providing a quick first-pass at identifying fragments in MS/MS spectra. PythoMS is a living project that will continue to evolve as addnl. scripts are developed and deployed.
- 39Burger, B.; Maffettone, P. M.; Gusev, V. V.; Aitchison, C. M.; Bai, Y.; Wang, X.; Li, X.; Alston, B. M.; Li, B.; Clowes, R.; Rankin, N.; Harris, B.; Sprick, R. S.; Cooper, A. I. A Mobile Robotic Chemist. Nature 2020, 583, 237– 241, DOI: 10.1038/s41586-020-2442-239A mobile robotic chemistBurger, Benjamin; Maffettone, Phillip M.; Gusev, Vladimir V.; Aitchison, Catherine M.; Bai, Yang; Wang, Xiaoyan; Li, Xiaobo; Alston, Ben M.; Li, Buyi; Clowes, Rob; Rankin, Nicola; Harris, Brandon; Sprick, Reiner Sebastian; Cooper, Andrew I.Nature (London, United Kingdom) (2020), 583 (7815), 237-241CODEN: NATUAS; ISSN:0028-0836. (Nature Research)Abstr.: Technologies such as batteries, biomaterials and heterogeneous catalysts have functions that are defined by mixts. of mol. and mesoscale components. As yet, this multi-length-scale complexity cannot be fully captured by atomistic simulations, and the design of such materials from first principles is still rare1-5. Likewise, exptl. complexity scales exponentially with the no. of variables, restricting most searches to narrow areas of materials space. Robots can assist in exptl. searches6-14 but their widespread adoption in materials research is challenging because of the diversity of sample types, operations, instruments and measurements required. Here we use a mobile robot to search for improved photocatalysts for hydrogen prodn. from water15. The robot operated autonomously over eight days, performing 688 expts. within a ten-variable exptl. space, driven by a batched Bayesian search algorithm16-18. This autonomous search identified photocatalyst mixts. that were six times more active than the initial formulations, selecting beneficial components and deselecting neg. ones. Our strategy uses a dexterous19,20 free-roaming robot21-24, automating the researcher rather than the instruments. This modular approach could be deployed in conventional labs. for a range of research problems beyond photocatalysis.
- 40Crosby, G. A.; Demas, J. N. Measurement of Photoluminescence Quantum Yields. Review. J. Phys. Chem. 1971, 75, 991– 1024, DOI: 10.1021/j100678a00140Measurement of photoluminescence quantum yields. ReviewCrosby, Glenn A.; Demas, James N.Journal of Physical Chemistry (1971), 75 (8), 991-1024CODEN: JPCHAX; ISSN:0022-3654.A review is given with 147 refs. Methods and apparatus for measuring photoluminescence quantum yields, standards, apparatus calibration, and data corrections are described.
- 41O’Connor, D. V. O.; Phillips, D. Time-Correlated Single Photon Counting; Academic Press: London, 1984.There is no corresponding record for this reference.
- 42Cava, R. J.; DiSalvo, F. J.; Brus, L. E.; Dunbar, K. R.; Gorman, C. B.; Haile, S. M.; Interrante, L. V.; Musfeldt, J. L.; Navrotsky, A.; Nuzzo, R. G.; Pickett, W. E.; Wilkinson, A. P.; Ahn, C.; Allen, J. W.; Burns, P. C.; Ceder, G.; Chidsey, C. E. D.; Clegg, W.; Coronado, E.; Dai, H.; Deem, M. W.; Dunn, B. S.; Galli, G.; Jacobson, A. J.; Kanatzidis, M.; Lin, W.; Manthiram, A.; Mrksich, M.; Norris, D.; Nozik, A. J.; Peng, X.; Rawn, C.; Rolison, D.; Singh, D. J.; Toby, B. H.; Tolbert, S.; Wiesner, U. B.; Woodward, P. M.; Yang, P. Future Directions in Solid State Chemistry: Report of the NSF-Sponsored Workshop. Prog. Solid State Chem. 2002, 30, 1– 101, DOI: 10.1016/S0079-6786(02)00010-942Future directions in solid state chemistry: report of the NSF-sponsored workshopCava, Robert J.; DiSalvo, Francis J.; Brus, Louis E.; Dunbar, Kim R.; Gorman, Christopher B.; Haile, Sossina M.; Interrante, Leonard V.; Musfeldt, Janice L.; Navrotsky, Alexandra; Nuzzo, Ralph G.; Pickett, Warren E.; Wilkinson, Angus P.; Ahn, Channing; Allen, James W.; Burns, Peter C.; Ceder, Gerdrand; Chidsey, Christopher E. D.; Clegg, William; Coronado, Eugenio; Dai, Hongjie; Deem, Michael W.; Dunn, Bruce S.; Galli, Giulia; Jacobson, Allan J.; Kanatzidis, Mercouri; Lin, Wenbin; Manthiram, Arumugam; Mrksich, Milan; Norris, David; Nozik, Arthur J.; Peng, Xiaogang; Rawn, Claudia; Rolison, Debra; Singh, David J.; Toby, Brian H.; Tolbert, Sarah; Wiesner, Ulrich B.; Woodward, Patrick M.; Yang, PeidongProgress in Solid State Chemistry (2002), 30 (1-2), 1-101CODEN: PSSTAW; ISSN:0079-6786. (Elsevier Science Ltd.)A review. A long-established area of scientific excellence in Europe, solid state chem. has emerged in the US in the past two decades as a field experiencing rapid growth and development. At its core, it is an interdisciplinary melding of chem., physics, engineering, and materials science, as it focuses on the design, synthesis and structural characterization of new chem. compds. and characterization of their phys. properties. As a consequence of this inherently interdisciplinary character, the solid state chem. community is highly open to the influx of new ideas and directions. The inclusionary character of the field's culture has been a significant factor in its continuing growth and vitality. This report presents an elaboration of discussions held during an NSF-sponsored workshop on Future Directions in Solid State Chem., held on the UC Davis Campus in Oct. 2001. That workshop was the second of a series of workshops planned in this topical area. The first, held at NSF headquarters in Arlington, Virginia, in Jan. of 1998, was designed to address the core of the field, describing how it has developed in the US and worldwide in the past decade, and how the members of the community saw the central thrusts of research and education in solid state chem. proceeding in the next several years. A report was published on that workshop (J.M. Honig, chair, "Proceedings of the Workshop on the Present Status and Future Developments of Solid State Chem. and Materials", Arlington, VA, Jan. 15-16, 1998) describing the state of the field and recommendations for future development of the core discipline. In the spirit of continuing to expand the scope of the solid state chem. community into new areas of scientific inquiry, the workshop elaborated in this document was designed to address the interfaces between our field and fields where we thought there would be significant opportunity for the development of new scientific advancements through increased interaction. The 7 topic areas, described in detail in this report, ranged from those with established ties to solid state chem. such as Earth and planetary sciences, and energy storage and conversion, to those such as condensed matter physics, where the connections are in their infancy, to biol., where the opportunities for connections are largely unexplored. Exciting ties to materials chem. were explored in discussions on mol. materials and nanoscale science, and a session on the importance of improving the ties between solid state chemists and experts in characterization at national exptl. facilities was included. The full report elaborates these ideas extensively.
- 43Aspuru-Guzik, A.; Persson, K. Materials Acceleration Platform: Accelerating Advanced Energy Materials Discovery by Integrating High-Throughput Methods and Artificial Intelligence; Mission Innovation: Innovation Challenge 6; Canadian Institute for Advanced Research, 2018.There is no corresponding record for this reference.
- 44Fabbri, E.; Schmidt, T. J. Oxygen Evolution Reaction─The Enigma in Water Electrolysis. ACS Catal. 2018, 8, 9765– 9774, DOI: 10.1021/acscatal.8b0271244Oxygen Evolution Reaction-The Enigma in Water ElectrolysisFabbri, Emiliana; Schmidt, Thomas J.ACS Catalysis (2018), 8 (10), 9765-9774CODEN: ACCACS; ISSN:2155-5435. (American Chemical Society)A review is given. We aim at increasing the awareness among the scientific community devoted to progresses in water electrolyzers of the very recent development made in the fundamental understanding of the OER, particularly focusing on the increasing consciousness that several processes actually underpin the evolution of O from a metal oxide catalyst. Traditionally, the OER mechanism on metal oxides has been derived from that on metal catalysts, where the main parameter governing the reaction overpotential is the binding strength of O (or oxygenated species/intermediates) on the catalyst surface following the Sabatier principle: the best catalyst in terms of displaying the min. overpotential binds O on its surface neither too strongly nor too weakly.
- 45Coley, C. W.; Eyke, N. S.; Jensen, K. F. Autonomous Discovery in the Chemical Sciences Part II: Outlook. Angew. Chem., Int. Ed. 2020, 59 (52), 23414– 23436, DOI: 10.1002/anie.20190998945Autonomous Discovery in the Chemical Sciences Part II: OutlookColey, Connor W.; Eyke, Natalie S.; Jensen, Klavs F.Angewandte Chemie, International Edition (2020), 59 (52), 23414-23436CODEN: ACIEF5; ISSN:1433-7851. (Wiley-VCH Verlag GmbH & Co. KGaA)This two-part Review examines how automation has contributed to different aspects of discovery in the chem. sciences. In this second part, we reflect on a selection of exemplary studies. It is increasingly important to articulate what the role of automation and computation has been in the scientific process and how that has or has not accelerated discovery. One can argue that even the best automated systems have yet to "discover" despite being incredibly useful as lab. assistants. We must carefully consider how they have been and can be applied to future problems of chem. discovery in order to effectively design and interact with future autonomous platforms. The majority of this Review defines a large set of open research directions, including improving our ability to work with complex data, build empirical models, automate both phys. and computational expts. for validation, select expts., and evaluate whether we are making progress towards the ultimate goal of autonomous discovery. Addressing these practical and methodol. challenges will greatly advance the extent to which autonomous systems can make meaningful discoveries.
- 46Molga, K.; Szymkuć, S.; Grzybowski, B. A. Chemist Ex Machina: Advanced Synthesis Planning by Computers. Acc. Chem. Res. 2021, 54, 1094– 1106, DOI: 10.1021/acs.accounts.0c0071446Chemist Ex Machina: Advanced Synthesis Planning by ComputersMolga, Karol; Szymkuc, Sara; Grzybowski, Bartosz A.Accounts of Chemical Research (2021), 54 (5), 1094-1106CODEN: ACHRE4; ISSN:0001-4842. (American Chemical Society)Teaching computers to plan multistep syntheses of arbitrary target mols.-including natural products-has been one of the oldest challenges in chem., dating back to the 1960s. This Account recapitulates two decades of our group's work on the software platform called Chematica, which very recently achieved this long-sought objective and has been shown capable of planning synthetic routes to complex natural products, several of which were validated in the lab. For the machine to plan syntheses at an expert level, it must know the rules describing chem. reactions and use these rules to expand and search the networks of synthetic options. The rules must be of high quality: They must delineate accurately the scope of admissible substituents, capture all relevant stereochem. information, detect potential reactivity conflicts, and protection requirements. They should yield only those synthons that are chem. stable and energetically allowed (e.g., not too strained) and should be able to extrapolate beyond examples already published in the literature. In parallel, the network-search algorithms must be able to assign meaningful scores to the sets of synthons they encounter, make judicious choices which of the network's branches to expand, and when to withdraw from unpromising ones. They must be able to strategize over multiple steps to resolve intermittent reactivity conflicts, exchange functional groups, or overcome local maxima of mol. complexity. Meeting all these requirements makes the problem of computer-driven retrosynthesis very multifaceted, combining expert and AI approaches further supplemented by quantum-mech. and mol.-mechanics calcns. Development of Chematica has been a very long and gradual process because all these components are needed. Any shortcuts-for example, reliance on only expert or only data-based approaches-yield chem. na.ovrddot.ive and often erroneous syntheses, esp. for complex targets. On the bright side, once all the requisite algorithms are implemented-as they now are-they not only streamline conventional synthetic planning but also enable completely new modalities that would challenge any human chemist, for example, synthesis with multiple constraints imposed simultaneously or library-wide syntheses in which the machine constructs "global plans" leading to multiple targets and benefiting from the use of common intermediates. These types of analyses will have profound impact on the practice of chem. industry, designing more economical, more green, and less hazardous pathways.
- 47Shim, E.; Kammeraad, J. A.; Xu, Z.; Tewari, A.; Cernak, T.; Zimmerman, P. M. Predicting Reaction Conditions from Limited Data through Active Transfer Learning. Chem. Sci. 2022, 13, 6655– 6668, DOI: 10.1039/D1SC06932B47Predicting reaction conditions from limited data through active transfer learningShim, Eunjae; Kammeraad, Joshua A.; Xu, Ziping; Tewari, Ambuj; Cernak, Tim; Zimmerman, Paul M.Chemical Science (2022), 13 (22), 6655-6668CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)Transfer and active learning have the potential to accelerate the development of new chem. reactions, using prior data and new expts. to inform models that adapt to the target area of interest. This article shows how specifically tuned machine learning models, based on random forest classifiers, can expand the applicability of Pd-catalyzed cross-coupling reactions to types of nucleophiles unknown to the model. First, model transfer is shown to be effective when reaction mechanisms and substrates are closely related, even when models are trained on relatively small nos. of data points. Then, a model simplification scheme is tested and found to provide comparative predictivity on reactions of new nucleophiles that include unseen reagent combinations. Lastly, for a challenging target where model transfer only provides a modest benefit over random selection, an active transfer learning strategy is introduced to improve model predictions. Simple models, composed of a small no. of decision trees with limited depths, are crucial for securing generalizability, interpretability, and performance of active transfer learning.
- 48Grzybowski, B. A.; Badowski, T.; Molga, K.; Szymkuć, S. Network Search Algorithms and Scoring Functions for Advanced-Level Computerized Synthesis Planning. WIREs Comput. Mol. Sci., e1630; DOI: 10.1002/wcms.1630 .There is no corresponding record for this reference.
- 49Shahriari, B.; Swersky, K.; Wang, Z.; Adams, R. P.; de Freitas, N. Taking the Human Out of the Loop: A Review of Bayesian Optimization. Proc. IEEE 2016, 104, 148– 175, DOI: 10.1109/JPROC.2015.2494218There is no corresponding record for this reference.
- 50Hickman, R. J.; Aldeghi, M.; Häse, F.; Aspuru-Guzik, A. Bayesian Optimization with Known Experimental and Design Constraints for Chemistry Applications. arXiv , 2022, 2203.17241.There is no corresponding record for this reference.
- 51Blaženović, I.; Kind, T.; Torbašinović, H.; Obrenović, S.; Mehta, S. S.; Tsugawa, H.; Wermuth, T.; Schauer, N.; Jahn, M.; Biedendieck, R.; Jahn, D.; Fiehn, O. Comprehensive Comparison of in Silico MS/MS Fragmentation Tools of the CASMI Contest: Database Boosting Is Needed to Achieve 93% Accuracy. J. Cheminformatics 2017, 9, 32, DOI: 10.1186/s13321-017-0219-x51Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracyBlazenovic Ivana; Jahn Martina; Biedendieck Rebekka; Jahn Dieter; Blazenovic Ivana; Schauer Nicolas; Blazenovic Ivana; Kind Tobias; Mehta Sajjan S; Wermuth Tobias; Fiehn Oliver; Torbasinovic Hrvoje; Obrenovic Slobodan; Tsugawa Hiroshi; Fiehn OliverJournal of cheminformatics (2017), 9 (1), 32 ISSN:1758-2946.In mass spectrometry-based untargeted metabolomics, rarely more than 30% of the compounds are identified. Without the true identity of these molecules it is impossible to draw conclusions about the biological mechanisms, pathway relationships and provenance of compounds. The only way at present to address this discrepancy is to use in silico fragmentation software to identify unknown compounds by comparing and ranking theoretical MS/MS fragmentations from target structures to experimental tandem mass spectra (MS/MS). We compared the performance of four publicly available in silico fragmentation algorithms (MetFragCL, CFM-ID, MAGMa+ and MS-FINDER) that participated in the 2016 CASMI challenge. We found that optimizing the use of metadata, weighting factors and the manner of combining different tools eventually defined the ultimate outcomes of each method. We comprehensively analysed how outcomes of different tools could be combined and reached a final success rate of 93% for the training data, and 87% for the challenge data, using a combination of MAGMa+, CFM-ID and compound importance information along with MS/MS matching. Matching MS/MS spectra against the MS/MS libraries without using any in silico tool yielded 60% correct hits, showing that the use of in silico methods is still important.
- 52De Vijlder, T.; Valkenborg, D.; Lemière, F.; Romijn, E. P.; Laukens, K.; Cuyckens, F. A Tutorial in Small Molecule Identification via Electrospray Ionization-mass Spectrometry: The Practical Art of Structural Elucidation. Mass Spectrom. Rev. 2018, 37, 607– 629, DOI: 10.1002/mas.2155152A tutorial in small molecule identification via electrospray ionization-mass spectrometry: The practical art of structural elucidationDe Vijlder, Thomas; Valkenborg, Dirk; Lemiere, Filip; Romijn, Edwin P.; Laukens, Kris; Cuyckens, FilipMass Spectrometry Reviews (2018), 37 (5), 607-629CODEN: MSRVD3; ISSN:0277-7037. (John Wiley & Sons, Inc.)A review. The identification of unknown mols. has been one of the cornerstone applications of mass spectrometry for decades. Most of what is discussed in this tutorial also applies to other atm. pressure ionization methods like atm. pressure chem./photoionization. We focus primarily on the fundamental steps of MS-based structural elucidation of individual unknown compds., rather than describing strategies for large-scale identification in complex samples. We critically discuss topics like the detection of protonated and deprotonated ions ([M + H]+ and [M-H]-) as well as other adduct ions, the detn. of the mol. formula, and provide some basic rules on the interpretation of product ion spectra. Our tutorial focuses primarily on the fundamental steps of MS-based structural elucidation of individual unknown compds. (eg, contaminants in chem. prodn., pharmacol. alteration of drugs), rather than describing strategies for large-scale identification in complex samples. This tutorial also discusses strategies to obtain useful orthogonal information (UV/Vis, H/D exchange, chem. derivatization, etc) and offers an overview of the different informatics tools and approaches that can be used for structural elucidation of small mols. It is primarily intended for beginning mass spectrometrists and researchers from other mass spectrometry sub-disciplines that want to get acquainted with structural elucidation are interested in some practical tips and tricks.
- 53Cook, A.; Johnson, A. P.; Law, J.; Mirzazadeh, M.; Ravitz, O.; Simon, A. Computer-Aided Synthesis Design: 40 Years On. WIREs Comput. Mol. Sci. 2012, 2, 79– 107, DOI: 10.1002/wcms.6153Computer-aided synthesis design. 40 years onCook, Anthony; Johnson, A. Peter; Law, James; Mirzazadeh, Mahdi; Ravitz, Orr; Simon, AnikoWiley Interdisciplinary Reviews: Computational Molecular Science (2012), 2 (1), 79-107CODEN: WIRCAH; ISSN:1759-0884. (Wiley-Blackwell)A review. The discipline of retrosynthetic anal. is now just over 40 years old. From the earliest day, attempts were made to incorporate this approach into computer programs to test the extent in which chem. perception and synthetic thinking could be formalized. Despite pioneering research efforts, computer-aided synthetic anal. failed to achieve widespread routine use by chemists, which can be attributed in part to the difficulty of building the required high-quality retrosynthetic transform databases required for credible analyses. However, with the advent over the past 25 years of large comprehensive reaction databases, work on successfully automating the construction of reliable and comprehensive reaction rule databases is promising to revitalize research in this field. This review compares and contrasts the diverse approaches taken by selected programs in both the design and implementation of mol. feature perception and reaction rule representation, and the concepts of synthetic strategy selection, representation, and execution were reviewed. In particular, the current work on automating the construction of reliable and comprehensive synthetic rule sets from available reaction databases in newer programs such as ARChem were discussed. The authors argued that the progress achieved in this aspect paves the way to a deeper exploration of computer approaches to applying strategy and control in the synthesis problem.
- 54Rappoport, D.; Aspuru-Guzik, A. Predicting Feasible Organic Reaction Pathways Using Heuristically Aided Quantum Chemistry. J. Chem. Theory Comput. 2019, 15, 4099– 4112, DOI: 10.1021/acs.jctc.9b0012654Predicting Feasible Organic Reaction Pathways Using Heuristically Aided Quantum ChemistryRappoport, Dmitrij; Aspuru-Guzik, AlanJournal of Chemical Theory and Computation (2019), 15 (7), 4099-4112CODEN: JCTCCE; ISSN:1549-9618. (American Chemical Society)Studying org. reaction mechanisms using quantum chem. methods requires from the researcher an extensive knowledge of both org. chem. and first-principles computation. The need for empirical knowledge arises because any reasonably complete exploration of the potential energy surfaces (PES) of org. reactions is computationally prohibitive. The authors have previously introduced the heuristically-aided quantum chem. (HAQC) approach to modeling complex chem. reactions, which abstrs. the empirical knowledge in terms of chem. heuristics-simple rules guiding the PES exploration-and combines them with structure optimizations using quantum chem. methods. The HAQC approach makes use of heuristic kinetic criteria for selecting reaction paths that are not only plausible, i.e., consistent with the empirical rules of org. reactivity, but also feasible under the reaction conditions. The authors develop heuristic kinetic feasibility criteria, which correctly predict feasible reaction pathways for a wide range of simple polar (substitutions, addns., and eliminations) and pericyclic org. reactions (cyclizations, sigmatropic shifts, and cycloaddns.). In contrast to knowledge-based reaction mechanism prediction methods, the same kinetic heuristics are successful in classifying reaction pathways as feasible or infeasible across this diverse set of reaction mechanisms. The energy profiles of HAQC and their potential applications in machine learning of chem. reactivity are discussed.
- 55Rappoport, D. Reaction Networks and the Metric Structure of Chemical Space(s). J. Phys. Chem. A 2019, 123, 2610– 2620, DOI: 10.1021/acs.jpca.9b0051955Reaction Networks and the Metric Structure of Chemical Space(s)Rappoport, DmitrijJournal of Physical Chemistry A (2019), 123 (13), 2610-2620CODEN: JPCAFH; ISSN:1089-5639. (American Chemical Society)In this paper, we develop a formal definition of chem. space as a discrete metric space of mols. and analyze its properties. To this end, we utilize the shortest path metric on reaction networks to define a distance function between mols. of the same stoichiometry (no. of atoms). The distance between mols. with different stoichiometries is formalized by making use of the partial ordering of stoichiometries with respect to inclusion. Calcns. of fractal dimension on metric spaces for individual stoichiometries show that they have low intrinsic dimensionality, about an order of magnitude less than the dimension of the underlying reactive potential energy surface. Our findings suggest that efficient search strategies on chem. space can be designed that take advantage of its metric structure.
- 56Wołos, A.; Roszak, R.; Żądło-Dobrowolska, A.; Beker, W.; Mikulak-Klucznik, B.; Spólnik, G.; Dygas, M.; Szymkuć, S.; Grzybowski, B. A. Synthetic Connectivity, Emergence, and Self-Regeneration in the Network of Prebiotic Chemistry. Science 2020, 369, eaaw1955, DOI: 10.1126/science.aaw1955There is no corresponding record for this reference.
- 57Arya, A.; Ray, J.; Sharma, S.; Simbron, R. C.; Lozano, A.; Smith, H. B.; Andersen, J. L.; Chen, H.; Meringer, M.; Cleaves, H. J. An Open Source Computational Workflow for the Discovery of Autocatalytic Networks in Abiotic Reactions. Chem. Sci. 2022, 13, 4838– 4853, DOI: 10.1039/D2SC00256F57An open source computational workflow for the discovery of autocatalytic networks in abiotic reactionsArya, Aayush; Ray, Jessica; Sharma, Siddhant; Cruz Simbron, Romulo; Lozano, Alejandro; Smith, Harrison B.; Andersen, Jakob Lykke; Chen, Huan; Meringer, Markus; Cleaves II, Henderson JamesChemical Science (2022), 13 (17), 4838-4853CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)A central question in origins of life research is how non-entailed chem. processes, which simply dissipate chem. energy because they can do so due to immediate reaction kinetics and thermodn, enabled the origin of highly-entailed ones, in which concatenated kinetically and thermodynamically favorable processes enhanced some processes over others. Some degree of mol. complexity likely had to be supplied by environmental processes to produce entailed self-replicating processes. The origin of entailment, therefore, must connect to fundamental chem. that builds mol. complexity. We present here an open-source chemoinformatic workflow to model abiol. chem. to discover such entailment. This pipeline automates generation of chem. reaction networks and their anal. to discover novel compds. and autocatalytic processes. We demonstrate this pipelines capabilities against a well-studied model system by vetting it against expremental data. This workflow can enable rapid identification of products of complex chemistries and their underlying synthetic relationships to help identify autocatalysis, and potentially self-organization, in such systems. The algorithms used in this study are open-source and reconfigurable by other user-developed workflows.
- 58Allen, F.; Pon, A.; Wilson, M.; Greiner, R.; Wishart, D. CFM-ID: A Web Server for Annotation, Spectrum Prediction and Metabolite Identification from Tandem Mass Spectra. Nucleic Acids Res. 2014, 42, W94– W99, DOI: 10.1093/nar/gku43658CFM-ID: a web server for annotation, spectrum prediction and metabolite identification from tandem mass spectraAllen, Felicity; Pon, Allison; Wilson, Michael; Greiner, Russ; Wishart, DavidNucleic Acids Research (2014), 42 (W1), W94-W99CODEN: NARHAD; ISSN:0305-1048. (Oxford University Press)CFM-ID is a web server supporting three tasks assocd. with the interpretation of tandem mass spectra (MS/MS) for the purpose of automated metabolite identification: annotation of the peaks in a spectrum for a known chem. structure; prediction of spectra for a given chem. structure and putative metabolite identification-a predicted ranking of possible candidate structures for a target spectrum. The algorithms used for these tasks are based on Competitive Fragmentation Modeling (CFM), a recently introduced probabilistic generative model for the MS/MS fragmentation process that uses machine learning techniques to learn its parameters from data. These algorithms have been extensively tested on multiple datasets and have been shown to out-perform existing methods such as MetFrag and FingerId. This web server provides a simple interface for using these algorithms and a graphical display of the resulting annotations, spectra and structures. CFM-ID is made freely available at http://cfmid.wishartlab.com.
- 59Djoumbou-Feunang, Y.; Pon, A.; Karu, N.; Zheng, J.; Li, C.; Arndt, D.; Gautam, M.; Allen, F.; Wishart, D. S. CFM-ID 3.0: Significantly Improved ESI-MS/MS Prediction and Compound Identification. Metabolites 2019, 9, 72, DOI: 10.3390/metabo904007259CFM-ID 3.0: significantly improved ESI-MS/MS prediction and compound identificationDjoumbou-Feunang, Yannick; Pon, Allison; Karu, Naama; Zheng, Jiamin; Li, Carin; Arndt, Dav; Gautam, Maheswor; Allen, Felicity; Wishart, Dav S.Metabolites (2019), 9 (4), 72CODEN: METALU; ISSN:2218-1989. (MDPI AG)Metabolite identification for untargeted metabolomics is often hampered by the lack of exptl. collected ref. spectra from tandem mass spectrometry (MS/MS). To circumvent this problem, Competitive Fragmentation Modeling-ID (CFM-ID) was developed to accurately predict electrospray ionization-MS/MS (ESI-MS/MS) spectra from chem. structures and to aid in compd. identification via MS/MS spectral matching. While earlier versions of CFM-ID performed very well, CFM-ID's performance for predicting the MS/MS spectra of certain classes of compds., including many lipids, was quite poor. Furthermore, CFM-ID's compd. identification capabilities were limited because it did not use exptl. available MS/MS spectra nor did it exploit metadata in its spectral matching algorithm. Here, we describe significant improvements to CFM-ID's performance and speed. These include (1) the implementation of a rule-based fragmentation approach for lipid MS/MS spectral prediction, which greatly improves the speed and accuracy of CFM-ID; (2) the inclusion of exptl. MS/MS spectra and other metadata to enhance CFM-ID's compd. identification abilities; (3) the development of new scoring functions that improves CFM-ID's accuracy by 21.1%; and (4) the implementation of a chem. classification algorithm that correctly classifies unknown chems. (based on their MS/MS spectra) in >80% of the cases. This improved version called CFM-ID 3.0 is freely available as a web server.
- 60Ji, H.; Deng, H.; Lu, H.; Zhang, Z. Predicting a Molecular Fingerprint from an Electron Ionization Mass Spectrum with Deep Neural Networks. Anal. Chem. 2020, 92, 8649– 8653, DOI: 10.1021/acs.analchem.0c0145060Predicting a Molecular Fingerprint from an Electron Ionization Mass Spectrum with Deep Neural NetworksJi, Hongchao; Deng, Hanzi; Lu, Hongmei; Zhang, ZhiminAnalytical Chemistry (Washington, DC, United States) (2020), 92 (13), 8649-8653CODEN: ANCHAM; ISSN:0003-2700. (American Chemical Society)Electron ionization-mass spectrometry (EI-MS) hyphenated to gas chromatog. (GC) is the workhorse for analyzing volatile compds. in complex samples. The spectral matching method can only identify compds. within the spectral database. In response, we present a deep-learning-based approach (DeepEI) for structure elucidation of an unknown compd. with its EI-MS spectrum. DeepEI employs deep neural networks to predict mol. fingerprints from an EI-MS spectrum and searches the mol. structure database with the predicted fingerprints. We evaluated DeepEI with MassBank spectra, and the results indicate DeepEI is an effective identification method. In addn., DeepEI can work cooperatively with database spectral matching and NEIMS (fingerprint to spectrum method) to improve identification accuracy.
- 61Xue, J.; Guijas, C.; Benton, H. P.; Warth, B.; Siuzdak, G. METLIN MS 2 Molecular Standards Database: A Broad Chemical and Biological Resource. Nat. Methods 2020, 17, 953– 954, DOI: 10.1038/s41592-020-0942-561METLIN MS2 molecular standards database: a broad chemical and biological resourceXue, Jingchuan; Guijas, Carlos; Benton, H. Paul; Warth, Benedikt; Siuzdak, GaryNature Methods (2020), 17 (10), 953-954CODEN: NMAEA3; ISSN:1548-7091. (Nature Research)There is no expanded citation for this reference.
- 62Horai, H.; Arita, M.; Kanaya, S.; Nihei, Y.; Ikeda, T.; Suwa, K.; Ojima, Y.; Tanaka, K.; Tanaka, S.; Aoshima, K.; Oda, Y.; Kakazu, Y.; Kusano, M.; Tohge, T.; Matsuda, F.; Sawada, Y.; Hirai, M. Y.; Nakanishi, H.; Ikeda, K.; Akimoto, N.; Maoka, T.; Takahashi, H.; Ara, T.; Sakurai, N.; Suzuki, H.; Shibata, D.; Neumann, S.; Iida, T.; Tanaka, K.; Funatsu, K.; Matsuura, F.; Soga, T.; Taguchi, R.; Saito, K.; Nishioka, T. MassBank: A Public Repository for Sharing Mass Spectral Data for Life Sciences. J. Mass Spectrom. 2010, 45, 703– 714, DOI: 10.1002/jms.177762MassBank: a public repository for sharing mass spectral data for life sciencesHorai, Hisayuki; Arita, Masanori; Kanaya, Shigehiko; Nihei, Yoshito; Ikeda, Tasuku; Suwa, Kazuhiro; Ojima, Yuya; Tanaka, Kenichi; Tanaka, Satoshi; Aoshima, Ken; Oda, Yoshiya; Kakazu, Yuji; Kusano, Miyako; Tohge, Takayuki; Matsuda, Fumio; Sawada, Yuji; Hirai, Masami Yokota; Nakanishi, Hiroki; Ikeda, Kazutaka; Akimoto, Naoshige; Maoka, Takashi; Takahashi, Hiroki; Ara, Takeshi; Sakurai, Nozomu; Suzuki, Hideyuki; Shibata, Daisuke; Neumann, Steffen; Iida, Takashi; Tanaka, Ken; Funatsu, Kimito; Matsuura, Fumito; Soga, Tomoyoshi; Taguchi, Ryo; Saito, Kazuki; Nishioka, TakaakiJournal of Mass Spectrometry (2010), 45 (7), 703-714CODEN: JMSPFJ; ISSN:1076-5174. (John Wiley & Sons Ltd.)MassBank is the first public repository of mass spectra of small chem. compds. for life sciences (<3000 Da). The database contains 605 electron-ionization mass spectrometry(EI-MS), 137 fast atom bombardment MS and 9276 electrospray ionization (ESI)-MSn data of 2337 authentic compds. of metabolites, 11 545 EI-MS and 834 other-MS data of 10 286 volatile natural and synthetic compds., and 3045 ESI-MS2 data of 679 synthetic drugs contributed by 16 research groups (Jan. 2010). ESI-MS2 data were analyzed under nonstandardized, independent exptl. conditions. MassBank is a distributed database. Each research group provides data from its own MassBank data servers distributed on the Internet. MassBank users can access either all of the MassBank data or a subset of the data by specifying one or more exptl. conditions. In a spectral search to retrieve mass spectra similar to a query mass spectrum, the similarity score is calcd. by a weighted cosine correlation in which weighting exponents on peak intensity and the mass-to-charge ratio are optimized to the ESI-MS2 data. MassBank also provides a merged spectrum for each compd. prepd. by merging the analyzed ESI-MS2 data on an identical compd. under different collision-induced dissocn. conditions. Data merging has significantly improved the precision of the identification of a chem. compd. by 21-23% at a similarity score of 0.6. Thus, MassBank is useful for the identification of chem. compds. and the publication of exptl. data.
- 63NIST. NIST 20 Tandem Mass Spectral Libraries; https://chemdata.nist.gov/dokuwiki/doku.php?id=chemdata:msms (accessed 2022-07-19).There is no corresponding record for this reference.
- 64Sans, V.; Porwol, L.; Dragone, V.; Cronin, L. A Self Optimizing Synthetic Organic Reactor System Using Real-Time in-Line NMR Spectroscopy. Chem. Sci. 2015, 6, 1258– 1264, DOI: 10.1039/C4SC03075C64A self optimizing synthetic organic reactor system using real-time in-line NMR spectroscopySans, Victor; Porwol, Luzian; Dragone, Vincenza; Cronin, LeroyChemical Science (2015), 6 (2), 1258-1264CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)A configurable platform for synthetic chem. incorporating an in-line bench-top NMR capable of monitoring and controlling org. reactions in real-time is discussed. The platform is controlled by a modular LabView software control system for hardware, NMR, data anal., and feedback optimization. Using this platform, real-time advanced structural characterization of reaction mixts., including 19F, 13C, DEPT, 2-dimensional NMR spectroscopy (COSY, HSQC, 19F-COSY), are reported for the first time. The potential of this technique was demonstrated by optimizing a catalytic org. reaction in real-time, showing its applicability to self-optimizing systems using criteria such as stereo-selectivity, multi-nuclear measurements, or 2-dimensional correlations.
- 65Granda, J. M.; Donina, L.; Dragone, V.; Long, D.-L.; Cronin, L. Controlling an Organic Synthesis Robot with Machine Learning to Search for New Reactivity. Nature 2018, 559, 377– 381, DOI: 10.1038/s41586-018-0307-865Controlling an organic synthesis robot with machine learning to search for new reactivityGranda, Jaroslaw M.; Donina, Liva; Dragone, Vincenza; Long, De-Liang; Cronin, LeroyNature (London, United Kingdom) (2018), 559 (7714), 377-381CODEN: NATUAS; ISSN:0028-0836. (Nature Research)The discovery of chem. reactions is an inherently unpredictable and time-consuming process. An attractive alternative is to predict reactivity, although relevant approaches, such as computer-aided reaction design, are still in their infancy. Reaction prediction based on high-level quantum chem. methods is complex, even for simple mols. Although machine learning is powerful for data anal., its applications in chem. are still being developed. Inspired by strategies based on chemists' intuition, we propose that a reaction system controlled by a machine learning algorithm may be able to explore the space of chem. reactions quickly, esp. if trained by an expert. Here we present an org. synthesis robot that can perform chem. reactions and anal. faster than they can be performed manually, as well as predict the reactivity of possible reagent combinations after conducting a small no. of expts., thus effectively navigating chem. reaction space. By using machine learning for decision making, enabled by binary encoding of the chem. inputs, the reactions can be assessed in real time using NMR and IR spectroscopy. The machine learning system was able to predict the reactivity of about 1,000 reaction combinations with accuracy greater than 80 per cent after considering the outcomes of slightly over 10 per cent of the dataset. This approach was also used to calc. the reactivity of published datasets. Further, by using real-time data from our robot, these predictions were followed up manually by a chemist, leading to the discovery of four reactions.
- 66Maschmeyer, T.; Prieto, P. L.; Grunert, S.; Hein, J. E. Exploration of Continuous-Flow Benchtop NMR Acquisition Parameters and Considerations for Reaction Monitoring. Magn. Reson. Chem. 2020, 58, 1234– 1248, DOI: 10.1002/mrc.509466Exploration of continuous-flow benchtop NMR acquisition parameters and considerations for reaction monitoringMaschmeyer, Tristan; Prieto, Paloma L.; Grunert, Shad; Hein, Jason E.Magnetic Resonance in Chemistry (2020), 58 (12), 1234-1248CODEN: MRCHEG; ISSN:0749-1581. (John Wiley & Sons Ltd.)This study focused on fundamental data acquisition parameter selection for a benchtop NMR (NMR) system with continuous flow, applicable for reaction monitoring. The effect of flow rate on the mixing behaviors within a flow cell was obsd., along with an exponential decay relationship between flow rate and the apparent spin-lattice relaxation time (T1*) of benzaldehyde. We also monitored sensitivity (as detd. by signal-to-noise ratios; SNRs) under various flow rates, analyte concns., and temps. of the analyte flask. Results suggest that a max. SNR can be achieved with low to medium flow rates and higher analyte concns. This was consistent with data collected with parameters that promote either slow or fast data acquisition. We further consider the effect of these conditions on the analyte's residence time, T1*, and magnetic field inhomogeneity that is a product of continuous flow. Altogether, our results demonstrate how fundamental acquisition parameters can be manipulated to achieve optimal data acquisition in continuous-flow NMR systems.
- 67Chatterjee, S.; Guidi, M.; Seeberger, P. H.; Gilmore, K. Automated Radial Synthesis of Organic Molecules. Nature 2020, 579, 379– 384, DOI: 10.1038/s41586-020-2083-567Automated radial synthesis of organic moleculesChatterjee, Sourav; Guidi, Mara; Seeberger, Peter H.; Gilmore, KerryNature (London, United Kingdom) (2020), 579 (7799), 379-384CODEN: NATUAS; ISSN:0028-0836. (Nature Research)Abstr.: Automated synthesis platforms accelerate and simplify the prepn. of mols. by removing the phys. barriers to org. synthesis. This provides unrestricted access to biopolymers and small mols. via reproducible and directly comparable chem. processes. Current automated multistep syntheses rely on either iterative1-4 or linear processes5-9, and require compromises in terms of versatility and the use of equipment. Here we report an approach towards the automated synthesis of small mols., based on a series of continuous flow modules that are radially arranged around a central switching station. Using this approach, concise vols. can be exposed to any reaction conditions required for a desired transformation. Sequential, non-simultaneous reactions can be combined to perform multistep processes, enabling the use of variable flow rates, reuse of reactors under different conditions, and the storage of intermediates. This fully automated instrument is capable of both linear and convergent syntheses and does not require manual reconfiguration between different processes. The capabilities of this approach are demonstrated by performing optimizations and multistep syntheses of targets, varying concns. via inline dilns., exploring several strategies for the multistep synthesis of the anticonvulsant drug rufinamide10, synthesizing eighteen compds. of two deriv. libraries that are prepd. using different reaction pathways and chemistries, and using the same reagents to perform metallaphotoredox carbon-nitrogen cross-couplings11 in a photochem. module-all without instrument reconfiguration.
- 68Bahr, M. N.; Damon, D. B.; Yates, S. D.; Chin, A. S.; Christopher, J. D.; Cromer, S.; Perrotto, N.; Quiroz, J.; Rosso, V. Collaborative Evaluation of Commercially Available Automated Powder Dispensing Platforms for High-Throughput Experimentation in Pharmaceutical Applications. Org. Process Res. Dev. 2018, 22, 1500– 1508, DOI: 10.1021/acs.oprd.8b0025968Collaborative Evaluation of Commercially Available Automated Powder Dispensing Platforms for High-Throughput Experimentation in Pharmaceutical ApplicationsBahr, Matthew N.; Damon, David B.; Yates, Simon D.; Chin, Alexander S.; Christopher, J. David; Cromer, Samuel; Perrotto, Nicholas; Quiroz, Jorge; Rosso, VictorOrganic Process Research & Development (2018), 22 (11), 1500-1508CODEN: OPRDFK; ISSN:1083-6160. (American Chemical Society)Many workflows in Pharmaceutical R&D involve the manipulation of defined amts. of powders. Automated powder dispensing platforms are currently available; however, these existing technologies do not meet the requirements for every high-throughput experimentation powder dispensing application. A Working Group (WG) composed of pharmaceutical researchers within the Enabling Technologies Consortium (ETC) evaluated automated platforms com. available from three manufacturers using an objective, systematic testing protocol. This paper describes the selection of powders and testing conditions used in this evaluation, and it assesses the impact that the powders, testing conditions, equipment environment, and other factors had on the performance of the selected platforms.
- 69Bahr, M. N.; Morris, M. A.; Tu, N. P.; Nandkeolyar, A. Recent Advances in High-Throughput Automated Powder Dispensing Platforms for Pharmaceutical Applications. Org. Process Res. Dev. 2020, 24, 2752, DOI: 10.1021/acs.oprd.0c0041169Recent Advances in High-Throughput Automated Powder Dispensing Platforms for Pharmaceutical ApplicationsBahr, Matthew N.; Morris, Mark A.; Tu, Noah P.; Nandkeolyar, AakankschitOrganic Process Research & Development (2020), 24 (11), 2752-2761CODEN: OPRDFK; ISSN:1083-6160. (American Chemical Society)A wide array of pharmaceutical research studies involve dispensing a variety of powders such as active ingredients, intermediates, catalysts, and formulation excipients. Automated powder dispensing platforms are increasingly relied upon to perform the mundane task of filling vials in multi-well plates for high-throughput experimentation workflows. A small group of pharmaceutical scientists collaborated to evaluate recent advances in com. available automation platforms from two instrument manufacturers using previously reported objective and systematic testing protocols. This manuscript details the testing conditions used for the evaluation and the results obtained and assesses the impact that the powder characteristics had on the performance of the selected platforms through statistical anal.
- 70Tu, N. P.; Dombrowski, A. W.; Goshu, G. M.; Vasudevan, A.; Djuric, S. W.; Wang, Y. High-Throughput Reaction Screening with Nanomoles of Solid Reagents Coated on Glass Beads. Angew. Chem., Int. Ed. 2019, 58, 7987– 7991, DOI: 10.1002/anie.20190053670High-Throughput Reaction Screening with Nanomoles of Solid Reagents Coated on Glass BeadsTu, Noah P.; Dombrowski, Amanda W.; Goshu, Gashaw M.; Vasudevan, Anil; Djuric, Stevan W.; Wang, YingAngewandte Chemie, International Edition (2019), 58 (24), 7987-7991CODEN: ACIEF5; ISSN:1433-7851. (Wiley-VCH Verlag GmbH & Co. KGaA)Technologies that enable rapid screening of diverse reaction conditions are of crit. importance to methodol. development and reaction optimization, esp. when mols. of high complexity and scarcity are involved. The lack of a general solid dispensing method for chem. reagents on micro- and nanomole scale prevents the full use of reaction screening technologies. The authors herein report the development of a technol. in which glass beads coated with solid chem. reagents (ChemBeads) enable the delivery of nanomole quantities of solid chem. reagents efficiently. By exploring the concept of preferred screening sets, the flexibility and generality of this technol. for high-throughput reaction screening was validated.
- 71Shiri, P.; Lai, V.; Zepel, T.; Griffin, D.; Reifman, J.; Clark, S.; Grunert, S.; Yunker, L. P. E.; Steiner, S.; Situ, H.; Yang, F.; Prieto, P. L.; Hein, J. E. Automated Solubility Screening Platform Using Computer Vision. iScience 2021, 24, 102176, DOI: 10.1016/j.isci.2021.10217671Automated solubility screening platform using computer visionShiri, Parisa; Lai, Veronica; Zepel, Tara; Griffin, Daniel; Reifman, Jonathan; Clark, Sean; Grunert, Shad; Yunker, Lars P. E.; Steiner, Sebastian; Situ, Henry; Yang, Fan; Prieto, Paloma L.; Hein, Jason E.iScience (2021), 24 (3), 102176CODEN: ISCICE; ISSN:2589-0042. (Elsevier B.V.)Soly. screening is an essential, routine process that is often labor intensive. Robotic platforms have been developed to automate some aspects of the manual labor involved. However, many of the existing systems rely on traditional analytic techniques such as high-performance liq. chromatog., which require pre-calibration for each compd. and can be resource consuming. In addn., automation is not typically end-to-end, requiring user intervention to move vials, establish anal. methods for each compd. and interpret the raw data. We developed a closed-loop, flexible robotic system with integrated solid and liq. dosing capabilities that relies on computer vision and iterative feedback to successfully measure caffeine soly. in multiple solvents. After initial researcher input (<2 min), the system ran autonomously, screening five different solvent systems (20-80 min each). The resulting soly. values matched those obtained using traditional manual techniques.
- 72J3016C: Taxonomy and Definitions for Terms Related to Driving Automation Systems for On-Road Motor Vehicles ; 2021; https://www.sae.org/standards/content/j3016_202104.There is no corresponding record for this reference.
- 73Ruiz-Castillo, P.; Buchwald, S. L. Applications of Palladium-Catalyzed C–N Cross-Coupling Reactions. Chem. Rev. 2016, 116, 12564– 12649, DOI: 10.1021/acs.chemrev.6b0051273Applications of Palladium-Catalyzed C-N Cross-Coupling ReactionsRuiz-Castillo, Paula; Buchwald, Stephen L.Chemical Reviews (Washington, DC, United States) (2016), 116 (19), 12564-12649CODEN: CHREAY; ISSN:0009-2665. (American Chemical Society)A review. Pd-catalyzed cross-coupling reactions that form C-N bonds have become useful methods to synthesize anilines and aniline derivs., an important class of compds. throughout chem. research. A key factor in the widespread adoption of these methods has been the continued development of reliable and versatile catalysts that function under operationally simple, user-friendly conditions. This review provides an overview of Pd-catalyzed N-arylation reactions found in both basic and applied chem. research from 2008 to the present. Selected examples of C-N cross-coupling reactions between nine classes of nitrogen-based coupling partners and (pseudo)aryl halides are described for the synthesis of heterocycles, medicinally relevant compds., natural products, org. materials, and catalysts.
- 74Chinchilla, R.; Nájera, C. The Sonogashira Reaction: A Booming Methodology in Synthetic Organic Chemistry. Chem. Rev. 2007, 107, 874– 922, DOI: 10.1021/cr050992x74The Sonogashira reaction: a booming methodology in synthetic organic chemistryChinchilla, Rafael; Najera, CarmenChemical Reviews (Washington, DC, United States) (2007), 107 (3), 874-922CODEN: CHREAY; ISSN:0009-2665. (American Chemical Society)A review. The palladium-catalyzed sp2-sp coupling reaction between aryl or alkenyl halides or triflates and terminal alkynes, with or without the presence of copper(I) cocatalyst, was reviewed. The mechanism of Sonogashira reaction was discussed, and the catalysts and reaction conditions were explored as well. In addn., the application of Sonogashira cross-coupling reaction was also introduced.
- 75Breugst, M.; Reissig, H.-U. The Huisgen Reaction: Milestones of the 1,3-Dipolar Cycloaddition. Angew. Chem., Int. Ed. 2020, 59, 12293– 12307, DOI: 10.1002/anie.20200311575The Huisgen Reaction: Milestones of the 1,3-Dipolar CycloadditionBreugst, Martin; Reissig, Hans-UlrichAngewandte Chemie, International Edition (2020), 59 (30), 12293-12307CODEN: ACIEF5; ISSN:1433-7851. (Wiley-VCH Verlag GmbH & Co. KGaA)A review. The concept of 1,3-dipolar cycloaddns. was presented by Rolf Huisgen 60 years ago. Previously unknown reactive intermediates, for example azomethine ylides, were introduced to org. chem. and the (3+2) cycloaddns. of 1,3-dipoles to multiple-bond systems (Huisgen reaction) developed into one of the most versatile synthetic methods in heterocyclic chem. In this Review, the authors present the history of this research area, highlight important older reports, and describe the evolution and further development of the concept. The most important mechanistic and synthetic results are discussed. Quantum-mech. calcns. support the concerted mechanism always favored by R. Huisgen; however, in extreme cases intermediates may be involved. The impact of 1,3-dipolar cycloaddns. on the click chem. concept of K. B. Sharpless will also be discussed.
- 76Molga, K.; Szymkuć, S.; Gołębiowska, P.; Popik, O.; Dittwald, P.; Moskal, M.; Roszak, R.; Mlynarski, J.; Grzybowski, B. A. A Computer Algorithm to Discover Iterative Sequences of Organic Reactions. Nat. Synth. 2022, 1, 49– 58, DOI: 10.1038/s44160-021-00010-3There is no corresponding record for this reference.
- 77Eppel, S.; Xu, H.; Bismuth, M.; Aspuru-Guzik, A. Computer Vision for Recognition of Materials and Vessels in Chemistry Lab Settings and the Vector-LabPics Data Set. ACS Cent. Sci. 2020, 6, 1743– 1752, DOI: 10.1021/acscentsci.0c0046077Computer Vision for Recognition of Materials and Vessels in Chemistry Lab Settings and the Vector-LabPics Data SetEppel, Sagi; Xu, Haoping; Bismuth, Mor; Aspuru-Guzik, AlanACS Central Science (2020), 6 (10), 1743-1752CODEN: ACSCII; ISSN:2374-7951. (American Chemical Society)This work presents a machine learning approach for the computer vision-based recognition of materials inside vessels in the chem. lab and other settings. In addn., we release a data set assocd. with the training of the model for further model development. The task to learn is finding the region, boundaries, and category for each material phase and vessel in an image. Handling materials inside mostly transparent containers is the main activity performed by human and robotic chemists in the lab. Visual recognition of vessels and their contents is essential for performing this task. Modern machine-vision methods learn recognition tasks by using data sets contg. a large no. of annotated images. This work presents the Vector-LabPics data set, which consists of 2187 images of materials within mostly transparent vessels in a chem. lab and other general settings. The images are annotated for both the vessels and the individual material phases inside them, and each instance is assigned one or more classes (liq., solid, foam, suspension, powder, ...). The fill level, labels, corks, and parts of the vessel are also annotated. Several convolutional nets for semantic and instance segmentation were trained on this data set. The trained neural networks achieved good accuracy in detecting and segmenting vessels and material phases, and in classifying liqs. and solids, but relatively low accuracy in segmenting multiphase systems such as phase-sepg. liqs. A computer vision system for recognition materials and vessels in the chem. lab. The system is based on the new LabPics image data set and convolutional neural nets for image segmentation.
- 78Steiner, S.; Wolf, J.; Glatzel, S.; Andreou, A.; Granda, J. M.; Keenan, G.; Hinkley, T.; Aragon-Camarasa, G.; Kitson, P. J.; Angelone, D.; Cronin, L. Organic Synthesis in a Modular Robotic System Driven by a Chemical Programming Language. Science 2019, 363, 1, DOI: 10.1126/science.aav2211There is no corresponding record for this reference.
- 79Mehr, S. H. M.; Craven, M.; Leonov, A. I.; Keenan, G.; Cronin, L. A Universal System for Digitization and Automatic Execution of the Chemical Synthesis Literature. Science 2020, 370, 101– 108, DOI: 10.1126/science.abc298679A universal system for digitization and automatic execution of the chemical synthesis literatureMehr, S. Hessam M.; Craven, Matthew; Leonov, Artem I.; Keenan, Graham; Cronin, LeroyScience (Washington, DC, United States) (2020), 370 (6512), 101-108CODEN: SCIEAS; ISSN:1095-9203. (American Association for the Advancement of Science)Robotic systems for chem. synthesis are growing in popularity but can be difficult to run and maintain because of the lack of a std. operating system or capacity for direct access to the literature through natural language processing. Here we show an extendable chem. execution architecture that can be populated by automatically reading the literature, leading to a universal autonomous workflow. The robotic synthesis code can be cor. in natural language without any programming knowledge and, because of the std., is hardware independent. This chem. code can then be combined with a graph describing the hardware modules and compiled into platform-specific, low-level robotic instructions for execution. We showcase automated syntheses of 12 compds. from the literature, including the analgesic lidocaine, the Dess-Martin periodinane oxidn. reagent, and the fluorinating agent AlkylFluor.
- 80Acceleration Consortium; https://acceleration.utoronto.ca (accessed 2021-06-10).There is no corresponding record for this reference.