Elements of RNA Design
- Paul D. CarlsonPaul D. CarlsonRobert F. Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, New York 14853, United StatesCenter for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, United StatesMore by Paul D. Carlson
- and
- Julius B. Lucks*Julius B. Lucks*Phone: 1.847.467.2943. Fax: 1.847.491.3728. E-mail: [email protected]Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, United StatesDepartment of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, United StatesMore by Julius B. Lucks
SPECIAL ISSUE
This article is part of the
The versatility of naturally occurring RNAs that perform central cellular functions has generated intense interest in designing engineered RNAs to control these processes. (1,2) Mechanistic studies of natural RNA systems that sense and respond to changes in temperature, small molecule or metabolite concentration, and the expression of other RNA and protein species in the cell have been combined with advances in computational nucleic acid design to enable the de novo design of synthetic RNA regulators to control gene expression. While progress has been remarkable, (2−4) challenges remain. Here we outline important progress and emerging technologies for state-of-the-art RNA design and challenges which must be addressed to enable the next generation of synthetic RNA devices that can be used in applications including biomanufacturing, therapeutics, and diagnostics (Figure 1).
Figure 1

Figure 1. Overview of the emerging design–test–learn cycle for RNA engineering. Natural RNA systems inspire the in silico design of synthetic versions with novel or optimized performance. Important considerations for design are kinetic vs thermodynamic folding regimes, how to include ligand and protein interactions, and whether to design at the 2D or 3D level. Functional testing and structural testing with high-throughput RNA structure chemical probing measurements can then be used to characterize the designed RNAs and troubleshoot unintended folding or expression patterns. These design and testing schemes in turn teach us about new principles for increasing the complexity and utility of synthetic RNA-based regulators. RT, reverse transcription.
A primary consideration in RNA design is the RNA folding regime most appropriate for a desired function. Two principle RNA folding regimes are most important, depending on which point in its lifecycle the RNA performs its function. Kinetic design focuses on the out-of-equilibrium folding processes that occur during RNA synthesis, where interactions and refolding events must occur during the fast process of transcription. Examples of RNAs that need kinetic design include riboswitches that make transcriptional regulatory decisions as a function of intracellular ligand concentration. On the other hand, thermodynamic design focuses on RNA functions that can occur over longer time scales, and is typically appropriate when interactions occur post-transcriptionally and are driven by free energy differences between possible RNA structures and interactions.
Most work in RNA synthetic biology to date has focused on thermodynamic design, primarily because we know the most about equilibrium RNA folding. One example is the toehold switch, a computationally designed RNA system that controls translation in response to the binding of a designed RNA trigger. Toehold switches allow tight control of translation and have enabled the most complex RNA-based synthetic gene regulation to date that can implement the in vivo evaluation of a 12-input logic function. (4) Kinetic RNA folding pathways on the other hand are generally more difficult to computationally predict, and therefore, designing in the kinetic regime has remained a challenge for RNA synthetic biology. If this challenge can be overcome, however, the kinetic RNA folding regime has several potential advantages, including the ability to regulate transcription with designed RNAs that would enable RNA-only genetic circuits. For example, de novo-designed transcriptional activators called small transcription activating RNAs (STARs) have been used to create genetic networks that display sophisticated temporal patterns of gene expression, including an incoherent feed forward loop governed by the STAR-mediated control of guide RNA synthesis for downstream CRISPR interference. (3) Both of these examples are evidence that there has been strong progress in RNA synthetic biology, with computationally designed STARs and toehold switches achieving dynamic ranges exceeding those of many protein-based systems. (3,4)
Another element to consider is the design of RNA interactions with small molecules, ions, and proteins. These interactions allow natural RNA regulators to serve as biosensors of cellular state. However, ligand–RNA and protein–RNA interactions are difficult to model in silico, as design algorithms cannot currently capture the many noncanonical structural contexts often needed to create specific binding interactions. Recently, progress was made in the computational design of synthetic riboswitches by forcing the fold of the aptamer domain and accounting for the additional energy of stabilization from ligand binding, representing an important step forward in effectively modeling these complex RNA–ligand interactions in synthetic systems. (2) However, substantial work remains to enable the completely de novo design of such interactions, and will require advancements such as the integration of RNA and protein design algorithms, as well as the further development of strategies to abstract the complexity of RNA–ligand binding to generate simplified design motifs.
Dimensionality is also an important element of RNA design. Most in silico design methods have been limited to two-dimensional (2D) design and typically do not allow for the consideration of the three-dimensional (3D) tertiary interactions that can be critical for proper RNA function. Designing at the level of secondary structure is often a suitable abstraction to achieve successful designs, particularly when RNA function is based on the conditional formation of base pair interactions. (3,4) However, designing more complex regulation strategies will require improved folding models to account for the three-dimensional nature of RNA folding. Recently, there has been progress in bridging the gap between de novo 2D and 3D design, where structural motifs taken from solved crystal structures were used as building blocks to design complex 3D structures including linkers for synthetic tethered ribosomes. (5) This work provides a glimpse of the staggering breadth of novel functions that de novo 3D design could enable.
Following design, functional and structural characterization form the backbone of the testing platform for RNA engineering. Functional testing links expression of a designed RNA network to a measurable cellular output such as gene expression. However, it can be difficult to explain discrepancies between the designed structure of an RNA regulatory network and an undesired functional output. A unique tool for RNA-based design is the broad suite of high-throughput chemical probing technologies that have been developed to characterize RNA structures in a range of folding regimes including in vitro, in vivo, and cotranscriptionally folded. Chemical probes covalently modify an RNA of interest in a structure-dependent manner, with more flexible regions being more readily modified. The positions of these modifications are then detected by reverse transcription and high throughput sequencing. The distribution of modifications is then used to calculate a reactivity at each position, with higher reactivities corresponding to more unstructured positions. These experiments can capture structural features beyond simple base-pairing, such as protein or ligand binding, base stacking, and tertiary interactions. Since these features can all be difficult or impossible to predict with current RNA folding models, chemical probing experiments are an important bridge between in silico design and in vivo function, accelerating the design–test–learn cycle. In a recent example of this integrated approach, parallel functional and structural testing of the pT181 RNA transcriptional repressor revealed a new design principle related to the structural flexibility of an RNA–RNA interaction domain, enabling the forward engineering of new repressor variants. (1) A question at the forefront of RNA synthetic biology is how to best integrate high-throughput RNA structural information within design algorithms.
While much has been learned in recent years, several challenges remain for the creation of complex RNA networks. First, the burden imposed by synthetic circuitry can be deleterious to cellular fitness. Synthetic RNAs are typically overexpressed relative to endogenous RNAs in order to minimize diffusion limitations and maximize regulator performance. (3,4) For small networks, this overexpression does not have a substantial impact on cellular fitness, but larger networks containing multiple layers of regulation or many individual species can lead to drastically reduced cellular fitness. Strategies to obviate the need for overexpression, such as the development of synthetic organelle-like vesicles to increase the local concentration of interacting species, will be necessary to minimize cellular burden for complex genetic circuitry. Context effects, that is, the spatial ordering of individual functional elements along an RNA transcript, can also impede the functional performance of complex RNA regulatory networks. Elements that behave as expected in isolation may interact with adjacent functional elements when expressed as part of a larger network; these long-range interactions are difficult to predict a priori and currently require a trial-and-error approach to test which organizational schemes perform best. Isolation strategies, including the placement of cleavable RNA elements between functional domains, (3) can help reduce these effects, although improved structural design principles are still needed. Finally, limited portability between organisms remains a fundamental roadblock. Theoretically, RNA folding could be robust to different cellular environments, which is an intriguing potential advantage of RNA regulation. However, in practice, RNA regulators are generally optimized to function in a single host and rely on cellular machinery—such as RNA polymerases (3) and ribosomes (4)—for proper function. Transfer of regulatory mechanisms to nonmodel organisms has therefore been difficult, limiting the widespread adoption of these regulation strategies. Decoupling these strategies from organism-specific features of cellular factors could facilitate the efficient transfer of RNA regulators to new hosts.
Here we have outlined several elements of RNA design that have fueled significant progress in RNA synthetic biology. The second edition of these elements will likely include deeper insights that distill the nuances of the structure–function relationships of RNA into principles of RNA design. On the basis of the rapid and impressive progress in RNA synthetic biology thus far, we anticipate designed RNA systems to play an increasingly important role in biotechnologies and to become powerful synthetic tools for unraveling the function of natural RNAs.
Acknowledgments
The authors thank Cameron Glasscock and Molly Evans for helpful comments in preparing this manuscript.
References
This article references 5 other publications.
- 1Takahashi, M. K., Watters, K. E., Gasper, P. M., Abbott, T. R., Carlson, P. D., Chen, A. A., and Lucks, J. B. (2016) Using in-cell SHAPE-Seq and simulations to probe structure–function design principles of RNA transcriptional regulators. RNA 22, 920– 933, DOI: 10.1261/rna.054916.115
- 2Espah Borujeni, A., Mishler, D. M., Wang, J., Huso, W., and Salis, H. M. (2016) Automated physics-based design of synthetic riboswitches from diverse RNA aptamers. Nucleic Acids Res. 44, 1– 13, DOI: 10.1093/nar/gkv1289[Crossref], [PubMed], [CAS], Google Scholar2https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BC28vms1ahtw%253D%253D&md5=51655b5890cbbc6f9e6d7696c915ff77Automated physics-based design of synthetic riboswitches from diverse RNA aptamersEspah Borujeni Amin; Huso Walker; Mishler Dennis M; Wang Jingzhi; Salis Howard MNucleic acids research (2016), 44 (1), 1-13 ISSN:.Riboswitches are shape-changing regulatory RNAs that bind chemicals and regulate gene expression, directly coupling sensing to cellular actuation. However, it remains unclear how their sequence controls the physics of riboswitch switching and activation, particularly when changing the ligand-binding aptamer domain. We report the development of a statistical thermodynamic model that predicts the sequence-structure-function relationship for translation-regulating riboswitches that activate gene expression, characterized inside cells and within cell-free transcription-translation assays. Using the model, we carried out automated computational design of 62 synthetic riboswitches that used six different RNA aptamers to sense diverse chemicals (theophylline, tetramethylrosamine, fluoride, dopamine, thyroxine, 2,4-dinitrotoluene) and activated gene expression by up to 383-fold. The model explains how aptamer structure, ligand affinity, switching free energy and macromolecular crowding collectively control riboswitch activation. Our model-based approach for engineering riboswitches quantitatively confirms several physical mechanisms governing ligand-induced RNA shape-change and enables the development of cell-free and bacterial sensors for diverse applications.
- 3Chappell, J., Westbrook, A., Verosloff, M., and Lucks, J. B. (2017) Computational design of small transcription activating RNAs for versatile and dynamic gene regulation. Nat. Commun. 8, 1051, DOI: 10.1038/s41467-017-01082-6[Crossref], [PubMed], [CAS], Google Scholar3https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BC1M7is1eqsw%253D%253D&md5=1c76725cdaaace499e331199ee36bd5cComputational design of small transcription activating RNAs for versatile and dynamic gene regulationChappell James; Lucks Julius B; Westbrook Alexandra; Verosloff Matthew; Lucks Julius BNature communications (2017), 8 (1), 1051 ISSN:.A longstanding goal of synthetic biology has been the programmable control of cellular functions. Central to this is the creation of versatile regulatory toolsets that allow for programmable control of gene expression. Of the many regulatory molecules available, RNA regulators offer the intriguing possibility of de novo design-allowing for the bottom-up molecular-level design of genetic control systems. Here we present a computational design approach for the creation of a bacterial regulator called Small Transcription Activating RNAs (STARs) and create a library of high-performing and orthogonal STARs that achieve up to ~ 9000-fold gene activation. We demonstrate the versatility of these STARs-from acting synergistically with existing constitutive and inducible regulators, to reprogramming cellular phenotypes and controlling multigene metabolic pathway expression. Finally, we combine these new STARs with themselves and CRISPRi transcriptional repressors to deliver new types of RNA-based genetic circuitry that allow for sophisticated and temporal control of gene expression.
- 4Green, A. A., Kim, J., Ma, D., Silver, P. A., Collins, J. J., and Yin, P. (2017) Complex cellular logic computation using ribocomputing devices. Nature 548, 117– 121, DOI: 10.1038/nature23271[Crossref], [PubMed], [CAS], Google Scholar4https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXht1ars73K&md5=6f9fc931b688dcef1a36f9d6ae172a50Complex cellular logic computation using ribocomputing devicesGreen, Alexander A.; Kim, Jongmin; Ma, Duo; Silver, Pamela A.; Collins, James J.; Yin, PengNature (London, United Kingdom) (2017), 548 (7665), 117-121CODEN: NATUAS; ISSN:0028-0836. (Nature Publishing Group)Synthetic biol. aims to develop engineering-driven approaches to the programming of cellular functions that could yield transformative technologies. Synthetic gene circuits that combine DNA, protein, and RNA components demonstrated a range of functions such as bistability, oscillation, feedback, and logic capabilities. However, it remains challenging to scale up these circuits owing to the limited no. of designable, orthogonal, high-performance parts, the empirical and often tedious compn. rules, and the requirements for substantial resources for encoding and operation. Here, the authors report a strategy for constructing RNA-only nanodevices to evaluate complex logic in living cells. The authors' ribocomputing systems are composed of de-novo-designed parts and operate through predictable and designable base-pairing rules, allowing the effective in silico design of computing devices with prescribed configurations and functions in complex cellular environments. These devices operate at the post-transcriptional level and use an extended RNA transcript to colocalize all circuit sensing, computation, signal transduction, and output elements in the same self-assembled mol. complex, which reduces diffusion-mediated signal losses, lowers metabolic cost, and improves circuit reliability. Ribocomputing devices in Escherichia coli can evaluate two-input logic with a dynamic range up to 900-fold and scale them to four-input AND, six-input OR, and a complex 12-input expression (A1 AND A2 AND NOT A1*) OR (B1 AND B2 AND NOT B2*) OR (C1 AND C2) OR (D1 AND D2) OR (E1 AND E2). Successful operation of ribocomputing devices based on programmable RNA interactions suggests that systems employing the same design principles could be implemented in other host organisms or in extracellular settings.
- 5Yesselman, J. D., Eiler, D., Carlson, E. D., Ooms, A. N., Kladwang, W., Shi, X., Costantino, D. A., Herschlag, D., Jewett, M. C., Kieft, J. S., and Das, R. Computational Design of Asymmetric Three-dimensional RNA Structures and Machines. 2017. bioRxiv. https://doi.org/10.1101/223479.Google ScholarThere is no corresponding record for this reference.
Cited By
This article is cited by 4 publications.
- M. Verosloff, J. Chappell, K. L. Perry, J. R. Thompson, J. B. Lucks. PLANT-Dx: A Molecular Diagnostic for Point-of-Use Detection of Plant Pathogens. ACS Synthetic Biology 2019, 8 (4) , 902-905. https://doi.org/10.1021/acssynbio.8b00526
- R. Kent, N. Dixon. Systematic Evaluation of Genetic and Environmental Factors Affecting Performance of Translational Riboswitches. ACS Synthetic Biology 2019, 8 (4) , 884-901. https://doi.org/10.1021/acssynbio.9b00017
- Johan O. L. Andreasson, Michael R. Gotrik, Michelle J. Wu, Hannah K. Wayment-Steele, Wipapat Kladwang, Fernando Portela, Roger Wellington-Oguri, , Rhiju Das, William J. Greenleaf. Crowdsourced RNA design discovers diverse, reversible, efficient, self-contained molecular switches. Proceedings of the National Academy of Sciences 2022, 119 (18) https://doi.org/10.1073/pnas.2112979119
- Courtney E. Szyjka, Eric J. Strobel. Cotranscriptional RNA Chemical Probing. 2022, 291-330. https://doi.org/10.1007/978-1-0716-2421-0_17
Abstract
Figure 1
Figure 1. Overview of the emerging design–test–learn cycle for RNA engineering. Natural RNA systems inspire the in silico design of synthetic versions with novel or optimized performance. Important considerations for design are kinetic vs thermodynamic folding regimes, how to include ligand and protein interactions, and whether to design at the 2D or 3D level. Functional testing and structural testing with high-throughput RNA structure chemical probing measurements can then be used to characterize the designed RNAs and troubleshoot unintended folding or expression patterns. These design and testing schemes in turn teach us about new principles for increasing the complexity and utility of synthetic RNA-based regulators. RT, reverse transcription.
References
ARTICLE SECTIONSThis article references 5 other publications.
- 1Takahashi, M. K., Watters, K. E., Gasper, P. M., Abbott, T. R., Carlson, P. D., Chen, A. A., and Lucks, J. B. (2016) Using in-cell SHAPE-Seq and simulations to probe structure–function design principles of RNA transcriptional regulators. RNA 22, 920– 933, DOI: 10.1261/rna.054916.115
- 2Espah Borujeni, A., Mishler, D. M., Wang, J., Huso, W., and Salis, H. M. (2016) Automated physics-based design of synthetic riboswitches from diverse RNA aptamers. Nucleic Acids Res. 44, 1– 13, DOI: 10.1093/nar/gkv1289[Crossref], [PubMed], [CAS], Google Scholar2https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BC28vms1ahtw%253D%253D&md5=51655b5890cbbc6f9e6d7696c915ff77Automated physics-based design of synthetic riboswitches from diverse RNA aptamersEspah Borujeni Amin; Huso Walker; Mishler Dennis M; Wang Jingzhi; Salis Howard MNucleic acids research (2016), 44 (1), 1-13 ISSN:.Riboswitches are shape-changing regulatory RNAs that bind chemicals and regulate gene expression, directly coupling sensing to cellular actuation. However, it remains unclear how their sequence controls the physics of riboswitch switching and activation, particularly when changing the ligand-binding aptamer domain. We report the development of a statistical thermodynamic model that predicts the sequence-structure-function relationship for translation-regulating riboswitches that activate gene expression, characterized inside cells and within cell-free transcription-translation assays. Using the model, we carried out automated computational design of 62 synthetic riboswitches that used six different RNA aptamers to sense diverse chemicals (theophylline, tetramethylrosamine, fluoride, dopamine, thyroxine, 2,4-dinitrotoluene) and activated gene expression by up to 383-fold. The model explains how aptamer structure, ligand affinity, switching free energy and macromolecular crowding collectively control riboswitch activation. Our model-based approach for engineering riboswitches quantitatively confirms several physical mechanisms governing ligand-induced RNA shape-change and enables the development of cell-free and bacterial sensors for diverse applications.
- 3Chappell, J., Westbrook, A., Verosloff, M., and Lucks, J. B. (2017) Computational design of small transcription activating RNAs for versatile and dynamic gene regulation. Nat. Commun. 8, 1051, DOI: 10.1038/s41467-017-01082-6[Crossref], [PubMed], [CAS], Google Scholar3https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BC1M7is1eqsw%253D%253D&md5=1c76725cdaaace499e331199ee36bd5cComputational design of small transcription activating RNAs for versatile and dynamic gene regulationChappell James; Lucks Julius B; Westbrook Alexandra; Verosloff Matthew; Lucks Julius BNature communications (2017), 8 (1), 1051 ISSN:.A longstanding goal of synthetic biology has been the programmable control of cellular functions. Central to this is the creation of versatile regulatory toolsets that allow for programmable control of gene expression. Of the many regulatory molecules available, RNA regulators offer the intriguing possibility of de novo design-allowing for the bottom-up molecular-level design of genetic control systems. Here we present a computational design approach for the creation of a bacterial regulator called Small Transcription Activating RNAs (STARs) and create a library of high-performing and orthogonal STARs that achieve up to ~ 9000-fold gene activation. We demonstrate the versatility of these STARs-from acting synergistically with existing constitutive and inducible regulators, to reprogramming cellular phenotypes and controlling multigene metabolic pathway expression. Finally, we combine these new STARs with themselves and CRISPRi transcriptional repressors to deliver new types of RNA-based genetic circuitry that allow for sophisticated and temporal control of gene expression.
- 4Green, A. A., Kim, J., Ma, D., Silver, P. A., Collins, J. J., and Yin, P. (2017) Complex cellular logic computation using ribocomputing devices. Nature 548, 117– 121, DOI: 10.1038/nature23271[Crossref], [PubMed], [CAS], Google Scholar4https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXht1ars73K&md5=6f9fc931b688dcef1a36f9d6ae172a50Complex cellular logic computation using ribocomputing devicesGreen, Alexander A.; Kim, Jongmin; Ma, Duo; Silver, Pamela A.; Collins, James J.; Yin, PengNature (London, United Kingdom) (2017), 548 (7665), 117-121CODEN: NATUAS; ISSN:0028-0836. (Nature Publishing Group)Synthetic biol. aims to develop engineering-driven approaches to the programming of cellular functions that could yield transformative technologies. Synthetic gene circuits that combine DNA, protein, and RNA components demonstrated a range of functions such as bistability, oscillation, feedback, and logic capabilities. However, it remains challenging to scale up these circuits owing to the limited no. of designable, orthogonal, high-performance parts, the empirical and often tedious compn. rules, and the requirements for substantial resources for encoding and operation. Here, the authors report a strategy for constructing RNA-only nanodevices to evaluate complex logic in living cells. The authors' ribocomputing systems are composed of de-novo-designed parts and operate through predictable and designable base-pairing rules, allowing the effective in silico design of computing devices with prescribed configurations and functions in complex cellular environments. These devices operate at the post-transcriptional level and use an extended RNA transcript to colocalize all circuit sensing, computation, signal transduction, and output elements in the same self-assembled mol. complex, which reduces diffusion-mediated signal losses, lowers metabolic cost, and improves circuit reliability. Ribocomputing devices in Escherichia coli can evaluate two-input logic with a dynamic range up to 900-fold and scale them to four-input AND, six-input OR, and a complex 12-input expression (A1 AND A2 AND NOT A1*) OR (B1 AND B2 AND NOT B2*) OR (C1 AND C2) OR (D1 AND D2) OR (E1 AND E2). Successful operation of ribocomputing devices based on programmable RNA interactions suggests that systems employing the same design principles could be implemented in other host organisms or in extracellular settings.
- 5Yesselman, J. D., Eiler, D., Carlson, E. D., Ooms, A. N., Kladwang, W., Shi, X., Costantino, D. A., Herschlag, D., Jewett, M. C., Kieft, J. S., and Das, R. Computational Design of Asymmetric Three-dimensional RNA Structures and Machines. 2017. bioRxiv. https://doi.org/10.1101/223479.Google ScholarThere is no corresponding record for this reference.