Sunday, May 17, 2026

How experiments and AI are finally beginning to reveal the hidden architecture of RNA

  

How experiments and AI are finally beginning to reveal the hidden architecture of RNA
Graphical Abstract


RNA Structure Is Entering Its Integration Era

RNA is not just a messenger. It is a negotiator, a sensor, a scaffold, a switch, a catalyst, and sometimes the quiet organizer of entire molecular conversations inside the cell.

For decades, biology treated RNA mainly as a carrier of genetic information: DNA made the instructions, RNA delivered them, and proteins did the work. That older view now feels almost naïve. Many RNAs do not merely transmit information. They perform it. They fold into shapes, bind proteins, sense metabolites, control gene expression, regulate translation, catalyze reactions, and help determine whether a cell responds, adapts, survives, or fails.

The catch is simple: RNA function depends heavily on RNA structure.

And RNA structure is difficult.

A recent Nature Biotechnology Perspective, Integrated experimental and AI innovations for RNA structure determination,” captures where the field is now heading. Published in 2026, the article argues that RNA structure determination is being reshaped by two forces moving together: improved experimental technologies and rapidly advancing artificial intelligence-based modeling tools.  

The important word is together.

Experiments alone are powerful but limited. AI alone is fast but not always biologically grounded. The real breakthrough is emerging where the two meet.

RNA Is a Shape-Shifting Molecule

RNA structure is not a decorative detail. It is often the mechanism.

A messenger RNA may contain structural elements that influence stability or translation. A riboswitch changes shape when it binds a metabolite. A viral RNA genome can use folded regions to regulate replication or translation. A long noncoding RNA may act through structural modules that organize protein complexes. A guide RNA must adopt the correct form to work efficiently with CRISPR-associated proteins.

In each case, sequence matters, but sequence is not enough.

The same RNA sequence can fold into different conformations depending on magnesium ions, proteins, temperature, chemical modifications, ligand binding, cellular crowding, or stress conditions. RNA is also highly charged and flexible, which makes it harder to capture structurally than many more rigid biomolecules.

This is why RNA structure determination is so demanding. Researchers are not just trying to find a single beautiful 3D model. They are trying to understand a moving structural ensemble.

That movement is not noise.

It is biology.

Why Classical Experimental Methods Still Matter

Before AI can predict RNA structure confidently, experimental data must continue to define what is real.

Classical structural biology has given RNA researchers several powerful methods. X-ray crystallography can reveal atomic detail, but many RNAs resist crystallization because they are flexible and heterogeneous. NMR spectroscopy can capture local structure and dynamics, especially for smaller RNAs, but it becomes harder with large and complex molecules. Cryo-electron microscopy, or cryo-EM, has become increasingly important for large RNA-containing complexes and is now pushing into RNA-only structures with improved scaffolding and modeling approaches. The Perspective highlights multiple recent cryo-EM-related advances for RNA structure determination, including scaffold-enabled and high-resolution approaches. (Nature)

Cryo-EM has changed the mood of structural biology. Molecules that were once too large, too flexible, or too difficult to crystallize can now be visualized in new ways. For RNA, this is especially important because many functional RNAs do not behave like rigid objects. They fold, flex, bind, release, and remodel.

Still, cryo-EM is not magic. Small RNAs remain difficult. Flexible regions may blur. Model building can be challenging. Interpreting density maps requires skill, prior knowledge, and increasingly, computational assistance.

That is where AI begins to enter the story.

AI Is Not Replacing Experiments. It Is Extending Them

AI-based structure prediction has transformed protein biology, and RNA researchers are now asking whether RNA can have a similar moment. The answer is complicated.

Protein structure prediction benefited from massive structural datasets and strong evolutionary signals. RNA has less experimentally solved 3D structural data, fewer large training datasets, and more troublesome conformational behavior. RNA also depends strongly on noncanonical base pairs, ion-mediated interactions, electrostatics, and alternative folding states.

So RNA cannot simply borrow protein AI and call the problem solved.

But the field is moving fast.

Recent RNA modeling tools are using geometric deep learning, language-model-inspired strategies, transformer networks, end-to-end learning, and hybrid physics-plus-machine-learning methods. The Nature Biotechnology Perspective lists several recent computational advances, including trRosettaRNA, RhoFold, NuFold, RoseTTAFoldNA, AlphaFold 3, and other RNA or biomolecular modeling frameworks. (Nature)

These tools are not all doing the same thing. Some predict RNA tertiary structures from sequence. Some work better with secondary structure inputs. Some model protein–RNA complexes. Some help rank predicted structures. Some assist with cryo-EM map interpretation. Together, they show a field in rapid expansion.

The key lesson is this: AI is becoming useful not because it replaces structural biology, but because it can generate hypotheses faster than experiments alone.

A predicted structure can guide mutagenesis.
A predicted fold can help interpret chemical probing data.
A predicted RNA–protein interface can direct biochemical validation.
A predicted conformer can suggest what state a molecule might adopt under specific conditions.

The model is not the answer. It is the beginning of a better experiment.

The Power of Integrative RNA Structure Determination

The strongest future for RNA structure biology is not “experiment versus computation.” It is integrative modeling.

In an integrative workflow, researchers may combine:

RNA sequence
secondary structure prediction
chemical probing data
evolutionary conservation
cryo-EM density maps
NMR constraints
known RNA motifs
deep learning models
physics-based refinement
molecular dynamics simulations

Each method contributes a different kind of truth.

Chemical probing can reveal which nucleotides are exposed or constrained. Cryo-EM can provide global architecture. AI can propose plausible folds. Evolutionary information can identify conserved interactions. Physics-based refinement can improve stereochemistry. Functional assays can test whether the proposed structure actually matters.

This integration is especially valuable because RNA is dynamic. A single method may capture one state, while another reveals a different one. A computational model may predict a possible conformer, while experimental probing can test whether that conformer exists under biological conditions.

For RNA, structure determination must increasingly become ensemble-aware.

The question is not only:

What does this RNA look like?

It is also:

What shapes can this RNA adopt, and which shape controls function?

Cryo-EM and AI Are Becoming Partners

One of the most exciting developments is the pairing of cryo-EM with AI-assisted model building.

Cryo-EM can generate density maps, but interpreting those maps for RNA is not always straightforward. RNA bases, sugar-phosphate backbones, modified nucleotides, and flexible regions can be difficult to assign correctly, especially when resolution varies across the molecule.

AI tools can help automate parts of this process. The Perspective cites recent methods such as CryoREAD for de novo nucleic acid modeling from cryo-EM maps and other automated approaches for detecting and modeling DNA/RNA in cryo-EM density. (Nature)

This matters because scalability has been a bottleneck.

If every RNA structure requires slow, highly specialized manual interpretation, progress remains limited. If AI-assisted pipelines can accelerate model building while maintaining accuracy, RNA structural biology becomes more accessible to more laboratories.

That could change the field.

Not every lab can spend years solving one RNA structure. But more labs may be able to combine probing, cryo-EM, prediction, and validation into a practical workflow.

Why This Matters for Biotechnology and Medicine

RNA structure is not only an academic puzzle. It is becoming a design problem.

RNA therapeutics are expanding rapidly: mRNA vaccines, siRNAs, antisense oligonucleotides, guide RNAs, aptamers, ribozymes, circular RNAs, self-amplifying RNAs, and RNA-based sensors all depend on folding behavior. A therapeutic RNA may fail not because its sequence is wrong, but because its structure is unstable, inaccessible, immunogenic, or incompatible with delivery.

For RNA drug discovery, structure matters even more.

Small molecules do not bind “RNA” in the abstract. They bind pockets, grooves, motifs, exposed bases, dynamic states, and structured regions. If we cannot see or predict those structural features, rational RNA-targeted drug design remains limited.

Better RNA structure determination could help researchers:

design more stable therapeutic RNAs
optimize guide RNA folding
identify druggable RNA motifs
predict RNA–protein interactions
engineer riboswitches and aptamers
design RNA nanostructures
understand viral RNA regulatory elements
improve RNA delivery and expression systems

This is where RNA structure becomes translational.

The future of RNA medicine will not depend only on writing better sequences. It will depend on designing molecules that fold into the right structures at the right time in the right cellular environment.

The Remaining Challenges Are Serious

There is reason for excitement, but not for hype.

RNA structure prediction still faces major limitations. Experimental RNA 3D structure datasets remain smaller than protein datasets. Many RNAs are flexible and context-dependent. Chemical modifications can alter folding. Cellular environments are difficult to reproduce. Protein binding can remodel RNA architecture. Magnesium and other ions can determine whether a predicted fold is realistic.

AI models may also produce confident-looking structures that are wrong.

That danger is not trivial. A beautiful model can mislead if it is not tested. RNA structural biology cannot afford blind trust in prediction tools, especially when the results are used for therapeutic design or mechanistic claims.

The correct attitude is not skepticism for its own sake.

It is disciplined validation.

Predictions should be challenged by experiments. Experiments should be interpreted with computational help. Models should be treated as hypotheses, not final truth.

The Field Is Moving Toward Structure-Guided RNA Biology

The deepest shift is conceptual.

RNA biology is moving from sequence annotation toward structure-guided understanding. Researchers increasingly want to know not only where an RNA is expressed, but how it folds, what it binds, how it changes shape, and whether that shape can be engineered.

This shift will influence many fields: virology, cancer biology, neuroscience, plant biology, synthetic biology, diagnostics, and therapeutic design.

For RNA researchers, the opportunity is enormous.

We can now imagine a future where an RNA sequence is analyzed not only for open reading frames, motifs, or target sites, but also for structural states, ligandable pockets, conformational switches, protein-binding surfaces, and engineering potential.

That future will not come from AI alone.

It will come from the integration of experimental structure determination, biochemical probing, deep learning, physical modeling, and functional validation.

RNA is a social molecule inside the cell because it does not act alone. It interacts, responds, recruits, folds, and refolds.

Perhaps the tools we use to study it must behave the same way.

Not one method.
Not one model.
Not one structure.

A network of evidence.

That is where RNA structure determination is going.

And that is why this field now feels less like a technical subdiscipline and more like one of the central engines of modern RNA biotechnology.


References / Sources

  1. Wang, W., Su, B., Peng, Z., & Yang, J. “Integrated experimental and AI innovations for RNA structure determination.” Nature Biotechnology, 44, 205–214, 2026. Published 5 January 2026. DOI: 10.1038/s41587-025-02974-5. (Nature)

Tuesday, May 12, 2026

Why AlphaFold transformed protein biology, while RNA structure prediction remains one of biology’s most stubborn frontiers

 

RNA Function Follows Form — But RNA Refuses to Sit Still

At a virtual conference in November 2020, structural biology changed.

The winner of the CASP14 protein-structure-prediction challenge was announced: AlphaFold, developed by Google DeepMind. The result was not merely better than previous tools. It was dramatically better. AlphaFold showed that artificial intelligence could predict many protein structures with near-experimental accuracy, solving a problem researchers had been chasing for decades.

Protein biology had its revolution.

RNA biology is still waiting for its equivalent moment.

That is the central tension behind Diana Kwon’s Nature Technology Feature, “RNA function follows form – why is it so hard to predict?” The article captures a frustrating truth: RNA is biologically essential, structurally fascinating, and increasingly important for medicine — but predicting its shape remains far harder than many people expected. 

RNA Is Not Just a Messenger

For decades, RNA was introduced in textbooks as a middleman: DNA stores genetic information, RNA carries the message, and proteins do the real work.

That explanation is now painfully incomplete.

RNA can regulate genes, catalyze reactions, guide protein complexes, control splicing, sense metabolites, organize cellular machinery, and influence disease. Ribozymes, riboswitches, long noncoding RNAs, microRNAs, guide RNAs, viral RNAs, circular RNAs, and therapeutic RNAs all remind us that RNA is not passive.

RNA does things.

But RNA does those things because it folds.

Its biological function depends on stems, loops, bulges, pseudoknots, junctions, long-range contacts, base stacking, ion coordination, and interactions with proteins or small molecules. In RNA biology, structure is not decoration. Structure is often the mechanism.

That is why the phrase “function follows form” matters.

An RNA molecule’s sequence tells us what it could become. Its structure tells us what it is actually capable of doing.

Why Proteins Were Easier for AI

AlphaFold’s success in protein structure prediction was built on several advantages.

Proteins have been studied structurally for a long time. Thousands of high-quality protein structures were already available in the Protein Data Bank. Protein sequences also carry rich evolutionary information: if two residues change together across evolution, they may physically interact in the folded protein. AlphaFold and related tools learned from these patterns at massive scale.

RNA does not offer the same easy path.

There are far fewer experimentally solved RNA structures than protein structures. Many RNAs are small, flexible, chemically sensitive, and structurally heterogeneous. RNA often depends on magnesium ions, cellular proteins, modifications, ligand binding, and environmental conditions to fold correctly. A sequence may not point to one stable structure. It may point to a shifting ensemble.

That makes RNA a much harder target for machine learning.

A protein often behaves like a molecule trying to reach a stable folded state.

RNA often behaves like a molecule negotiating among several states.

RNA’s Flexibility Is the Problem — and the Biology

The biggest mistake is to think RNA structure prediction is simply protein structure prediction with different building blocks.

RNA has its own grammar.

It has only four standard bases, but those bases can form canonical and noncanonical interactions. Its phosphate backbone is highly charged. Its folding can depend strongly on ions. It can form alternative secondary structures. It can switch conformations when binding a metabolite or protein. It can expose or hide regulatory regions depending on context.

This is not just a computational nuisance.

RNA flexibility is often exactly how RNA works.

A riboswitch must change shape to regulate gene expression. A viral RNA may remodel itself during infection. A guide RNA must fold into a form compatible with its protein partner. An mRNA may contain structural elements that affect translation, degradation, or immune recognition.

So the goal is not always to predict one “correct” RNA structure.

The goal may be to predict a population of possible structures, then understand which one matters under a specific biological condition.

That is much harder.

AlphaFold 3 Helps, But It Does Not End the RNA Problem

AlphaFold 3 expanded the modeling landscape by predicting biomolecular complexes involving proteins, DNA, RNA, small molecules, ions, and modified residues. That is a major advance because biology rarely happens molecule by molecule in isolation. Cells are crowded with interacting systems. 

But RNA structure prediction is still not solved.

AlphaFold 3 can model RNA-containing complexes, but RNA-only folding and RNA conformational ensembles remain difficult. Many RNA structures depend on experimental constraints, secondary structure priors, or specialized RNA-focused modeling approaches. The RNA field is therefore not simply waiting for one universal model to solve everything.

It is building a different toolkit.

The New RNA AI Toolkit

Several AI-based methods are now pushing RNA structure prediction forward.

Tools such as RhoFold+ use RNA language models and deep learning to predict RNA 3D structures from sequence. RhoFold+ was trained with large-scale RNA sequence information and designed to address the data scarcity that limits RNA modeling. 

Other methods, including trRosettaRNA and trRosettaRNA2, use RNA-specific structural logic, secondary structure information, and deep learning to improve 3D prediction. Recent work on trRosettaRNA2 emphasizes the value of secondary-structure-aware modeling and conformer prediction — a crucial feature because RNA often exists in multiple structural states rather than one fixed architecture. 

These methods suggest an important principle:

RNA prediction will probably not be solved by sequence alone.

It will require secondary structure priors, evolutionary signals, experimental probing data, cryo-EM maps, chemical constraints, molecular simulations, and AI models working together.

Experiments Still Matter

The rise of AI does not make RNA experiments obsolete.

It makes them more important.

Chemical probing methods such as SHAPE and DMS-based approaches can reveal which nucleotides are flexible, paired, exposed, or protected. Cryo-electron microscopy can capture larger RNA-containing complexes. NMR can reveal dynamics and local structure. X-ray crystallography can still provide atomic detail when crystals are available.

Each method sees a different part of the RNA story.

AI can propose models quickly. Experiments can test whether those models are real.

This is where RNA structure biology is heading: not toward purely computational prediction, but toward integrative structure determination.

A model becomes more trustworthy when it agrees with probing data, mutational analysis, cryo-EM density, biochemical function, and evolutionary conservation.

For RNA, evidence must converge.

Why This Matters for Medicine

RNA structure prediction is not just a technical problem for structural biologists. It is becoming central to biotechnology and medicine.

mRNA vaccines, siRNA drugs, antisense oligonucleotides, CRISPR guide RNAs, aptamers, ribozymes, circular RNAs, and self-amplifying RNAs all depend on folding behavior. A therapeutic RNA may fail because it folds incorrectly, exposes the wrong region, activates unwanted immune sensors, degrades too quickly, or binds inefficiently.

RNA-targeted small-molecule drugs are another major frontier. For a drug to bind RNA selectively, the RNA must present a recognizable structural pocket or motif. Without structural knowledge, RNA drug discovery becomes guesswork.

Better RNA structure prediction could help researchers design more stable RNAs, improve guide RNA performance, identify druggable RNA motifs, engineer riboswitches, and understand viral RNA elements.

In short, RNA structure prediction is not only about seeing molecules.

It is about designing them.

The Real Lesson from AlphaFold

AlphaFold changed protein biology because it made high-quality structural models widely accessible. It did not eliminate experiments, but it changed where experiments begin.

RNA needs a similar shift.

But the RNA version of that revolution may look different. It may not be a single model that predicts one final structure from sequence. It may be a network of tools that predicts secondary structures, tertiary folds, alternative conformers, RNA–protein complexes, RNA–ligand interactions, and experimentally testable structural hypotheses.

RNA does not sit still.

So RNA structure prediction must become comfortable with motion.

The next breakthrough will not simply tell us, “Here is the RNA structure.”

It will tell us:

Here are the structures this RNA can adopt. Here is when they appear. Here is what they bind. Here is how they regulate biology. Here is how we can redesign them.

That is the future RNA biology is moving toward.

Protein structure prediction had its AlphaFold moment.

RNA’s moment may be harder, slower, and messier.

But it may also be more interesting — because RNA is not merely a molecule with a shape.

It is a molecule with possibilities.


References / Sources

  1. Kwon, D. “RNA function follows form – why is it so hard to predict?” Nature, 2025. (Nature)

  2. Jumper, J. et al. “Highly accurate protein structure prediction with AlphaFold.” Nature, 2021. (Nature)

  3. Abramson, J. et al. “Accurate structure prediction of biomolecular interactions with AlphaFold 3.” Nature, 2024. (Nature)

  4. Shen, T. et al. “Accurate RNA 3D structure prediction using a language model-based deep learning approach.” Nature Methods, 2024. (Nature)

  5. Wang, W. et al. “The trRosettaRNA server for RNA structure prediction.” Nature Protocols, 2026. (Yang Lab)

  6. CASP16 RNA structure prediction assessment and Yang-Server/trRosettaRNA2 reporting. (Wiley Online Library)

Sunday, May 10, 2026

RNA Folding Is Not Just Shape: The Principles That Make RNA Predictable

 

RNA is often introduced as DNA's messenger, a disposable copy of genetic instructions. That picture is far too small. RNA can switch genes on and off, guide enzymes to genomic targets, catalyze reactions, scaffold protein assemblies, sense metabolites, and carry vaccine instructions into cells. It does these jobs not only through its sequence, but through the structures that sequence folds into.

That makes RNA folding one of biology's most useful prediction problems. If we can predict how an RNA molecule folds, we can begin to predict how it behaves. If we can design a sequence that folds into a chosen structure, we can build RNA tools for medicine, diagnostics, synthetic biology, and nanotechnology. The challenge is that RNA is not a rigid object. It is a restless molecule moving across an energy landscape, with useful structures competing against near-misses.

The Basic Rule: Pairing Creates Structure, But Energy Chooses The Fold

RNA is built from four bases: A, U, G, and C. The familiar base-pairing rules, A with U and G with C, allow a single RNA strand to fold back on itself. Stems form where complementary regions pair. Loops, bulges, internal loops, and junctions form where pairing is interrupted.

But the final fold is not chosen by base-pairing alone. It is chosen by the balance of free energy across all possible structures. A predicted "minimum free energy" structure is the one a model estimates to be most stable. Stacked base pairs usually stabilize RNA. Large loops, unstable junctions, weak stems, or awkward local motifs can destabilize it. Magnesium ions, temperature, proteins, ligands, chemical modifications, and the cellular environment can all shift the balance.

So the first principle is simple but powerful: RNA folding is competitive. The target fold must be more favorable than the alternative folds the same sequence can make.

The Second Rule: Local Motifs Can Make Or Break A Design

The paper attached to this prompt, Anderson-Lee et al.'s "Principles for Predicting RNA Secondary Structure Design Difficulty," focused on inverse folding: given a desired RNA secondary structure, can we find a sequence that folds into it? The study drew on Eterna, a citizen-science RNA design platform, where tens of thousands of players and multiple algorithms tested what makes RNA designs easy or difficult.

Their results show why folding prediction is also a design problem. Some target structures are easy to specify on paper but hard to realize in a real sequence. Short stems are a classic example. A two-base-pair stem may look harmless in a diagram, but it offers only a small number of stable sequence choices. If many short stems appear in the same design, the sequence often needs repeated mini-patterns, and repeated patterns can mispair with one another.

Bulges and internal loops create another problem. They interrupt stacking interactions, weakening the stem and making nearby alternative folds more competitive. Multiloops, where several stems meet, require careful tuning of closing base pairs and nearby loop energies. Zigzag-like arrangements of opposing bulges are especially difficult: they can make an otherwise straightforward RNA hard for algorithms to design.

This leads to a practical design rule from the Eterna community: the "principle of least elements." The fewer destabilizing or difficult motifs a target structure contains, the more likely it is to be designable.

The Third Rule: Symmetry Is Beautiful, But Dangerous

Human designers like symmetry. RNA often does not.

In RNA design, repeated stems, repeated loops, and exact visual symmetry can be traps. Repetition narrows the usable sequence space and increases the chance that one part of the molecule will pair with the wrong partner. A symmetric diagram may invite misfolded alternatives that are nearly as stable as, or more stable than, the intended fold.

This is one reason natural RNAs often show broken symmetry. They may contain repeated domains, but the repeated parts are not usually exact copies at the secondary-structure level. Small asymmetries can help prevent incorrect pairing while preserving the broader biological function.

For real-world design, this is a quiet but important lesson: do not confuse structural elegance with molecular reliability. A slightly irregular RNA may be easier to make, easier to predict, and more robust in cells.

The Fourth Rule: Prediction Needs Ensembles, Not Just One Fold

Many beginner explanations of RNA folding focus on one predicted structure. In real biology, that is rarely enough. RNA molecules occupy ensembles: collections of structures with different probabilities. Some RNAs need one dominant structure. Others need to switch between states, as riboswitches do when they bind metabolites. Still others need to keep a region unpaired so a protein, ribosome, guide RNA, or reverse transcriptase can access it.

That means useful prediction asks several questions:

-          What is the most likely fold?

-          What alternative folds are close in energy?

-          Which nucleotides are likely to be paired or unpaired?

-          How often does the molecule expose a functional site?

-          How stable is the RNA against chemical degradation?

-          How does the fold change when proteins, ligands, ions, or modifications are present?

High-throughput experiments have become essential here. Chemical probing methods such as SHAPE and DMS can measure which nucleotides are flexible or accessible across thousands of RNA molecules. These datasets can reveal where thermodynamic models succeed, where they fail, and how machine-learning models can improve prediction.

Why This Matters For Biological Applications

RNA folding prediction is not an academic exercise. It affects whether RNA technologies work outside a diagram.

In gene silencing, siRNAs and shRNAs must present the right guide strand and avoid structures that block loading into cellular machinery. In CRISPR genome editing, guide RNAs must preserve the scaffold structures needed for Cas protein binding while keeping the targeting region accessible. In riboswitch and biosensor engineering, the RNA must change structure reliably when it binds a molecule. In RNA nanotechnology, repeated tiles, junctions, and short stems must assemble without generating unwanted mispaired products.

For mRNA therapeutics and vaccines, folding affects translation, immune recognition, and degradation. RNA is chemically fragile; unpaired and flexible regions can be more vulnerable to hydrolysis. Models that predict local structure and degradation patterns can help design mRNAs that last longer while still being translated efficiently.

The most promising real-world strategy is therefore not "predict the perfect fold once." It is an iterative loop:

  1. Choose a target function.
  2. Propose structures that obey known designability rules.
  3. Use computational tools to predict folds, ensembles, accessibility, and degradation risk.
  4. Test many candidates experimentally.
  5. Feed the results back into improved models.

This is already happening. Eterna-derived work has used community-designed RNA datasets to benchmark and improve folding packages. OpenVaccine-style efforts have combined RNA design and machine learning competitions to predict RNA degradation. The future of RNA engineering will likely come from this blend of physical modeling, high-throughput measurement, human intuition, and machine learning. 

The principles governing RNA folding are not just chemical rules; they are design rules. Stable stems help. Awkward loops, short repeated stems, dense difficult motifs, and exact symmetry can hurt. The best RNA designs respect the whole folding landscape, not just the desired final picture.

That is why RNA prediction is becoming so valuable for biology. It lets scientists ask, before entering the lab, whether a proposed RNA is likely to fold, switch, expose, bind, silence, guide, translate, or survive as intended. The more accurately we can answer those questions, the more RNA becomes a programmable material for living systems.

Sources

Anderson-Lee, J. et al. "Principles for Predicting RNA Secondary Structure Design Difficulty." Journal of Molecular Biology 428, 748-757 (2016). https://doi.org/10.1016/j.jmb.2015.11.013

Wayment-Steele, H. K. et al. "RNA secondary structure packages evaluated and improved by high-throughput experiments." Nature Methods 19, 1234-1242 (2022). https://doi.org/10.1038/s41592-022-01605-0

Wayment-Steele, H. K. et al. "Deep learning models for predicting RNA degradation via dual crowdsourcing." Nature Machine Intelligence 4, 1174-1184 (2022). https://doi.org/10.1038/s42256-022-00571-8

Monday, May 04, 2026

RNA Structure Is Becoming the Next Big Frontier in Biology

 How experiments, AI, and integrative modeling are changing the way we understand RNA folding, function, and therapeutics

trRosettaRNA: automated prediction of RNA 3D structure with transformer network
RNA Structure


For a long time, RNA was treated as biology’s messenger: DNA made the plan, RNA carried the instructions, and proteins did the real work.

That view is now far too small.

RNA is not just a passive courier of genetic information. It can fold into complex structures, catalyze biochemical reactions, regulate gene expression, control splicing, bind proteins, sense metabolites, and participate in disease processes. In many cases, RNA’s function depends not only on its sequence but on its shape.

That is why RNA structure has become one of the most important frontiers in molecular biology.

A review published in Nature Methods titled “Advances and opportunities in RNA structure experimental determination and computational modeling” captures this shift clearly: if we want to understand what RNA does, we must understand how it folds. The authors summarize modern experimental and computational tools for RNA secondary and tertiary structure analysis and argue that the growing volume of RNA structural data is opening the door to more powerful integrative modeling approaches. (Nature)

The message is simple but profound:

RNA biology is becoming structural biology.

Why RNA Structure Matters

RNA structure begins with base pairing.

Adenine pairs with uracil. Guanine pairs with cytosine. Sometimes guanine pairs with uracil. These interactions create stems, loops, bulges, junctions, pseudoknots, and long-range contacts. Together, they form what we call RNA secondary structure.

But RNA does not stop there.

Secondary structure folds into three-dimensional structure, where distant regions of the molecule come together in space. This 3D architecture determines whether an RNA can bind a protein, recognize a small molecule, act as an enzyme, regulate translation, or switch between active and inactive states.

This is where RNA becomes difficult.

Unlike many proteins, RNA is often highly flexible. It may not have one fixed structure. Instead, the same RNA molecule can exist as a population of conformers—different structural states that shift depending on ions, temperature, binding partners, chemical modifications, or cellular context.

That flexibility is not a technical nuisance. It is often the biology itself.

A riboswitch functions because it changes shape. Viral RNA genomes rely on structural elements for replication and translation. Long noncoding RNAs may act through modular structural domains. Therapeutic RNAs must fold correctly to remain stable and functional.

So, asking “What is the RNA sequence?” is no longer enough.

The better question is:

What structures can this RNA form, and which of those structures matter?

The Experimental Toolbox Is Expanding

RNA structure determination has traditionally relied on powerful but demanding experimental methods.

X-ray crystallography can provide high-resolution structures, but RNA molecules are often hard to crystallize. Their flexibility, charge, and conformational heterogeneity make crystal formation difficult.

Nuclear magnetic resonance spectroscopy, or NMR, is useful for studying RNA dynamics and smaller RNA structures, but it becomes more challenging as RNA size and complexity increase.

Cryo-electron microscopy, or cryo-EM, has become increasingly valuable for larger RNA-containing complexes, especially ribonucleoproteins. It allows researchers to visualize molecular assemblies that were once extremely difficult to capture structurally.

Alongside these classical approaches, high-throughput structure-probing methods have transformed the field. Techniques based on chemical probing, such as SHAPE and DMS-based approaches, can reveal which nucleotides are flexible, paired, exposed, or protected. These methods do not always give a full atomic structure, but they provide crucial experimental constraints for modeling RNA folding.

This is one of the most important changes in the field.

RNA structure determination is no longer limited to solving one purified molecule at a time. Researchers can now collect structural information across many RNAs, under different conditions, sometimes even inside cells.

That matters because RNA structure is context-dependent.

An RNA may fold differently in vitro and in vivo. It may change structure when bound to proteins. It may respond to stress, metabolites, ions, or chemical modifications. Experimental probing helps reveal these dynamic states rather than forcing RNA into a single static model.

Computational Modeling Is Catching Up

For decades, RNA secondary structure prediction relied heavily on thermodynamic models. These approaches estimate the most favorable base-pairing arrangement based on energy rules. They remain useful, especially for shorter RNAs and well-behaved structures.

But RNA folding is not always clean.

Long-range interactions, pseudoknots, tertiary contacts, alternative conformers, and cellular constraints can make structure prediction difficult. That is where computational modeling has had to evolve.

Modern RNA modeling now combines several layers of information:

Sequence conservation
Thermodynamic folding
Experimental probing data
Known structural motifs
Machine learning
Deep learning
3D geometry prediction
Molecular dynamics simulations

The strongest approaches are often integrative. They do not rely on only one source of information. Instead, they combine experimental constraints with computational prediction to generate more biologically realistic models.

This is exactly where the field is moving.

A major theme of the Nature Methods review is that experimental data and computational modeling are becoming increasingly connected. More experimental structural data improves computational tools. Better computational tools help interpret experimental data. Together, they create a feedback loop that can accelerate RNA structure discovery. 

AI Is Changing the RNA Structure Landscape

Protein structure prediction had its AlphaFold moment. RNA structure prediction is still more difficult, but it is rapidly advancing.

Recent AI-based methods are beginning to predict RNA secondary structure, tertiary structure, RNA–protein interactions, RNA–small molecule binding, and conformational states. These tools are helped by the rise of RNA language models, deep learning architectures, and larger biological datasets.

A 2023 review in Briefings in Bioinformatics highlighted how machine learning and deep learning are being applied to RNA secondary structure prediction, RNA aptamers, RNA–protein interactions, and RNA drug discovery. 

More recently, models such as trRosettaRNA2 have shown how secondary structure priors can guide RNA 3D structure and conformer prediction. The 2026 Nature Machine Intelligence article describing trRosettaRNA2 emphasizes that RNA 3D prediction remains difficult because experimental RNA structure data are scarce and RNA molecules are flexible, but structure-aware deep learning can improve prediction performance. 

This is important because RNA models cannot simply copy protein-prediction logic.

RNA has its own grammar.

Its folding depends heavily on base pairing, electrostatics, magnesium ions, noncanonical interactions, and alternative conformations. AI tools that understand or incorporate these features may be better suited for RNA than generic structure predictors.

Why This Matters for RNA Therapeutics

RNA structure is not just an academic problem.

It is becoming central to biomedical research.

mRNA vaccines, siRNAs, antisense oligonucleotides, guide RNAs, ribozymes, aptamers, circular RNAs, and self-amplifying RNAs all depend on RNA behavior. Stability, translation efficiency, immune recognition, intracellular delivery, target binding, and degradation can all be influenced by structure.

For therapeutic RNA design, structure prediction can help answer practical questions:

Will this RNA fold into the intended form?
Will it expose the right binding region?
Will it hide important motifs?
Will it form unwanted secondary structures?
Will it remain stable long enough to function?
Will chemical modifications disrupt or improve folding?

RNA structure also matters for RNA-targeted small-molecule drugs.

For many years, drug discovery focused mainly on proteins. RNA was often considered too flexible, too charged, or too difficult to target selectively. That view is changing. RNA can form structured pockets, motifs, and conformational states that small molecules may recognize.

A 2026 Nature Biotechnology article notes that RNA has emerged as an attractive target for drug discovery because its structures can be modulated by small molecules to influence processes such as splicing, translation, RNA–protein interactions, noncoding RNA processing, and RNA virus replication. 

This is where RNA structure becomes directly translational.

If we can map RNA structures accurately, we may be able to design drugs that bind RNA with the same strategic precision once reserved for proteins.

The Challenge: RNA Is Not Still

The biggest challenge is that RNA is dynamic.

Many methods still try to report one dominant structure. But biological RNA may exist as an ensemble. The “minor” conformer may be the functional one. A transient structure may control splicing. A rare state may expose a druggable pocket. A ligand may shift the entire folding landscape.

This creates a major challenge for both experiments and AI.

The future of RNA structure biology will not be only about predicting one beautiful model. It will be about predicting structural ensembles.

That means researchers will need tools that can capture alternative folds, kinetic traps, folding intermediates, ligand-bound states, protein-bound states, and cellular context.

This is also why experimental validation remains essential.

AI can generate models. Chemical probing can test accessibility. Cryo-EM can visualize large complexes. NMR can capture dynamics. Mutational analysis can test function. No single method is enough.

The best RNA structural biology will be hybrid.

The Future: Structure-Guided RNA Biology

The next phase of RNA research will likely be defined by integration.

Experimental probing will feed computational models. AI will generate structural hypotheses. High-throughput datasets will refine prediction tools. Molecular simulations will explore conformational flexibility. Drug discovery platforms will search for ligandable RNA motifs. Synthetic biologists will design RNA switches, sensors, scaffolds, and therapeutic constructs with structural logic built in from the beginning.

For plant biology, virology, cancer biology, neuroscience, and RNA therapeutics, this shift is significant.

RNA structure is no longer a specialist detail.

It is becoming a design principle.

We are entering a period where researchers will not only ask what an RNA sequence encodes, but what it folds into, how it moves, what it binds, and how that structure can be engineered.

That is the real opportunity.

The future of RNA biology will belong not only to those who can read RNA sequences, but to those who can understand RNA shapes.

And increasingly, those shapes will be discovered by experiments and predicted by machines working together.


References / Sources

  1. Zhang, J., Fei, Y., Sun, L., & Zhang, Q. C. “Advances and opportunities in RNA structure experimental determination and computational modeling.” Nature Methods, 19, 1193–1207, 2022.  

  2. PubMed entry for Zhang et al., “Advances and opportunities in RNA structure experimental determination and computational modeling.”  

  3. Sato, K. “Recent trends in RNA informatics: a review of machine learning and deep learning for RNA secondary structure, aptamers and drug discovery.” Briefings in Bioinformatics, 2023. 

  4. Wang, W., Peng, Z., & Yang, J. “Predicting RNA 3D structure and conformers using a pre-trained secondary structure model and structure-aware attention.” Nature Machine Intelligence, 2026.  

  5. Fei, Y. et al. “Predicting small molecule–RNA interactions without experimentally resolved RNA structures.” Nature Biotechnology, 2026. 

Thursday, April 30, 2026

RNA Finally Gets a Smarter Folding Map: Why trRosettaRNA2 Matters

A new AI model shows how secondary structure knowledge can help predict RNA 3D shapes, conformers, and possibly the next generation of RNA-based biotechnology.

RNA Finally Gets a Smarter Folding Map: Why trRosettaRNA2 Matters
Overall architecture of trRosettaRNA.


RNA has always been more than a genetic messenger.

It folds. It bends. It hides. It exposes. It switches shape in response to ions, proteins, metabolites, temperature, and cellular context. In many cases, RNA does not behave like a rigid molecule with one “correct” form. It behaves more like a restless biological object, moving through different structural states while still carrying information.

That is why predicting RNA structure has remained one of the hardest problems in modern structural biology.

Proteins had their AlphaFold moment. RNA has been waiting for its own.

A recent Nature Machine Intelligence article by Wenkai Wang, Zhenling Peng, and Jianyi Yang, published on 21 April 2026, introduces trRosettaRNA2, a deep learning approach designed to predict RNA 3D structures and conformers by combining secondary structure priors with structure-aware attention. 

And the important part is not simply that the model predicts RNA structure.

The important part is how it does it.

Why RNA 3D Structure Is So Difficult

RNA structure prediction is difficult because RNA does not fold like a simple string becoming a single fixed object. Its bases pair locally and distantly. Stems, loops, bulges, junctions, pseudoknots, and long-range contacts all shape its final architecture.

Even worse, RNA is flexible.

The same RNA molecule can exist in multiple conformers, meaning different 3D spatial arrangements of the same sequence. These conformers are not experimental noise. They are often central to RNA function. Riboswitches, ribozymes, regulatory RNAs, viral RNA elements, and therapeutic RNA designs may all depend on structural switching.

The authors of the new study frame RNA 3D structure and conformer prediction as a “grand challenge” because experimental RNA 3D datasets remain limited, RNA molecules are intrinsically flexible, and both experimental and computational approaches still face major constraints. (Nature)

This is the gap trRosettaRNA2 tries to address.

The Smart Move: Start With Secondary Structure

Most RNA biologists already think about RNA in layers.

First, there is the sequence: A, U, G, C.

Then comes secondary structure: which bases pair with which bases.

Then comes tertiary structure: how the whole molecule folds in 3D space.

trRosettaRNA2 uses this hierarchy intelligently. Instead of trying to jump blindly from sequence to 3D structure, it incorporates an auxiliary secondary structure prior module, pre-trained on extensive secondary structure data. This module generates base-pairing priors and also works as an independent RNA secondary structure prediction method called trRNA2-SS. According to the article, trRNA2-SS achieves state-of-the-art secondary structure prediction performance. 

That matters because RNA secondary structure is not a decorative intermediate. It is often the skeleton of RNA architecture.

A weak secondary structure prediction can distort the entire 3D model. A better secondary structure prediction gives the model a more biologically realistic starting point.

This is the central insight: for RNA, 2D information is not old-fashioned. It is powerful prior knowledge.

What “SS-Aware Attention” Means

The model also uses secondary-structure-aware attention to generate RNA 3D structures and conformers. In simpler terms, the AI does not treat every nucleotide relationship as equally abstract. It uses base-pairing information to guide how the model attends to different parts of the RNA molecule.

This is important because attention mechanisms are good at learning relationships, but RNA has known structural logic. Stems, loops, and base-pairing patterns are not random. By allowing the model to “see” secondary structure priors, trRosettaRNA2 gives deep learning a biochemical hint.

The result is a model that is both data-driven and structure-aware.

That balance is valuable. Purely experimental RNA structure determination can be slow and technically difficult. Purely computational prediction can drift into unrealistic folds. trRosettaRNA2 tries to sit between those worlds by learning from data while respecting RNA folding biology.

Why the CASP16 Result Got Attention

One of the strongest claims in the paper is that Yang-Server, based on trRosettaRNA2, was the top automated server for RNA structure prediction in the CASP16 blind test, surpassing AlphaFold 3 for this RNA structure prediction setting.

That statement should be read carefully.

It does not mean AlphaFold 3 is weak. AlphaFold 3 remains a major achievement in biomolecular modeling. But it does suggest something important: RNA may need RNA-specific strategies. A model designed around protein logic, even a very powerful one, may not always capture the peculiar folding behavior of RNA as effectively as a model that explicitly uses RNA secondary structure priors.

That is the lesson here.

RNA is not simply “protein structure prediction, but with nucleotides.”

It has its own grammar.

Conformers May Be the Bigger Story

The most exciting part of this work may not be static 3D prediction. It may be conformer exploration.

The authors report that trRosettaRNA2 can explore RNA conformers and successfully capture structural heterogeneity in ribonuclease P RNA without requiring experimental data. 

That is a major conceptual shift.

Many biological RNAs do not function by sitting still. They function by moving between states. A riboswitch may expose or hide a regulatory element. A viral RNA may adopt alternative folds during replication. A guide RNA may change shape when bound to a protein complex. A therapeutic RNA may lose function if it folds into an unintended structure.

So, predicting a single “best” RNA structure may be insufficient.

For RNA biology, the better question is often: What structural states can this molecule realistically occupy?

trRosettaRNA2 moves the field closer to that question.

Why This Matters for RNA Biotechnology

The implications are broad.

For RNA therapeutics, better structure prediction could help design more stable, functional RNA molecules. For ribozymes and aptamers, it could help identify folds that support catalysis or ligand binding. For synthetic biology, it could improve the design of RNA switches, sensors, and regulatory circuits. For RNA-targeted drug discovery, it could help reveal structural pockets or conformational states that are invisible from sequence alone.

For plant biotechnology and RNA silencing, the implications are more selective but still interesting. Not every dsRNA, siRNA, or circRNA design requires full 3D modeling. Many practical silencing designs still depend on target accessibility, sequence specificity, off-target filtering, processing efficiency, and delivery chemistry.

But as RNA designs become more complex—circular RNAs, structured RNA scaffolds, multi-hairpin constructs, aptamer-guided RNAs, self-amplifying RNAs, or RNA nanostructures—3D prediction becomes more relevant.

At that point, sequence alone is not enough.

A designed RNA may look perfect on paper and still fold into the wrong shape.

The Tool Is Also Available

The trRosettaRNA2 source code is available through the YangLab-SDU GitHub repository, and the pipeline includes secondary structure prediction, 3D structure prediction using an end-to-end neural network, and optional 3D folding by energy minimization. 

The repository notes that users can provide an MSA as input, run the prediction pipeline, and optionally supply custom secondary structures in formats such as dot-bracket, bpseq, ct, or base-pairing probability matrices. 

That flexibility is important. It means researchers are not locked into only one kind of input. If they have experimentally supported or manually curated secondary structure information, they may be able to use it to guide 3D prediction.

The Necessary Caution

This is not a reason to abandon experiments.

Predicted RNA structures still need validation, especially when used for mechanistic claims, therapeutic design, or high-stakes molecular engineering. RNA folding depends on context: ions, temperature, crowding, proteins, modifications, ligand binding, and cellular environment can all affect structure.

A computational model can narrow the search space.

It cannot replace biochemical judgment.

But that is still a meaningful advance. In RNA biology, narrowing the search space is often the difference between guessing blindly and designing intelligently.

A Real Step Toward RNA’s AlphaFold Moment

The phrase “AlphaFold moment” is used often, sometimes too loosely. RNA structure prediction is not simply waiting for one model to solve everything. RNA is dynamic, chemically sensitive, and experimentally underrepresented compared with proteins.

Still, trRosettaRNA2 feels like a serious step forward because it respects something RNA biologists already know:

RNA structure begins with base pairing, but it does not end there.

By combining pre-trained secondary structure prediction, SS-aware attention, end-to-end 3D modeling, and conformer exploration, trRosettaRNA2 gives researchers a more practical route into RNA structural space.

The future of RNA design will not belong only to those who can write sequences.

It will belong to those who can predict what those sequences become.


References / Sources

  1. Wang, W., Peng, Z., & Yang, J. “Predicting RNA 3D structure and conformers using a pre-trained secondary structure model and structure-aware attention.” Nature Machine Intelligence, published 21 April 2026. DOI: 10.1038/s42256-026-01223-x. 

  2. YangLab-SDU. trRosettaRNA2 GitHub repository. Source code and usage notes for the trRosettaRNA2 RNA structure prediction pipeline. 

  3. Yang Lab. trRosettaRNA server for RNA structure prediction. Web server description, inputs, outputs, benchmark notes, and publication updates.

  4. Wang, W., Peng, Z., & Yang, J. Preprint version: “Predicting RNA 3D structure and conformers using a pre-trained secondary structure model and structure-aware attention.” bioRxiv, 2025. 

  5. Wang, W. et al. “trRosettaRNA: automated prediction of RNA 3D structure with transformer network.” Nature Communications, 2023. Listed by Yang Lab as the earlier trRosettaRNA method reference.

Wednesday, April 29, 2026

RNA Is Not a Floppy String: Why We Need Better Mental Models of RNA Structure

 

RNA function depends on form, but the way we draw and talk about RNA often hides what the molecule is really doing.

For years, RNA was introduced as biology’s messenger.

DNA held the instructions. RNA carried the message. Proteins did the serious work.

That simple story was useful for teaching, but it also created a problem. It made RNA sound temporary, passive, and structurally uninteresting — almost like a disposable transcript moving from nucleus to ribosome.

That picture is no longer acceptable.

RNA is now central to some of the most exciting areas in biology and medicine: mRNA vaccines, siRNA drugs, CRISPR guide RNAs, viral genomes, riboswitches, long noncoding RNAs, circular RNAs, RNA sensors, RNA therapeutics, and RNA-targeted small-molecule drugs. The COVID-19 pandemic reminded the world that RNA can be both a disease agent and a therapeutic solution. At the same time, new discoveries continue to show that RNA performs diverse functions in healthy and diseased cells. 

But one misconception still lingers.

Many people still imagine RNA as a floppy, single-stranded line.

That cartoon is convenient. It is also misleading.

RNA is not naturally structureless. RNA is not simply waiting to become important after a protein binds it. RNA is an inherently interactive molecule. It folds, stacks, compacts, switches, remodels, and forms local and long-range contacts. Its function depends not only on the sequence, but also on the shape.

A 2022 PNAS Perspective by Quentin Vicens and Jeffrey S. Kieft, “Thoughts on how to think (and talk) about RNA structure,” makes exactly this point. The authors argue that many foundational features of RNA structure are misunderstood and that the community needs better mental models for how RNA behaves. Their goal is not just semantic. Better thinking about RNA structure leads to better mechanistic models, better experiments, and better interpretation of data. 

The Cartoon Problem

Scientific cartoons shape scientific thinking.

DNA is usually drawn as a double helix. Proteins are drawn as folded globular objects. RNA is often drawn as a wavy line.

The visual message is obvious: DNA has structure, proteins have structure, and RNA is just a loose strand.

But RNA does not become structured only in special cases. Even regions drawn as “loops” or “bulges” may have defined three-dimensional organization. Bases can stack. Noncanonical pairs can form. Loops can orient stems. Junctions can organize larger folds. What looks empty in a secondary structure diagram may be structurally meaningful in 3D. Vicens and Kieft emphasize that repeated cartoon representations can unintentionally reinforce the false idea that RNA is generally limp or unstructured.  

This matters because assumptions become experimental blind spots.

If we assume an mRNA is mostly unstructured, we may overlook structural elements that affect translation, stability, localization, immune recognition, or protein binding. If we treat unpaired regions as meaningless, we may miss tertiary contacts or regulatory motifs. If we assume RNA structure is static, we may miss the conformational switching that actually drives function.

RNA is not a line.

RNA is a structural landscape.

Observation 1: Base Stacking Is a Major Driver of RNA Structure

When people think about RNA structure, they usually think first about Watson–Crick base pairing: A pairs with U, G pairs with C.

That is important, but it is not the whole story.

One of the key points from Vicens and Kieft is that base stacking is fundamental to RNA structure. RNA bases are aromatic rings, and they tend to stack against each other in water. This stacking helps create helical conformations even before we think about classical base-pairing rules. The authors note that understanding RNA structure requires understanding stacking, not only hydrogen bonding.  

This is a useful correction.

RNA does not need perfect Watson–Crick stems to have structure. Even “single-stranded” regions can be locally organized through stacking and other interactions. That means the phrase “single-stranded RNA” should not be mentally translated as “structureless RNA.”

Single-stranded does not mean floppy.

Observation 2: Structured Does Not Mean Static

Another common mistake is to treat structure and flexibility as opposites.

If RNA is structured, we imagine it as fixed.
If RNA is flexible, we imagine it as disordered.

RNA often lives between those extremes.

An RNA molecule may be structured but dynamic. It may populate multiple conformations. It may shift between states depending on temperature, ions, proteins, metabolites, chemical modifications, or mutations. A riboswitch, for example, functions precisely because it changes structure in response to ligand binding. Viral RNAs may remodel during infection. mRNAs may expose or hide regions depending on cellular context.

Vicens and Kieft describe RNA as having a conformational landscape that can change with environment, binding partners, or sequence changes.  

This is a critical point for RNA biology.

The question is not always, “What is the RNA structure?”

Sometimes the better question is:

Which structures can this RNA adopt, and under what conditions?

Observation 3: RNA Is Often Compact

The floppy-strand cartoon also implies extension.

But many RNAs are compact. Bases stack. Local structures form. Long-distance interactions bring distant regions together. Natural RNAs often fold back on themselves, and their 5′ and 3′ ends may be closer in space than a stretched cartoon would suggest.  

This matters for how we think about RNA accessibility.

A target sequence may look exposed in a linear sequence map but buried in a folded structure. A guide RNA target site may be sequence-compatible but structurally inaccessible. A chemical probe may react differently depending on local folding. A protein may need to remodel RNA before binding or translating it.

In other words, RNA accessibility is not just a sequence problem.

It is a structure problem.

Observation 4: Watson–Crick Pairing Is Important, But Not Everything

Watson–Crick base pairing is central to RNA secondary structure, but it can become overemphasized.

Many RNA diagrams show stems as meaningful and loops as empty. That creates a false hierarchy: paired regions look structured, unpaired regions look unstructured.

But folded RNA 3D structures often depend on non-Watson–Crick interactions, base triples, A-minor motifs, ribose zippers, stacking networks, backbone contacts, ion-mediated interactions, and other tertiary features. Vicens and Kieft argue that non-Watson–Crick interactions are often underappreciated, even though they are crucial for stabilizing functional RNA conformations. 

This is especially important for students and researchers who rely heavily on secondary structure prediction.

A dot-bracket structure can be useful. It can show likely base-pairing patterns. But it does not fully describe RNA structure. It does not capture all noncanonical interactions. It does not show 3D packing. It does not reveal all dynamics.

Secondary structure is a map.

It is not the territory.

Observation 5: “Unpaired” Does Not Mean “Unimportant”

The word “unpaired” is dangerous.

It sounds like absence.

But unpaired nucleotides can be essential. They may form tertiary contacts, create recognition surfaces, participate in ligand binding, control folding pathways, or allow conformational switching. Some unpaired bases are exposed because they need to interact. Others are tucked into structured motifs that are invisible in simple diagrams.

This is why RNA structural interpretation must go beyond counting paired and unpaired bases.

A loop may be a binding pocket.
A bulge may bend a helix.
A junction may organize an entire domain.
An exposed base may be a regulatory sensor.

RNA often hides function in places that look “unstructured” on paper.

Observation 6: RNA Structure Must Be Treated as Evidence, Not Decoration

RNA structure should not be added to a model at the end as a pretty figure.

It should guide experimental design from the beginning.

If an RNA region is predicted to regulate translation, test whether disrupting the structure changes translation. If a loop is proposed to bind a protein, mutate the loop without destroying the whole fold. If a long-range contact is suspected, design compensatory mutations. If a chemical probing signal changes under stress, ask whether the structural change is causal or merely correlated.

This is where better mental models become practical.

Good RNA structure thinking improves primer design, mutagenesis, reporter assays, chemical probing, RNA therapeutics, guide RNA engineering, aptamer selection, viral RNA studies, and RNA-targeted drug discovery.

It also prevents overinterpretation.

A predicted structure is not proof. A probing signal is not automatically a mechanism. A base-pairing model is not a complete 3D structure. A beautiful diagram is not biological truth.

RNA structure must be tested.

Why This Matters Now

RNA research is expanding quickly because RNA is now central to biotechnology and medicine.

mRNA vaccines showed the world that RNA can be engineered as a therapeutic platform. siRNAs and antisense oligonucleotides are already clinically important. CRISPR guide RNAs depend on correct folding and protein interaction. Circular RNAs, self-amplifying RNAs, ribozymes, aptamers, and RNA nanostructures are being explored for future applications.

In every case, structure matters.

An RNA therapeutic may fail because it folds incorrectly.
A guide RNA may underperform because its scaffold is disrupted.
An mRNA may degrade faster because structural elements expose vulnerable regions.
A small molecule may bind only one conformational state of an RNA.
A viral RNA element may regulate replication through a structure that sequence analysis alone misses.

This is why the old image of RNA as a floppy strand is not harmless.

It limits imagination.

The Future: Make RNA Structure Easier to Use

One of the most important suggestions from the Vicens and Kieft Perspective is that validated RNA structural information should become easier for all researchers to access and use.

That means better databases, better annotations, better visualization tools, better education, and better integration of experimental and computational evidence. RNA structure information should not remain trapped inside specialist papers or difficult software pipelines. It should become part of everyday biological reasoning.

A molecular biologist studying gene regulation should be able to ask structural questions.
A virologist studying RNA genomes should be able to evaluate RNA folds.
A therapeutic designer should be able to assess structural accessibility.
A student should learn that RNA is not naturally shapeless.

The field needs more than better algorithms.

It needs better habits of thought.

A Better Way to Think About RNA

RNA is not simply a sequence.
RNA is not merely a messenger.
RNA is not a limp strand unless proven otherwise.

RNA is an inherently structured, dynamic, compact, interaction-rich molecule whose function depends on form.

Sometimes that form is stable.
Sometimes it shifts.
Sometimes the important structure is local.
Sometimes it is long-range.
Sometimes the key functional feature is not a Watson–Crick stem but an exposed base, a stacked loop, a transient contact, or an alternative conformer.

To understand RNA, we must stop drawing it as biology’s loose thread.

RNA is closer to a molecular instrument: folded, tuned, responsive, and capable of changing its performance depending on context.

The next generation of RNA biology will depend not only on sequencing more RNAs, but on learning how to see them properly.

And that begins with a better mental picture.


References / Sources

  1. Vicens, Q., & Kieft, J. S. “Thoughts on how to think (and talk) about RNA structure.” Proceedings of the National Academy of Sciences, 119(17), e2112677119, 2022. (PNAS)


Sunday, April 26, 2026

Self-Replicating RNA Motifs, RNAi Signals, and What Actually Matters for Plant dsRNA Design

Self-Replicating RNA Motifs, RNAi Signals, and What Actually Matters for Plant dsRNA Design
Source: https://thernablog.blogspot.com/


RNA design has a way of sounding more mysterious than it really is. People hear that some RNA molecules can replicate themselves, and the next question comes fast: if self-replicating RNA carries special motifs that let it amplify inside cells, could those same motifs be borrowed to make a dsRNA trigger stronger? Could they drive bigger RNAi responses? Could they make spray-induced gene silencing more potent in plants?

It is a smart question. It is also where a lot of confusion begins.

The short answer is that self-replicating RNA motifs and RNAi-relevant motifs are not the same thing. They live in neighboring parts of RNA biology, but they were built for different jobs. One helps an RNA molecule get copied by replication machinery. The other helps a dsRNA molecule get chopped into the right small RNAs and guide efficient silencing of a chosen target.

That distinction matters in plant biotechnology, especially now that dsRNA sprays, SIGS, and RNA-based crop protection are moving from concept toward practical deployment. If the goal is gene silencing, the most important features are usually not exotic replication signals. They are the simple, stubborn design features that determine whether a target window will yield abundant, specific, useful siRNAs.

Why self-replicating RNA sounds like an RNAi shortcut

The appeal is obvious. A self-replicating RNA does more than deliver instructions once. In viral replicons and self-amplifying RNA systems, the RNA contains cis-acting elements and replicase functions that allow the molecule to copy itself inside the cell without producing a full infectious virus. In principle, more copies should mean more RNA substrate. More substrate sounds like more silencing. That is the seduction of the idea.

But mechanistically, that leap is too fast. Replication motifs are not generic “high-RNAi” motifs. They are recognition elements for the replication system. They tell the replicase where to bind, where to start, and how to manage RNA synthesis. In alphavirus-derived self-amplifying systems, for example, conserved terminal sequences, structured untranslated regions, and subgenomic promoter elements are central to amplification. Those are instructions for copying, not instructions for Dicer optimization.

What counts as a self-replicating RNA motif?

Self-replicating RNAs usually carry a small set of recurring cis-acting features. At the ends of the molecule, there are often conserved 5′ and 3′ sequence elements, terminal stem-loops, and other structured regions that help the replication complex recognize the RNA as a template. Some systems also depend on internal cis-replication elements, long-range RNA–RNA interactions, or pseudoknot-like structures that stabilize the architecture needed for replication.

In alphavirus-style self-amplifying RNA, another major feature is the subgenomic promoter. That sequence allows abundant expression of a downstream payload after replication starts. It is one of the reasons self-amplifying RNA can produce more biological output from a relatively small amount of input RNA.

All of that is real. All of it is important. None of it means those motifs are automatically useful as direct enhancers of RNAi.

Why RNAi plays by different rules

RNA interference cares about something else. For a dsRNA trigger to work well, it needs to become a productive substrate for Dicer-like processing and generate small interfering RNAs that are both abundant and specific. In plants, DCL4 and DCL2 are especially important for producing 21- and 22-nucleotide siRNAs in antiviral and post-transcriptional silencing contexts, while DCL3 is more associated with 24-nucleotide siRNAs and transcriptional silencing pathways.

So the logic of a good dsRNA design window is not: Does this look like a viral replication template? The logic is: Will this region produce many useful 21- to 22-nt siRNAs, with low off-target complementarity, acceptable sequence complexity, and a strong chance of hitting the intended transcript cleanly?

That is why good RNAi design usually begins with the target gene, not with replication motifs. A region can be a beautiful replication template and still be a mediocre silencing trigger. It can also be a poor replication template and an excellent dsRNA target window. These are separate design spaces.

Where the two worlds actually overlap

There is one place where self-replicating RNA and RNAi do meaningfully intersect: replication often generates double-stranded RNA intermediates. In plants, those dsRNA intermediates are strong triggers of RNA silencing. That is why viral infection produces abundant virus-derived siRNAs, and why viral replication is so tightly entangled with the plant silencing machinery.

This is the key nuance. A self-replicating system can increase RNAi-related output, but usually because it generates more dsRNA substrate during replication, not because the replication motifs themselves are magical silencing enhancers. The benefit is indirect. The system works as a whole, and the silencing pathway responds to the RNA products that system creates.

That difference matters for design. If you are engineering a full replicon or a plant viral vector, replication motifs matter a great deal because without them the amplification system collapses. But if you are designing a conventional dsRNA trigger for exogenous delivery or SIGS, simply grafting replication motifs onto the ends of your dsRNA is not expected to produce a better silencing molecule by itself.

What this means for plant dsRNA and SIGS design

For plant spray-induced gene silencing, the priorities are much more practical than dramatic. The most useful dsRNA region is usually a target-derived coding window that can generate many distinct, effective siRNAs while minimizing off-target matches elsewhere in the plant or in non-target organisms. Regions with heavy repetition, low complexity, awkward composition, or other obvious sequence liabilities are usually poor choices.

This is why SIGS design is not mainly an exercise in importing motifs from self-replicating RNA. It is an exercise in choosing the right target region. The design question is not, Which viral element can I add? It is, Which segment of this gene is most likely to yield strong and specific silencing?

That may sound less glamorous, but it is usually the correct answer.

There is also a warning buried in the virology literature. Many plant viruses encode viral suppressors of RNA silencing. These proteins evolved to weaken precisely the pathway that RNAi-based crop protection wants to exploit. So not every self-replicating RNA architecture is automatically helpful in a silencing context. A vector can replicate beautifully and still be the wrong system if it carries suppressor functions that undermine the host silencing response.

The practical bottom line

If the goal is strong RNAi in plants, start with the target transcript. Choose windows that are likely to produce abundant 21- and 22-nt siRNAs. Screen hard for off-targets. Avoid repeats and low-complexity regions. Think about sequence quality before you think about clever architecture.

Bring self-replicating RNA motifs into the conversation only when you are deliberately building a true replicon-based system whose job is to amplify RNA inside the host. In that case, those motifs are essential because they enable the system to replicate. But even then, they do not replace the need for good target selection. They help make more RNA. They do not automatically make any RNA a better silencing trigger.

That distinction is easy to blur because both worlds use RNA structure, both can involve double-stranded intermediates, and both can produce powerful biological effects. But in practice, they answer different design questions. One asks how to copy RNA efficiently. The other asks how to silence a gene efficiently.

And in plant dsRNA and SIGS work, knowing which question you are actually trying to solve is half the battle.

References

1.    Vallet T, Vignuzzi M. Self-Amplifying RNA: Advantages and Challenges of a Versatile Platform for Vaccine Development. Viruses. 2025;17(4):566.https://doi.org/10.3390/v17040566

Schmidt C, Muralinath M, Schulzke JD, et al. Self-Amplifying RNA Vaccine Candidates: Alternative Platforms for mRNA Vaccine Development. Pathogens. 2023;12(1):138. https://doi.org/10.3390/pathogens12010138

Vaucheret H, Voinnet O. The plant siRNA landscape. The Plant Cell. 2024;36(2):246-275. https://doi.org/10.1093/plcell/koad253

Wekesa C, Kiprotich K, Muoma J, et al. Small RNAs as systemic signals in plant defense: mechanisms, challenges, and future directions. Molecular Biology Reports. 2026;53:623. https://doi.org/10.1007/s11033-026-11817-8

Chen C, Imran M, Feng X, Shen X, et al. Spray-induced gene silencing for crop protection: recent advances and emerging trends. Frontiers in Plant Science. 2025;16:1527944. https://doi.org/10.3389/fpls.2025.1527944

Roth BM, Pruss GJ, Vance VB. Plant viral suppressors of RNA silencing. Virus Research. 2004;102(1):97-108. https://doi.org/10.1016/j.virusres.2004.01.020

Verchot-Lubicz J, Carr JP. Viral Suppressors of Gene Silencing. In: Mahy BWJ, Van Regenmortel MHV, eds. Encyclopedia of Virology. 3rd ed. 2008:325-332. https://doi.org/10.1016/B978-012374410-4.00718-4

Blevins T, Rajeswaran R, Aregger M, et al. Four plant Dicers mediate viral small RNA biogenesis and DNA virus induced silencing. Nucleic Acids Research. 2006;34(21):6233-6246. https://doi.org/10.1093/nar/gkl886

Thursday, April 23, 2026

Self-Replicating RNA Molecules: Why They Matter for Food, Agriculture, and Medicine

 

Self-Replicating RNA Molecules: Why They Matter for Food, Agriculture, and Medicine
Source

For years, RNA was treated mainly as a messenger—an important but temporary molecule that carried instructions from DNA to the protein-making machinery of the cell. That view has changed dramatically. A newer class of RNA technologies, often called self-replicating RNA or self-amplifying RNA, does something much more ambitious: once inside a cell, the RNA can direct the production of its own replication machinery and generate additional RNA copies. In practical terms, that means one small dose of RNA can produce a much larger biological effect than conventional messenger RNA.

At the center of this technology is the idea of an RNA replicon. Most self-replicating RNA systems are inspired by positive-sense RNA viruses. Scientists keep the parts of the viral genome that are needed for RNA replication, but remove the genes required to make infectious viral particles. What remains is a noninfectious RNA system that can enter a cell, make replication proteins, amplify itself, and drive high-level expression of a chosen payload—such as a vaccine antigen, therapeutic protein, or experimental trait.

What makes self-replicating RNA different?

Conventional mRNA gives the cell one set of instructions and depends entirely on the amount of RNA delivered. Self-replicating RNA adds a built-in amplification step. Once the replication machinery is produced, the RNA can make more copies of itself or of a subgenomic payload, leading to stronger and sometimes more sustained expression from a lower starting dose.

That dose-sparing property is one reason the field has attracted so much attention. It can improve manufacturing efficiency, reduce material requirements, and potentially make large-scale deployment more practical when speed and cost matter. But the technology is not simple. These RNAs still require careful sequence engineering, appropriate delivery systems, and a balance between amplification, stability, and innate immune activation.

Why it matters in medicine

Medicine is where self-replicating RNA has advanced furthest toward real-world use. The most visible application is vaccination. Self-amplifying RNA vaccines are designed to generate strong antigen expression while using lower RNA doses than standard mRNA vaccines. That has obvious implications for manufacturing, stockpiling, and rapid response during outbreaks.

An important milestone has already been reached. The European Medicines Agency lists Kostaive (zapomeran) as a COVID-19 vaccine containing a self-amplifying mRNA molecule. According to the EMA, this RNA contains instructions both for the viral antigen and for a replicase that makes additional RNA copies inside host cells. That regulatory milestone helped move self-replicating RNA from a promising concept into an approved medical platform.

Beyond infectious disease vaccines, self-replicating RNA is being explored for cancer immunotherapy, in vivo protein replacement, and other transient therapeutic applications. The attraction is clear: strong biological output without permanent integration into the genome. In that sense, the platform occupies a useful middle ground between short-lived conventional mRNA and more durable but more complex DNA or viral vector systems.

Even so, challenges remain. Delivery vehicles such as lipid nanoparticles are still central. Reactogenicity, replicase-related innate immune responses, RNA stability, and manufacturing consistency can all affect performance. The promise is real, but the platform still requires careful tuning.

Why it matters in agriculture

Agriculture may prove just as important as medicine. Plant biotechnology has long shown how replication-enabled RNA systems can be used to drive strong transient expression. Plant viral replicons and geminiviral replicon systems have been used to boost gene expression in plants, especially in Nicotiana benthamiana, without creating stable transgenic lines.

That matters first for crop research. Replication-enabled RNA vectors allow rapid testing of genes, pathways, resistance traits, and synthetic constructs. Instead of waiting months for stable transformation, researchers can move quickly from construct design to expression to phenotype.

It also matters for molecular farming. Plants can be used as temporary bioreactors to produce antibodies, enzymes, vaccine antigens, and other valuable proteins. Replicon-based systems improve expression efficiency and help make plant-based biomanufacturing more productive. Recent work using a Bean Yellow Dwarf Virus replicon system to express nattokinase in Nicotiana benthamiana is one example of how these systems can support scalable protein production.

Self-replicating RNA concepts are also relevant to crop protection. They may support transient delivery of protective RNAs, antiviral effectors, or genome-editing components in plant tissues. In the future, they could help create faster, more programmable responses to plant disease and stress without necessarily depending on permanent genomic change.

Why it matters for food systems

The food relevance of self-replicating RNA is broader than it first appears. One major area is animal health. Better RNA vaccines for livestock, poultry, and aquaculture could reduce disease losses, improve productivity, and strengthen food security. A dose-sparing RNA platform is especially attractive when rapid deployment and large population coverage are important.

Another area is food-related biomanufacturing. Replicon-based expression systems in plants can be used to produce enzymes, functional proteins, nutraceutical candidates, or health-related biomolecules relevant to food processing and nutrition. In that sense, self-replicating RNA is not only a therapeutic platform; it is also a manufacturing strategy.

More broadly, flexible RNA platforms may help food systems respond more quickly to emerging biological threats. The same feature that makes self-amplifying RNA attractive in human medicine—rapid redesign with relatively small starting material—also matters in agricultural biosecurity and veterinary preparedness.

Why scientists are excited

The scientific excitement around self-replicating RNA is not just about novelty. It is about leverage. These molecules combine molecular biology, delivery science, synthetic biology, and manufacturing logic in a single programmable platform. They allow transient biology to become much more potent.

There is also a deeper shift in how biology is being engineered. A self-replicating RNA is not merely a passive instruction set. It is a dynamic molecular program that can enter a cell, execute a sequence of events, and amplify its own effect. That is why the technology feels important across so many sectors.

The constraints that still matter

Every powerful platform comes with tradeoffs. For self-replicating RNA, the recurrent issues include delivery efficiency, innate immune sensing, payload size limits, sequence stability, and process consistency during manufacturing.

In plant and agricultural systems, additional concerns include host range, environmental behavior, containment, regulatory acceptance, and public trust. In food applications, cost and consumer perception may be just as influential as technical performance. The future of the field will depend not only on what these molecules can do in principle, but on how reliably and responsibly they can be deployed.

The bigger picture

Self-replicating RNA molecules represent one of the clearest signs that modern biology is becoming more programmable, modular, and cross-sectoral. In medicine, they support lower-dose vaccines and potentially more efficient transient therapeutics. In agriculture, they enable faster expression systems, new crop protection strategies, and stronger molecular farming platforms. In food systems, they may improve animal health, support decentralized production of valuable biomolecules, and strengthen resilience against biological threats.

That is why self-replicating RNA matters. It is not just another RNA format. It is a way of making biological instructions do more with less—and that principle has consequences far beyond any single vaccine, crop, or product.


 

Selected References

Vallet T, Vignuzzi M. Self-Amplifying RNA: Advantages and Challenges of a Versatile Platform for Vaccine Development. Viruses. 2025;17(4):566. doi:10.3390/v17040566. PMCID: PMC12031284.

European Medicines Agency (EMA). Kostaive. European Commission–authorized product information and EPAR overview. Updated 2025. Available from the EMA website.

Wang K, et al. Harnessing Transient Expression Systems with Plant Viral Vectors for the Production of Biopharmaceuticals in Nicotiana benthamiana. 2025. PMCID: PMC12193647.

Wang K, et al. Transient expression of full-length and mature nattokinase in Nicotiana benthamiana using a modified Bean Yellow Dwarf Virus replicon system. Frontiers in Plant Science. 2025.

Kim NS, et al. Efficient production of functional cholera toxin B subunit in Nicotiana benthamiana using geminiviral replicon systems. Frontiers in Bioengineering and Biotechnology. 2025.

Silva-Pilipich N, et al. A Second Revolution of mRNA Vaccines against COVID-19. Vaccines. 2024;12(3):318.