Monday, June 29, 2026

Evolution of Plant microRNA Gene Families: Birth, Expansion, and Functional Diversification of Small RNA Regulators

 

Evolution of Plant microRNA Gene Families: Birth, Expansion, and Functional Diversification

How plant MIRNA genes arise, duplicate, diversify, co-evolve with targets, and sometimes disappear.

Introduction

Plant development, adaptation, and genome regulation depend not only on protein-coding genes but also on small regulatory RNAs. Among the most important of these are microRNAs, or miRNAs: short, non-coding RNAs, usually around 20-24 nucleotides long, that guide Argonaute-containing silencing complexes to complementary target transcripts. In plants, this targeting is often highly sequence-specific and frequently results in transcript cleavage, although translational repression and other regulatory outcomes also occur.

The genes that produce miRNAs are known as MIRNA genes. They are usually transcribed into primary transcripts that fold into stem-loop structures. These precursors are processed mainly by DICER-LIKE1 and associated proteins to release mature miRNA duplexes. One strand is loaded into an Argonaute protein and directs repression of target mRNAs. Because plant miRNAs often regulate transcription factors, hormone-response genes, nutrient-homeostasis genes, and disease-resistance genes, small changes in MIRNA gene copy number, sequence, expression, or targeting can have large developmental and evolutionary consequences.

The evolution of plant microRNA gene families is therefore a story of regulatory innovation. Some miRNA families are ancient and deeply conserved across land plants. Others are recently born, restricted to a species, genus, or family, and may disappear before becoming functionally embedded. Plant MIRNA evolution is not a linear march from simple to complex. It is a dynamic cycle of birth, duplication, divergence, selection, and loss.

What Is a Plant microRNA Gene Family?

A plant microRNA gene family is usually defined by similarity among mature miRNA sequences and, often, similarity in target recognition. Multiple MIRNA loci may produce identical or nearly identical mature miRNAs. These loci can be dispersed across the genome, arranged in tandem clusters, retained after whole-genome duplication, or generated independently through local rearrangements.

For example, conserved families such as miR156/157, miR160, miR164, miR165/166, miR167, miR169, miR172, miR319, and miR396 regulate major developmental transcription-factor families, including SPL, ARF, NAC, HD-ZIP III, AP2, TCP, and GRF genes. These modules are central to phase transition, organ polarity, leaf development, root architecture, flowering, and stress responses. Their conservation suggests that once a MIRNA-target module becomes integrated into a core regulatory network, it can be maintained for hundreds of millions of years.

At the same time, plant genomes contain many lineage-specific MIRNA genes. These younger loci may have weak expression, less precise processing, unstable precursor structures, or narrow tissue-specific activity. Some are evolutionary experiments. A few become useful regulators. Many are lost.

Birth of New MIRNA Genes

One of the best-supported mechanisms for the origin of new plant MIRNA genes is inverted duplication of target-gene sequences. In this model, a fragment of a protein-coding gene is duplicated and inserted in an inverted orientation near a related sequence. The resulting genomic region can form a hairpin RNA. Initially, such a hairpin may behave more like a source of small interfering RNAs, producing heterogeneous small RNAs. Over time, mutations may refine the foldback structure, improve processing precision, and favor production of a dominant mature miRNA.

This mechanism is especially elegant because it explains how a new miRNA can immediately possess complementarity to a biologically relevant target. If the MIRNA precursor arose from a duplicated fragment of its future target gene, the mature miRNA may already recognize that target or related paralogs. The newly born MIRNA gene therefore begins with a plausible regulatory connection rather than needing to find one entirely by chance.

However, birth is not enough. A young MIRNA locus must pass several evolutionary filters. It must be transcribed in the right place. Its precursor must be processed accurately. The mature miRNA must be loaded into the correct Argonaute complex. Its target interaction must provide a selective benefit or at least avoid harmful misregulation. Only then can a young MIRNA gene move from genomic accident to functional regulator.

Duplication and Expansion of MIRNA Families

Once a MIRNA gene becomes useful, duplication can expand its regulatory influence. Plant genomes are shaped by tandem duplication, segmental duplication, transposable-element activity, and repeated rounds of whole-genome duplication. MIRNA genes are not exempt from these forces.

Tandem duplication can create multiple MIRNA copies in close genomic proximity. Segmental duplication can move related MIRNA loci into different chromosomal contexts. Whole-genome duplication can duplicate both MIRNA genes and their target genes at the same time, creating new opportunities for dosage balance, subfunctionalization, and regulatory divergence.

Expansion of a MIRNA family can increase dosage. More MIRNA copies may produce more mature miRNA, strengthening repression of target transcripts. But copy-number expansion can also enable expression divergence. One MIRNA copy may remain active in leaves, another in roots, another during reproductive development, and another during stress. Even if mature miRNA sequences remain identical, promoter divergence can create new spatial and temporal regulation.

This is one reason plant MIRNA gene families are often functionally more complex than their small size suggests. A mature miRNA sequence may be conserved, but the genomic loci that produce it can differ in expression pattern, precursor structure, processing efficiency, and evolutionary age.

Conservation and Ancient Regulatory Modules

The most deeply conserved plant miRNA families tend to regulate transcription factors or regulatory proteins. This is not accidental. Transcription factors sit near the top of developmental control systems. A single miRNA targeting a transcription-factor family can coordinate entire developmental programs.

The miR156-SPL module is a classic example. miR156 is associated with juvenile-to-adult phase transition, flowering, architecture, and stress-related traits. Its target SPL transcription factors control broad developmental outputs. The miR172-AP2 module also contributes to phase transition and flowering. The miR165/166-HD-ZIP III module regulates adaxial-abaxial polarity, vascular patterning, and meristem function. The miR160 and miR167 families regulate auxin-response factors, connecting miRNA evolution to hormone signaling.

These ancient families are usually under strong purifying selection. Their mature sequences are highly conserved because even small sequence changes could alter target recognition. Their target sites are also conserved because disruption may disturb essential developmental programs. In such cases, the MIRNA gene and its target become an evolutionary unit: each constrains the other.

Rapid Turnover of Young MIRNA Genes

In contrast to ancient conserved families, young plant MIRNA genes show rapid birth and death. Many species-specific MIRNA candidates are found in small-RNA datasets, but not all represent stable evolutionary innovations. Some may be weakly expressed hairpins, degradation products, siRNA-like loci, or recently formed precursors that have not yet acquired canonical features.

Young MIRNA genes often show several features: limited phylogenetic conservation, lower expression, less precise processing, weaker evidence of Argonaute loading, and uncertain target repression. Over evolutionary time, most are lost. A small fraction gradually acquire stronger precursor structure, more accurate processing, more consistent mature miRNA accumulation, and biologically meaningful targets.

This creates a layered MIRNA repertoire. At the bottom are newly formed, unstable, or weakly functional hairpin loci. In the middle are lineage-specific MIRNA genes with emerging regulatory roles. At the top are deeply conserved MIRNA families embedded in core plant biology.

Whole-Genome Duplication and MIRNA Family Evolution

Whole-genome duplication is a major force in plant evolution. Many angiosperm lineages have experienced one or more genome duplication events. After such events, most duplicated genes are eventually lost, but some are retained because they provide dosage balance, developmental flexibility, or raw material for innovation.

MIRNA genes can be retained after whole-genome duplication, but their fate depends on both the MIRNA locus and its target network. If a MIRNA and its target genes are duplicated together, the regulatory relationship may be preserved. Alternatively, one MIRNA copy may be lost while target duplicates diverge. In other cases, retained MIRNA duplicates may acquire different expression domains or subtly different mature sequences.

The consequences can be significant. A duplicated MIRNA family member may regulate one subset of target paralogs, while another copy regulates a different subset. This can help duplicated protein-coding genes escape identical regulation and develop new functions. Thus, MIRNA evolution after whole-genome duplication contributes not only to small-RNA diversity but also to the rewiring of gene regulatory networks.

Target Co-evolution

Plant miRNA evolution cannot be understood by looking only at MIRNA genes. The target genes evolve too. A miRNA target site may be conserved, lost, duplicated, or modified. Target-site changes can weaken regulation, create new regulation, or shift a transcript from one miRNA family to another.

This co-evolution is especially visible in duplicated gene families. If a transcription-factor family expands, some paralogs may retain the ancestral miRNA target site while others lose it. The result is regulatory partitioning. One group remains under miRNA control; another escapes repression and may evolve a new expression pattern.

Defense-related NBS-LRR genes provide a striking example of miRNA-target co-evolution. These genes often occur in large, rapidly evolving clusters. Because overexpression of immune receptors can be costly, miRNAs that target conserved motifs in duplicated NBS-LRR transcripts may help control immune-gene dosage. In some cases, the same type of target gene expansion that creates regulatory problems may also generate the inverted-repeat structures from which new miRNAs arise. The target family therefore helps produce its own regulator.

Functional Diversification Within MIRNA Families

After duplication, MIRNA family members can diversify in several ways.

First, they can diverge in expression. Two MIRNA loci producing the same mature sequence may be active in different tissues, developmental stages, or environmental conditions.

Second, they can diverge in precursor structure. Changes in the stem-loop can affect DCL1 processing accuracy, mature miRNA abundance, or production of alternative small RNAs from the same precursor.

Third, mature miRNA sequences can diverge. Even one or two nucleotide substitutions, especially in target-recognition regions, can shift target specificity.

Fourth, regulatory context can change. A MIRNA locus may acquire new promoter elements, become responsive to stress, or be integrated into hormone signaling.

Through these processes, MIRNA family members can undergo subfunctionalization, where ancestral functions are partitioned among duplicates, or neofunctionalization, where one copy acquires a new role.

Loss of MIRNA Genes

Loss is as important as gain. MIRNA genes may be deleted, silenced, structurally degraded, or rendered nonfunctional by mutations that disrupt processing or expression. Target sites can also be lost, making the MIRNA irrelevant even if the MIRNA gene remains.

Loss may occur because a young MIRNA never provided a selective advantage. It may also occur because regulation becomes harmful under new ecological or developmental conditions. In some cases, loss of a MIRNA or target site may release a gene from repression and contribute to phenotypic diversification.

This turnover explains why the MIRNA complement differs strongly among plant species. Conserved families provide a stable regulatory backbone, while lineage-specific families reflect recent evolutionary experimentation.

Evolutionary Significance

The evolution of plant microRNA gene families reveals how genomes build regulatory complexity without needing entirely new proteins. A short RNA sequence, if produced accurately and expressed in the right context, can regulate many transcripts. This makes miRNAs powerful tools for coordinating gene families, buffering expression noise, and fine-tuning developmental transitions.

Plant MIRNA evolution also shows that regulatory networks are modular. A miRNA and its target family can form a portable regulatory unit. Once established, such a module can be duplicated, modified, lost, or redeployed. This modularity helps plants adapt to new body plans, reproductive strategies, stress environments, and pathogen pressures.

Future Directions

Several questions remain central to the field. How many lineage-specific MIRNA annotations are truly functional? What structural features determine whether a young hairpin becomes a canonical MIRNA gene? How often do new MIRNA genes arise from target-gene fragments, transposable elements, or random hairpins? How do MIRNA duplicates partition expression after whole-genome duplication? And how does target-site evolution contribute to crop domestication and adaptation?

Long-read transcriptomics, improved small-RNA sequencing, degradome analysis, Argonaute immunoprecipitation, comparative genomics, and genome editing are now making these questions more tractable. In crops, understanding MIRNA family evolution may help researchers manipulate architecture, flowering time, stress tolerance, nutrient use, and immunity with greater precision.

Conclusion

Plant microRNA gene families evolve through a balance of conservation and experimentation. Ancient families such as miR156, miR160, miR165/166, miR167, miR172, and miR396 form deeply conserved regulatory circuits that control development and physiology. Younger MIRNA genes arise continuously through inverted duplication, local rearrangement, duplication, and genome-scale events. Most are lost, but a few become integrated into functional networks.

The evolution of plant MIRNA families is therefore not merely the history of small RNA genes. It is the history of how plants refine gene regulation, absorb genome duplication, control expanding gene families, and generate developmental and adaptive diversity from short RNA sequences.

Selected References

Evolution of plant microRNA gene families. Cell Research, 2007. https://www.nature.com/articles/7310113

Evolution of plant microRNAs and their targets. Trends in Plant Science, 2008. https://doi.org/10.1016/j.tplants.2008.03.009

Origins and Evolution of MicroRNA Genes in Plant Species. Genome Biology and Evolution, 2012. https://pmc.ncbi.nlm.nih.gov/articles/PMC3318440/

The evolution of microRNAs in plants. Current Opinion in Plant Biology, 2017. https://pmc.ncbi.nlm.nih.gov/articles/PMC5342909/

MicroRNA Gene Evolution in Arabidopsis lyrata and Arabidopsis thaliana. The Plant Cell, 2010. https://pmc.ncbi.nlm.nih.gov/articles/PMC2879733/

Conservation and evolution of miRNA regulatory programs in plant development. Current Opinion in Plant Biology, 2007. https://pmc.ncbi.nlm.nih.gov/articles/PMC2080797/

De novo origination of MIRNAs through generation of short inverted repeats in target genes. RNA Biology, 2019. https://pmc.ncbi.nlm.nih.gov/articles/PMC6546375/

No comments: