Evolution of Plant microRNA Gene Families: Birth, Expansion,
and Functional Diversification
How plant MIRNA genes arise, duplicate, diversify, co-evolve with targets, and sometimes disappear.
Introduction
Plant development, adaptation, and genome regulation depend
not only on protein-coding genes but also on small regulatory RNAs. Among the
most important of these are microRNAs, or miRNAs: short, non-coding RNAs,
usually around 20-24 nucleotides long, that guide Argonaute-containing
silencing complexes to complementary target transcripts. In plants, this
targeting is often highly sequence-specific and frequently results in
transcript cleavage, although translational repression and other regulatory
outcomes also occur.
The genes that produce miRNAs are known as MIRNA genes. They
are usually transcribed into primary transcripts that fold into stem-loop
structures. These precursors are processed mainly by DICER-LIKE1 and associated
proteins to release mature miRNA duplexes. One strand is loaded into an
Argonaute protein and directs repression of target mRNAs. Because plant miRNAs
often regulate transcription factors, hormone-response genes,
nutrient-homeostasis genes, and disease-resistance genes, small changes in MIRNA
gene copy number, sequence, expression, or targeting can have large
developmental and evolutionary consequences.
The evolution of plant microRNA gene families is therefore a
story of regulatory innovation. Some miRNA families are ancient and deeply conserved
across land plants. Others are recently born, restricted to a species, genus,
or family, and may disappear before becoming functionally embedded. Plant MIRNA
evolution is not a linear march from simple to complex. It is a dynamic cycle
of birth, duplication, divergence, selection, and loss.
What Is a Plant microRNA Gene Family?
A plant microRNA gene family is usually defined by
similarity among mature miRNA sequences and, often, similarity in target
recognition. Multiple MIRNA loci may produce identical or nearly identical
mature miRNAs. These loci can be dispersed across the genome, arranged in
tandem clusters, retained after whole-genome duplication, or generated
independently through local rearrangements.
For example, conserved families such as miR156/157, miR160,
miR164, miR165/166, miR167, miR169, miR172, miR319, and miR396 regulate major
developmental transcription-factor families, including SPL, ARF, NAC, HD-ZIP
III, AP2, TCP, and GRF genes. These modules are central to phase transition,
organ polarity, leaf development, root architecture, flowering, and stress
responses. Their conservation suggests that once a MIRNA-target module becomes
integrated into a core regulatory network, it can be maintained for hundreds of
millions of years.
At the same time, plant genomes contain many
lineage-specific MIRNA genes. These younger loci may have weak expression, less
precise processing, unstable precursor structures, or narrow tissue-specific
activity. Some are evolutionary experiments. A few become useful regulators.
Many are lost.
Birth of New MIRNA Genes
One of the best-supported mechanisms for the origin of new
plant MIRNA genes is inverted duplication of target-gene sequences. In this
model, a fragment of a protein-coding gene is duplicated and inserted in an
inverted orientation near a related sequence. The resulting genomic region can
form a hairpin RNA. Initially, such a hairpin may behave more like a source of
small interfering RNAs, producing heterogeneous small RNAs. Over time,
mutations may refine the foldback structure, improve processing precision, and
favor production of a dominant mature miRNA.
This mechanism is especially elegant because it explains how
a new miRNA can immediately possess complementarity to a biologically relevant
target. If the MIRNA precursor arose from a duplicated fragment of its future
target gene, the mature miRNA may already recognize that target or related
paralogs. The newly born MIRNA gene therefore begins with a plausible
regulatory connection rather than needing to find one entirely by chance.
However, birth is not enough. A young MIRNA locus must pass
several evolutionary filters. It must be transcribed in the right place. Its
precursor must be processed accurately. The mature miRNA must be loaded into
the correct Argonaute complex. Its target interaction must provide a selective
benefit or at least avoid harmful misregulation. Only then can a young MIRNA
gene move from genomic accident to functional regulator.
Duplication and Expansion of MIRNA Families
Once a MIRNA gene becomes useful, duplication can expand its
regulatory influence. Plant genomes are shaped by tandem duplication, segmental
duplication, transposable-element activity, and repeated rounds of whole-genome
duplication. MIRNA genes are not exempt from these forces.
Tandem duplication can create multiple MIRNA copies in close
genomic proximity. Segmental duplication can move related MIRNA loci into
different chromosomal contexts. Whole-genome duplication can duplicate both
MIRNA genes and their target genes at the same time, creating new opportunities
for dosage balance, subfunctionalization, and regulatory divergence.
Expansion of a MIRNA family can increase dosage. More MIRNA
copies may produce more mature miRNA, strengthening repression of target transcripts.
But copy-number expansion can also enable expression divergence. One MIRNA copy
may remain active in leaves, another in roots, another during reproductive
development, and another during stress. Even if mature miRNA sequences remain
identical, promoter divergence can create new spatial and temporal regulation.
This is one reason plant MIRNA gene families are often
functionally more complex than their small size suggests. A mature miRNA
sequence may be conserved, but the genomic loci that produce it can differ in
expression pattern, precursor structure, processing efficiency, and
evolutionary age.
Conservation and Ancient Regulatory Modules
The most deeply conserved plant miRNA families tend to
regulate transcription factors or regulatory proteins. This is not accidental.
Transcription factors sit near the top of developmental control systems. A
single miRNA targeting a transcription-factor family can coordinate entire
developmental programs.
The miR156-SPL module is a classic example. miR156 is
associated with juvenile-to-adult phase transition, flowering, architecture,
and stress-related traits. Its target SPL transcription factors control broad
developmental outputs. The miR172-AP2 module also contributes to phase
transition and flowering. The miR165/166-HD-ZIP III module regulates
adaxial-abaxial polarity, vascular patterning, and meristem function. The
miR160 and miR167 families regulate auxin-response factors, connecting miRNA
evolution to hormone signaling.
These ancient families are usually under strong purifying
selection. Their mature sequences are highly conserved because even small
sequence changes could alter target recognition. Their target sites are also
conserved because disruption may disturb essential developmental programs. In
such cases, the MIRNA gene and its target become an evolutionary unit: each
constrains the other.
Rapid Turnover of Young MIRNA Genes
In contrast to ancient conserved families, young plant MIRNA
genes show rapid birth and death. Many species-specific MIRNA candidates are
found in small-RNA datasets, but not all represent stable evolutionary
innovations. Some may be weakly expressed hairpins, degradation products,
siRNA-like loci, or recently formed precursors that have not yet acquired
canonical features.
Young MIRNA genes often show several features: limited
phylogenetic conservation, lower expression, less precise processing, weaker
evidence of Argonaute loading, and uncertain target repression. Over
evolutionary time, most are lost. A small fraction gradually acquire stronger
precursor structure, more accurate processing, more consistent mature miRNA
accumulation, and biologically meaningful targets.
This creates a layered MIRNA repertoire. At the bottom are
newly formed, unstable, or weakly functional hairpin loci. In the middle are
lineage-specific MIRNA genes with emerging regulatory roles. At the top are
deeply conserved MIRNA families embedded in core plant biology.
Whole-Genome Duplication and MIRNA Family Evolution
Whole-genome duplication is a major force in plant
evolution. Many angiosperm lineages have experienced one or more genome
duplication events. After such events, most duplicated genes are eventually
lost, but some are retained because they provide dosage balance, developmental
flexibility, or raw material for innovation.
MIRNA genes can be retained after whole-genome duplication,
but their fate depends on both the MIRNA locus and its target network. If a
MIRNA and its target genes are duplicated together, the regulatory relationship
may be preserved. Alternatively, one MIRNA copy may be lost while target
duplicates diverge. In other cases, retained MIRNA duplicates may acquire
different expression domains or subtly different mature sequences.
The consequences can be significant. A duplicated MIRNA
family member may regulate one subset of target paralogs, while another copy
regulates a different subset. This can help duplicated protein-coding genes
escape identical regulation and develop new functions. Thus, MIRNA evolution
after whole-genome duplication contributes not only to small-RNA diversity but
also to the rewiring of gene regulatory networks.
Target Co-evolution
Plant miRNA evolution cannot be understood by looking only
at MIRNA genes. The target genes evolve too. A miRNA target site may be
conserved, lost, duplicated, or modified. Target-site changes can weaken
regulation, create new regulation, or shift a transcript from one miRNA family
to another.
This co-evolution is especially visible in duplicated gene
families. If a transcription-factor family expands, some paralogs may retain
the ancestral miRNA target site while others lose it. The result is regulatory
partitioning. One group remains under miRNA control; another escapes repression
and may evolve a new expression pattern.
Defense-related NBS-LRR genes provide a striking example of
miRNA-target co-evolution. These genes often occur in large, rapidly evolving
clusters. Because overexpression of immune receptors can be costly, miRNAs that
target conserved motifs in duplicated NBS-LRR transcripts may help control
immune-gene dosage. In some cases, the same type of target gene expansion that
creates regulatory problems may also generate the inverted-repeat structures
from which new miRNAs arise. The target family therefore helps produce its own
regulator.
Functional Diversification Within MIRNA Families
After duplication, MIRNA family members can diversify in several ways.
First, they can diverge in expression. Two MIRNA loci
producing the same mature sequence may be active in different tissues,
developmental stages, or environmental conditions.
Second, they can diverge in precursor structure. Changes in
the stem-loop can affect DCL1 processing accuracy, mature miRNA abundance, or
production of alternative small RNAs from the same precursor.
Third, mature miRNA sequences can diverge. Even one or two
nucleotide substitutions, especially in target-recognition regions, can shift
target specificity.
Fourth, regulatory context can change. A MIRNA locus may
acquire new promoter elements, become responsive to stress, or be integrated
into hormone signaling.
Through these processes, MIRNA family members can undergo
subfunctionalization, where ancestral functions are partitioned among
duplicates, or neofunctionalization, where one copy acquires a new role.
Loss of MIRNA Genes
Loss is as important as gain. MIRNA genes may be deleted,
silenced, structurally degraded, or rendered nonfunctional by mutations that
disrupt processing or expression. Target sites can also be lost, making the
MIRNA irrelevant even if the MIRNA gene remains.
Loss may occur because a young MIRNA never provided a
selective advantage. It may also occur because regulation becomes harmful under
new ecological or developmental conditions. In some cases, loss of a MIRNA or
target site may release a gene from repression and contribute to phenotypic
diversification.
This turnover explains why the MIRNA complement differs
strongly among plant species. Conserved families provide a stable regulatory
backbone, while lineage-specific families reflect recent evolutionary experimentation.
Evolutionary Significance
The evolution of plant microRNA gene families reveals how
genomes build regulatory complexity without needing entirely new proteins. A
short RNA sequence, if produced accurately and expressed in the right context,
can regulate many transcripts. This makes miRNAs powerful tools for
coordinating gene families, buffering expression noise, and fine-tuning
developmental transitions.
Plant MIRNA evolution also shows that regulatory networks
are modular. A miRNA and its target family can form a portable regulatory unit.
Once established, such a module can be duplicated, modified, lost, or
redeployed. This modularity helps plants adapt to new body plans, reproductive
strategies, stress environments, and pathogen pressures.
Future Directions
Several questions remain central to the field. How many
lineage-specific MIRNA annotations are truly functional? What structural
features determine whether a young hairpin becomes a canonical MIRNA gene? How
often do new MIRNA genes arise from target-gene fragments, transposable
elements, or random hairpins? How do MIRNA duplicates partition expression
after whole-genome duplication? And how does target-site evolution contribute
to crop domestication and adaptation?
Long-read transcriptomics, improved small-RNA sequencing,
degradome analysis, Argonaute immunoprecipitation, comparative genomics, and
genome editing are now making these questions more tractable. In crops,
understanding MIRNA family evolution may help researchers manipulate architecture,
flowering time, stress tolerance, nutrient use, and immunity with greater
precision.
Conclusion
Plant microRNA gene families evolve through a balance of
conservation and experimentation. Ancient families such as miR156, miR160,
miR165/166, miR167, miR172, and miR396 form deeply conserved regulatory
circuits that control development and physiology. Younger MIRNA genes arise
continuously through inverted duplication, local rearrangement, duplication,
and genome-scale events. Most are lost, but a few become integrated into
functional networks.
The evolution of plant MIRNA families is therefore not
merely the history of small RNA genes. It is the history of how plants refine
gene regulation, absorb genome duplication, control expanding gene families, and
generate developmental and adaptive diversity from short RNA sequences.
Selected References
Evolution of plant microRNA gene families. Cell Research, 2007. https://www.nature.com/articles/7310113
Evolution of plant microRNAs and their targets. Trends in
Plant Science, 2008. https://doi.org/10.1016/j.tplants.2008.03.009
Origins and Evolution of MicroRNA Genes in Plant Species.
Genome Biology and Evolution, 2012.
https://pmc.ncbi.nlm.nih.gov/articles/PMC3318440/
The evolution of microRNAs in plants. Current Opinion in
Plant Biology, 2017. https://pmc.ncbi.nlm.nih.gov/articles/PMC5342909/
MicroRNA Gene Evolution in Arabidopsis lyrata and
Arabidopsis thaliana. The Plant Cell, 2010.
https://pmc.ncbi.nlm.nih.gov/articles/PMC2879733/
Conservation and evolution of miRNA regulatory programs in
plant development. Current Opinion in Plant Biology, 2007.
https://pmc.ncbi.nlm.nih.gov/articles/PMC2080797/
De novo origination of MIRNAs through generation of short
inverted repeats in target genes. RNA Biology, 2019.
https://pmc.ncbi.nlm.nih.gov/articles/PMC6546375/
No comments:
Post a Comment