Mol. Cells 2016; 39(5): 375-381
Published online April 27, 2016
https://doi.org/10.14348/molcells.2016.0013
© The Korean Society for Molecular and Cellular Biology
Correspondence to : *Correspondence: chi13@korea.ac.kr
MicroRNAs (miRNAs) are small non-coding RNAs (∼22 nucleotides) regulating gene expression at the post-transcriptional level. By directing the RNA-induced silencing complex (RISC) to bind specific target mRNAs, miRNA can repress target genes and affect various biological phenotypes. Functional miRNA target recognition is known to majorly attribute specificity to consecutive pairing with seed region (position 2?8) of miRNA. Recent advances in a transcriptome-wide method of mapping miRNA binding sites (Ago HITS-CLIP) elucidated that a large portion of miRNA-target interactions
Keywords argonaute, CLIP, microRNA, non-canonical targets
microRNAs (miRNAs) are single stranded non-coding RNA molecules of ∼22 nucleotides (nt) that regulate gene expression via post-transcriptional and/or translational repression (Ambros, 2004). Primary miRNA (pri-miRNAs) are transcribed in the nucleus by RNA polymerase II or III, where ∼70 nt stem-loop miRNA precursors (pre-miRNAs) are subsequently excised by the microprocessor complex containing the RNase III enzyme Drosha, and exported to the cytoplasm via Exportin-5 (Kim et al., 2009). Dicer, another RNase III enzyme, further processes pre-miRNAs to produce mature miRNAs, the final product being a ∼22 base-pair duplex with 2 nt-long 3′ overhangs (He and Hannon, 2004). Then, one strand of the mature miRNA is loaded onto Argonaute (Ago, also known as Eif2c), a core protein of the RNA-induced silencing complex (RISC). miRNA forms base pairs with a target mRNA as a guide for Ago binding and to direct the specificity of the RISC effector, decreasing target mRNA levels and/or its translation (Fabian et al., 2010), where mRNA destabilization is the dominant mechanism (Eichhorn et al., 2014; Guo et al., 2010).
miRNAs are abundant in the mammalian genome (more than 2000 human miRNAs are currently reported in miRBase) (Kozomara and Griffiths-Jones, 2014) and their regulatory role is essential, affecting various biological phenomena (Kim, 2005; Sim et al., 2014). Supporting evidence derives from the fact that a lethal phenotype during early development was observed in Dicer1-null (Bernstein et al., 2003) or Ago2-null (Liu et al., 2004) mice, and various biological defects were also reported after losses of individual miRNAs (Park et al., 2010). In addition, alterations of miRNA regulation are related to many diseases such as neurological disorders (Hebert and De Strooper, 2009), various types of cancer (Croce, 2009), and cardiovascular diseases (Olson, 2014). Importantly, all of these defects were ultimately caused by a dysregulation in target gene expression. Therefore, identification of miRNA targets is the key for understanding miRNA function. However, the limitation here is our ability to delineate a general principle for identification of specific RNA targets upon which miRNAs act. The problem stems from the observation that most of miRNA target sites have partial complementarity (Ambros, 2004).
In contrast to plants, a near-perfect base pairing of miRNA to its target is rare in animals, making it a challenge to predict the target sites (Bartel, 2009). However, initial prediction attempts provided evidence that local short stretches (≥ 6 nt) of consecutive base-pairing significantly contribute to target recognition (John et al., 2004; Krek et al., 2005; Lewis et al., 2003; Stark et al., 2003). Conceptually termed as the “nucleus,” the short consecutive matches could initiate a miRNA-target duplex, followed by the propagation of partial annealing that may further stabilize miRNA-target hybridization (Filipowicz, 2005; Rajewsky, 2006). Intriguingly, nuclei were further found to be typically located in the 5′ end region of miRNAs called the “seed”, enabling the prediction of miRNA target sites (Lewis et al., 2003).
Functional miRNA-target interactions are known to majorly require as few as 6-nt matches within the seed region (position 2-8, Fig. 1A) (Bartel, 2009). There are possible 6-mers (positions 1?6, 2?7, and 3?8), 7-mers (positions 2?8 and 1?7), and 8-mer (position 1?8) matches in the seed. Otherwise, a 6-mer match to position 3?8 is called an “offset 6-mer seed” because of its position and a marginal effect on repression (Friedman et al., 2009). Such canonical seed sites were initially known by early biological studies (Lee et al., 1993; Poy et al., 2004; Wightman et al., 1993), which were further validated by microarray experiments that detected enrichment of seed matches in miRNA-dependent transcripts showing repression (Grimson, 2007; Lim, 2005), and also by bioinformatics analyses, which found widespread conservation of seed sites in 3′ untranslated regions (3′ UTRs) in multi-genome sequences (Lewis et al., 2005; Xie et al., 2005). Seed-pairing rules have been informative in prediction and analysis of canonical seed sites, often in combination with evolutionary conservation (Friedman et al., 2009; Lewis et al., 2005), secondary structure (Long et al., 2007), or neighboring context information (Grimson, 2007). However, since a 6 nt match presents on average every ∼4,000 nt, likely to be occurred often by chance, such strategies still suffer from both false-positive (∼40?66%) and false-negative predictions (∼50?70%) (Mourelatos, 2008) even in the usage of microarray or proteomic approaches (Baek et al., 2008; Selbach et al., 2008). Furthermore, seed-pairing rules cannot identify non-canonical target sites, which have been reported as functional (Brodersen and Voinnet, 2009).
Since seed-pairing rules are widely adopted, there has been an unintentional bias to study only the canonical seed matches, overlooking the non-canonical targets. Nevertheless, several biological studies have functionally validated that perfectly matched miRNA seeds are neither necessary nor sufficient for all functional miRNA-target interactions (Brodersen and Voinnet, 2009). For example, supplementary components in near-perfect sites compensate for imperfect seed matches and are functional for target cleavage (miR-196 for
When seed-pairing rules were applied, putative miRNA targets from microarrays that showed miRNA-dependent repression were often demonstrated to have high false-negatives, implicating prevalent usage of non-canonical target sites. In lieu of this, microarray analysis of miR-24-transfected K562 cells found that several miR-24 targets are repressed through non-canonical sites, named “seedless” recognition elements (Fig. 1B) (Lal et al., 2009). In addition, “centered sites”, comprising 11?12 consecutive base-pairing to the center of miRNA, were also identified by the analysis of microarray data where neither perfect seed nor 3′ compensatory pairing was observed (Fig. 1B) (Shin et al., 2010). However, the limitation of such studies is that, lacking information on precise binding sites, they are unable to distinguish between direct and indirect miRNA targets.
Uncertainty in direct miRNA target sites necessitates the development of experimental methods capable of recovering miRNAs physically associated with their targets (Easow et al., 2007). Initially, biochemical isolation of miRNA-mRNA complexes via Ago protein-specific immunoprecipitation was attempted in order to purify mRNAs bound by Ago-miRNA (Easow et al., 2007; Hammell, 2008; Hendrickson et al., 2008; Karginov, 2007). However, the integrity of the approach was questioned because of the possible high background caused by nonspecific RNA-protein interactions, especially mediated by
Ago HITS-CLIP was the first to offer a general means of mapping precise miRNA target sites and has been widely applied to cultured cells (Haecker et al., 2012; Hafner et al., 2010; Kim et al., 2015; Kishore et al., 2011; Leung et al.; Loeb et al., 2012; Riley et al., 2012a; Xue et al., 2013), tissues (Boudreau et al., 2014; Chi et al., 2009; Kameswaran et al., 2014), and even to a whole organism (
Non-canonical miRNA-target sites called “nucleation bulges” were identified by analyzing Ago HITS-CLIP “orphan clusters” (Chi et al., 2012). Initially, G-bulge sites for miR-124 were found to be abundant in the mouse brain, where the target sites matched to the seed (positions 2?7) contained a bulged-out G nucleotide corresponding to position between 5 and 6 of the miRNA (miR-124 for
Application of the pivot pairing rule successfully decoded the non-canonical nucleation bulge sites, comprising ≥15% of all Ago-miRNA-mRNA interactions in the mouse brain (Chi et al., 2012). Nucleation bulge sites were also observed in Ago HITS-CLIP analyses performed in the human brain (Boudreau et al., 2014) and several cell lines (Hafner et al., 2010). In addition, their sequences are evolutionally conserved (Chi et al., 2012). The pivot pairing rule improved both quality and quantity of miRNA target sites in their identification (Stefani and Slack, 2012) since it can serve as a general rule that can be incorporated in any computational analysis (Kim et al., 2013).
In addition to nucleation bulges, “seed-like motifs” that contain mismatches in seed pairing were found by examining differential Ago HITS-CLIP binding sites in miR-155 deficient T cells (miR-155 for
To explain the pivot pairing rule, a hypothetical phase named “transitional nucleation state” was proposed (Fig. 2) (Chi et al., 2012). Combining the concepts of a “nucleus” (Filipowicz, 2005; Rajewsky, 2006; Tomari and Zamore, 2005) and findings from structural studies for recognition mechanisms of Ago silencing complexes - nucleation, propagation and cleavage of target RNAs (Schirle et al., 2014; Wang et al., 2009), transitional nucleation is defined as a transient miRNA-target duplex with a 5-base-paired nucleation (position 2?6) (Fig. 2B). If the transitional nucleation becomes sufficiently stable to form, this state may be further transformed into a bulge formation where the originally matched pivot nucleotide in position 6 becomes bulged-out and subsequently extended to hybridization towards the 3′ end of the miRNA (further than position 6, Fig. 2) (Chi et al., 2012). This model is also well supported by several structural studies of Ago (Elkayam et al., 2012; Nakanishi et al., 2012; Schirle and MacRae, 2012) where nucleotides poised for transitional nucleation (position 2?6) are particularly prearranged becoming A-form helical structures, which are susceptible for base pairing. Intriguingly, such A-form-like helical geometry is disrupted after a pivot (between position 6 and 7) formed by a kink resulting from the insertion of the amino acid isoleucine (I365) from the human Ago2 protein (Elkayam et al., 2012; Schirle and MacRae, 2012). Thus, in theory, any target site pairing to the seed region (either a seed match or a nucleation bulge) requires a shift of this nuclear helix to overcome the kink. In support of this model, single-molecule analysis showed such stepwise processes whereby Ago2 initially scans for target sites using a small region (position 2?4) (Chandradoss et al., 2015) and subsequently mediates a rapid and stable binding to the seed region of a miRNA (Jo et al., 2015; Salomon et al., 2015), serving as a proofreading procedure for target recognition (Yao et al., 2015).
miRNA is reshaped by loading onto Ago, being divided into several functional domains-the anchor, seed, central, 3′ supplementary, and tail regions (Fig. 3) (Salomon et al., 2015; Schirle et al., 2014; Wee et al., 2012). Importantly, the seed region has two prearranged continuous base stacking configurations (positions 2?6 and 7?9) caused by kinks at nucleotides 6?7 and 9?10 (Fig. 3A) (Elkayam et al., 2012; Schirle and MacRae, 2012; Schirle et al., 2014). Therefore, transitional nucleation starts pairing through helix 2?6 and subsequently propagates to helices 7?9, overcoming the kink at 6?7 for cases of 5′ dominant interactions (Fig. 3B), such as the seed (Bartel, 2009) or nucleation bulge sites (Chi et al., 2012). The opposite may also happen for central dominant interactions (Shin et al., 2010) - the interaction could be initiated by paring through helix 7?9 along with the central region (positions 10?12) and further expand up to the 3′ supplementary region (positions 13?16) (Fig. 3C). In this case, crossing the barrier of the kink at position 9?10 may be required. For “seed-like motifs”, where seed sites contain mismatches, deletions, or wobble pairings (Grosswendt et al., 2014; Helwak et al., 2013; Loeb et al., 2012; Moore et al., 2015), transitional nucleation may require 3′ compensatory interactions (Fig. 3D), which could be a general determinant of additional specificity for Ago binding as shown by CLEAR-CLIP (Moore et al., 2015).
Biological systems initially generate marginally effective non-canonical regulations when existing biological mechanisms require alternative strategies to compensate for what major canonical pathways have been unable to accomplish. Followed by this notion, majority of non-canonical miRNA-target sites were shown to mediate gene repression at a modest level (Chi et al., 2012; Helwak et al., 2013; Lal et al., 2009; Loeb et al., 2012; Moore et al., 2015) only except for centered sites (Shin et al., 2010), which can trigger slicing activity of Ago but only exist as few in whole transcriptome. However, such modest repression, shown by which the most of non-canonical target sites including nucleation bulges for miRNAs, was often observed as insignificant in large-scale gene expression analyses (Agarwal et al., 2015). These were possibly because the marginal repression was confounded by the issue of cellular heterogeneity, variability derived from secondary effects of target repression, and differences in sensitivity and threshold used in the analyses, or non-canonical sites identified by Ago CLIP based methods could be the true interaction but may not be always functional as the consequence of transient bindings for searching targets (Chandradoss et al., 2015) or as requiring combinatorial occurrences of target sites (Krek et al., 2005). In fact, evaluation of target repression at individual single cell level (Moore et al., 2015) and gene expression analyses in combination with CLIP data (Chi et al., 2012; Grosswendt et al., 2014; Helwak et al., 2013; Loeb et al., 2012; Moore et al., 2015) did observe significant repression mediated by non-canonical sites albeit the effect is still marginal. Since CLIP data only indicate the direct bindings, they should be analyzed together with gene expression data to access the functionality. Future studies should be performed carefully to clear out such issues whenever they analyze marginal effects from non-canonical interactions.
The modest effects from the widespread non-canonical sites are likely to be caused by reduced numbers of targets bound by Ago-miRNA (Chi et al., 2012; Grosswendt et al., 2014; Helwak et al., 2013; Loeb et al., 2012; Moore et al., 2015) probably due to low binding affinity. In support of this, any mismatches or wobbles in the seed region decrease target binding but enhance the turnover of the RISC complex (Wee et al., 2012), suggesting that non-canonical binding can induce intermediate affinity without affecting the concentration of the Ago complex. Additionally, this may be a mechanism that provides an unbound Ago-miRNA complex to adjacent target sites, as shown by the single-molecule analysis where the lateral diffusion from weakly bound Ago promoted cooperation between neighboring target sites (Chandradoss et al., 2015). As a result, combination of canonical and non-canonical sites may provide a variety of spectra in the regulation of gene expression, enabling a fine-tuning of repression activity. Moreover, relative to canonical seed sites, non-canonical sites have modest sequence conservation across species (Chi et al., 2012; Grosswendt et al., 2014; Loeb et al., 2012; Moore et al., 2015), suggesting that they may be evolutionary intermediates under selective pressure for a shift towards high affinity seed sites. In addition, gene ontology analysis of Ago HITS-CLIP showed that the majority of non-canonical targets have similar functions to the canonical ones, although they are slightly different in detail (Chi et al., 2012; Loeb et al., 2012; Moore et al., 2015). This indicates that non-canonical targets may have a different biological function that needs to be acquired to improve or compensate for the canonical targets.
Although several non-canonical miRNA-target sites were reported as functional, they did not receive much intention since they could not be definitely defined (Brodersen and Voinnet, 2009). However, Ago HITS-CLIP method, which can generate a precise transcriptome-wide map of miRNA target sites (Chi et al., 2009), unexpectedly revealed that large portion of miRNA-target interactions are non-canonical (Chi et al., 2012). Advances in Ago HITS-CLIP analyses further identified non-canonical “nucleation bulges” (Chi et al., 2012) and “seed-like motifs” (Grosswendt et al., 2014; Helwak et al., 2013; Loeb et al., 2012; Moore et al., 2015), expanding our knowledge in the understanding of miRNA targets and their functions. Moreover, the transitional nucleation model, yielded by the analytic process of explaining the pattern of nucleation bulges, offers a general molecular model that can be used to understand the mechanism of miRNA target recognition through seed regions (Chi et al., 2012). Extending this knowledge to applications of RNA silencing, modified siRNAs that contain abasic substitution in the pivot (position 6) were recently developed to completely eliminate miRNA-like off-target repression (Lee et al., 2015; Seok et al., 2016). Biological significance of the non-canonical interactions could be interpreted as evolutional intermediates with slightly distinct functions. However, the biological functions postulated here remain to be corroborated.
Mol. Cells 2016; 39(5): 375-381
Published online May 31, 2016 https://doi.org/10.14348/molcells.2016.0013
Copyright © The Korean Society for Molecular and Cellular Biology.
Heeyoung Seok1, Juyoung Ham2, Eun-Sook Jang3, and Sung Wook Chi1,*
1Division of Life Sciences, College of Life Sciences and Biotechnology, Korea University, Seoul 02841, Korea, 2Department of Health Sciences and Technology, Samsung Advanced Institute for Health Sciences and Technology, Sungkyunkwan University, Seoul 06351, Korea, 3EncodeGEN Co. Ltd., Seoul 06329, Korea
Correspondence to:*Correspondence: chi13@korea.ac.kr
MicroRNAs (miRNAs) are small non-coding RNAs (∼22 nucleotides) regulating gene expression at the post-transcriptional level. By directing the RNA-induced silencing complex (RISC) to bind specific target mRNAs, miRNA can repress target genes and affect various biological phenotypes. Functional miRNA target recognition is known to majorly attribute specificity to consecutive pairing with seed region (position 2?8) of miRNA. Recent advances in a transcriptome-wide method of mapping miRNA binding sites (Ago HITS-CLIP) elucidated that a large portion of miRNA-target interactions
Keywords: argonaute, CLIP, microRNA, non-canonical targets
microRNAs (miRNAs) are single stranded non-coding RNA molecules of ∼22 nucleotides (nt) that regulate gene expression via post-transcriptional and/or translational repression (Ambros, 2004). Primary miRNA (pri-miRNAs) are transcribed in the nucleus by RNA polymerase II or III, where ∼70 nt stem-loop miRNA precursors (pre-miRNAs) are subsequently excised by the microprocessor complex containing the RNase III enzyme Drosha, and exported to the cytoplasm via Exportin-5 (Kim et al., 2009). Dicer, another RNase III enzyme, further processes pre-miRNAs to produce mature miRNAs, the final product being a ∼22 base-pair duplex with 2 nt-long 3′ overhangs (He and Hannon, 2004). Then, one strand of the mature miRNA is loaded onto Argonaute (Ago, also known as Eif2c), a core protein of the RNA-induced silencing complex (RISC). miRNA forms base pairs with a target mRNA as a guide for Ago binding and to direct the specificity of the RISC effector, decreasing target mRNA levels and/or its translation (Fabian et al., 2010), where mRNA destabilization is the dominant mechanism (Eichhorn et al., 2014; Guo et al., 2010).
miRNAs are abundant in the mammalian genome (more than 2000 human miRNAs are currently reported in miRBase) (Kozomara and Griffiths-Jones, 2014) and their regulatory role is essential, affecting various biological phenomena (Kim, 2005; Sim et al., 2014). Supporting evidence derives from the fact that a lethal phenotype during early development was observed in Dicer1-null (Bernstein et al., 2003) or Ago2-null (Liu et al., 2004) mice, and various biological defects were also reported after losses of individual miRNAs (Park et al., 2010). In addition, alterations of miRNA regulation are related to many diseases such as neurological disorders (Hebert and De Strooper, 2009), various types of cancer (Croce, 2009), and cardiovascular diseases (Olson, 2014). Importantly, all of these defects were ultimately caused by a dysregulation in target gene expression. Therefore, identification of miRNA targets is the key for understanding miRNA function. However, the limitation here is our ability to delineate a general principle for identification of specific RNA targets upon which miRNAs act. The problem stems from the observation that most of miRNA target sites have partial complementarity (Ambros, 2004).
In contrast to plants, a near-perfect base pairing of miRNA to its target is rare in animals, making it a challenge to predict the target sites (Bartel, 2009). However, initial prediction attempts provided evidence that local short stretches (≥ 6 nt) of consecutive base-pairing significantly contribute to target recognition (John et al., 2004; Krek et al., 2005; Lewis et al., 2003; Stark et al., 2003). Conceptually termed as the “nucleus,” the short consecutive matches could initiate a miRNA-target duplex, followed by the propagation of partial annealing that may further stabilize miRNA-target hybridization (Filipowicz, 2005; Rajewsky, 2006). Intriguingly, nuclei were further found to be typically located in the 5′ end region of miRNAs called the “seed”, enabling the prediction of miRNA target sites (Lewis et al., 2003).
Functional miRNA-target interactions are known to majorly require as few as 6-nt matches within the seed region (position 2-8, Fig. 1A) (Bartel, 2009). There are possible 6-mers (positions 1?6, 2?7, and 3?8), 7-mers (positions 2?8 and 1?7), and 8-mer (position 1?8) matches in the seed. Otherwise, a 6-mer match to position 3?8 is called an “offset 6-mer seed” because of its position and a marginal effect on repression (Friedman et al., 2009). Such canonical seed sites were initially known by early biological studies (Lee et al., 1993; Poy et al., 2004; Wightman et al., 1993), which were further validated by microarray experiments that detected enrichment of seed matches in miRNA-dependent transcripts showing repression (Grimson, 2007; Lim, 2005), and also by bioinformatics analyses, which found widespread conservation of seed sites in 3′ untranslated regions (3′ UTRs) in multi-genome sequences (Lewis et al., 2005; Xie et al., 2005). Seed-pairing rules have been informative in prediction and analysis of canonical seed sites, often in combination with evolutionary conservation (Friedman et al., 2009; Lewis et al., 2005), secondary structure (Long et al., 2007), or neighboring context information (Grimson, 2007). However, since a 6 nt match presents on average every ∼4,000 nt, likely to be occurred often by chance, such strategies still suffer from both false-positive (∼40?66%) and false-negative predictions (∼50?70%) (Mourelatos, 2008) even in the usage of microarray or proteomic approaches (Baek et al., 2008; Selbach et al., 2008). Furthermore, seed-pairing rules cannot identify non-canonical target sites, which have been reported as functional (Brodersen and Voinnet, 2009).
Since seed-pairing rules are widely adopted, there has been an unintentional bias to study only the canonical seed matches, overlooking the non-canonical targets. Nevertheless, several biological studies have functionally validated that perfectly matched miRNA seeds are neither necessary nor sufficient for all functional miRNA-target interactions (Brodersen and Voinnet, 2009). For example, supplementary components in near-perfect sites compensate for imperfect seed matches and are functional for target cleavage (miR-196 for
When seed-pairing rules were applied, putative miRNA targets from microarrays that showed miRNA-dependent repression were often demonstrated to have high false-negatives, implicating prevalent usage of non-canonical target sites. In lieu of this, microarray analysis of miR-24-transfected K562 cells found that several miR-24 targets are repressed through non-canonical sites, named “seedless” recognition elements (Fig. 1B) (Lal et al., 2009). In addition, “centered sites”, comprising 11?12 consecutive base-pairing to the center of miRNA, were also identified by the analysis of microarray data where neither perfect seed nor 3′ compensatory pairing was observed (Fig. 1B) (Shin et al., 2010). However, the limitation of such studies is that, lacking information on precise binding sites, they are unable to distinguish between direct and indirect miRNA targets.
Uncertainty in direct miRNA target sites necessitates the development of experimental methods capable of recovering miRNAs physically associated with their targets (Easow et al., 2007). Initially, biochemical isolation of miRNA-mRNA complexes via Ago protein-specific immunoprecipitation was attempted in order to purify mRNAs bound by Ago-miRNA (Easow et al., 2007; Hammell, 2008; Hendrickson et al., 2008; Karginov, 2007). However, the integrity of the approach was questioned because of the possible high background caused by nonspecific RNA-protein interactions, especially mediated by
Ago HITS-CLIP was the first to offer a general means of mapping precise miRNA target sites and has been widely applied to cultured cells (Haecker et al., 2012; Hafner et al., 2010; Kim et al., 2015; Kishore et al., 2011; Leung et al.; Loeb et al., 2012; Riley et al., 2012a; Xue et al., 2013), tissues (Boudreau et al., 2014; Chi et al., 2009; Kameswaran et al., 2014), and even to a whole organism (
Non-canonical miRNA-target sites called “nucleation bulges” were identified by analyzing Ago HITS-CLIP “orphan clusters” (Chi et al., 2012). Initially, G-bulge sites for miR-124 were found to be abundant in the mouse brain, where the target sites matched to the seed (positions 2?7) contained a bulged-out G nucleotide corresponding to position between 5 and 6 of the miRNA (miR-124 for
Application of the pivot pairing rule successfully decoded the non-canonical nucleation bulge sites, comprising ≥15% of all Ago-miRNA-mRNA interactions in the mouse brain (Chi et al., 2012). Nucleation bulge sites were also observed in Ago HITS-CLIP analyses performed in the human brain (Boudreau et al., 2014) and several cell lines (Hafner et al., 2010). In addition, their sequences are evolutionally conserved (Chi et al., 2012). The pivot pairing rule improved both quality and quantity of miRNA target sites in their identification (Stefani and Slack, 2012) since it can serve as a general rule that can be incorporated in any computational analysis (Kim et al., 2013).
In addition to nucleation bulges, “seed-like motifs” that contain mismatches in seed pairing were found by examining differential Ago HITS-CLIP binding sites in miR-155 deficient T cells (miR-155 for
To explain the pivot pairing rule, a hypothetical phase named “transitional nucleation state” was proposed (Fig. 2) (Chi et al., 2012). Combining the concepts of a “nucleus” (Filipowicz, 2005; Rajewsky, 2006; Tomari and Zamore, 2005) and findings from structural studies for recognition mechanisms of Ago silencing complexes - nucleation, propagation and cleavage of target RNAs (Schirle et al., 2014; Wang et al., 2009), transitional nucleation is defined as a transient miRNA-target duplex with a 5-base-paired nucleation (position 2?6) (Fig. 2B). If the transitional nucleation becomes sufficiently stable to form, this state may be further transformed into a bulge formation where the originally matched pivot nucleotide in position 6 becomes bulged-out and subsequently extended to hybridization towards the 3′ end of the miRNA (further than position 6, Fig. 2) (Chi et al., 2012). This model is also well supported by several structural studies of Ago (Elkayam et al., 2012; Nakanishi et al., 2012; Schirle and MacRae, 2012) where nucleotides poised for transitional nucleation (position 2?6) are particularly prearranged becoming A-form helical structures, which are susceptible for base pairing. Intriguingly, such A-form-like helical geometry is disrupted after a pivot (between position 6 and 7) formed by a kink resulting from the insertion of the amino acid isoleucine (I365) from the human Ago2 protein (Elkayam et al., 2012; Schirle and MacRae, 2012). Thus, in theory, any target site pairing to the seed region (either a seed match or a nucleation bulge) requires a shift of this nuclear helix to overcome the kink. In support of this model, single-molecule analysis showed such stepwise processes whereby Ago2 initially scans for target sites using a small region (position 2?4) (Chandradoss et al., 2015) and subsequently mediates a rapid and stable binding to the seed region of a miRNA (Jo et al., 2015; Salomon et al., 2015), serving as a proofreading procedure for target recognition (Yao et al., 2015).
miRNA is reshaped by loading onto Ago, being divided into several functional domains-the anchor, seed, central, 3′ supplementary, and tail regions (Fig. 3) (Salomon et al., 2015; Schirle et al., 2014; Wee et al., 2012). Importantly, the seed region has two prearranged continuous base stacking configurations (positions 2?6 and 7?9) caused by kinks at nucleotides 6?7 and 9?10 (Fig. 3A) (Elkayam et al., 2012; Schirle and MacRae, 2012; Schirle et al., 2014). Therefore, transitional nucleation starts pairing through helix 2?6 and subsequently propagates to helices 7?9, overcoming the kink at 6?7 for cases of 5′ dominant interactions (Fig. 3B), such as the seed (Bartel, 2009) or nucleation bulge sites (Chi et al., 2012). The opposite may also happen for central dominant interactions (Shin et al., 2010) - the interaction could be initiated by paring through helix 7?9 along with the central region (positions 10?12) and further expand up to the 3′ supplementary region (positions 13?16) (Fig. 3C). In this case, crossing the barrier of the kink at position 9?10 may be required. For “seed-like motifs”, where seed sites contain mismatches, deletions, or wobble pairings (Grosswendt et al., 2014; Helwak et al., 2013; Loeb et al., 2012; Moore et al., 2015), transitional nucleation may require 3′ compensatory interactions (Fig. 3D), which could be a general determinant of additional specificity for Ago binding as shown by CLEAR-CLIP (Moore et al., 2015).
Biological systems initially generate marginally effective non-canonical regulations when existing biological mechanisms require alternative strategies to compensate for what major canonical pathways have been unable to accomplish. Followed by this notion, majority of non-canonical miRNA-target sites were shown to mediate gene repression at a modest level (Chi et al., 2012; Helwak et al., 2013; Lal et al., 2009; Loeb et al., 2012; Moore et al., 2015) only except for centered sites (Shin et al., 2010), which can trigger slicing activity of Ago but only exist as few in whole transcriptome. However, such modest repression, shown by which the most of non-canonical target sites including nucleation bulges for miRNAs, was often observed as insignificant in large-scale gene expression analyses (Agarwal et al., 2015). These were possibly because the marginal repression was confounded by the issue of cellular heterogeneity, variability derived from secondary effects of target repression, and differences in sensitivity and threshold used in the analyses, or non-canonical sites identified by Ago CLIP based methods could be the true interaction but may not be always functional as the consequence of transient bindings for searching targets (Chandradoss et al., 2015) or as requiring combinatorial occurrences of target sites (Krek et al., 2005). In fact, evaluation of target repression at individual single cell level (Moore et al., 2015) and gene expression analyses in combination with CLIP data (Chi et al., 2012; Grosswendt et al., 2014; Helwak et al., 2013; Loeb et al., 2012; Moore et al., 2015) did observe significant repression mediated by non-canonical sites albeit the effect is still marginal. Since CLIP data only indicate the direct bindings, they should be analyzed together with gene expression data to access the functionality. Future studies should be performed carefully to clear out such issues whenever they analyze marginal effects from non-canonical interactions.
The modest effects from the widespread non-canonical sites are likely to be caused by reduced numbers of targets bound by Ago-miRNA (Chi et al., 2012; Grosswendt et al., 2014; Helwak et al., 2013; Loeb et al., 2012; Moore et al., 2015) probably due to low binding affinity. In support of this, any mismatches or wobbles in the seed region decrease target binding but enhance the turnover of the RISC complex (Wee et al., 2012), suggesting that non-canonical binding can induce intermediate affinity without affecting the concentration of the Ago complex. Additionally, this may be a mechanism that provides an unbound Ago-miRNA complex to adjacent target sites, as shown by the single-molecule analysis where the lateral diffusion from weakly bound Ago promoted cooperation between neighboring target sites (Chandradoss et al., 2015). As a result, combination of canonical and non-canonical sites may provide a variety of spectra in the regulation of gene expression, enabling a fine-tuning of repression activity. Moreover, relative to canonical seed sites, non-canonical sites have modest sequence conservation across species (Chi et al., 2012; Grosswendt et al., 2014; Loeb et al., 2012; Moore et al., 2015), suggesting that they may be evolutionary intermediates under selective pressure for a shift towards high affinity seed sites. In addition, gene ontology analysis of Ago HITS-CLIP showed that the majority of non-canonical targets have similar functions to the canonical ones, although they are slightly different in detail (Chi et al., 2012; Loeb et al., 2012; Moore et al., 2015). This indicates that non-canonical targets may have a different biological function that needs to be acquired to improve or compensate for the canonical targets.
Although several non-canonical miRNA-target sites were reported as functional, they did not receive much intention since they could not be definitely defined (Brodersen and Voinnet, 2009). However, Ago HITS-CLIP method, which can generate a precise transcriptome-wide map of miRNA target sites (Chi et al., 2009), unexpectedly revealed that large portion of miRNA-target interactions are non-canonical (Chi et al., 2012). Advances in Ago HITS-CLIP analyses further identified non-canonical “nucleation bulges” (Chi et al., 2012) and “seed-like motifs” (Grosswendt et al., 2014; Helwak et al., 2013; Loeb et al., 2012; Moore et al., 2015), expanding our knowledge in the understanding of miRNA targets and their functions. Moreover, the transitional nucleation model, yielded by the analytic process of explaining the pattern of nucleation bulges, offers a general molecular model that can be used to understand the mechanism of miRNA target recognition through seed regions (Chi et al., 2012). Extending this knowledge to applications of RNA silencing, modified siRNAs that contain abasic substitution in the pivot (position 6) were recently developed to completely eliminate miRNA-like off-target repression (Lee et al., 2015; Seok et al., 2016). Biological significance of the non-canonical interactions could be interpreted as evolutional intermediates with slightly distinct functions. However, the biological functions postulated here remain to be corroborated.
Hyeonseo Hwang, Hee Ryung Chang, and Daehyun Baek
Mol. Cells 2023; 46(1): 21-32 https://doi.org/10.14348/molcells.2023.2157Chaehwan Oh, Dahyeon Koh, Hyeong Bin Jeon, and Kyoung Mi Kim
Mol. Cells 2022; 45(9): 603-609 https://doi.org/10.14348/molcells.2022.0056Woo Ryung Kim, Eun Gyung Park, Kyung-Won Kang, Sang-Myeong Lee, Bumseok Kim, and Heui-Soo Kim
Mol. Cells 2020; 43(11): 953-963 https://doi.org/10.14348/molcells.2020.0177