ARTICLE
Received 4 May 2015 | Accepted 12 Oct 2015 | Published 25 Nov 2015
Michael J. Moore1, Troels K.H. Scheel2,3,4, Joseph M. Luna1,2, Christopher Y. Park1,5, John J. Fak1, Eiko Nishiuchi2, Charles M. Rice2 & Robert B. Darnell1,5
microRNAs (miRNAs) act as sequence-specic guides for Argonaute (AGO) proteins, which mediate posttranscriptional silencing of target messenger RNAs. Despite their importance in many biological processes, rules governing AGOmiRNA targeting are only partially understood. Here we report a modied AGO HITS-CLIP strategy termed CLEAR (covalent ligation of endogenous Argonaute-bound RNAs)-CLIP, which enriches miRNAs ligated to their endogenous mRNA targets. CLEAR-CLIP mapped B130,000 endogenous miRNAtarget interactions in mouse brain and B40,000 in human hepatoma cells. Motif and structural analysis dene expanded pairing rules for over 200 mammalian miRNAs. Most interactions combine seed-based pairing with distinct, miRNA-specic patterns of auxiliary pairing. At some regulatory sites, this specicity confers distinct silencing functions to miRNA family members with shared seed sequences but divergent 30-ends. This work provides a means for explicit biochemical identication of miRNA sites in vivo, leading to the discovery that miRNA 30-end pairing is a general determinant of AGO binding specicity.
1 Laboratory of Molecular Neuro-Oncology and Howard Hughes Medical Institute, The Rockefeller University, 1230 York Avenue, Box 226, New York, New York 10065, USA. 2 Laboratory of Virology and Infectious Disease, Center for the Study of Hepatitis C, The Rockefeller University, New York, New York 10065, USA. 3 Copenhagen Hepatitis C Program (CO-HEP), Department of Infectious Diseases and Clinical Research Centre, Copenhagen University Hospital, 2650 Hvidovre, Denmark. 4 Department of Immunology and Microbiology, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen, Denmark. 5 New York Genome Center, 101 Avenue of the Americas, New York, New York 10013, USA. Correspondence and requests for materials should be addressed to M.J.M. (email: mailto:[email protected]
Web End [email protected] ) or to R.B.D. (email: mailto:[email protected]
Web End [email protected] ).
NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 1
& 2015 Macmillan Publishers Limited. All rights reserved.
DOI: 10.1038/ncomms9864 OPEN
miRNAtarget chimeras reveal miRNA 30-end pairing as a major determinant of Argonaute target specicity
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864
microRNAs (miRNAs) are small, non-coding RNAs that mediate posttranscriptional RNA silencing by sequence-specic targeting of Argonaute (AGO) proteins to
mRNAs1. miRNAs regulate the development, homeostasis and pathologies of virtually all vertebrate tissues. Many miRNAs have specic or enriched expression in the central nervous system, regulating such diverse processes as neuronal differentiation, excitation, synaptogenesis and plasticity2. Accordingly, miRNA dysregulation is implicated in neurological disorders and many cancers including glioma and liver cancer35. However, miRNA function in these contexts remains unclear, as most in vivo mRNA targets are unknown.
Accurate miRNA target identication remains a formidable challenge6. Canonical miRNA binding involves base pairing of the miRNA seed region (nucleotides 28) to complementary target sites7,8. Such short motifs occur frequently in the transcriptome and are not sufcient to predict miRNA binding, leading to high false discovery rates for purely bioinformatic predictions9. To mitigate this limitation, evolutionary conservation and local AU sequence content are employed as screens for site functionality and accessibility, respectively7,10. However, the importance of non-conserved miRNA regulation, especially in the brain11, and limitations of context predictions without empirical binding information are well established12. Moreover, the assumption of uniform rules for all miRNAs ignores non-canonical miRNA binding, increasingly recognized as widespread1315. Rules beyond seed-based pairing such as supplementary pairing of miRNA 30-bases 1217 have been described but are generally considered rare1618. Other non-canonical binding modes include 30-end centric seedless
pairing19,20, centred miRNA pairing21 and nucleation bulges in the seed region13.
Empirical mapping of miRNA target sites in vivo was rst achieved with ultraviolet cross-linking and immunoprecipitation with high-throughput sequencing (HITS-CLIP) of AGO proteins2224. AGO HITS-CLIP generates two data setsa transcriptome-wide target binding map and an empirical catalogue of expressed miRNAsthat empower accurate identication of functional miRNA-binding sites. However, the inability to link miRNA and target unambiguously remains a limitation. Two groups reported experimental strategies to ligate miRNA to target RNA in puried AGO complexes. CLASH (cross-linking and sequencing of hybrids) identied thousands of miRNAtarget chimeras using dual-tagged AGO1 in HEK-293T cells, revealing frequent seed-independent miRNA binding19,25. Soon after, modied photoactivatable ribonucleoside-enhanced CLIP identied B3,600 unambiguous events in Caenorhabditis elegans26. Although identifying thousands of novel interactions, the reliance of these studies on exogenous AGO expression excludes them from analysis of human tissues and, currently, in vivo mouse models, and raises concerns about the stoichiometry of RNA-binding events.
We have developed modications of AGO HITS-CLIP, termed CLEAR (covalent ligation of endogenous Argonaute-bound RNAs)-CLIP, permitting isolation of miRNAtarget chimeras from endogenous AGOmiRNAmRNA complexes. CLEAR-CLIP identies tens of thousands of miRNA target sites in mouse brain including novel targets for many neuron-specic miRNAs. In mouse brain and human liver cells, we dene expanded pairing rules for over 200 mammalian miRNAs illustrating widespread use of miRNA 30-end auxiliary pairing in vivo and tolerance of diverse, although constrained, pairing patterns for many miRNAs. Integrated with HITS-CLIP binding information, CLEAR-CLIP provides an improved empirical basis for identication of physiologic canonical and non-canonical miRNA regulation.
ResultsCLEAR-CLIP denes miRNAtarget interactions in vivo. We modied AGO HITS-CLIP to facilitate direct ligation of miRNA and target RNA. Endogenous AGORNA complexes were puried from ultraviolet-irradiated mouse brain neocortex using monoclonal anti-AGO and were washed in stringent conditions that disrupt native AGOmRNA interactions (Fig. 1a)22,27. Complexes were treated with dilute RNAse to generate footprint-sized fragments. To test whether T4 RNA ligase I treatment could join free RNA ends, AGORNA was radiolabelled with polynucleotide kinase (PNK) and 32P-g-ATP, then treated with RNA ligase.
Complexes were treated with alkaline phosphatase and visualized by SDSpolyacrylamide gel electrophoresis (PAGE) and autoradiography to assess dephosphorylation. Compared with untreated samples, ligase-treated complexes were protected from dephosphorylation, indicating ligation of RNA ends (Supplementary Fig. 1a). Using optimized ligation conditions, 12 biological replicates from post-natal day 13 (P13)-aged mouse neocortex were prepared, along with two no-ligase control samples omitting RNA ligase I treatment. Pre-adenylated 30-adapter was added on-bead with truncated RNA ligase 2, which cannot catalyse standard RNARNA ligation28. Isolation, cloning and sequencing of AGO-bound RNA tags retrieved hundreds of thousands of miRNAtarget chimeric reads in addition to standard target and miRNA fragments (Supplementary Table 1). We termed this method CLEAR-CLIP.
CLEAR-CLIP yielded miRNAtarget chimeras in two orientations, termed miR-rst and miR-last based on the position of miRNA and target fragments (Fig. 1a). Most chimeras contained full-length miRNAs and miR-rst chimeras were on-average 14-fold more frequent than miR-last. Uniquely mapped miR-rst chimeras were B1.55% of total unique reads in ligase-treated samples, but only B0.20.3% in no-ligase samples. miR-last chimeras were B0.050.2% of unique reads, irrespective of ligase treatment. Thus, most miR-rst chimeras were dependent on exogenous ligase but miR-last chimeras were not. Importantly, chimeric and non-chimeric mRNA target sequences could
Figure 1 | CLEAR-CLIP unambiguously identies endogenous in vivo miRNAtarget interactions. (a) In CLEAR-CLIP, AGOtarget contacts are cross-linked in vivo by ultraviolet irradiation. Endogenous AGO is immunopuried from tissue lysates and washed under stringent conditions that disrupt the interaction of AGOmiRNA with non-cross-linked target RNAs. Target regions cannot be cloned from no-ultraviolet controls, indicating that cross-linking of AGO to target mRNA (shown as X) is required. Cross-linking of the miRNA may not be necessary, because the AGOmiRNA interaction is uniquely strong and survives stringent washing. After washing, RNA ends are modied to facilitate miRNAtarget ligation and joined with T4 RNA Ligase I treatment, yielding miRNAtarget chimeric RNAs in two orientations at the indicated frequencies. All depicted post-IP manipulations up to SDSPAGE occur on beads. Correlation plots of miRNA abundance of all miR-rst (b) and miR-last (c) chimeras versus small RNA sequencing data in the brain67. Pearsons correlation coefcients (r) are shown. CDF plots of cognate miRNA seed matches in target regions relative to ligation site for all miR-rst chimeras in plus-ligase (d) and no-ligase (e) samples, and for all miR-last chimeras in plus-ligase (f) and no-ligase (g) samples. (h) Distribution of standard AGO CLIP and miRNA target chimeras in transcript regions. (i) CLEAR-CLIP conrmed known miRNA regulation, here exemplied by miR-124 regulation of the Ptbp1 30-UTR.
Other examples are shown in Supplementary Fig. 3c.
2 NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864 ARTICLE
not be cloned from no-ultraviolet controls, indicating that in vivo AGOmRNA ultraviolet cross-linking was strictly required for CLEAR-CLIP.miRNA frequency in miR-rst chimeras correlated with brain miRNA abundance (Fig. 1b and Supplementary Fig. 1bd).
miR-rst chimeras were dominated by a small number brain-abundant miRNAs (Supplementary Fig. 1e). In contrast, miR-last chimeras did not correlate to miRNA abundance and were dominated by dubiously annotated miRNAs (Fig. 1c and Supplementary Fig. 1d). Target regions in miR-rst chimeras
miR-first chimeras miR-last chimeras
12
16 r = 0.72
r = 0.019
14
UV irradiation of live brain tissue
10
Chimera abundance
log 2(counts)
CDF: seed presence
CDF: seed presence
Chimera abundance
log 2 (counts)
12
10
8
8
6
6
4
4
2
2
4
6 8 10 12 14 16 18 4
2 6 8 10 12 14 16 18
miRNA abundance log2 (miRNA-seq)
miRNA abundance log2 (miRNA-seq)
Canonical seed matches in miR-first, plus-ligase
1. Tissue lysis, DNAse
1. 5 Phosphorylation
1. 3 Dephosphorylation (CIP)
2. 3 linker addition (on-bead, truncated) RNA ligase 2, pre-adenylated linker)
2. RNAse treatment 3. AGO IP4. High stringency washes
Purified AGO ternary complex
x=cross-link
0.5
0.5
CDF: seed presenceCDF: seed presence
6mer 6mer offset Total
8mer
6mer 6mer offset Total
0.4
8mer 7merA1 7merm8
0.4
7merA1 7merm8
0.3
0.3
P
0.2
0.2
5 HO
HO P
Ligation(T4 RNA ligase I)
P
3
x
x
0.1
0.1
0 200 100
200 100 200 100
0 100 200
0 200 100
0 100 200
(PNK, ase minus)2. Second stringent wash sequence
P
3
Distance from ligation point (nts)
Distance from ligation point (nts)
Canonical seed matches in miR-last, plus-ligase
Canonical seed matches in miR-first, no-ligase
canonical seed matches in miR-last, no-ligase
0.5
0.5
P
0.4
0.4
5 3
5
P
8mer
6mer 6mer offset Total
7merA1 7merm8
7merA1 7merm8
HO
8mer
6mer 6mer offset Total
0.3
0.3
0.2
0.2
0.1
0.1
0
0
0 100 200
0 100 200
Major product (>90%) Minor product (<10%)
Distance from ligation point (nts)
Distance from ligation point (nts)
Dual/other
3-UTR
CDS CDS
3-UTR/ 3-UTR/
Dual/other
P Px P HO x
3. PNK radiolabelling
SDSPAGE, nitrocellulose transfer, cloning per published HITS-CLIP protocol
5 3
miRNAtarget
5 3
targetmiRNA
Deep intergenic(5.6%)
(4.6%)
Deep intergenic(6.9%)
(4.6%)
3-UTR
(27.6%)
(26.3%)
Intron (41%)
Intron (36%)
other (3.5%)
other (3.2%)
(17.5%)
(23.2%)
AGO CLIP reads
AGO CLIP reads
miRNA-target chimeras
Major miR-first chimeras Minor miR-last chimeras
Unique chimeras Clustered events
39,413 10,458 6,619
363,953 166,580 130,120
Mapped chimeras
miR-124 miR-30c miRNAtargetchimeric reads
Ptbp1 3-UTR
miR-124
miR-124
miR-296 miR-9
NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 3
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864
were also strongly enriched for canonical seed matches to their cognate miRNAs (Fig. 1d). Seed enrichment occurred within B75 nt of the miRNA ligation junction in the expected downstream (30) region, but not the upstream region (50) (Fig. 1d). Consistent with prior ndings, chimeras were present at low levels in no-ligase samples26, although with reduced seed enrichments (Fig. 1e). For miR-last chimeras, the reversed pattern of seed distribution around the ligation junction was expected; however, this pattern was weak in ligase-treated samples and was absent in no-ligase samples (Fig. 1f,g). As they better reected miRNA abundance and known miRNA targeting features, we focused exclusively on miR-rst chimeras (henceforth chimeras).
Notably, many CLEAR-CLIP target regions lacked canonical seed matches (Fig. 1d), consistent with similar analyses19,26. We took two approaches to assess miRNA ligation to non-cross-linked targets, which could falsely identify non-physiologic interactions. First, we tested chimera ligation after denaturing AGO complexes in 6M guanidine hydrochloride, as in CLASH19. Interactions from denatured samples were similar to other samples based on miRNA seed match frequency, indicating bona de interactions. However, compared with other samples, the yield of chimeric and non-chimeric CLIP reads was low (Supplementary Table 1) and skewed to non-genic sites (Supplementary Fig. 1f); thus, we pursued it no further.
Second, we performed mixing experiments to assess miRNA ligation to non-target sequences after postlysis re-association. CLEAR-CLIP was done on lysates from cross-linked mouse cortex mixed with Escherichia coli total RNA, which contains thousands of potential miRNA sites by random chance at a per-nucleotide frequency comparable to mouse. For two replicates each, equal mass amounts of mouse and E. coli RNA or a large excess of E. coli RNA (sixfold) were mixed. We conrmed that E. coli RNA was not degraded in brain lysates (Supplementary Fig. 2). Across four mouse-only control samples, 1% of chimeric CLIP reads mapped to the E. coli genome, establishing the background from cross-mapped reads and minute RNA contaminants from commercial enzymes29 (Supplementary Table 2). Average E. coli mapping rates were 1.9% in equal-mixture samples and 5.2% in excess-mixture samples. To examine a more complex competitor RNA pool, we performed CLEAR-CLIP on mixed lysates from ultraviolet-irradiated mouse brain and non-cross-linked Drosophila S2 cells containing equal amounts of RNA. Here, 0.7% of mouse-only chimeric sequences mapped to the Drosophila genome compared with2.9% of mixed mouse/y samples (Supplementary Table 2). Collectively, these experiments indicate low (o5%) false discovery comparable to related methods19.
CLEAR-CLIP enhances the brain miRNA regulatory map. Chimeras with the same miRNA and overlapping genomic coordinates were clustered to yield 130,120 brain miRNAtarget interactions (Fig. 1a and Supplementary Data 1). Seventy-nine per cent (102,882) of interactions were also supported by nonchimeric AGO CLIP reads. We combined chimeric CLEAR-CLIP reads with conventional CLIP reads from 15 total biological replicates, to generate an enhanced brain miRNA regulatory map. We identied 96,685 AGO peaks supported in at least 5 mice, dened as biological complexity (BC)Z5 (Supplementary Data2)22. Twenty-seven per cent of BCZ5 peaks (26,304) had chimera support unambiguously identifying the miRNA(s) and this proportion increased substantially for peaks with greater BC (Supplementary Fig. 1g). Consistent with our prior studies, B20% of brain AGO peaks were orphans lacking 6mer seed matches for the 35 most abundant miRNA families22. Chimera data linked miRNAs to 6,136 (B28%) orphan peaks,
disambiguating thousands of biologically robust non-canonical miRNA-binding sites.
Chimera-dened interactions and non-chimeric AGO CLIP reads were similarly distributed in the transcriptome (Fig. 1h). In addition to 30-untranslated region (UTR) and coding DNA sequence (CDS) sites, chimeras identied many intronic sites with miRNA-dependent AGO binding (Supplementary Fig. 3a)3032. Intronic interactions were not previously reported for CLASH in 293T cells, because reads were only aligned against mature transcripts19. Our alignment of raw CLASH data against a genomic reference recovered many intronic (B15%) and other non-30-UTR sites (460%), independently conrming such binding. To examine whether annotated intronic interactions in the brain fall in mis-annotated exons, we examined polyA
RNA sequencing from age-matched mouse cortex33. As polyA selection strongly enriches mature transcripts, introns show much lower coverage than coding or 30-UTR exons. Accordingly, chimera-identied intronic sites showed low RNA sequencing coverage relative to exonic sites (Supplementary Fig. 3b). For comparison, binding sites for NOVA and RBFOX in the brain, which also bind intronic and exonic sequences, showed similar patterns34,35.
CLEAR-CLIP retrieved known miRNA regulatory sites (Fig. 1i and Supplementary Fig. 3c) and functions for well-characterized neuronal miRNAs, such as miR-124 and miR-9, in neuron development, synapse formation and axon guidance (Supplementary Data 3)22,36,37. Gene Ontology analysis indicated neuronal regulatory functions for less-characterized brain miRNAs, including miR-26 (for example, axon development and locomotion), miR-138 (neurotransmitter transport and secretion, and calcium transport) and miR-9* (cell migration and motility; Supplementary Data 3). In addition, Kyoto Encyclopedia of Genes and Genomes (KEGG) database analysis recovered known associations of miR-124, miR-9 and miR-26 with glioma, including known and many novel targets (Supplementary Fig. 4).
CLEAR-CLIP-identied sites are functional. Chimera-identied sites from the brain are functional in global analyses of miRNA perturbation. For brain polyribosome-associated mRNAs from miR-128 knockout (KO) and wild-type (WT) mice, the presence of miR-128 chimeras in transcript 30-UTRs correlated with enhanced polysome association in miR-128 KO brain (Fig. 2a)2. Sites with canonical seed matches and non-canonical sites predicted signicant de-repression (Fig. 2b).
More detailed analysis was possible for miR-124 due to the large number of identied sites. In CAD neuroblastoma cells transfected with miR-124 mimic, the presence of miR-124 chimeras in 30-UTRs in mouse brain correlated with repressed transcript levels compared with control cells (Fig. 2c)38. Chimera sites identied once (cluster size, N 1) predicted signicant
regulation and sites identied multiple times (N41) or overlapping AGO CLIP peaks conferred stronger repression.
Consistent with our prior studies, AGO peaks encompassing miR-124 seed matches predicted signicant transcript repression in miR-124-transfected cells (Fig. 2d)22. Critically, when such peaks overlapped miR-124 chimeras, repression was signicantly greater. Thus, chimera information improved identication of functional miRNA sites in vivo. To examine different types of miR-124 sites, we dened mutually exclusive sets of transcripts possessing only chimera-dened canonical miR-124 sites or only non-canonical sites. Canonical sites correlated with signicant transcript repression (Fig. 2e). Non-canonical sites predicted only a small shift in RNA levels (Fig. 2f) due largely to bulged 8mer miR-124 sites, the only non-canonical group predicting
4 NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864 ARTICLE
Polysome association miR-128 KO vs WT
Polysome association miR-128 KO vs WT
0.0 0.4 0.2 0.0 0.2 0.4
1.0
1.0
0.8
0.8
P = 0.031 P = 0.018
P = 0.0025
0.6
0.6
CDF
CDF
0.4
Chimera sites (n)
0.4
Chimera sites (n)
Non-miR-128 (9,036) miR-128 (335)
Non-miR-128 (9,036)
0.2
0.2
miR-128 canon. (110) miR-128 non-canon. (225)
0.0 0.4 0.2 0.0 0.2 0.4
Fold change (log2)
Fold change (log2) Fold change (log2)
Fold change (log2)
RNA levels miR-124 transfected vs control
RNA levels miR-124 transfected vs control
0.0 1.0 0.5 0.0 0.5 1.0
1.0
1.0
0.8
P = 1.13e11 P = 4.82e14 P = 1.55e34
0.8
P = 6.7e22 P = 2.4e52 P = 6.0e37
0.6
Sites (n)
0.6
Sites (n)
CDF
CDF
0.4
Non-miR-124 chim. (5,572) miR-124 single chim. (1,298) miR-124 cluster chim. (280) miR-124 chim.+peak (1270)
Peak no miR-124 seed (21,929) miR-124 chim. (1,578)
Peak + miR-124 seed (2,082) Peak + miR-124 seed + chim. (406)
0.4
0.2
0.2
0.0 1.0 0.5 0.0 0.5 1.0
RNA levels miR-124 transfected vs control
RNA levels miR-124 transfected vs control
1.0
1.0
0.8
P = 1e14P = 5.4e19 P = 4.1e05 P = 0.0018 P = 1.1e09
Chimera sites (n)
0.8
P = 0.0081 P = 0.0056
Non-miR-124 (5,572)
0.6
0.6
Chimera sites (n)
CDF
CDF
0.4
miR-124 8mer (39) miR-124 7mer-m8 (103)
miR-124 7mer-A1 (65) miR-124 6mer (114) miR-124 6mer-off (73)
0.4
Non-miR-124 (5572) miR-124 non-canon. (1,160)
miR-124 bulged 8mer (44)
0.2
0.2
0.0 1.0 0.5 0.0 0.5 1.0
0.0 0.50 0.25 0.00 0.25 0.50
Fold change (log2)
Fold change (log2)
Figure 2 | miRNAtarget chimeras identify functional interactions. (a) Polyribosome association in miR-128 KO versus WT mouse brain2 plotted as a CDF for 30-UTR sites identied with miR-128 chimeras (red) and non-miR-128 30-UTR chimeras (black). (b) CDF as in a shown for canonical (blue) and non-canonical (orange) miR-128 sites. (c) Fold change in mRNA levels in CAD cells transfected with miR-124 mimic versus control38 are plotted as a CDF for 30-UTR sites identied with miR-124 chimeras in the brain. miR-124 sites identied once (red), multiple times (blue) or overlapping AGO CLIP peaks (magenta) are shown compared with non-miR-124 sites (black). (d) CDF of 30-UTR miR-124 sites as in c, showing miR-124 sites identied with chimeras (violet), peaks overlapping miR-124 seed matches (cyan) or peaks overlapping both seeds and miR-124 chimera(s) (orange). (e) CDF plots for transcripts with only chimera-dened canonical 30-UTR miR-124 sites, broken down by site type. (f) CDF as in e for all 30-UTR non-canonical sites (green) and bulged 8mer sites (cyan). In all panels, P-values from KolmogorovSmirnov testing comparing coloured subsets with control (black) sites are shown, along with the number of sites (n) in each set.
signicant transcript repression in this data set. These analyses show that AGO HITS-CLIP maps supplemented with chimera data improved identication of functional miRNA target sites, including specic non-canonical sites.
Diverse miRNAmRNA pairing patterns. In addition to canonical sites, motif searches allowing expanded seed match variants revealed a high proportion of single mismatch and bulged sites (430% together), and many (B20%) lacking appreciable seed homology (Fig. 3a). These patterns were similar across different transcript regions, showing that CDS and intronic AGO targeting follows similar rules to 30-UTR binding. For chimera clusters of increasing sizes (N) and chimeras overlapping AGO peaks, canonical sites were slightly enriched (Fig. 3b). Similar canonical motifs were used by all miRNAs but relative frequencies varied (Fig. 3c).
We determined overlap of chimera-dened sites with Target-Scan predictions, a purely bioinformatic approach, for six abundant brain miRNA families7. Chimera-identied sites in 30-UTRs for a given miRNA were much more likely to overlap
TargetScan-predicted sites for that miRNA than random control sites (Fig. 3d). Nonetheless, TargetScan supported only a minority of chimera-dened sites and concordance varied for different
miRNAs. A major source of discrepancy was the preponderance of 6mer and imperfect seed match variants in chimera-identied binding sites, functional categories not present in TargetScan. Detailed analysis of imperfect seed sites conrmed established patterns, such as the miR-124 target G bulge between miRNA positions 5 and 6 (Fig. 3e)13. Other motifs revealed strong miRNA-specic preferences for the location of bulged miRNA or target nucleotides (Fig. 3e and Supplementary Fig. 5a,b). Notably, 22 of the top 25 brain miRNAs disallowed bulging at one or more sites, most often position 5 (16/25). These preferences identify specic single-nucleotide target deletions that, presumably by forcing unfavourable miRNA bulges, should effectively abolish AGO binding and regulation. Compared with bulged motifs, seed mismatches were more evenly distributed and showed less miRNA-specic variation (Fig. 3e and Supplementary Fig. 5c). An exception was GU wobble interactions, which showed strong preferences such as miR-30 position 3 (Supplementary Fig. 3d).
Unbiased de novo motif analysis of chimera target regions identied strong enrichment of seed-complementary motifs (Fig. 3f)39. miRNAs without signicant seed binding were mostly low-abundance, often passenger-strand isoforms, which could be affected by sampling error. In addition, many miRNA targets had
NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 5
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864
0.20.18
Fraction overlapping
TargetScan
8mer
7merm8
7merA1
6mer
5mer
Mismatch
Bulge
None
0.16
6mer off.
0.14
0.12
Overlap with TargetScan sites for same miRNA
Overlap with control TargetScan sites
Faction
0.1
0.08
0.06
3-UTR
0 0.2 0.4 0.6 0.8 1
5
0.04
0.02
CDS
0
Intron
miR-30
miR-26
miR-124
miR-128
miR-9
let-7
Other
All
Seed mismatch position miRNA bulge position Target bulge position
1
1
All
miR-124
miR-9
1
All
miR-124
miR-9
0.8
Fraction sites
All miR-124
miR-9
0.8
0.8
Fraction
0.6
0.6
0.6
0.4
0.4
0.4
Peaks
0 0.2 0.4 0.6 0.8 1
0.2
0.2
0.2
1
No. of chimeras
supporting site
0 3 4 5 6 7
0 3 4 5 6 7
0 2,3 3,4 4,5 5,6 6,7
2
A G G C AU U U G G A G G C A
U U U G G A/U G A
C G U A
3
1 24
miRNA position
4
miR124
miR9
miR26a
let7a
let7b
let7c
let7i
miR128
miR181a
miR30a
miR30b
miR30c
miR30d
0 10 20 30 40 50 % Motif presence
miR30e
miRNAs
miR138
miR125a
miR125b
miR16
miR9*
miR23a
miR23b
miR219
miR103
All
0 0.2 0.4 0.6 0.8 1
Fraction
Figure 3 | Motif analysis reveals miRNA binding dependent on seed and auxiliary pairing. The proportion of chimera-dened target regions with the indicated seed variants is plotted, broken down (a) by transcript region, (b) by the number of times interactions were identied with chimeras (N) or whether chimeras overlapped AGO CLIP peaks and (c) for the most abundant miRNA families in mouse brain, ranked from the top by decreasing abundance. (d) Overlap of 30-UTR chimera-identied sites in the brain with TargetScan predicted sites for the same miRNA (red) or three equally sized random control sets of TargetScan sites (black). Control sets were restricted to the top 20 brain miRNAs. Only target sites in mRNAs with detectable expression in the cortex were considered. (e) The distributions of mismatched and bulged nucleotides for chimera-identied sites with imperfect seed motifs are plotted for the top 25 mouse brain miRNAs (black), miR-124 (red) and miR-9 (blue). Error bars show the s.d. at each position for the top 25 miRNAs in the brain. miRNA seed sequences for miR-124 and miR-9 are shown below mismatch and miRNA bulge plots. Below the target bulge plot, the most frequently bulged target nucleotide at the indicated position is shown when strong preferences (450% of sites) were apparent. Sites from all transcript regions were included in this analysis. (f) De novo analysis of cognate miRNA-complementary-enriched 7mer motifs in all chimera target regions plotted as a heat map across the miRNA. Each line represents one miRNA and colour intensity scales with abundance in target sequences. miRNAs are ordered by hierarchical clustering.
strong enrichments for motifs complementary to miRNA 30-end sequences. Several auxiliary motifs included the classic supplementary pairing region from nucleotides 1316, but many different regions of auxiliary binding were evident17.
Expanded miRNAtarget pairing rules in the brain. Motif analysis revealed extensive seed-based and auxiliary miRNA targeting in vivo. For resolution of individual events, we performed duplex structure predictions for target regions and their
6 NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864 ARTICLE
All miR-124 miR-9 miR-181a
1 24
miRNA position
1
2
3
miR-9 miR-181a
0.15
0.10
Denstiy
miR-124
P ~ 0 P=1.8 10206 P=2.5 1077
Real
Shuffled
0.05
30
25
20
15
10
MFE (kcal mol1)
kcluster
miR124
All
miR9
*
*
*
*
*
*
miR26a
1
2
3
4
5
6
*
*
*
let7a
*
4
5
6
let7b
*
* *
*
let7c
*
*
let7i
*
*
miR128
miR181a
*
*
miR30a
*
*
*
*
*
*
miR30b
*
*
miR30c
*
*
miR30d
miR30e
*
*
*
*
*
miR138
miR125b
*
*
*
miR709
*
miR16
*
*
miR9*
miR23a
*
miR23b
*
*
miR125a
*
*
miR219
*
*
miR103
*
*
0 0.2 0.4 0.6 Fraction
0.8 1
16
Cntrl mimic miR-9 mimic
No seed
Cntrl mimic miR-181a mimic
No seed
14
1.2
1.2 **
** ** **
1
1
Auxiliary base pairs (#)
12
0.8
mRNA level
mRNA level
0.8
10
0.6
0.6
8
0.4
0.4
6
0.2
0.2
4
0
0
2
Target
Site type
* * ** ** * **
Gosr2
Map6d1
Nbea
Ctnnd1
Serinc3
Rfx7
Gnaq
Target
Comt
Gul
Chchd10
Smad7
Chd9
0
1 2 3 4 5 6 7
Seed
Both
Site type
Seed
Seed base pairs (#)
Figure 4 | Duplex structure prediction reveals diverse targeting patterns for brain miRNAs. (a) RNAhybrid miRNAtarget duplex structure predictions represented as heat maps40. Black pixels indicate base pairing and white pixels indicate gaps. Structures were partitioned by k-means clustering into six groups (see Methods). Interactions from all transcript regions were included in this analysis. (b) Structure maps for individual miRNAs compared with all. (c) Density plots of duplex minimum free energies (MFEs) are shown for the indicated miRNAtarget interactions (blue) or shufed interactions (red), where each chimeric target region was randomly re-assigned to an miRNA from a different chimeric interaction. MFEs were calculated with RNAhybrid. Axis labels are printed once, but apply to all plots. P-values from two-tailed t-tests are shown. (d) Distributions of the six identied k-clusters for the top brain miRNAs, ranked by decreasing abundance from the top to the bottom. Most brain miRNAs (B90%) and all shown here have signicant preferences versus the whole population (*positive enrichment, Po10 3, Fishers exact test; full set is in Supplementary Table 3). (e) Box plot comparing number of predicted seed region base pairs with predicted auxiliary base pairs for all brain miRNAtarget chimeras. (f) Experimental validation of chimera-identied seed-dependent and seedless (k 4, with no canonical seeds in 30-UTR) miR-9 and miR-181a targets was performed by transfecting miRNA mimics into
N2A cells and measuring endogenous targets by qRTPCR. The average fold change in miRNA mimic versus control mimic-transfected cells is shown from four independent transfections, s.e.m. *Po0.05 and **Po0.01, one-tailed t-test. Smad7, a previously conrmed miR-181a target, served as a positive control68.
cognate miRNAs using RNAhybrid (Supplementary Data 1)40. k-means clustering of structures revealed six major modes of miRNAtarget binding, with ve dominated by seed-site pairing combined with various auxiliary binding patterns (Fig. 4a,b). Four clusters (k 14) closely mirrored similar analyses of 293T
CLASH sites, including a seed-independent class (k 4)19. A fth
group identied by CLASH, encompassing B20% of interactions and lacking signicant miRNAtarget pairing, was not identied here. We also observed novel classes with seed pairing coupled with bipartite or tripartite auxiliary pairing patterns. These
NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 7
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864
clusters, including the distinctive patterns of auxiliary binding, were not observed when target regions and miRNAs were shufed by randomly re-assigning each chimeric target region the miRNA from a different chimera. Shufed interactions showed signicantly lower duplex hybridization energies than true ones, consistent with the discovery of real binding events (Fig. 4c).
Remarkably, of 212 miRNAs with 450 identied target sites in the brain, 196 (B90%) showed signicant enrichment or depletion in one or more k-means binding class (Fig. 4d and Supplementary Table 3). For example, miR-124 was strongly enriched in groups 1 (P 1.6 10 245, Fishers exact test) and 5
(Po1.6 10 245), and marginally in group 2 (P 2.1 10 3).
In contrast, miR-124 was strongly depleted in groups 3 (Po1.6 10 245), 4 (P 3.7 10 140) and 6 (P 1.1 10
174). This pattern conrmed strong seed dependence for miR-124 binding and revealed distinct patterns of favoured auxiliary binding (Fig. 4b,d). Motif analysis also supported auxiliary pairing, showing an enriched 7mer motif complementary to miR-124 positions 14 to 20 (Fig. 3f). Structural inference revealed distinct binding patterns contributing to this motif consensus.
Some miRNAs tolerated striking diversity in pairing interactions. miR-9 was enriched in group 3 (P 3.3 10 130, Fishers
exact test), characterized by strong seed dependence and frequent auxiliary pairing from positions 14 to 22, and group 6 (P 3.9 10 17), characterized by a tripartite auxiliary pattern (Fig. 4b). miR-9 was also enriched for seedless binding (k 4, P 2.2 10 9).
Similarly, miR-181 family members were enriched in both seed-dependent and -independent classes. Globally, interactions with more predicted seed pairing exhibited fewer predicted auxiliary base pairs and vice versa (Fig. 4e). Canonical sites with less seed pairing (6mer and 7mer-A1) had slightly more predicted auxiliary pairing than stronger seed sites (8mer and 7mer-m8), consistent with supplementary 30-pairing (Supplementary Fig. 6a)17.
A stronger effect was evident for bulged or mismatched 8mer and 7mer motifs, which had more auxiliary pairing than their perfect match counterparts, indicating complementary pairing to offset imperfect seed matches (Supplementary Fig. 6bd)18.
Specic classes of CLEAR-CLIP-dened sites are preferentially conserved in mammals, consistent with functional signicance7,41,42. In both CDS and 30-UTRs, groups 1, 2 and 3 were modestly more conserved than groups 4, 5 and 6, with seedless interactions (k 4) showing lowest overall conservation
(Supplementary Fig. 7a,b). The 30-UTR sites with canonical seed matches and certain bulged or mismatched motifs were more conserved than sites lacking seed homology (Supplementary Fig. 7c). CDS sites showed a similar pattern, except for mismatched sites (Supplementary Fig. 7d). To compare conservation of seed and auxiliary pairing regions, we calculated conservation scores in the seed and auxiliary portions of 30-
UTR target sites. For 8mer and 7mer-m8 sites, target seed regions were modestly more conserved than the auxiliary region (Po0.05, one-tailed t-test). For other sites, seed and auxiliary regions were similarly conserved (Supplementary Fig. 7e), implying evolutionary pressure to maintain the whole miRNA binding site.
We conrmed chimera-identied regulation by transfecting miRNA mimics into mouse neuroblastoma (N2A) cells and measuring endogenous target mRNA levels by quantitative reverse transcriptasePCR (qRTPCR). miRNA mimics repressed most miR-9 (6/7) and miR-181a (5/6) targets examined, including all with canonical seeds and several with seedless interactions and no canonical seed matches in their 30-UTRs (Fig. 4f). These experiments support prior ndings that seed-independent miRNA targeting is functional but weaker than seed-dependent regulation14,19.
Endogenous miRNAtarget chimeras in human hepatoma cells. To independently assess miRNAtarget pairing patterns, we searched for miRNAtarget chimeras in standard HITS-CLIP libraries from human hepatoma (Huh-7.5) cells. miR-rst chimeras were present at B0.5% of unique reads, suggesting that on-bead RNA ligase I treatment for 30-linker addition in the standard protocol can form chimeras (Supplementary Table 4 and Supplementary Data 4). As in the brain, miR-rst chimera target regions were strongly enriched for cognate miRNA seed matches, whereas miR-last were less so (Fig. 5a,b). In total, 34,986 miRNAtarget interactions were identied in Huh-7.5 cells (Supplementary Fig. 8a)43, conrming that standard HITS-CLIP libraries contain miRNAtarget chimeras, albeit at reduced frequency26.
To further test the functionality of chimera-identied sites, we examined data from Huh-7.5 cells treated with locked nucleic acid (LNA) against miR-122 or miravirsen, a clinical miR-122 inhibitor44. AGO binding to 30-UTR regions with miR-122 7mer or 8mer seed matches was specically reduced in miR-122 LNA versus control cells (Fig. 5c). This effect was stronger for sites overlapping miR-122 chimeras and even stronger when both predictors were combined. When regions outside 30-UTRs were included, a signicant effect was only observed when miR-122 chimeras were present (Fig. 5d). These results indicate that chimeras enhanced prediction of 30-UTR and non-30-UTR sites.
For miravirsen treatment, miR-122 seed presence alone was predictive in all cases, but miR-122 chimeras enhanced these predictions (Supplementary Fig. 8b,c). This analysis provided further evidence that miRNA chimeras improve identication of miRNA regulatory sites.
miRNAtarget chimeras in the absence of exogenous ligase. Chimeras independent of exogenous ligase were present in small numbers in mouse brain and were reported in C. elegans26. These interactions showed signicant seed enrichment, suggesting many are real (Fig. 1e). We used CLEAR-CLIP in Huh-7.5 cells to investigate mammalian transfer RNA ligase HSPC117 as a potential source of these chimeras and a means to enhance chimera ligation45. As in mouse brain, Huh-7.5 CLEAR-CLIP yielded chimeras at B2% of mapped reads. Ligase-treated samples showed a B10-fold enrichment for miR-rst chimeras and a smaller enrichment for miR-last (Fig. 5e). CLEAR-CLIP without ligase addition was also done on Huh-7.5 cells with induced overexpression of HSPC117 or efcient depletion by RNA interference (Supplementary Fig. 8d). In both conditions, chimera frequencies were not signicantly different from controls with endogenous HSPC117 levels (Fig. 5e). We also searched for chimeras containing truncated miRNAs, in case RNAse cleavage was a prerequisite for HSPC117-mediated ligation26, yielding the same result (Fig. 5f). Interestingly, truncated chimeras in Huh-7.5 cells comprised an additional B1% of mapped reads, far more than in the brain, with most truncated one nucleotide (Supplementary Fig. 8e). This analysis ruled out HSPC117 as a major endogenous source of chimeras.
Expanded miRNAtarget pairing rules in human cells. Motif and structural analysis revealed global miRNAtarget pairing patterns in Huh-7.5 cells. As in mouse brain, seed-complementary motifs were identied for most miRNAs, in addition to many 30-auxiliary motifs (Fig. 6a). For structure clustering, informative binding classes in Huh-7.5 cells were most evident with seven k-groups, as opposed to six in mouse brain (Fig. 6b). Two Huh-7.5 groups (5A and 5B), similar to group 5 from mouse brain, showed bipartite auxiliary pairing but at distinct sites. The other clusters closely resembled corresponding groups in mouse
8 NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864 ARTICLE
Huh-7.5 miR-first
Huh-7.5 miR-last
0.5
0.5
CDF: seed presence
8mer 7merA1 7merm8 6mer 6mer offset Total
CDF: seed presence
8mer 7merA1 7merm8 6mer 6mer offset Total
0.4
0.4
0.3
0.3
0.2
0.2
0.1
0.1
0 200 100 0 100 200
0 200 100 0 100 200
Distance from ligation point (nts)
Distance from ligation point (nts)
AGO cluster density miR-122 LNA vs mock 3-UTRs
AGO cluster density miR-122 LNA vs mock all regions
1.0
1.0
P = 0.0016 P = 0.0077 P = 0.00013
P = 0.00093 P = 0.4P = 8e07
0.8
0.8
0.6
0.6
CDF
CDF
AGO peaks (n):
miR-122 seed (68) miR-122 chimera (49) Seed + chimera (17)
AGO peaks (n):
All (1,007)
miR-122 chimera (133) Seed + chimera (48)
All (4,242) miR-122 seed (300)
0.4
0.4
0.2
0.2
0.0 2.0 1.0 0.0 1.0 2.0
0.0 2.0 1.0 0.0 1.0 2.0
Fold change (log2)
Fold change (log2)
miR-first miR-last
% miRNA chimeras of
All chimeras, 12 miRNA 5'nts Full-length miRNA chimeras
Truncated 4 nt miRNA chimeras
Truncated 2 nt miRNA chimeras
% miRNA chimeras of
total mapped collapsed
reads
10
0.1
total mapped collapsed
reads
10
0.1
0.01
Truncated 1 nt miRNA chimeras
1
1
Exogenous ligase HSPC117 RNAse A RNAse T1
+ +
++ ++ ++ + +
+ + ++
+
+ +
++ ++++ + +
+ + ++
+
Exogenous ligase HSPC117
RNAse A RNAse T1
+ + + ++ ++
++
++
+ + + + +
+
+
Figure 5 | miRNAtarget chimeras identify functional interactions in Huh-7.5 cells. CDF seed-enrichment plots as in Fig. 1dg for miR-rst (a) and miR-last (b) chimera target regions from Huh-7.5 HITS-CLIP. CDF plots of LNA-122 induced changes in AGO binding across 30-UTRs (c) or all regions (d) for sites with miR-122 7-8mer seeds (magenta), miR-122 chimeras (red) or the combination of both (blue). P-values are shown for Kolmogorov
Smirnov tests comparing indicated subsets to control (black) sets. (e) miR-rst and miR-last chimeras in CLEAR-CLIP on Huh-7.5 cells as a percentage of total unique reads, varying the presence of exogenous T4 RNA ligase and RNAse. For HSPC117 ligase, ( ) represents endogenous levels, ( ) represents
siRNA knockdown and ( ) represents overexpression as shown in Supplementary Fig. 8d. (f) Analysis of CLEAR-CLIP derived miR-rst chimera
truncation and the effects of HSPC117 manipulation. Percentage of chimeras harbouring full-length miRNAs was compared with chimeras with the indicated 30-truncations, or with all putative chimeric reads with at least 12 nts miRNA sequence starting at the 50-miRNA end. In e,f, the mean values of two biological replicates is shown for each sample, with error bars indicating s.d.
brain. The appearance of more diversity in Huh-7.5 cells may reect the diversity of their miRNA proles, which included many miRNAs expressed at high to moderate levels (Supplementary Fig. 8f). Comparably, brain miRNAtarget interactions involved fewer, very abundant miRNAs, consistent with a narrower range of structures (Supplementary Fig. 1e).
Of 83 human miRNAs detected in 50 or more chimeras, 75 (90%) were signicantly enriched or depleted in specic binding classes (Fig. 6c and Supplementary Table 5). To assess the reproducibility of chimera-dened pairing patterns in different biologic settings, motif enrichments were compared for the 12 miRNAs among the 50 most abundant in both mouse brain and Huh-7.5 cells (Fig. 6d). Overall binding patterns were preserved across species and tissue types in 9 of 12 cases, supporting the robustness of our methods. The remaining three miRNAs showed similar enrichment of auxiliary motifs but divergent seed enrichments, which may reect the different target populations in these settings.
Auxiliary pairing regulates miRNAtarget specicity in vivo. As a striking indication that auxiliary pairing regulates miRNA target specicity, duplex structure analysis revealed distinct binding patterns for members of miRNA seed families (for example, let-7, miR-30, miR-181 and miR-125) (Fig. 4d). As CLEAR-CLIP does not yet provide comprehensive coverage of all miRNA-binding sites, it was not possible to compare the overlap of different miRNA paralogues by occupancy analysis. Instead, we used de novo motif analysis to search for distinguishing features of the target regions of individual paralogues. For most miRNA family members, motifs complementary to divergent 30-sequences were highly enriched in cognate target regions but not their paralogues (Fig. 7a,b, below charts). Next, we reasoned that if inter-family preferences existed, family members should form more stable duplex structures with their own identied target regions than other paralogues. We calculated duplex energies for CLEAR-CLIP target regions of each abundant let-7 family member in the brain with each let-7 miRNA in a four-way pair-
NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 9
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864
1 24
miRNA position
1 24
miRNA position
miRNAs
kcluster
1
2
3
4
5A
5B
6
0 20 50
Presence (%)
k clusters
miR17 1
2
3
4
5A
5B
6
0 0.2 0.4 0.6 0.8 1
Mouse brain
*
*
1 24
miRNA position
miR320a
*
*
*
miR92a
*
Human hepatoma Huh7.5
miR194
*
*
hsamiR16
mmumiR181b
miR26a
*
hsamiR23a
miR19a
hsamiR24
*
hsamiR26a
miR16
*
hsamiR26b
miR122
*
*
hsamiR27a
hsamiR30a
miR23a
*
hsamiR30b
miR27a
*
*
hsamiR30c
miR186
hsamiR130a
*
*
*
hsamiR181a
miR30a
*
*
hsamiR181b
miR21
*
miR93
*
*
miR130a
*
miR374a
*
*
*
mmumiR16
mmumiR23a
mmumiR24
miR18a
*
Mouse brain
miR30c
*
mmumiR26a
miR20a
mmumiR26b
*
*
mmumiR27a
miR103a
*
mmumiR30a
miR192
*
mmumiR30b
mmumiR30c
miR106b
*
*
mmumiR130a
miR574
*
mmumiR181a
All
Fraction
Figure 6 | Expanded miRNA pairing rules for human miRNAs. (a) De novo analysis of cognate miRNA-complementary-enriched 7mer motifs in chimera target regions plotted as a heat map across the miRNA. Each line represents one miRNA, with colour intensity indicating abundance in target sequence. miRNAs are ordered by hierarchical clustering. Interactions from all Huh-7.5 HITS-CLIP and CLEAR-CLIP experiments from all transcript regions were included in these analyses. (b) RNAhybrid miRNAtarget duplex structure predictions represented as heat maps as in Fig. 4a, partitioned by k-means clustering40. (c) Distributions of the seven identied k-clusters for top Huh-7.5 miRNAs ranked by abundance in chimeras from top to bottom. Most miRNAs (B90%) and all shown here have distinct preferences versus the whole population. Interactions from all transcript regions were included in this analysis (*positive enrichment, Po10 3, Fishers exact test; full results in Supplementary Table 5). (d) Comparative motif analysis heat map for the12 miRNAs that were among the 50 most abundant in both mouse brain and Huh-7.5 cells.
wise comparison (Fig. 7c). In all cases, let-7 family miRNAs formed more stable structures with their cognate target regions than other paralogues. This observation is striking in that some paralogues (for example, let-7b and let-7c) have higher GC content and thus intrinsic potential for more stable structures. Shufing analysis of miR-30 family members revealed similar specicity, although certain preferences were more signicant than others (Fig. 7d). Specically, miR-30b and miR-30c showed more signicant differences from miR-30a, miR-30d and miR-30e than from each other and vice versa. Analysis of miR-125 and miR-181 families revealed additional intra-family target preferences (Supplementary Fig. 9ad). Thus, motif and structure information indicate distinct targeting preferences for
miRNA paralogues controlled by differential miRNA 30-end pairing.
We validated functional specicity of miRNA family members using uorescence reporters with paralogue-specic target sites in their 30-UTRs (Fig. 8a)46. We examined miR-30a, miR-30c and miR-125a targets sites predicted to form more stable pairing with a specic paralogue and which were ligated to only that paralogue in at least two CLEAR-CLIP experiments. Reporters were cotransfected into N2A cells with plasmids expressing miRNA family members or a control C. elegans miRNA. miRNA expression was conrmed by northern blotting (Supplementary Fig. 10a) and silencing activity was conrmed using reporters with perfect complementary sites (Supplementary Fig. 10b,c). For
10 NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864 ARTICLE
1
1
0.8
0.8
Fraction base paired
Fraction base paired
0.6
0.6
0.4
0.4
0.2
let7a let7b let7c let7i
0.2
miR30a miR30b miR30c miR30d miR30e
0 1 5 10 15 20
0 1 5 10 15 20
miRNA position
miRNA position
let-7a UGAGGUAGUAGGUUGUAUAGUU UGUAAACAUCC ACU AAG
miR-30a
let-7b
5 3 5 3
***
10107
1075
let-7c
let-7i
1012
UGAGGUAGUAGGUUGUGUGGUU
UGAGGUAGUAGGUUGUAUGGUU
UGAGGUAGUAGUUUGUGCUGUU
miR-30b
miR-30c
miR-30d
10182
1091
10253
1073
1043
UGUAAACAUCC ACU GCU
UGUAAACAUCC ACU CAG
UGUAAACAUCC ACU AAG
UCG GG
UAC CA
UAC CU
CCG GG
UUG GG
miR-30e
UGUAAACAUCC ACU AAG
15
10
***
****
**
****
****
******
****
****
**
****
****
****
17
***
12
**
**
Minimum free energy
(kcal mol1 )
Minimum free energy
(kcal mol1 )
*****
*** ** ***
19
***
***
***
**
**
*
***
14
21
16
23
25
18
20
miRNA paired
27 a b c i a b c i a b c i a b c i
Targets
miRNA paired
ab c de ab c de ab c de ab c de ab c de
Targets
let-7a
let-7b
let-7c
let-7i
miR-30c
miR-30b
miR-30a
miR-30d
miR-30e
# Sites (n) 1,397 3,814 4,015 1,089 1,397 3,814 4,015 1,089
# Sites (n)
Figure 7 | CLEAR-CLIP reveals target specicity among miRNA family members. Base pairing proles from duplex structure maps for let-7 (a) and miR-30 (b) family members are shown. For each miRNA, the fraction of interactions with base pairing at each miRNA position is plotted. miRNA sequences are shown below with coloured bases indicating divergent nucleotides. De novo motif analysis of target regions for indicated miRNAs revealed family-member-specic motifs complementary to divergent parts of the miRNAs. For easier interpretation, the target motifs were reverse complemented to match the miRNA sequences. P-values for enrichment over background (AGO-binding regions in brain) from HOMER are indicated. No unique auxiliary motif was found for let-7a, the only such case. (c,d) Predicted minimum free energies (MFEs) from pairwise analysis of duplex structures for chimera-dened targets and the indicated let-7 (c) or miR-30 (d) family members is shown. Targets paired with their chimera-identied, cognate let-7 family member are shaded darker. Interactions from all transcript regions were included in these analyses. Box plots depict interquartile (2575) values (*Po0.05, **Po0.001, ***Po10 10 and ****Po10 50, one-tailed t-test).
CLEAR-CLIP-dened sites, repression was specic or more signicant for the predicted paralogue in several cases (Fig. 8dg,k). Effects included supplementary 30-pairing enhancing canonical repression (Fig. 8f,g) and paralogue-specic regulation at non-canonical sites (Fig. 8d,e,k). For other sites, repression in the presence of canonical (Fig. 8l,m) or non-canonical (Fig. 8h,i) sites was similar for different family members. When predicted pairing for one paralogue was signicantly more stable (4 6 kcal mol 1
D minimum free energy), paralogue-specic activity was usually observed. An exception was an 8mer mismatch miR-30c site with GU wobble pairing at miRNA position 3, which showed similar repression by both miR-30a and miR-30c despite extensive predicted 30-pairing with miR-30c (Fig. 8i). The strong repression by both paralogues was comparable to that of a perfect 8mer site
(Fig. 8b), consistent with our nding that GU pairing is well-tolerated at specic seed positions (Supplementary Fig. 5d). Conversely, more subtle differences in predicted pairing(2.8 kcal mol 1) enhanced miR-30c activity at a 6mer site with predicted supplementary 30-pairing (Fig. 8f). This complexity underscores the need for empirical binding maps to supplement structure- and sequence-based predictions. More broadly, these results illustrate paralogue-specic miRNA activity and diverse functional classes of non-canonical sites.
DiscussionCLEAR-CLIP gains its power from the formation of sequential covalent bonds that reect in vivo interactions. The utility of
NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 11
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864
pCMV
miRNA genomicfragment Flow cytometry
+
tagBFP
pTRE-3G-BI
tagRFP
pEF-1
AGO binding site
GFP
miR-30 8mer
(+ve control)
miR-125 site (ve control)
Non-canonical
miR-30a sites
8mer mismatch
16
16
celmiR67 miR30c miR30a
celmiR67 miR30c miR30a
16
16
celmiR67 miR30c miR30a
G =12.6
G=21.3
celmiR67 miR30c miR30a
G =10.9
G=23.5
14
14
14
14
Log 2 (TagRFP)
12
Log 2 (TagRFP)
12
Log 2 (TagRFP)
12
Log 2 (TagRFP)
12
10
10
10
10
8
8
8
8
8 9 11 13 15
8 9 11 13 15
8 9 11 13 15
8 9 11 13 15
Log2 (TagBFP)
Log2 (TagBFP) Log2 (TagBFP) Log2 (TagBFP)
Log2 (TagBFP)
Log2 (TagBFP) Log2 (TagBFP) Log2 (TagBFP)
Log2 (TagBFP)
Log2 (TagBFP) Log2 (TagBFP) Log2 (TagBFP)
miR-30c sites
7mer-m8 w/ 3 pairing
6mer w/ 3 pairing 8mer mismatch (wobble)
7mer-m8 bulged
16
16
16
16
celmiR67 miR30c miR30a
G =19.7
G=16.9
celmiR67 miR30c miR30a
G =21.6
G=15.9
celmiR67 miR30c miR30a
G =17.5
G=12.3
celmiR67 miR30c miR30a
G =25.6
G=16.6
Log 2 (TagRFP)
Log 2 (TagRFP)
14
14
14
14
12
Log 2 (TagRFP)
12
Log 2 (TagRFP)
12
Log 2 (TagRFP)
12
10
10
10
10
8
8
8
8
9 10 12 14 16
9 10 12 14 16
8 9 11 13 15
9 10 12 14 16
miR-125a sites
miR-30 site (ve control)
Bulged 7mer-m8 w/ 3 pairing
7mer-m8 Offset 6mer
16
celmiR67 miR125a miR125b
16
16
celmiR67 miR125a miR125b
G =29.4
G=21.5
celmiR67 miR125a miR125b
G =26.4
G=22.7
celmiR67 miR125a miR125b
G =25.6
G=23.7
16
14
14
Log 2 (TagRFP)
14
14
12
Log 2 (TagRFP)
12
Log 2 (TagRFP)
12
12
10
10
10
10
8
8
8
8
9 10 12 14
7 9 11 13 15
8 9 11 13 15
8 10 12 14 16
Figure 8 | miRNA family member specicity conrmed by single cell measurements. (a) A system for single cell measurements of miRNA-mediated repression, adapted from ref. 46. tagRFP and tagBFP are expressed from the doxycycline-inducible bidirectional pTRE-3G-BI promoter. CLEAR-CLIP-dened AGO-binding sites were cloned into the 30-UTR of the tagRFP cassette, with tagBFP used for internal normalization. Plasmids co-expressing miRNAs and
GFP were co-transfected and measurements were taken 48 h later. (bm) Log-transformed plots of tagRFP versus tagBFP uorescence, with minimum free energies (MFEs) (DG) for predicted base pairing between duplex structures for indicated paralogues. A description of the site type is shown above each plot, with bold labelling denoting successful validation of paralogue specicity. Evaluation of miR-30a (red), miR-30c (blue) and negative control miRNA (black) overexpression on (b) a full miR-30 8mer site as a positive control for miR-30 paralogues; (c) a miR-125 site as a negative control for miR-30 paralogues; (d,e) sites with predicted miR-30a preference; and (fi) sites with predicted miR-30c preference. Evaluation of miR-125a (blue), miR-125b (red) and negative control miRNA (black) overexpression on (j) a miR-30 site as a negative control for miR-125 paralogs and (km) sites with predicted miR-125a preference. Representative plots from at least two independent experiments for each construct are shown.
miRNAtarget chimeras was demonstrated in two prior studies using CLASH and in vivo photoactivatable ribonucleoside-enhanced CLIP19,26. In mixing experiments, CLEAR-CLIP showed low false target identication rates similar to these
approaches without relying on specialized tagging strategies. CLEAR-CLIP thus provides a snapshot of true, physiologic miRNAtarget interactions and is uniquely applicable to all mammalian model systems and human samples47. In contrast to
12 NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864 ARTICLE
CLASH, CLEAR-CLIP does not require fully denaturing AGO and involves a single purication step. Our experiments with denatured AGO and analyses of published CLASH data showed low yield of standard non-chimeric CLIP reads compared with standard AGO HITS-CLIP, hindering robust AGO-binding peak identication. With straightforward modications of HITS-CLIP, CLEAR-CLIP simultaneously generates chimera information and high-quality, transcriptome-wide AGO HITS-CLIP maps. These dual data sets improved identication of functional miRNA target sites compared with HITS-CLIP or chimeras alone (Figs 2d and 5c,d), a key advantage, as miRNAtarget ligation remains limiting. Optimized ligation conditions yielded at least tenfold enrichment in ligase-treated versus no-ligase samples, a substantial improvement over prior methods26, but insufcient for comprehensive coverage. A key future goal is further improvement of this efciency to reduce false negatives and achieve the global coverage of HITS-CLIP maps.
CLEAR-CLIP yielded insights into pairing rules for over 200 mammalian miRNAs. Enriched target motifs revealed seed-dependence for most miRNAs, with widespread bulged or mismatched pairing, and extensive 30-auxiliary interactions (Figs 3 and 6). miRNAtarget duplex structure prediction claried that most interactions employed seed and auxiliary pairing in combination (Figs 4 and 6). Most miRNAs were signicantly enriched or depleted in one or more binding class, with many favouring two or more categories. This tolerance for distinct but constrained pairing structures was most apparent for abundant miRNAs with robust maps, suggesting that increased CLEAR-CLIP and CLASH efciency and/or proles in additional cell types will reveal similarly diverse pairing rules for other miRNAs. Similar pairing patterns applied to conventional 30-UTR targeting, as well as CDS and intronic binding. The latter indicates extensive, miRNA-dependent nuclear targeting of AGO. Although previous studies established AGO nuclear localization and RNA binding22,30,31,48, its mechanistic dependence on miRNA guidance was previously unclear.
Motifs and structure inference showed extensive pairing of miRNA 30-ends with targets. Such auxiliary interactions can stabilize or enhance miRNAtarget pairing, in particular together with imperfect seed pairing18. Global analysis of bulged and mismatched seed interactions from CLEAR-CLIP shows this phenomenon is common (Supplementary Figs 5 and 6). The importance of 30-auxiliary binding is still debated, with some reports demonstrating signicant effects18,49 and others concluding limited ones7. Analyses of miRNA mimic transfections found that supplementary pairing of miRNA bases 1217 marginally enhanced target repression in rare instances17,50. However, the sensitivity of such analyses may be limited by stringent requirements for continuous spans of auxiliary binding7. CLEAR-CLIP revealed diverse, often discontinuous auxiliary pairing that could hinder the detection of motif presence or conservation above background (Figs 4a and 6b). A second consideration is the heavy reliance of prior conclusions on acute overexpression of miRNAs, which may perturb endogenous AGOmiRNAtarget stoichiometry or interrogate different target repertoires than are available in vivo. Recent evidence for co-evolution of miRNAs and targets, in particular in neurons, underscores the importance of examining physiologic interactions51. The use of transcript destabilization in vitro as a sole functional readout may also overlook other AGO functions, including translational control, targeting to non-30-
UTR regions and interactions with other RNA-binding proteins42.
As a striking indication that auxiliary interactions regulate miRNA target specicity, we observed specicity among paralogues in miRNA seed families (Fig. 7). Such specicity was
previously illustrated for two let-7 family targets in Drosophila and has been speculated elsewhere18. Functional single-cell assays conrmed paralogue specicity for several sites from brain CLEAR-CLIP (Fig. 8). Other sites were similarly regulated by different paralogues, indicating miRNA family members are functionally redundant at certain sites and specic at others. Indeed, the strict conservation of miRNA families and their unique expression patterns in vivo, including across brain regions, supports specic functions52,53.
The predominance of canonical seed pairing in mediating mRNA target level repression is supported by CLEAR-CLIP-dened sites (Fig. 2). In addition, CLEAR-CLIP data demonstrated widespread, functional non-canonical miRNA targeting and substantial diversity in canonical and non-canonical interactions among different miRNAs. CLEAR-CLIP identied functional, non-canonical regulation globally for miR-128 and miR-124 (Fig. 2), and for individual miR-9, miR-181, miR-30 and miR-125 targets (Fig. 4f and Fig. 8bm). Non-canonical sites included diverse seed mismatch and bulged variants, and seedless interactions in both mouse brain and Huh-7.5 cells. Interestingly, a number of major miRNAs enriched for seedless interactions (for example, miR-9, miR-181, miR-30 and miR-186) have AU-rich seed sites, indicating that weak seed-pairing stability may favour seedless non-canonical interactions10. Our results support growing evidence of widespread non-canonical miRNA regulation that is likely to have a large collective impact1315,17,19,20,21. We expect CLEAR-CLIP and similar methods will facilitate discovery of these sites and rene in vivo miRNA regulatory maps in future studies.
Methods
Mice. All mouse experiments were approved by The Rockefeller University Institutional Animal Care and Use Committee regulations. P13-aged C57BL6/J mice were used for all experiments, except for BR21, BR22 and BR23 (Drosophila mixing), which used 6-week-old mice.
CLEAR-CLIP. Tissue cross-linking and lysis. Neocortex was dissected and cross-linked as described and snap frozen54. Frozen pellets were re-suspended in threefold volume (w/w) lysis buffer (1 PBS/1% Igepal/0.5% sodium
deoxycholate/0.1% SDS) containing Complete protease inhibitors (Roche). Lysates were treated with 30 ml RQ1 DNAse (Promega) at 37 C for 5 min with shaking.
Pre-immunoprecipitation RNAse treatment. For samples BR1, BR2, BR4, BR13, BR14, BR15, BR16, BR17, BR18, BR19, BR20, BR21, BR22 and BR23, RNAse A (USB Products) was added to lysates at 0.0001 U ml 1 and incubated at 37 C for 5 min. RNAsin Plus (Promega) was added at 0.5 U ml 1 and lysates were cleared by ultracentrifugation (50 000g). For remaining samples, RNAse treatment was done after immunoprecipitation (see below).
Immunoprecipitation and washing. Cleared lysates were rocked with Dynal Protein A beads (Life Technologies) prepared with 2A8 anti-AGO27,55 for 90 min at 4 C, then washed:
Three times lysis buffer containing 5 Denhardts solution Twice high-detergent buffer (1 PBS/1% Igepal/1% sodium deoxycholate/0.2%
SDS).
Three times low-salt buffer (15 mM Tris pH 7.5, 5 mM EDTA) Twice high-salt buffer (1 PBS/1% Igepal/0.5% sodium deoxycholate/0.1%
SDS, 1 M NaCl (nal, including PBS)).
Twice PNK wash buffer (50 mM Tris pH 7.5, 10 mM MgCl2, 0.5% Igepal)
On-bead RNAse treatment. For samples BR3, BR5, BR6, BR7, BR8, BR9, BR10, BR11 and BR12, beads were re-suspended in 0.5 ml lysis buffer containing 2 mg ml 1 BSA and RNAse A at 0.00002 U ml 1. Samples were treated at 37 C for 5 min with shaking, transferred to ice and supplemented with 0.5 U ml 1 RNAsin
Plus. Beads were rocked for 20 min at 4 C, to recover any dissociated antigen, then washed:
Twice high-detergent buffer Three times low-salt buffer Once high-salt buffer Twice PNK buffer
50-End phosphorylation and chimera ligation. Beads were treated with PNK (30-phosphatase minus) (NEB) and 1 mM ATP to phosphorylate cleaved mRNA 50-ends. Beads were washed three times in PNK buffer, then chimera ligation was
NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 13
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864
performed overnight at 16 C with 0.625 U ml 1 T4 RNA Ligase I, 1 mM ATP and0.1 mg ml 1 BSA in a 100 ml total volume. The following morning, fresh RNA Ligase I (25 U) and ATP (1 mM) were added to each sample and incubation was continued 46 h. For minus-ligase controls (BR4 and BR5), RNA ligase was omitted. Beads were washed:
Twice lysis buffer Once PNK/EDTA/EGTA buffer (50 mM Tris pH 7.5, 10 EDTA,10 mM EGTA,
0.5% Igepal)
Twice PNK buffer
Alkaline phosphatase treatment and 30-linker ligation. Alkaline phosphate treatment was performed to remove 30-phosphate groups27. Pre-adenylated 30-linker (50-rAppGTGTCAGTCACTTCCAGCGG-30) was added using truncated RNA Ligase 2 (NEB), with 2.5 ml 20 mM linker and 4 U enzyme per 40 ml reaction (16 C overnight).
Radiolabelling of AGORNA complexes. AGORNA complexes were radiolabelled directly with PNK treatment in the presence of [g-32P]-ATP, followed by cold chase, exactly as described27.
SDSPAGE and amplication of RNA footprints. SDSPAGE, nitrocellulose transfer, extraction of AGO-bound RNA, 50-linker ligation and RTPCR steps were performed exactly as described27.
Addition of high-throughput sequencing adapters. Adapters for high-throughput sequencing were added to libraries with additional PCR cycles. PCR conditions were exactly as described, but indexed primers specied in Supplementary Table 7 allowed sample multiplexing. Libraries were sequenced on the Illumina Hiseq 2500 platform with 100-nucleotide single-end reads or on the Illumina Miseq with 75-nucleotide single-end reads.
CLEAR-CLIP with AGO denaturation. AGORNA complexes were puried as described up through PNK treatment, then eluted from beads with denaturation buffer (50 mM Tris pH 7.5, 0.1% Igepal, 6 M guanidine HCl, 300 mM NaCl). Samples were diluted vefold in 1 PBS/0.1% Igepal and run over a buffer
exchange column (Pierce) equilibrated with lysis buffer. AGORNA complexes were re-captured on fresh beads conjugated to 2A8 antibody, which was conrmed by western blotting. Subsequent steps were performed as described above.
CLEAR-CLIP mixing experiments. Total E. coli RNA was isolated with the RNAsnap method56. Either equal amounts or a sixfold excess of E. coli RNA (by mass) was equilibrated in lysis buffer and added to brain lysates. CLEAR-CLIP was then performed exactly was described, starting with DNAse treatment. For analyses in Supplementary Fig. 2, RNA was extracted after DNAse treatment (with or without RNAse) with Trizol LS and analysed by Bioanalyzer (Agilent) and qRT PCR. For Drosophila mixing experiments, lysates from non-cross-linked S2 cells and cross-linked mouse brain containing equal mass amounts RNA were combined immediately post lysis and CLEAR-CLIP was performed starting at DNAse treatment.
CLEAR-CLIP in Huh7.5 cells. Huh7.5 CLEAR-CLIP was done as above with the following modications. Cells (2 107) growing in 150 mm plates were irradiated
once for 400 mJ cm 2 and once for 200 mJ cm 2 using a Spectrolinker XL-1500 (Spectronics Corporation). Cells were trypsinized, pelleted and stored at 80 C.
Lysis was done in 1 ml lysis buffer. RNAse A (0.00040.00004 U ml 1; see Supplementary Table 4) or 0.1 U ml 1 RNAse T1 (Ambion) was used for RNAse treatment.
AGO HITS-CLIP in Huh7.5 cells. Standard AGO CLIP was done as per the previously published protocol27, except for multiplexing modications described above.
Plasmids. pRetroX-TRE3G-HSPC117 plasmid was constructed by inserting the HSPC117 (c22orf28) sequence from pLX304-c22orf28-H9 (ref. 45) into the doxycycline-inducible retroviral vector pRetroX-TRE3G (Clontech).
The dual-colour reporter vector was described elsewhere57. Inserts corresponding to CLEAR-CLIP-dened binding sites were synthesized as gBlocks (IDT) (Supplementary Table 6) and cloned into the 30-UTR of tagRFP by Gibson
Assembly (NEB) using EcoRV-linearized vector and inserts at a 1:5 molar ratio. Transformed clones were grown as maxi-preps at 30 C and conrmed by restriction digests and sequencing.
Mouse miR-125a construct was purchased from SBI (MMIR-125a-PA-1). Genomic fragments for miR-125b, miR-30a and miR-30c spanning B200 nucleotides upstream and downstream of primary hairpins were synthesized as gBlocks (IDT) and inserted into the SBI vector between EcoRI and BamHI. Constructs expressing miR-30a from the miR-30c locus and miR-125b from the miR-125a locus were also made, in an effort to control for processing efciency. However, miR-30a was only expressed from its endogenous locus (Supplementary Fig. 10). Therefore, endogenous fragments were used in all reporter experiments. The cel-miR-67 hairpin was cloned into the miR-30c genomic locus. Efcient expression of cel-miR-67 was conrmed by qRTPCR using the miScript system (not shown).
Cell culture and transfections. N2A mouse neuroblastoma (ATCC) and Huh7.5 human hepatoma cells58 were maintained in standard conditions.
N2A miRNA mimic reverse transfections were done with Dharmafect1 reagent and miRIDIAN mouse miRNA mimics or negative control mimic #1 (Dharmacon). Complexes were pre-formed in 24-well dishes, according to manufacturers instructions, and 120 000 cells per well were added giving a nal mimic concentration of 25 nM.
To generate N2A cells stably expressing the Tet-3G activator construct (Clontech), N2A cells were transfected with Xtremegene 9 (6:1 reagent:plasmid ratio, 375 ng plasmid per 24-well) and split at varying dilutions into G418 media 48 h later. Functional clones were identied by transfecting pTRE-BI-RFP construct and screening for doxycycline-inducible red uorescent protein (RFP) expression.
For inducible expression of HSPC117, Huh7.5 cells expressing Tet-3G activator (kind gift from C. Takacs) were transduced with pRetroX-TRE3G-HSPC117. HSPC117 expression was induced by 3 mg ml 1 doxycycline.
For Huh7.5 cell miRNA inhibitor experiments, cells were seeded the day before and transfected with LNA-122 or miravirsen/SPC3649 (50-CcAttGTcaCaCtCC-30;
LNA in upper case and DNA in lower case, Exiqon) at 30 nM using RNAi/Max (Life Technologies). No signicant cytotoxicity was observed from the applied concentrations of LNA and miravirsen/SPC3649, as determined using CellTiter-Glo (Promega).
qRTPCR analysis. For miRNA mimic experiments, RNA was extracted from N2A cells 24 h post transfection with Trizol (Ambion). RNA was further puried with DNAse treatment on High Pure RNA Isolation columns (Roche). Total RNA(0.5 mg) was reverse transcribed with the iScript kit (Biorad). qPCR was done with SYBR Green Mix (Life Technologies) on the iQ Cycler (Biorad). Gene-specic primers (Supplementary Table 7) were designed with Primer3 and tested to conrm efcient amplication of single products59. The following programme was carried to 40 cycles: 30 s 95 C (denaturation); 30 s 58 C (annealing); and 20 s72 C (extension). Results were analysed by DDCt, using RPL10A mRNA, an abundant transcript with negligible AGO binding in its 30-UTR in brain, for normalization.
For E. coli/mouse mixing experiments in Supplementary Fig. 2, RNA was extracted with Trizol LS (Ambion). Equal volumes re-suspended RNA were reverse transcribed with the iScript kit and analysed by qPCR as above.
Western blotting. For western blottings, 10 mg protein from cleared Huh-7.5 lysates were run per lane of a 412% NuPage gel (Life Technologies) and blotted onto a polyvinylidene diuoride membrane. HSPC117 was detected using Anti-C22orf28 antibody (Abcam, ab98231, 1 mg ml-1) and Goat-anti-Rabbit-HRP (Pierce 31462, 1:50,000).
Flow cytometry. N2A-Tet3G cells were co-transfected with miRNA (250 ng) and reporter (125 ng) plasmids in media with 1 mg ml 1 doxycycline (Sigma). At 24 h media was refreshed and at 48 h cells were trypsinized, harvested and xed with
Cytox/Cytoperm buffer (BD Biosciences). Cells were analysed on the MACSQuant cytometer (Miltenyi Biotec). Data were processed as described46,57. Briey, single cells were gated in FlowJo software and uorescence values were exported for analysis with custom R scripts. Cells were binned on the basis of tagBFP uorescence and mean tagRFP uorescence was calculated for each bin. Binned tagRFP means were plotted against binned tagBFP means.
Northern blotting. RNA was extracted from transfected N2A cells or brain with Trizol. Thirty micrograms of RNA per sample were run on 15% urea PAGE gels and then transferred to nylon membranes (Perkin Elmer). Hybridization of 32P-labelled DNA oligonucleotide probes (Supplementary Table 7) was done at 37 C in Ultrahyb-Oligo buffer (Ambion) overnight. Membranes were washed four times with 2 SSC/0.1% SDS and exposed to lm.
Bioinformatic analysis. Initial bioinformatic processing was performed exactly as described27. An additional de-multiplexing step was added after 30-adapter removal using a simple search for sample-specic indices (Supplementary Table 1). Peak calling for brain AGO HITS-CLIP was done as described, using pooled reads from ten biological samples in the present study and ve from a prior one22.
Identication of miRNAmRNA chimeras. Reads containing miRNA sequences were identied by reverse mapping mature miRNA sequences against sample libraries using Bowtie60. Changes to default parameters were as follows: maximum mismatches allowed in the seed ( n 1), seed length ( l 8), maximum total of
quality values at mismatched read positions ( e 35) and maximum reported
alignments ( k 1). Reads mapped to more than one miRNA, usually
members of the same miRNA family, were collapsed to a single, randomly chosen hit for initial analyses. Chimeric sequences upstream (50) and/or downstream (30)
of miRNAs were extracted, ltered for a minimum length of 18 nt and mapped against the appropriate reference genome (mouse mm9, human hg18, Drosophila dm3 or E. coli (Genbank CP000948.1)) with Bowtie. Only single, uniquely mapped hits were allowed and PCR duplicates were consolidated as described27. Fragments mapping to miRNA genes were removed.
14 NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864 ARTICLE
P-values below 1e 10 were rarely observed and c-values meeting the threshold
were never observed. Heat maps were created in the R gplots package.
RNA duplex structure prediction. Duplex structure predictions for miRNA and target region were made with RNAhybrid40. The rst miRNA nucleotide was trimmed, as this position does not basepair to targets62. Target regions (75 nt) were examined. Clusters4100 nt in length (o0.5% of total) were omitted. Clusters 475 nt and r100 nt were trimmed symmetrically from both ends to a lengthof 75 nt.
We reasoned that canonical seed matches and variants were likely to be engaged in base pairing when present. Default RNAhybrid settings identied most seed matches in target regions (B71% of total and B80% of 8mers). To improve concordance with motif presence, pairing was forced at appropriate seed positions when 8mer, 7mer, 6mer or 5mer matches were present, improving concordance to B95%. For targets with mismatch (8mer, 7mer and 6mer) or bulged (8mer and 7mer) motifs, two duplexes were predicted with forced pairing at positions 3 and 4 (setting f 3,4) or positions 5 and 6 (f 5,6). Predicted structures were usually identical, but when different the lower energy structure was used. For targets lacking seed homology, seed pairing was not forced (f option omitted).
For duplex heat maps, base-paired (WatsonCrick or G:U) miRNA sites were assigned a score of 1 and unpaired sites a score of 0. k-means clustering of the resulting matrix was done with Cluster 3.0 and visualized with Java TreeView63,64. Cluster numbers (k) 312 were tested, with k 6 providing the most meaningful
set of distinct categories in the brain. Enrichments of miRNAs in different k groups were evaluated by Fishers exact test, comparing the distribution of each miRNA against all interactions. Analyses for Huh7.5 data were done identically, but k 7
yielded more intuitive clustering of interactions.
Conservation analysis. Conservation scores (phlyoP) for duplex regions dened by RNAhybrid were downloaded from UCSC Genome Browser65,66. Plotted conservation scores for target regions were calculated by averaging base-wise phyloP scores across intervals.
Analysis of miRNA family specicity. To remove ambiguity in assigning chimeras among family members, Bowtie alignments were repeated with no mismatch allowance. For miRNA base-pairing proles, the percentage of chimera-identied interactions with base pairing at each miRNA position was calculated from duplex map predictions. For pairwise comparisons of predicted structures, target regions for each miRNA family member were used to predict duplex structures with each miRNA with RNAhybrid. Here, simplied settings were used without consideration of canonical seeds (f settings omitted). For motif analysis, enriched 6mer, 8mer, 10mer and 12mer motifs in target regions were determined with HOMER, using AGO-binding regions in the brain as the background39.
miR-rst chimeras in the brain were present in B14-fold excess of miR-last (Supplementary Table 1). This result differs from reported CLASH results, where miR-rst and miR-last species were present at comparable levels19. This difference may reect an idiosyncrasy of AGO1, the only AGO paralogue analysed by CLASH, or denaturation of AGO in the CLASH protocol, which may expose the buried miRNA 50-end. In CLEAR-CLIP, miR-last chimeras frequently involved dubiously annotated miRNAs, did not reect endogenous miRNA abundance and were not formed by exogenous ligase. They were therefore excluded from subsequent analyses. Unique miR-rst chimeric reads linked to same miRNA and with overlapping genomic coordinates were clustered together, using the GenomicRanges package in R61.
Analysis of chimera targets in miRNA perturbation experiments. Normalized microarray values for polyribosome proles in miR-128 KO and WT mouse brains were obtained from GEO2. Genes with contradictory probe information (different signs) were ltered and probe log2 fold-change (log2FC) values for remaining genes were averaged. For cumulative distribution function (CDF) analysis (Fig. 2a,b), log2FC ratios (KO/WT) in transcript polysome association were plotted for miR-128 30-UTR chimera sites. Non-miR-128 30-UTR chimeras were plotted as controls.
Normalized microarray values for CAD neuroblastoma cells transfected with miR-124 or control mimics were obtained from GEO and processed as for miR-128 proles38. In Fig. 2c, transcripts were divided into mutually exclusive sets based on the number of times (N 1 or N41, where N is the number of times an interaction
was identied by CLEAR-CLIP) the most frequently identied chimera site in their 30-UTRs occurred. Log2FC ratios (miR-124/control) were plotted as CDFs. miR-124 sites overlapping AGO-binding peaks, regardless of cluster size (N), were also plotted. The control set (non-miR-124, black) for all analyses were sites from transcripts lacking miR-124 chimeras. In Fig. 2d, CDFs were plotted for chimera-identied miR-124 sites, peak-identied sites overlapping miR-124 seed matches and the intersection of those sets. In Fig. 2e,f, transcripts were divided into mutually exclusive sets based on the presence of only canonical miR-124 chimera sites (e) or only non-canonical sites (f) in 30-UTRs.
For LNA-122- or miravirsen-treated Huh7.5 cells, standard AGO CLIP data from four biological replicates each of mock, LNA-122 and miravirsen were analysed, with alignment and peak calling as described above. Clusters were normalized to the read depth of their respective libraries after adding a pseudo-count of 1. Canonical miRNA seed searches were carried out within robust AGO clusters (32 nts). AGO clusters overlapping miRNA chimeras were identied with the genomeIntervals R package30. For the CDF plots shown, a minimum BC of 4 and a cluster density of 40 was required.
Sequence extraction and analysis. Sequence extraction and seed motif searches, including for mismatch and indel variants, were done with the GenomicRanges and BioStrings packages in the R Bioconductor suite61,30. Only single-nucleotide mismatches or indels were allowed. Clustered target regions up to 75 nt downstream of the ligation site, which sometimes extended beyond the sequenced reads, were searched. The selection of this interval was based on our observation that the vast majority of 8mer and 7mer-m8 seed matches fell within this region. These 75 nt regions were used subsequently for motif and structure analysis.
TargetScan 6.2 overlap. Genomic coordinates for mouse TargetScan 6.2 sites were ltered for genes expressed in P13 cortex7. Per cent overlap of 30-UTR
CLEAR-CLIP regions for the indicated miRNAs (collapsed by seed family, Fig. 3d) and TargetScan sites for that miRNA was calculated. For each miRNA, overlap was also calculated for three negative control sets of equal size, randomly selected from TargetScan sites for the top 20 abundant miRNAs (also only in cortex-expressed transcripts).
Motif analysis. For de novo motif analysis in chimera target sequences, chimeras were grouped for each miRNA present in at least 50 individual chimeras and40 individual sites. Background sequences totaling ve times the number of foreground (target) sequences were selected from other miRNA chimeras, excluding other miRNAs with the same seed site. De novo motif discovery was performed on three independent background sets using Homer39, expecting 7mer motifs and checking motifs for complementarity to the cognate miRNA, using commands similar to:
perl bin/ndMotifs.pl foreground/hsa-miR-122-5p.txt fasta output/hsa-miR-122-5p/ -fasta background/hsa-miR-122-5p.txt -mcheck motifs/hsa-miR-122-5p.motif -norevopp -noknown -len 7 -bits
Reverse complement miRNA sequences were added to the Homer list of known motifs using commands similar to:
perl bin/seq2prole.pl CAAACACCATTGTCACACTCCA 0 hsamiR-122-5p 4 motifs/hsa-miR-122-5p.motif
Information from Homer output les was extracted using regular expressions in R and a combined condence parameter, c, was calculated as:
c ( log10(p) 10)/10 (s 0.35) 6.7,where p is the P-value and s is the match score with the given miRNA from Homer.
Motifs with sZ0.35, information content per bpZ1.75 and cZ1 were retained. In seven iterations of random comparisons of background sequences,
References
1. Fabian, M. R., Sonenberg, N. & Filipowicz, W. Regulation of mRNA translation and stability by microRNAs. Annu. Rev. Biochem. 79, 351379 (2010).
2. Tan, C. L. et al. MicroRNA-128 governs neuronal excitability and motor behavior in mice. Science 342, 12541258 (2013).
3. Im, H. I. & Kenny, P. J. MicroRNAs in neuronal function and dysfunction. Trends Neurosci. 35, 325334 (2012).
4. Mizoguchi, M. et al. MicroRNAs in human malignant gliomas. J. Oncol. 2012, 732874 (2012).
5. Setty, M. et al. Inferring transcriptional and microRNA-mediated regulatory programs in glioblastoma. Mol. Syst. Biol. 8, 605 (2012).
6. Hausser, J. & Zavolan, M. Identication and consequences of miRNA-target interactions--beyond repression of gene expression. Nat. Rev. Genet. 15, 599612 (2014).
7. Friedman, R. C., Farh, K. K., Burge, C. B. & Bartel, D. P. Most mammalian mRNAs are conserved targets of microRNAs. Genome Res. 19, 92105 (2009).
8. Nielsen, C. B. et al. Determinants of targeting by endogenous and exogenous microRNAs and siRNAs. RNA 13, 18941910 (2007).
9. Yue, D., Liu, H. & Huang, Y. Survey of computational algorithms for microRNA target prediction. Curr. Genomics 10, 478492 (2009).
10. Garcia, D. M. et al. Weak seed-pairing stability and high target-site abundance decrease the prociency of lsy-6 and other microRNAs. Nat. Struct. Mol. Biol. 18, 11391146 (2011).
11. Lopez, J. P. et al. miR-1202 is a primate-specic and brain-enriched microRNA involved in major depression and antidepressant treatment. Nat. Med. 20, 764768 (2014).
12. Majoros, W. H. et al. MicroRNA target site identication by integrating sequence and binding information. Nat. Methods 10, 630633 (2013).
13. Chi, S. W., Hannon, G. J. & Darnell, R. B. An alternative mode of microRNA target recognition. Nat. Struct. Mol. Biol. 19, 321327 (2012).
14. Loeb, G. B. et al. Transcriptome-wide miR-155 binding map reveals widespread noncanonical microRNA targeting. Mol. Cell 48, 760770 (2012).
15. Betel, D., Koppal, A., Agius, P., Sander, C. & Leslie, C. Comprehensive modeling of microRNA targets predicts functional non-conserved and non-canonical sites. Genome Biol. 11, R90 (2010).
16. Bartel, D. P. MicroRNAs: target recognition and regulatory functions. Cell 136,
215233 (2009).
NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 15
& 2015 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864
17. Grimson, A. et al. MicroRNA targeting specicity in mammals: determinants beyond seed pairing. Mol. Cell 27, 91105 (2007).
18. Brennecke, J., Stark, A., Russell, R. B. & Cohen, S. M. Principles of microRNA-target recognition. PLoS Biol. 3, e85 (2005).
19. Helwak, A., Kudla, G., Dudnakova, T. & Tollervey, D. Mapping the human miRNA interactome by CLASH reveals frequent noncanonical binding. Cell 153, 654665 (2013).
20. Lal, A. et al. miR-24 Inhibits cell proliferation by targeting E2F2, MYC, and other cell-cycle genes via binding to seedless 30UTR microRNA recognition elements. Mol. Cell 35, 610625 (2009).
21. Shin, C. et al. Expanding the microRNA targeting code: functional sites with centered pairing. Mol. Cell 38, 789802 (2010).
22. Chi, S. W., Zang, J. B., Mele, A. & Darnell, R. B. Argonaute HITS-CLIP decodes microRNA-mRNA interaction maps. Nature 460, 479486 (2009).
23. Licatalosi, D. D. et al. HITS-CLIP yields genome-wide insights into brain alternative RNA processing. Nature 456, 464469 (2008).
24. Hafner, M. et al. Transcriptome-wide identication of RNA-binding protein and microRNA target sites by PAR-CLIP. Cell 141, 129141 (2010).
25. Kudla, G., Granneman, S., Hahn, D., Beggs, J. D. & Tollervey, D. Cross-linking, ligation, and sequencing of hybrids reveals RNA-RNA interactions in yeast. Proc. Natl Acad. Sci. USA 108, 1001010015 (2011).
26. Grosswendt, S. et al. Unambiguous identication of miRNA:target site interactions by different types of ligation reactions. Mol. Cell 54, 10421054 (2014).
27. Moore, M. J. et al. Mapping Argonaute and conventional RNA-binding protein interactions with RNA at single-nucleotide resolution using HITS-CLIP and CIMS analysis. Nat. Protoc. 9, 263293 (2014).
28. Viollet, S., Fuchs, R. T., Munafo, D. B., Zhuang, F. & Robb, G. B. T4 RNA ligase 2 truncated active site mutants: improved tools for RNA analysis. BMC Biotechnol. 11, 72 (2011).
29. Ule, J., Jensen, K., Mele, A. & Darnell, R. B. CLIP: a method for identifying protein-RNA interaction sites in living cells. Methods 37, 376386 (2005).30. Gagnon, K. T., Li, L., Chu, Y., Janowski, B. A. & Corey, D. R. RNAi factors are present and active in human cell nuclei. Cell Rep. 6, 211221 (2014).
31. Taliaferro, J. M. et al. Two new and distinct roles for Drosophila Argonaute-2 in the nucleus: alternative pre-mRNA splicing and transcriptional repression. Genes Dev. 27, 378389 (2013).
32. Tan, G. S. et al. Expanded RNA-binding activities of mammalian Argonaute 2. Nucleic Acids Res. 37, 75337545 (2009).
33. Yan, Q. et al. Systematic discovery of regulated and conserved alternative exons in the mammalian brain reveals NMD modulating chromatin regulators. Proc. Natl Acad. Sci. USA 112, 34453450 (2015).
34. Weyn-Vanhentenryck, S. M. et al. HITS-CLIP and integrative modeling dene the Rbfox splicing-regulatory network linked to brain development and autism. Cell Rep. 6, 11391152 (2014).
35. Zhang, C. et al. Integrative modeling denes the nova splicing-regulatory network and its combinatorial controls. Science 329, 439443 (2010).
36. Cheng, L. C., Pastrana, E., Tavazoie, M. & Doetsch, F. miR-124 regulates adult neurogenesis in the subventricular zone stem cell niche. Nat. Neurosci. 12, 399408 (2009).
37. Gao, F. B. Context-dependent functions of specic microRNAs in neuronal development. Neural. Dev. 5, 25 (2010).
38. Makeyev, E. V., Zhang, J., Carrasco, M. A. & Maniatis, T. The MicroRNA miR-124 promotes neuronal differentiation by triggering brain-specic alternative pre-mRNA splicing. Mol. Cell 27, 435448 (2007).
39. Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576589 (2010).
40. Kruger, J. & Rehmsmeier, M. RNAhybrid: microRNA target prediction easy, fast and exible. Nucleic Acids Res. 34, W451W454 (2006).
41. Lewis, B. P., Burge, C. B. & Bartel, D. P. Conserved seed pairing, often anked by adenosines, indicates that thousands of human genes are microRNA targets. Cell 120, 1520 (2005).
42. Gaidatzis, D., van Nimwegen, E., Hausser, J. & Zavolan, M. Inference of miRNA targets using evolutionary conservation and pathway analysis. BMC Bioinformatics 8, 69 (2007).
43. Hsu, S. D. et al. miRTarBase update 2014: an information resource for experimentally validated miRNA-target interactions. Nucleic Acids Res. 42, D78D85 (2014).
44. Janssen, H. L. et al. Treatment of HCV infection by targeting microRNA.N. Engl. J. Med. 368, 16851694 (2013).45. Popow, J. et al. HSPC117 is the essential subunit of a human tRNA splicing ligase complex. Science 331, 760764 (2011).
46. Mukherji, S. et al. MicroRNAs can generate thresholds in target gene expression. Nat. Genet. 43, 854859 (2011).
47. Erhard, F. et al. Widespread context dependency of microRNA-mediated regulation. Genome Res. 24, 906919 (2014).
48. Huang, V. & Li, L. C. Demystifying the nuclear function of Argonaute proteins. RNA Biol. 11, 1824 (2014).
49. Broderick, J. A., Salomon, W. E., Ryder, S. P., Aronin, N. & Zamore, P. D. Argonaute protein identity and pairing geometry determine cooperativity in mammalian RNA silencing. RNA 17, 18581869 (2011).
50. Elefant, N., Altuvia, Y. & Margalit, H. A wide repertoire of miRNA binding sites: prediction and functional implications. Bioinformatics 27, 30933101 (2011).
51. Barbash, S., Shifman, S. & Soreq, H. Global coevolution of human microRNAs and their target genes. Mol. Biol. Evol. 31, 12371247 (2014).
52. He, M. et al. Cell-type-based analysis of microRNA proles in the mouse brain. Neuron 73, 3548 (2012).
53. Roush, S. & Slack, F. J. The let-7 family of microRNAs. Trends Cell Biol. 18, 505516 (2008).
54. Ule, J. et al. CLIP identies Nova-regulated RNA networks in the brain. Science 302, 12121215 (2003).
55. Nelson, P. T. et al. A novel monoclonal antibody against human Argonaute proteins reveals unexpected characteristics of miRNAs in human blood cells. RNA 13, 17871792 (2007).
56. Stead, M. B. et al. RNAsnap: a rapid, quantitative and inexpensive, method for isolating total RNA from bacteria. Nucleic Acids Res. 40, e156 (2012).57. Luna, J. M. et al. Hepatitis C virus RNA functionally sequesters miR-122. Cell 160, 10991110 (2015).
58. Blight, K. J., McKeating, J. A. & Rice, C. M. Highly permissive cell lines for subgenomic and genomic hepatitis C virus RNA replication. J. Virol. 76, 1300113014 (2002).
59. Untergasser, A. et al. Primer3--new capabilities and interfaces. Nucleic Acids Res. 40, e115 (2012).
60. Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efcient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).
61. Lawrence, M. et al. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 9, e1003118 (2013).
62. Parker, J. S., Parizotto, E. A., Wang, M., Roe, S. M. & Barford, D. Enhancement of the seed-target recognition step in RNA silencing by a PIWI/MID domain protein. Mol. Cell 33, 204214 (2009).
63. Eisen, M. B., Spellman, P. T., Brown, P. O. & Botstein, D. Cluster analysis and display of genome-wide expression patterns. Proc. Natl Acad. Sci. USA 95, 1486314868 (1998).
64. Saldanha, A. J. Java Treeview--extensible visualization of microarray data. Bioinformatics 20, 32463248 (2004).
65. Rhead, B. et al. The UCSC Genome Browser database: update 2010. Nucleic Acids Res. 38, D613D619 (2010).
66. Pollard, K. S., Hubisz, M. J., Rosenbloom, K. R. & Siepel, A. Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res. 20, 110121 (2010).
67. Zhang, Z., Lee, J. E., Riemondy, K., Anderson, E. M. & Yi, R. High-efciency RNA cloning enables accurate quantication of miRNA expression by deep sequencing. Genome Biol. 14, R109 (2013).
68. Parikh, A. et al. microRNA-181a has a critical role in ovarian cancer progression through the regulation of the epithelial-mesenchymal transition. Nat. Commun. 5, 2977 (2014).
Acknowledgements
We thank members of the Darnell and Rice laboratories, in particular Jennifer Darnell, for thoughtful insights and support. This work was supported by grants from the US National Institutes of Health, NINDS (NS034389 and NS081706 to R.B.D.), NCI (CA057973 to C.M.R.), NIAID (AI091707 and AI090055 to C.M.R.), Ofce of the Director through the NIH Roadmap for Medical Research (DK085713 to C.M.R.), The Starr Foundation and the Simons Foundation (SFARI 240432 to R.B.D.). R.B.D. is an Investigator of the Howard Hughes Medical Institute. M.J.M. was supported by the Jane Cofn Childs Memorial Fund. T.K.H.S. was supported by a Postdoctoral Fellowship and a Sapere Aude Research Talent Award from The Danish Council for Independent Research. J.M.L. was supported by a David Rockefeller Graduate Student Fellowship.
Author contributions
M.J.M., T.K.H.S., C.M.R. and R.B.D. designed the studies and interpreted the results. M.J.M., T.K.H.S., J.M.L., J.J.F. and E.N. performed the experiments and analysed data. C.Y.P. developed bioinformatic analysis pipelines and analysed data. M.J.M., T.K.H.S. and R.B.D. wrote the manuscript and made the gures.
Additional information
Accession codes: High-throughput sequencing data are available at NCBI GEO under the accession number GSE73059.
Supplementary Information accompanies this paper at http://www.nature.com/naturecommunications
Web End =http://www.nature.com/ http://www.nature.com/naturecommunications
Web End =naturecommunications
Competing nancial interests: The authors declare no competing nancial interests.
16 NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2015 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms9864 ARTICLE
Reprints and permission information is available online at http://npg.nature.com/reprintsandpermissions/
Web End =http://npg.nature.com/ http://npg.nature.com/reprintsandpermissions/
Web End =reprintsandpermissions/
How to cite this article: Moore, M. J. et al. miRNAtarget chimeras reveal miRNA 30-end pairing as a major determinant of argonaute target specicity. Nat. Commun.
6:8864 doi: 10.1038/ncomms9864 (2015).
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the articles Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
Web End =http://creativecommons.org/licenses/by/4.0/
NATURE COMMUNICATIONS | 6:8864 | DOI: 10.1038/ncomms9864 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 17
& 2015 Macmillan Publishers Limited. All rights reserved.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright Nature Publishing Group Nov 2015
Abstract
microRNAs (miRNAs) act as sequence-specific guides for Argonaute (AGO) proteins, which mediate posttranscriptional silencing of target messenger RNAs. Despite their importance in many biological processes, rules governing AGO-miRNA targeting are only partially understood. Here we report a modified AGO HITS-CLIP strategy termed CLEAR (covalent ligation of endogenous Argonaute-bound RNAs)-CLIP, which enriches miRNAs ligated to their endogenous mRNA targets. CLEAR-CLIP mapped ∼130,000 endogenous miRNA-target interactions in mouse brain and ∼40,000 in human hepatoma cells. Motif and structural analysis define expanded pairing rules for over 200 mammalian miRNAs. Most interactions combine seed-based pairing with distinct, miRNA-specific patterns of auxiliary pairing. At some regulatory sites, this specificity confers distinct silencing functions to miRNA family members with shared seed sequences but divergent 3'-ends. This work provides a means for explicit biochemical identification of miRNA sites in vivo, leading to the discovery that miRNA 3'-end pairing is a general determinant of AGO binding specificity.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer