Content area
Issue Title: Special Issue: Production and Labeling of Biological Macromolecules for NMR Investigations/Guest Edited by Gerhard Wagner
The known biological functions of RNA have expanded in recent years and now include gene regulation, maintenance of sub-cellular structure, and catalysis, in addition to propagation of genetic information. As for proteins, RNA function is tightly correlated with structure. Unlike proteins, structural information for larger, biologically functional RNAs is relatively limited. NMR signal degeneracy, relaxation problems, and a paucity of long-range ^sup 1^H-^sup 1^H dipolar contacts have limited the utility of traditional NMR approaches. Selective isotope labeling, including nucleotide-specific and segmental labeling strategies, may provide the best opportunities for obtaining structural information by NMR. Here we review methods that have been developed for preparing and purifying isotopically labeled RNAs, as well as NMR strategies that have been employed for signal assignment and structure determination.[PUBLICATION ABSTRACT]
J Biomol NMR (2010) 46:113125 DOI 10.1007/s10858-009-9375-2
PERSPECTIVE
Isotope labeling strategies for NMR studies of RNA
Kun Lu Yasuyuki Miyazaki Michael F. Summers
Received: 30 June 2009 / Accepted: 20 August 2009 / Published online: 30 September 2009 The Author(s) 2009. This article is published with open access at Springerlink.com
Abstract The known biological functions of RNA have expanded in recent years and now include gene regulation, maintenance of sub-cellular structure, and catalysis, in addition to propagation of genetic information. As for proteins, RNA function is tightly correlated with structure. Unlike proteins, structural information for larger, biologically functional RNAs is relatively limited. NMR signal degeneracy, relaxation problems, and a paucity of long-range 1H1H dipolar contacts have limited the utility of traditional NMR approaches. Selective isotope labeling, including nucleotide-specic and segmental labeling strategies, may provide the best opportunities for obtaining structural information by NMR. Here we review methods that have been developed for preparing and purifying isotopically labeled RNAs, as well as NMR strategies that have been employed for signal assignment and structure determination.
Keywords NMR RNA structure Enzymatic ligation
Isotope labeling
Introduction
RNA molecules play pivotal roles in a large and growing number of known biological functions. Initially relegated to roles in genetic information transfer, RNA species comprising as few as *19 nucleotides (nt), or as many as *4,700 nt or more, are now known to play roles in
transcriptional regulation, metabolite and protein recognition, catalysis, maintenance of sub-cellular and viral structure, and other essential functions (Hassouna et al. 1984; Doudna and Rath 2002; Bartel 2004; Kim 2005; Brodersen and Voinnet 2006; Boisvert et al. 2007; Edwards et al. 2007; Korostelev and Noller 2007; Wakeman et al. 2007; Bessonov et al. 2008; Steitz 2008). In addition, a signicant portion of the transcribed Eukaryotic genome produces non-protein-coding RNAs, of which only a few have known biological roles (Ponting et al. 2009). Current understanding at the atomic level of how RNAs function is generally limited, a problem that is compounded by the relative paucity of high-resolution structural information. To date, only *1,300 RNA structures have been deposited in Nucleic Acid Database (NDB), whereas, more than 55,000 protein structures have been deposited in the Protein Databank (PDB). Conformational heterogeneity and a relatively uniform, negative surface charge are two signicant impediments to structural studies by X-ray crystallography.
NMR is a potentially powerful tool for studying the structure and dynamics of RNA (Wthrich 1986). Unfortunately, because most RNAs contain only four types of residues, chemical shift dispersion of 1H, 13C, 15N and 31P nuclei is relatively low. Overall proton density is also much lower than that of proteins, and non-exchangeable proton density is concentrated mainly within the major grooves of A-helices. In addition, interproton distances between different secondary structure elements are typically greater than 5, which limits the utility of NOE experiments for establishing overall RNA folds (Allain and Varani 1997; Lukavsky and Puglisi 2005). Finally, NMR assignments are critically dependent on analysis of signals for aromatic CH groups, and 1H13C dipolar coupling can severely limit the sensitivity and resolution of 1H13C correlation
K. Lu Y. Miyazaki M. F. Summers (&)
Howard Hughes Medical Institute and Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD 21250, USAe-mail: [email protected]
123
114 J Biomol NMR (2010) 46:113125
NMR spectra obtained for these groups in larger RNAs with longer rotational correlation times. For these reasons, high-resolution NMR-based structural studies have been applied mainly to relatively small RNAs: Of the 298 RNA NMR structures that have been deposited to date in the Nucleic Acid Database, only three are comprised of 50 or more nucleotides.
The study of RNA structure has typically relied on the divide-and-conquer strategy, in which functional RNAs are dissected into smaller sub-domains (for examples, see: (Aboul-ela et al. 1996; Amarasinghe et al. 2000; Spriggs et al. 2008). Unfortunately, the isolated sub-domains typically lack function, and it is not always clear that their folding matches the folding that occurs in the context of the intact RNA. In fact, examples have emerged in which the isolated and intact structures differ signicantly (Abramovitz and Pyle 1997). Structures of small RNAs can also be susceptible to crystal packing effects (Shah and Brunger 1999).
Nucleotide specic 13C, 15N, and 2H isotope labeling
was introduced in the early 90 s as a means of overcoming signal degeneracy problems (Batey et al. 1992, 1995; Nikonowicz and Pardi 1992; Michnicka et al. 1993; Kim et al. 1995). More recently, segmental labeling approaches have been developed for studying sub-domain structure in the context of intact, functional RNAs (Xu and Crothers 1996; Kim et al. 2002; Lukavsky et al. 2003). This strategy should work well for RNAs with subdomains that have relatively short rotational correlation times, and could be applicable to RNAs with longer rotational correlation times using appropriate TROSY-based NMR experiments (Pervushin et al. 1997; Riek et al. 1999; Fiala et al. 2000). This review outlines the methods that have been developed for the preparation and purication of milligram quantities of isotopically-labeled RNAs, as well as the NMR assignment strategies that could eventually allow high resolution structure determination of RNA subdomains within very large ([700 nt) RNAs.
RNA synthesis
Chemical synthesis
Chemical synthesis of RNA became commercially feasible in the late 80s with the development of efcient ribose 20-hydroxyl protecting groups (Usman et al. 1987; Scaringe et al. 1998). Although modied NTPs can readily be incorporated into specic positions, low coupling efciencies limit this approach to relatively small RNAs. At present, chemical synthesis is practical for RNAs comprising fewer than *55 nt. Site-specic labeling of even these relatively small RNAs could be extremely useful for
incorporation into larger RNAs by enzymatic ligation (see below). Unfortunately, isotopically-labeled phosphoramidites for RNA synthesis are not commercially available.
Enzymatic synthesis using T7 RNA polymerase
RNAs for NMR-based structure determination are now commonly prepared by in vitro transcription using the T7 RNA polymerase (RNAP) enzyme. Milligram quantities of RNAs ranging in length from 10 to 30,000 nt can be prepared, and it is straightforward to incorporate commercially available 13C-, 15N-, and 2H-isotopically labeled nucleotides (Milligan and Uhlenbeck 1989; Pokrovskaya and Gurevich 1994). T7 RNAP is highly specic for its own promoters and exhibits no afnity for the closely related T3 promoters. The error frequency in transcripts for wild type T7 RNA polymerase is as low as 6 9 10-5.
Although T7 RNAP-dependent in vitro transcription is currently the most popular and widely used method for preparing RNAs, the approach does have some drawbacks. First, to achieve the best yields, the most commonly used T7 class III promoter requires guanosine at the rst and second positions of the sequence. As such, non-native 50-guanosines sometimes must be added to the RNA sequence of interest. Similarly, the T7 U2.5 class II promoter requires a 50-AG sequence for efcient transcription.
These 50-sequence requirements can be overcome by engineering the appropriate 50-GG (or 50-AG) start sequence into a hammerhead (HH) ribozyme sequence, which itself is appended to the 50 end of the fully native target RNA sequence (discussed further below).
A second drawback is that T7 RNAP often adds one or more non-templated nucleotides (typically Cs) to the 30-terminus of the product RNA (Milligan et al. 1987;
Milligan and Uhlenbeck 1989; Pokrovskaya and Gurevich 1994). Although 30-heterogeneity is not a signicant problem for most structural studies, it can be a major problem if the RNA is to be ligated to the 50-end of a second RNA molecule. 50-end heterogeneity can also be a problem for transcripts that have more than 3 consecutive 5-guanosines (Pleiss et al. 1998), or for constructs with other 50-terminal sequences such as 50-CACUGU, 50-CA
GAGA or 50-GAAAAA (Helm et al. 1999). 50-end heterogeneity does not appear to be a problem when using the
T7 U2.5 class II promoter.
Control of 50-end homogeneity
The efcient way to produce RNA transcripts with uniform 50-ends is to construct a DNA template with a self-cleaving hammerhead sequence immediately upstream of the RNA sequence of interest. The HH ribozyme is a relatively small structural element ( *50 nt) that catalyzes phosphodiester
123
J Biomol NMR (2010) 46:113125 115
bond hydrolysis specically at its 30-GUC sequence. The HH ribozyme folds into the catalytically active structure immediately upon transcription and self-cleaves in the presence of Mg2?. The desired RNA product contains a 50 hydroxyl group which, as discussed below, can be benecial for subsequent ligation (Fig. 1a) (Birikh et al. 1997). Another advantage of using a 50 hammerhead cassette is that the optimized transcription start sequence (e.g., 50-GG
for the T7 class III promoter) can be incorporated into the ribozyme sequence, rather than the RNA of interest.
Control of 30 end homogeneity
Several methods are presently available for controlling 30 end homogeneity. For smaller RNAs (\60 nt) that can be generated from synthetic templates, non-templated 30
Fig. 1 Method used to produce RNA with homogeneous 50 and 30 ends. a Hammerhead (HH)
cleavage produces homogeneous 50-end; b HDV/
HH ribozyme, RNase P cleavage or 20-O-Me of the last two nucleotides on the DNA template can produce 30-
homogeneous ends. c RNaseH cleavage of the transcribed RNA can produce both homogeneous 50 and 30 ends at the cleaving point. For details see Hartmann et al. (2005)
123
116 J Biomol NMR (2010) 46:113125
nucleotide addition can be inhibited by incorporating two 20-O-methoxyribose-modied nucleotides (commercially available) at the 50-end of the template strand (Kao et al. 1999) (Fig. 1b). For larger RNAs, the DNA template can be modied such that it encodes for a HH ribozyme at the 30-end of the RNA of interest. However, HH ribozymes preferentially cleave at sites that contain a 30-GUC sequence, and this would restrict sequence options for the 30-end of the desired RNA.
The Hepatitis delta virus (HDV) ribozyme has no such sequence requirements and can be used to control 30 end homogeneity of any RNA sequence of interest. The HDV ribozyme is active during transcription in the presence of Mg2?. The product RNA contains a 20,30-cyclic phosphate at its 30 end (Fig. 1b), which can be advantageous for controlling subsequent ligation reactions (see below). A frequently observed problem is that the upstream RNA sequence can interfere with the proper folding of the HDV ribozyme, leading to poor cleavage efciency. This problem can sometimes be overcome by cycling the temperature to refold the RNA, which allows the ribozyme domain to adopt the active structure (at least transiently) for self-cleavage. HDV cleavage efciency can also be improved by addition of Ca2? (rather than Mg2?) as a cofactor (Cerrone-Szakal et al. 2009). An additional approach we have used is to redesign the HDV ribozyme so that it does not form secondary structures with the appended RNA of interest, based on MFOLD predictions (Zuker 2003).
The RNase P ribozyme can also be employed to control 30-end homogeneity. In this approach, a tRNA sequence is appended to the 30-end of the desired RNA sequence.
RNase P recognizes and cleaves the 50-end of the tRNA, affording the product RNA of interest. Like HDV, RNase P does not have specic sequence requirements for residues upstream of the cleavage site. The major advantage of this method over HDV-dependent cleavage is that there is generally no folding inference, due to the fact that tRNAs adopt highly stable, autonomous structures. The product RNAs contain 30-hydroxyl termini (Fig. 1b), in contrast to the 20, 30-cyclic phosphates generated by HDV and HH cleavage, which is suitable for many downstream applications.
Control of both 50 and 30 end homogeneity
RNase H can recognize and bind to DNA/RNA hybrid duplexes and cleave the backbone of the RNA, leaving a 50-phosphate and 30-hydroxyl group (Berkower and Leis 1973) (Fig. 1c). This enzyme can therefore be used to control homogeneity at both the 50 and 30 ends of the target
RNA. When the target RNA and an oligonucleotide
containing a short stretch of DNA anked by 20-O-Me
RNA sequences form a hybrid duplex (Fig. 1c), the site of cleavage is specic but apparently dependent on the commercial source of the RNase H (Inoue et al. 1987; Hart-mann et al. 2005). RNase H can be used to remove non-native 50-elements that contain optimized T7 transcription start sites (Lapham and Crothers 1996).
RNA purication methods
Purication of large quantities of RNAs is typically achieved by denaturing polyacrylamide gel electrophoresis (PAGE; 13 by 16.500 gel box size) (Wyatt et al. 1991). This is a robust method that works well for a wide range of RNA sizes. Nucleotide-level resolution can typically be achieved for RNAs of up to *30 residues. This method can be laborious, and often requires the use of several gels per
NMR sample. The standard procedure includes an initial ethanol precipitation step, which in our hands often leads to the formation of insoluble RNA aggregates. We typically skip this step, and directly add the transcription reaction mixture to the gel. For RNAs that contain self-complementary dimerization elements, best results have been obtained by running the gels in a warm environment (e.g., at *50C in a small enclosed area with a space heater).
This is necessary because RNA fragments resulting from aborted synthesis can associate with the full-length construct of interest if both contain the self-complementary dimerization sequence. Ion-pair reversed-phase HPLC and anion-exchange HPLC have also been employed to purify RNA under denaturing conditions (Anderson et al. 1996; Shields et al. 1999; Azarani and Hecker 2001). These approaches are much less laborious and time-consuming, but the best results have been achieved when applied to relatively small RNAs. Of course, purication under denaturing conditions requires desalting and refolding of the RNA prior to structural studies.
Several additional chromatographic methods have been developed for purifying RNA under non-denaturing conditions, including size exclusion (Lukavsky and Puglisi 2004; Kim et al. 2007; McKenna et al. 2007), DNA-afnity (Cheong et al. 2004), and protein-afnity (Crowe et al. 1994; Batey and Kieft 2007) column chromatography (for a recent review see (Dayie 2008)). An afnity-tag based purication procedure was also developed recently, in which protein-RNA interactions immobilize RNAs carrying both a modied HDV sequence and a 30-afnity tag (Batey and Kieft 2007). After the immobilization, the target RNA is cleaved by the addition of imidazole, which activates the modied ribozyme. This purication approach also affords product RNAs with homogeneous 30-ends.
123
J Biomol NMR (2010) 46:113125 117
Isotope labeling strategies
Residue-specic isotopic labeling has greatly facilitated structural studies of large proteins and protein assemblies by reducing NMR spectral complexity and unfavorable relaxation behavior (Fiaux et al. 2004; Sprangers and Kay 2007). Proteins enriched with 2H, 15N and/or 13C can be directly prepared in E. Coli by commercially available materials (Sprangers and Kay 2007), and site-specically labeled proteins can be synthesized by cell-free methods (Kainosho et al. 2006). A variety of site-specically deuterated and isotopically labeled amino acids are currently available for amino acid-specic labeling. In comparison, relatively few reagents are commercially available for the preparation of isotopically labeled RNAs. Until very recently, it has only been possible to obtain uniformly enriched rNTPs (13C, 15N, 13C/15N, or 2H) from commer
cial sources (and at considerable expense).
Ribose labeling
Ribonucleotide triphosphates that contain 2H and/or 2H,
13C specically labeled ribose moieties can be prepared
enzymatically, but the process is laborious and requires more than a dozen enzymatic reactions (Fig. 2) (Scott et al. 2000). Procedures have been developed by Williamson and co-workers to prepare variety of combinations of site-specically deuterated and/or 13C-enriched NTPs that can subsequently be used for in vitro transcription (Scott et al. 2000). Incorporation of rNTPs with selectively deuterated ribose groups led to dramatic simplication of NMR spectra obtained for a 30 KD (86 nt) GGAA tetra loop receptor complex (Fig. 3), and enabled its structure determination (Davis et al. 2005).
Perdeutarated NTPs
Perdeutarated NTPs are commercially available and can be used in combination with protonated NTPs to enzymatically synthesize RNAs in which specic types of nucleotides are protonated (with the remainder being partially or fully deuterated). The main advantage of this approach is that it allows spectral editing without the signicant signal broadening associated with 13C incorporation. In fact, 1H NMR signals of partially deuterated samples are typically narrower than those observed for fully protonated samples,
Fig. 2 Reaction scheme for the enzymatic conversion of glucose to the four NTPs used to make RNA. Glucose and all intermediates are shown in boldface type, and enzymes denoted in italics. Reprinted with permission from Scottet al. (2000)
123
118 J Biomol NMR (2010) 46:113125
Fig. 3 Relief of spectral crowding by site-specic deuteration. a Secondary structure of a 30 KDa GAAA tetra loopreceptor RNA. The helical regions are shown in black, the tetra loop is shown in red and the receptor is in green. b, c: portions of the 2D NOESY spectra
obtained for the fully-protonated b and selectively deuterated c tetra loop receptor RNA. Reprinted, with permission, from Davis et al. (2005)
due to limited 1H1H spin diffusion. Sequential nucleotides of a given type can be readily assigned via the standard sequential walk strategy (Wthrich 1986). A signicant disadvantage is that samples containing more than one type of protonated NTP may be required in order to make sequential signal assignments for adjacent residues of different types. Fortunately, the C82H deuterons can be exchanged with protons, providing a convenient means of assigning fully protonated nucleotides that are adjacent to C8-protonated, ribose-deuterated purines (Fig. 4). This approach, combined with traditional 3D and 4D 13C-edited NOESY experiments, were used to complete the NMR signal assignment and structure determination of a 101-nt viral RNA encapsidation signal (Fig. 4) (DSouza et al. 2004). Studies of a 30-kDa GAAA tetraloop-receptor complex (Davis et al. 2005) required more sensitive two-dimensional ltered/edited NOESY experiments developed by Feigon and co-workers (Peterson et al. 2004), but even with these experiments many signals were broadened beyond detection.
H8 enrichment of perdeuterated purines
Perdeuterated purine rNTPs that are presently available from commercial sources typically contain a low to modest levels of protonation at the C8 position (ca. 530%). As indicated above, specic protonation at this site can greatly facilitate assignment and analysis of 2D NOESY spectra obtained for partially deuterated RNAs, and a greater level of protonation at this site can be advantageous. It is well
Fig. 4 Simplication of the 2D NOESY spectrum obtained for nucleotide-specic protonated, 2H-labeled RNA. a Secondary structure of the 101-nt core encapsidation signal of the Moloneymurine leukemia virus. b Portion of the 2D NOESY spectrum of the G-protonated, C,U,A-perdeuterated RNA. Breakthrough peaks from a small amount of A-H8 substitution facilitated assignment of the NMR signals. Reprinted, with permission, from DSouza et al. (2004)
123
J Biomol NMR (2010) 46:113125 119
known that purine H8 protons readily exchange with deuterium under basic conditions (Rabi and Fox 1973; Goodman 1974). Triethylamine (TEA) is a suitable reagent for promoting 1H/2H exchange at the C8 position because of its ease of handling (Huang et al. 1997). TEA is volatile and can be removed after 1H/2H exchange simply by lyophilization. In our laboratory, commercially obtained, perdeuterated GTP and ATP are incubated with 15 equivalent TEA in water at 60C for 24 h and for 5 days, respectively, which results in essentially complete exchange of D8 to H8 in both rNTPs with minimal to no rNTP hydrolysis or degradation.
2D NOESY spectra obtained for a 132 nucleotide dimeric RNA prepared with and without nucleotide-specic deuteration are shown in Fig. 5b. The ngerprint region of the fully protonated sample exhibited extreme signal overlap and was unassignable. However, well-resolved and assignable signals were observed for a sample prepared with fully perdeuterated rCTP and rUTP, H8-exchanged/perdeuterated rATP, and fully protonated rGTP (Fig. 5c). Although this approach facilitated assignment of the purine H8 and H10 signals, assignment of the pyrimi-dine aromatic signals was problematic due to the large intrinsic linewidths of the H5 and H6 proton signals and the associated signal overlap.
H6 and H5 enrichment of perdeuterated pyrimidines
For larger RNAs, the 1H NMR signals of the H5 and H6 aromatic protons can be broadened due to 1H1H dipolar coupling. Three-bond 1H1H scalar coupling also contributes to the apparent linewidths of these protons, and it would be advantageous to use samples in which the C5 or C6 protons were selectively substituted by 2H. Unfortunately, procedures have not been reported for specically substituting the H5 or H6 protons of rNTPs by deuterium. Procedures have been reported for specic C5 or C6 deuterium substitutions with pyrimidine 50-monophosphates (Cullis et al. 1995; Rabi and Fox 1973). H5 substitution by deuterium was reportedly achieved at *90% levels, and H6 substitution was achieved at 6575% levels (Huang et al.
1997). In preliminary experiments, our laboratory has had difculties achieving these levels of deuterium exchange without signicant concomitant sample degradation. Preparation of selectively C6-protonated/perdeuterated rNTPs would require several steps, including hydrolysis of commercially available, perdeuterated nucleotides, 1H/2H exchange, removal of degradation contaminants, and rephosphorylation. The eld would clearly benet by the commercial availability of perdeuterated, C6-protonated (or protonated, C5-deuterated) pyrimidine triphosphate
Fig. 5 H8 enrichment of perdeuterated purines simplies the NMR spectrum and enables assignment of larger RNAs.a Predicted secondary structure of stem loops C and D of the Moloneymurine leukemia virus 50-UTR. b 2D NOESY spectrum of the fully protonated SL-CD dimer. c 2D NOESY spectrum of the SL-CD dimer synthesized using fully-protonated GTP, H8-protonated, perdeuterated ATP, perdeuterated CTP and perdeuterated UTP. Black and red lines denote inter-guanosine and adenosine-to guanosine-H10 connectivities, respectively (Miyazaki and Summers, unpublished)
123
120 J Biomol NMR (2010) 46:113125
ribonucleotide. Some of these desired rNTPs may soon be available from commercial sources.
Segmental labeling strategies
One method for overcoming signal overlap problems associated with larger RNAs involves chemically ligating a small, isotopically labeled RNA fragment to a larger, unlabeled (or differentially labeled) fragment. Segmental ligation has been most commonly performed using either T4 DNA ligase or T4 RNA ligase, each of which has advantages and disadvantages. Deoxyribozymes that catalyze RNA ligation with high efciencies and turnover rates have also been developed recently.
T4 DNA ligase
Moore and Sharp rst used T4 DNA ligase to link two RNA strands (Moore and Sharp 1992). T4 DNA ligase requires a DNA splint or cDNA template to align and hold the RNA fragments together (Fig. 6a). The RNA fragments to be ligated need to have correct termini at the junction site (30-OH and 50-monophosphate). Xu et al. used T4 DNA ligase to synthesize two different segmentally 15N labeled samples (Xu and Crothers 1996) corresponding to the 50 half of Caenorhabditis elegans spliced leader RNA. They rst prepared two full-length RNAs by in vitro transcription: one that was fully labeled and one that was unlabeled. RNase H was then used to cleave both RNAs into two fragments, both of which contained homogeneous ends at the cleavage site. The cleaved 50-fragment of the unlabeled
RNA was ligated to the cleaved 30 fragment of the labeled RNA using T4 DNA ligase (yield *1854%). Comparison of the 2D 1H15N HSQC spectra obtained for the uniformly labeled and segmentally labeled samples enabled the investigators to unambiguously distinguish between two competing secondary structure models.
Although DNA ligase has been widely used to incorporate radioactive groups, nucleotide analogs, and cross-linking groups at specic sites (Query et al. 1994; Gozani et al. 1996; Pasman and Garcia-Blanco 1996; Maroney
et al. 2000; Frilander and Steitz 2001), the ligation yield is relatively low and the reaction is slow. As such, this approach is not always ideal for preparing quantities of RNAs needed for structural analysis. In addition, the intrinsic secondary structures of the DNA splint and the RNA fragments to be ligated can interfere with hybridization, resulting in reduced ligation efciencies. The DNA splint length can also affect the ligation yield (Kurschat et al. 2005).
T4 RNA ligase
T4 RNA ligase can also be used to ligate isotopically labeled and unlabeled RNA fragments. Like DNA ligase, T4 RNA ligase requires a 50-monophosphate on the donor fragment and a 30-hydroxyl group on the acceptor element.
Unlike DNA ligase, RNA ligase can ligate single stranded nucleotides at the junction site. (Fig. 6b) It can work effectively both in a templated (Bain and Switzer 1992) and non-templated fashion. For templated ligation, the single-stranded RNA acceptor and donor are brought in close proximity using a DNA template (also called a splint) or tRNA (Wittenberg and Uhlenbeck 1985; Ohtsuki et al. 1996; Ohtsuki et al. 1998; Kim et al. 2002). The template is designed such that it does not base pair with residues near the penultimate nucleotides that are to be ligated (typically 26 residues on both the donor and acceptor strands). Ligation efciency is reduced when the ultimate and penultimate nucleotides of the acceptor molecule are pyrimi-dines (Wittenberg and Uhlenbeck 1985).
Several groups have used T4 RNA ligase for non-templated ligation. Watanabe and co-workers prepared nematode mitochondrial tRNAMet constructs containing one or two isotopically labeled nucleotides. Labeling was achieved by ligating labeled nucleotides to one RNA segment, then appending a second RNA strand (Ohtsuki et al. 1996, 1998). The un-reacted substrate from the rst ligation step was oxidized with periodate to avoid a purication step (Kurata et al. 2003). Based on NMR data obtained for these specically-labeled samples, a tertiary structural model for the tRNA could be constructed (Ohtsuki et al. 1996, 1998).
Fig. 6 Commonly used methods for large-scale RNA ligation. a DNA ligase mediated ligation. b RNA ligase mediated ligation. c Deoxyribozyme mediated RNA ligation. RNA ligase mediated ligation can occur with or without DNA splint templating. c is reproduced from Purtha et al. (2005) with permission
123
J Biomol NMR (2010) 46:113125 121
Puglisi and co-workers used T4 RNA ligase to segmentally label a 100 kDa IRES RNA (Kim et al. 2002). A simple one step reaction was used to prepare the RNA fragments for ligation. For the acceptor, a hammerhead ribozyme was attached to the 50 without the need to treat with calf intestinal alkaline phosphatase to dephosphate. For the donor, they attached a 30-hammerhead sequence, which leaves a 20,30-cyclic phosphate after cleavage, and blocks the 30 terminus from self-ligation. GMP was used to prime the transcription reaction for the donor fragment in order to obtain the appropriate 50-monophosphate. The acceptor and donor
RNAs were then ligated into a 100 kDa RNA by T4 RNA ligase, with ligation yields of up to *50%. NMR data obtained for the ligated construct indicated that domain II of the IRES forms an independent structure that does not interact with other parts of IRES (Kim et al. 2002).
Lukavsky and co-workers developed a more convenient ligation strategy, in which the donor and acceptor RNA fragments are obtained from a single template (Lukavsky and Puglisi 2005; Tzakos et al. 2006, 2007). In the plasmid template, a T7 promoter sequence is followed by a donor sequence and a 30-hammerhead ribozyme (Fig. 7). The 30 ribozyme is connected to a 50 ribozyme sequence via a short, exible linker. The reaction is primed with GMP, and the primary transcript is excised by the HH ribozymes to yield the desired donor fragment with 50-monophosphate and 30-cyclic phosphate. The acceptor fragment contains hydroxyl groups on both termini. With this strategy, four ligation fragments (two enriched with 15N and two with
13C) can be obtained from only two transcription reactions. The method was used to prepare segmentally labeled
samples of a 25 kDa brain cytoplasmic RNA. A potential concern with this approach is that it does not provide a means of controlling 30-terminus run-off transcription, and non-native nucleotides could therefore be incorporated into the middle of the full length RNA.
Our group has used T4 RNA ligase to prepare the segmentally labeled HIV-1 50-UTR. A HDV ribosome sequence was designed into the 30-ends of the acceptor and donor fragments to control the homogeneity and prevent self-ligation. Following HDV cleavage, the acceptor 30-cyclic phosphate was removed using T4 polynucleotide kinase. The ligation yield of using this approach (without a splint) exceeded 95%. High quality 1H,13C-correlated HMQC spectra were obtained for the 712 nucleotide dimeric 50-UTR (Fig. 8), showing clearly that signal assignments and structural studies should be feasible for very large RNAs with appropriate internal dynamics (unpublished results).
Deoxyribozyme-catalyzed synthesis of 30-50 RNA linkages
DNA and RNA ligases do not always provide acceptable yields, and deoxyribozymes have been developed as a potentially attractive alternative for RNA ligation. Deoxyribozymes are DNA enzymes that typically catalyze reactions involving nucleic acid substrates (Hbartner and Silverman 2007). Silverman and co-workers engineered two RNA ligase deoxyribozymes that generate 30-50 linkages rapidly and in high yield (Fig. 6c) (Purtha et al. 2005).
The deoxyribozyme has modest sequence requirements for
Fig. 7 Scheme for preparation of segmentally labeled RNAs from a single template.(a) Plasmid template, in which the 30-hammerhead (HH) and 50-HH were engineered into the donor and acceptor RNAs respectively. (bd): Representative aromatic regions of 1H,13C-TROSY spectra of BC1 DTE RNA recorded with (b) a uniformly 13C-/15 N-labeled sample, (c) a 13C
SI,15N-SII segmentally labeled sample and (d) a 15N-SI,13C-SII segmentally labeled sample. Reproduced from Tzakos et al. (2007) with permission
123
122 J Biomol NMR (2010) 46:113125
Fig. 8 Representative 1H13C HMQC NMR spectrum obtained for the AUG region of the intact, 712 nt dimeric HIV-1 50-UTR. a One of several predicted secondary structures of HIV-1NL4-3 50-UTR; DIS,
dimer initiation site; SD, major splice donor site; AUG, (green, bold), gag start codon. 13C-labeled residues are shown in green. b 1H13C HMQC spectrum obtained for the dimeric HIV-1 50-UTR at low ionic strength (10 mM TrisHCl, pH 7.0) (Lu and Summers, unpublished)
the RNA substrates, and the 50- end of the donor does not require to be monophosphate.
Three fragment RNA ligation
Rader and coworkers found that the DNA splint normally used for DNA ligase can signicantly improve the catalytic efciency of T4 RNA ligase-mediated RNA ligation (Stark et al. 2006). A 128 nt ligated RNA was generated from three RNA fragments using a single template with yields as high as 7080% (5 min reaction time). All three fragments were generated by chemical synthesis, so end heterogeneity was not an issue. Also, because the DNA splint brings the appropriate 50- and 30- ends into close proximity, self-ligation was not a signicant problem. In addition, the
RNA ligase did not exhibit sequence specicity for the ligating substrates. Similar yields were obtained for a two-
step ligation procedure, in which two of the fragments were ligated rst, followed by ligation of the third fragment.
Wijmenga and co-workers also demonstrated that isotopically labeled uridine can be incorporated into the central position of the 20 kDa e-RNA of Duck Hepatitis B virus via three fragment RNA ligation (Nelissen et al. 2008). The optimal ligation site was chosen based on the DINAMelt (Markham and Zuker 2005; Markham and Zuker 2008) predicted tendency of the 30- and 50-ends of the isolated fragments to remain unstructured prior to ligation. Two protocols were evaluated: a two-step protocol that uses T4 DNA ligase and RNA ligase 1, and a one-pot protocol, that uses T4 RNA ligase 1 only. The one-pot protocol gave somewhat lower yields and more side products. This method allowed direct observation of a specic imino proton in the 20 kD aligated RNA (Nelissen et al. 2008).
Conclusions
Although the eld of RNA structural biology has not kept pace with that of proteins, recent advances in nucleic acid molecular biology, coupled with the development of new NMR methodologies, should signicantly facilitate future productivity. Computational tools and reagents for the design, synthesis and ligation of RNA fragments are convenient to use and cost-efcient. New approaches for assigning NMR spectra and for determining RNA structure and dynamics that have been developed over the past two decades are capable of providing mechanistic insights that cannot be obtained by other methods. The stage appears to be nearly set for answering some of the major questions in the expanding eld of RNA biology.
A signicant limitation to studies of larger RNAs by NMR is that the variety of commercially available, isotopically labeled rNTPs is limited, and those that can be purchased are relatively expensive. Aside from reducing the costs, the most immediate need is for pyrimidine rNTPs that are either selectively protonated at C6 and perdeuterated at all (or most) other non-exchangeable CH sites, or selectively deuterated at C5 and protonated at all (or most) other sites. Such reagents would complement the C8-protonated purine rNTPs that can readily be generated from commercial, perdeuterated rNTPs. A second major need is for commercially available, isotopically labeled ribonucleotide phosophoramidites that could be used to chemically synthesize relatively short RNAs with labels at specic positions. Such fragments could then be enzymatically ligated with larger fragments to allow direct observation of specic nucleotides of interest under native-like (functionally active) conditions.
123
J Biomol NMR (2010) 46:113125 123
Open Access This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
References
Aboul-ela F, Karn J, Varani G (1996) Structure of HIV-1 TAR RNA in the absence of ligands reveals a novel conformation of the trinucleotide bulge. Nucleic Acids Res 397:43981
Abramovitz DL, Pyle AM (1997) Remarkable morphological variability of a common RNA folding motif: the GNRA tetraloop-receptor interaction. J Mol Biol 266:493506
Allain FH-T, Varani G (1997) How accurately and precisely can RNA structure be determined by NMR? J Mol Biol 267:338351 Amarasinghe GK, De Guzman RN, Turner RB, Summers MF (2000)
NMR structure of stem loop SL2 of the HIV-1 Y-RNA packaging signal reveals a novel A-U-A base triple platform. J Mol Biol 299:145156Anderson AC, Scaringe SA, Earp BE, Frederick CA (1996) HPLC purication of RNA for crystallography and NMR. RNA 2: 110117Azarani A, Hecker KH (2001) RNA analysis by ion-pair reversed-phase high performance liquid chromatography. Nucleic Acid Res 29:e7Bain JD, Switzer C (1992) Regioselective ligation of oligoribonucleotides using DNA splints. Nucleic Acids Res 20:4372 Bartel DP (2004) MicroRNAs: genomics, biogenesis, mechanism, and function. Cell 116:281297Batey RT, Kieft JS (2007) Improved native afnity purication of
RNA. RNA 13:13841389Batey RT, Inada M, Kujawinski E, Puglisi JD, Williamson JR (1992)
Preparation of isotopically labeled ribonucleotides for multidimensional NMR spectroscopy of RNA. Nucleic Acids Res 20:45154523Batey RT, Battiste JL, Williamson JR (1995) Preparation of isotopically enriched RNAs for heteronuclear NMR. Methods Enzymol 261:300322Berkower I, Leis JH J (1973) Isolation and characterization of an endonuclease from Escherichia coli specic for ribonucleic acid in ribonucleic acid-deoxyribonucleic acid hybrid structures. J Biol Chem 248:59145921Bessonov S, Anokhina M, Will CL, Urlaub H, Luhrmann R (2008)
Isolation of an active step I spliceosome and composition of its RNP core. Nature 452:846850Birikh KR, Heaton PA, Eckstein F (1997) The structure, function and application of the hammerhead enzyme. Eur J Biochem 245:116 Boisvert FM, van Koningsbruggen S, Navascues J, Lamond AI (2007)
The multifunctional nucleolus. Nat Rev Mol Cell Biol 8:574585 Brodersen P, Voinnet O (2006) The diversity of RNA silencing pathways in plants. Trends Genet 22:268280 Cerrone-Szakal AL, Chadalavada DM, Goldend BL, Bevilacqua PC
(2009) Mechanistic characterization of the HDV genomic ribozyme: The cleavage site base pair plays a structural role in facilitating catalysis. RNA 14:17461760Cheong HK, Hwang E, Lee C, Choi BS, Cheong C (2004) Rapid preparation of RNA samples for NMR spectroscopy and X-ray crystallography. Nucleic Acids Res 32:e84Crowe J, Dbeli H, Gentz R, Hochuli E, Stuber D, Henco K (1994)
6xHis-NiNTA chromatography as a superior technique in recombinant protein expression/purication. Methods Mol Biol 31:371387
Cullis PM, Malone ME, Podomore ID, Symons MCR (1995) Site of protonation of one-electron-reduced cytosine and its derivatives in aqueous methanol glasses. J Phys Chem 99:92939298
DSouza V, Dey A, Habib D, Summers MF (2004) NMR structure of the 101 nucleotide core encapsidation signal of the Moloney Murine Leukemia Virus. J Mol Biol 337:427442
Davis JH, Tonelli M, Scott LG, Jaeger L, Williamson JR, Butcher SE
(2005) RNA helical packing in solution: NMR structure of a 30 kDa GAAA tetraloop-receptor complex. J Mol Biol 351: 371382Dayie KT (2008) Key labeling technologies to tackle sizeable problems in RNA structural biology. Int J Mol Sci 9:12141240 Doudna JA, Rath VL (2002) Structure and function of the eukaryotic ribosome: the next frontier. Cell 109:153156Edwards TE, Klein DJ, Ferre-DAmare AR (2007) Riboswitches: small-molecule recognition by gene regulatory RNAs. Curr Opin Struct Biol 17:273279Fiala R, Czernek J, Sklenar V (2000) Transverse relaxation optimized triple-resonance NMR experiments for nucleic acids. J Biomol NMR 16:291302Fiaux J, Bertelsen EB, Horwich AL, Wuthrich K (2004) Uniform and residue-specic 15 N-labeling of proteins on a highly deuterated background. J Biomol NMR 29:289297Frilander MJ, Steitz JA (2001) Dynamic exchanges of RNA interactions leading to catalytic core formation in the U12-dependent spliceosome. Mol Cell 7:217226Goodman L (1974) Chemical synthesis and transformations of nucleosides. In: Tso PO (ed) Basic principles in nucleic acid chemistry. Academic Press, New York, pp 170208Gozani O, Feld R, Reed R (1996) Evidence that sequence-independent binding of highly conserved U2 snRNP proteins upstream of the branch site is required for assembly of spliceosomal complex A. Genes Dev 10:233243Hartmann RK, Bindereif A, Schn A, Westhof E (2005) Hand book of RNA biochemistry. Wiley, VCHHassouna N, Michot B, Bachellerie JP (1984) The complete nucleotide sequence of mouse 28S rRNA gene. Implications for the process of size increase of the large subunit rRNA in higher eukaryotes. Nucleic Acids Res 12:35633583Helm M, Brul H, Gieg R, Florentz C (1999) More mistakes by T7
RNA polymerase at the 5 ends of in vitro-transcribed RNAs. RNA 5:618621Hbartner C, Silverman SK (2007) Review: recent advances in DNA catalysis. Biopolymers. Biopolymers 87:279291Huang XY P, LeProust E, Gao X (1997) An efcient and economic site-specic deuteration strategy for NMR studies of homologous oligonucleotide repeat sequences. Nucleic Acid Res 25:47584763Inoue H, Hayase Y, Iwai S, Ohtsuka E (1987) Sequence-dependent hydrolysis of RNA using modied oligonucleotide splints and RNase H. FEBS Lett 215:327330Kainosho M, Torizawa T, Iwashita Y, Terauchi T, Mei Ono A,
Guntert P (2006) Optimal isotope labelling for NMR protein structure determinations. Nature 440:5257Kao C, Zheng M, Rdisser S (1999) A simple and efcient method to reduce non templated nucleotide addition at the 3 terminus of RNAs transcribed by T7 RNA polymerase. RNA 5:12681272 Kim VN (2005) Small RNAs: classication, biogenesis, and function.
Mol Cells 19:115Kim I et al (1995) Selective deuteration of RNA for NMR signal assignment. Nucleic Acids Symp Ser 34:123124Kim I, Lukavsky PJ, Puglisi JD (2002) NMR study of 100 kDa HCV
IRES RNA using segmental isotope labeling. J Am Chem Soc 124:93389339
123
124 J Biomol NMR (2010) 46:113125
Kim I, McKenna SA, Viani-Puglisi E, Puglisi JD (2007) Rapid purication of RNAs using fast performance liquid chromatography (FPLC). RNA 13:289294
Korostelev A, Noller HF (2007) The ribosome in focus: new structures bring new insights. Trends Biochem Sci 32:434441
Kurata S, Ohtsuki T, Suzuki T, Watanabe K (2003) Quick two-step
RNA ligation employing periodate oxidation. Nucl Acids Res 31:e145Kurschat WCM J, Wombacher R, Helm M (2005) Optimizing splinted ligation of highly structured small RNAs. RNA 11:19091914Lapham J, Crothers DM (1996) RNase H cleavage for processing of in vitro transcribed RNA for NMR studies and RNA ligation. RNA 2:289296Lukavsky PJ, Puglisi JD (2004) Large-scale preparation and purication of polyacrylamide-free RNA oligonucleotides. RNA 10:889893Lukavsky PJ, Puglisi JD (2005) Structure determination of large biological RNAs. Methods Enzymol 394:399415Lukavsky PJ, Kim I, Otto GA, Puglisi JD (2003) Structure of HCV
IRES domain II determined by NMR. Nature Struct Biol 10:10331038Markham NR, Zuker M (2005) DINAMelt web server for nucleic acid melting prediction. Nucleic Acids Res 33:W577W581 Markham NR, Zuker M (2008) UNAFold: software for nucleic acid folding and hybridization. Methods Mol Biol 453:331 Maroney PA, Romfo CM, Nilsen TW (2000) Functional recognition of 50 splice site by U4/U6.U5 tri-snRNP denes a novel ATP-dependent step in early spliceosome assembly. Mol Cell 6: 317328McKenna SA et al (2007) Purication and characterization of transcribed RNAs using gel ltration chromatography. Nat Protoc 2:32703277Michnicka MJ, Harper JW, King GC (1993) Selective isotopic enrichment of synthetic RNA: application to the HIV-1 TAR element. Biochemistry 32:395400Milligan JF, Uhlenbeck OC (1989) Synthesis of small RNAs using T7
RNA polymerase. Meth Enzymol 180:5162Milligan JF, Groebe DR, Witherell GW, Uhlenbeck OC (1987)
Oligoribonucleotide synthesis using T7 RNA polymerase and synthetic DNA templates. Nucleic Acids Res 15:87839798 Moore MJ, Sharp PA (1992) Site-specic modication of pre-mRNA: the 2-hydroxyl groups at the splice sites. Science 256:992997 Nelissen FHT, van Gammeren AJ, Tessari M, Girard FC, Heus HA,
Wijmenga SS (2008) Multiple segmental and selective isotope labeling of large RNA for NMR structural studies. Nucl Acids Res 36:e89Nikonowicz EP, Pardi A (1992) Three-dimensional heteronuclear
NMR studies of RNA. Nature 355:184186
Ohtsuki T, Kawai G, Watanabe Y, Kita K, Nishikawa K, Watanabe K
(1996) Preparation of biologically active Ascaris suum mitochondrial tRNAMet with a TV-replacement loop by ligation of chemically synthesized RNA fragments. Nucl Acids Res 24:662667Ohtsuki T, Kawai G, Watanabe K (1998) Stable isotope-edited NMR analysis of Ascaris suum mitochondrial tRNAMet having a TV-replacement loop. J Biochem 124:2834Pasman Z, Garcia-Blanco MA (1996) The 50 and 30 splice sites come together via a three dimensional diffusion mechanism. Nucleic
Acids Res 24:16381645Pervushin K, Riek R, Wider G, Wthrich K (1997) Attenuated T2 relaxation by mutual cancellation of dipoledipole coupling and
chemical shift anisotropy indicates an avenue to NMR structures of very large biological macromolecules in solution. Proc Natl Acad Sci USA 94:1236612371Peterson RD, Theimer CA, Wu H, Feigon J (2004) New applications of 2D ltered/edited NOESY for assignment and structure elucidation of RNA and RNAprotein complexes. J Biomol NMR 28:5967Pleiss JA, Derrick ML, Uhlenbeck OC (1998) T7 RNA polymerase produces 5 end heterogeneity during in vitro transcription from certain templates. RNA 4:13131317Pokrovskaya ID, Gurevich VV (1994) In vitro transcription: preparative RNA yields in analytical scale reactions. Anal Biochem 220:420423Ponting CP, Oliver PL, Reik W (2009) Evolution and functions of long no ncoding RNAs. Cell 136:629641Purtha WE, Coppins RL, Smalley MK, Silverman SK (2005) General deoxyribozyme-catalyzed synthesis of native 30-50 RNA linkages. J Am Chem Soc 127:1312413125
Query CC, Moore MJ, Sharp PA (1994) Branch nucleophile selection in pre-mRNA splicing: evidence for the bulged duplex model. Genes Dev 8:587597
Rabi JA, Fox JJ (1973) Nucleosides. LXXIX. Facile base-catalyzed hydrogen isotope labeling at position 6 of pyrimidine nucleo-sides. J Am Chem Soc 95:16281632
Riek R, Wider G, Pervushin K, Wuthrich K (1999) Polarization transfer by cross-correlated relaxation in solution NMR with very large molecules. Proc Natl Acad Sci USA 96:49184923
Scaringe SA, Wincott FE, Caruther MH (1998) Novel RNA synthesis method using 50-O-Silyl-20-O-orthoester protecting groups. J Am
Chem Soc 120:1182011821Scott LG, Tolbert TJ, Williamson JR (2000) Preparation of specically
2H- and 13C-labeledribonucleotides. MethodsEnzymol 317:1838 Shah SA, Brunger AT (1999) The 1.8 crystal structure of a statically disordered 17 base pair RNA duplex: Principles of RNA crystal packing and its effect on nucleic acid structure. J Mol Biol 285:15771588Shields TP, Mollova E, Ste marie L, Hansen MR, Pardi A (1999)
High-performance liquid chromatography purication of homogeneous-length RNA produced by trans cleavage with a hammerhead ribozyme. RNA 5:12591267Sprangers R, Kay LE (2007) Quantitative dynamics and binding studies of the 20s proteasome by NMR. Nature 445:618622 Spriggs S, Garyu L, Connor R, Summers MF (2008) Potential intraand intermolecular interactions involving the Unique-50 Region of the HIV-1 50-UTR. Biochemistry 46:1306413073Stark MR, Pleiss JA, Deras M, Scaringe SA, Rader SD (2006) An
RNA ligase-mediated method for the efcient creation of large, synthetic RNAs. RNA 12:20142019Steitz TA (2008) A structural understanding of the dynamic ribosome machine. Nat Rev Mol Cell Biol 9:242253Tzakos AG, Grace CRR, Lukavsky PJ, Riek R (2006) NMR techniques for very large proteins and RNAs in solution. Annu Rev Biophys Biomol Str 35:319342Tzakos AG, Easton LE, Lukavsky PJ (2007) Preparation of large
RNA oligonucleotides with complementary isotope-labeled segments for NMR structural studies. Nat Protoc 2:21392147 Usman N, Ogilvie KK, Jiang MY, Cedergren RJ (1987) The automated chemical synthesis of long oligoribuncleotides using 20-O-silylated ribonucleoside 30-O-phosphoramidites on a controlled-pore glass support: synthesis of a 43-nucleotide sequence similar to the 3-half molecule of an Escherichia coli formylmethionine tRNA. J Am Chem Soc 109:78457854
123
J Biomol NMR (2010) 46:113125 125
Wakeman CA, Winkler WC, Dann CE III (2007) Structural features of metabolite-sensing ribo switches. Trends Biochem Sci 32:415424
Wittenberg WL, Uhlenbeck OC (1985) Specic replacement of functional groups of uridine-33 in yeast phenylalanine transfer ribonucleic acid. Biochemistry 24:27052712
Wthrich K (1986) NMR of proteins and nucleic acids. Wiley,
New YorkWyatt JR, Chastain M, Puglisi JD (1991) Synthesis and purication of large amounts of RNA oligonucleotides. BioTechniques 11: 764769
Xu JL J, Crothers DM (1996) Determining RNA solution structure by segmental isotopic labeling and NMR: application to Caenorhabditis elegans spliced leader RNA 1. Proc Natl Acad Sci USA 93:4448
Zuker M (2003) Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res 31:34063415
123
Springer Science+Business Media B.V. 2010