ARTICLE
Received 25 Nov 2016 | Accepted 25 Apr 2017 | Published 6 Jun 2017
DOI: 10.1038/ncomms15741 OPEN
Architecture of the RNA polymerase II-Paf1C-TFIIS transcription elongation complex
Youwei Xu1, Carrie Bernecky1, Chung-Tien Lee2,3, Kerstin C. Maier1, Bjrn Schwalb1, Dimitry Tegunov1, Jrgen M. Plitzko4, Henning Urlaub2,3 & Patrick Cramer1
The conserved polymerase-associated factor 1 complex (Paf1C) plays multiple roles in chromatin transcription and genomic regulation. Paf1C comprises the ve subunits Paf1, Leo1, Ctr9, Cdc73 and Rtf1, and binds to the RNA polymerase II (Pol II) transcription elongation complex (EC). Here we report the reconstitution of Paf1C from Saccharomyces cerevisiae, and a structural analysis of Paf1C bound to a Pol II EC containing the elongation factor TFIIS. Cryo-electron microscopy and crosslinking data reveal that Paf1C is highly mobile and extends over the outer Pol II surface from the Rpb2 to the Rpb3 subunit. The Paf1-Leo1 heterodimer and Cdc73 form opposite ends of Paf1C, whereas Ctr9 bridges between them. Consistent with the structural observations, the initiation factor TFIIF impairs Paf1C binding to Pol II, whereas the elongation factor TFIIS enhances it. We further show that Paf1C is globally required for normal mRNA transcription in yeast. These results provide a three-dimensional framework for further analysis of Paf1C function in transcription through chromatin.
1 Department of Molecular Biology, Max-Planck-Institute for Biophysical Chemistry, Max Planck Society, Am Fassberg 11, Gttingen 37077, Germany.
2 Bioanalytical Mass Spectrometry, Max-Planck-Institute for Biophysical Chemistry, Am Fassberg 11, Gttingen 37077, Germany. 3 Bioanalytics Group, Institute for Clinical Chemistry, University Medical Center, Gttingen, Robert-Koch-Strasse 40, Gttingen 37075, Germany. 4 Department of Molecular Structural Biology, Max-Planck-Institute for Biochemistry, Am Klopferspitz 18, Martinsried 82152, Germany. Correspondence and requests for materials should be addressed to P.C. (email: mailto:[email protected]
Web End [email protected] ).
NATURE COMMUNICATIONS | 8:15741 | DOI: 10.1038/ncomms15741 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 1
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms15741
The polymerase-associated factor 1 (Paf1) complex (Paf1C) is a general and conserved RNA polymerase II (Pol II) transcription elongation factor1. Paf1C was rst identied
through its copurication with Pol II from yeast cells2,3. Yeast Paf1C comprises the subunits Paf1, Leo1, Ctr9, Cdc73 and Rtf1 (refs 1,4). Paf1C shows genetic interactions with the yeast transcription elongation factors Spt4-Spt5 and Spt16-Pob3, the counterparts of human DSIF and FACT, respectively5. Paf1C also associates with transcribed regions in vivo6, suggesting that it is a transcription elongation factor. Paf1C subunits are required for efcient transcription in vivo7.
Paf1C has multiple roles in chromatin transcription. Yeast Paf1C functions in methylation of histone H3 by Set1 and Dot1, thus linking transcription elongation to chromatin methylation8. In yeast, Rtf1 binds the chromatin remodeller Chd1 (ref. 9) and is required for ubiquitination of histone H2B (refs 1012) and histone methylation13. Drosophila Rtf1 also functions in histone methylation, gene expression and Notch signalling14. Human Paf1C binds to histone H3 tails with dimethylated histone H3 arginine17 (ref. 15).
Paf1C also has important functions that are not directly related to chromatin. Paf1C is required for cotranscriptional RNA 30-processing16,17. Human Cdc73 physically interacts with protein complexes required for 30-processing18. Paf1C also represses cryptic transcription19 and is implicated in cellular differentiation20 and human cancer2123. Paf1C represses gene silencing by small RNAs in Schizosaccharomyces pombe24, and
Leo1 is involved in heterochromatin spreading25. Paf1C has recently been found to regulate Pol II phosphorylation, promoterproximal pausing and release into gene bodies26,27. Paf1C is also involved in the resolution of transcription replication conicts28.
Paf1C is generally recruited to transcribed units, apparently entering the Pol II elongation complex (EC) downstream of the transcription start site, and exiting at the polyadenylation (pA) site29. There is evidence that Paf1C recruitment to Pol II requires direct contacts with Pol II and additional contacts with Pol II-associated factors. Paf1C recruitment in vivo requires the Bur1-Bur2 kinase30, and is aided by Spt4 (ref. 31). Paf1C and its Cdc73 subunit bind the phosphorylated C-terminal domain of Pol II and the phosphorylated C-terminal repeat region (CTR) of Spt5, which is also a general elongation factor32. A Plus3 domain in Rtf1 can bind the Spt5 CTR33. Rtf1 is however not stably associated with Paf1C in all species and is not required for Paf1C recruitment in human cells, where it has non-overlapping functions34. In ssion yeast, Rtf1 also has other functions35. The C-terminal GTPase-like domain of Cdc73 is important for chromatin association of Paf1C (ref. 36). Leo1 is also important for Paf1C recruitment and binds RNA37.
Structural studies have revealed that Paf1C is a modular and exible complex with several structured regions. The crystal structure of a complex of regions in Paf1 and Leo1 revealed antiparallel b-sheets for heterodimerization38. This study also showed that Ctr9 is a scaffold for Paf1C onto which the Paf1-
a
371 462 534 731 801 874 913
33 620 1015 1059
56 183
89 251 338
Ctr9
1077
150 232
H. sapiens PDB: 4M6T
17 90
362 441
230
Paf1 Leo1 binding
445
Cdc73
GTPase-like domain
393
S. cerevisiae PDB: 3V46
2 184
74 139
238
Plus3 Domain
370
442
Rtf1
HMD
Paf1C interaction
558
TPR
Low complexity
S. cerevisiae PDB: 5E8B
H. sapiens PDB: 4L1P
16 46
Leo1
164 259 336
H. sapiens PDB: 4M6T
464
Paf1 binding
b
c d
Absorbance (mAU)
150
100
50
0
kDa 140 115
80
65
50
30
25
15
40
kDa 140
115
80
65
50
30
25
15
40
A280 nm A260 nm
Ctr9
Rtf1
Leo1
Paf1
Cdc73
Ctr9-C913
Leo1
Rtf1
Paf1
Cdc73
Paf1C
5
10 20
15
Elution volume (ml)
Paf1C
Paf1C-Ctr9-C913
Figure 1 | Preparation of recombinant Paf1C. (a) Predicted domain structure of the ve Paf1C subunits from S. cerevisiae. The predicted tetratrico peptide repeat motifs (yellow) and low complexity regions (green) were dened using SMART42. Structurally resolved regions are indicated with black lines. The Plus3 domain in Rtf1 (red) binds single-stranded DNA40 and the Spt5 CTR41. A Paf1C-interacting region (orange) in Rtf1 (ref. 13) was conrmed in this work. Histone modication domain (HMD, pink) in Rtf1 directly contacts ubiquitin conjugase Rad6 in vivo44. (b) Size-exclusion chromatogram (Superose 6 10/300; GE Healthcare) of recombinant full-length Paf1C indicates a pure complex free of nucleic acids. (c) Coomassie-stained SDSPAGE analysis of recombinant ve-subunit Paf1C after size-exclusion chromatography reveals the presence of all ve subunits in apparently stoichiometric amounts. Molecular weight markers are indicated on the left. The identity of subunits was conrmed by mass spectrometry. (d) Same as c but for puried complex Paf1C-Ctr9-DC913.
2 NATURE COMMUNICATIONS | 8:15741 | DOI: 10.1038/ncomms15741 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms15741 ARTICLE
Leo1 heterodimer and Cdc73 assemble. The structure of the GTPase-like domain in the C-terminal region of yeast Cdc73 was also solved36,39. In addition, structures were reported for the Plus3 domain of human Rtf1 (refs 40,41), and for the Plus3 domain in complex with a phosphorylated Spt5 CTR repeat41. There is however no structural information on Ctr9, the largest Paf1C subunit.
Despite its critical role in chromatin transcription and transcription-coupled events, neither the structure of Paf1C nor its location on the Pol II EC are known. Here we reconstitute yeast Paf1C from recombinant subunits, assemble a complex of Paf1C with the Pol II-TFIIS EC and locate Paf1C on the polymerase surface with a combination of electron microscopy (EM) and crosslinking. From this work emerges a trilobal architecture of Paf1C and the location of Paf1C on the elongating polymerase, providing a basis for a mechanistic analysis of transcription elongation through chromatin.
ResultsRecombinant Paf1C. Paf1C from the yeast Saccharomyces cerevisiae (S. cerevisiae) has a molecular weight of 340 kDa and consists of ve subunits, Paf1, Leo1, Ctr9, Cdc73 and Rtf1 (Fig. 1a). Analysis of the primary and secondary structures of these subunits42,43 predicted known structured domains36,3841,44 and eight tetratrico peptide repeats in Ctr9 (Fig. 1a). In addition, multiple regions of low sequence complexity were detected in all Paf1C subunits except Cdc73, consistent with the known exibility of Paf1C (Supplementary Fig. 1a). To study Paf1C structurally, we established preparation of pure recombinant Paf1C after coexpression of its subunits in Escherichia coli (E. coli) (see Fig. 1b,c and Methods section). The ve Paf1C subunits were coexpressed from three vectors in E. coli and the complex puried using chromatographic methods (see Fig. 1b and Methods section). We obtained B0.4 mg of pure Paf1C per liter of E. coli cell culture. Puried Paf1C contained all ve subunits in apparently stoichiometric amounts (Fig. 1c).
Paf1C structural core and exible periphery. To map a stable structural core of Paf1C, we identied a Ctr9 variant that lacked 164 amino acids at its C terminus (Ctr9-DC913), but was sufcient to form a stable complex with the other subunits after coexpression (Paf1C-Ctr9-DC913) (Fig. 1d). We also obtained a recombinant Paf1C variant that additionally lacked subunit Rtf1 (Paf1C-Ctr9-DC913-DRtf1; Supplementary Fig. 1b). To further delineate the core of Paf1C, we used limited proteolysis and Edman sequencing (see Supplementary Fig. 1a and Methods section) and designed deletion mutants of the remaining three subunits. The limited proteolysis also showed that Leo1 and Paf1 are unstable. Combining iterative truncations of Leo1, Paf1 and Ctr9-DC-913, coexpression and copurication, we dened a structural core of Paf1C that additionally lacked the C-terminal region of Paf1 and both terminal regions of Leo1.
The resulting Paf1C core contained Ctr9-DC913, Paf1-DC361, Leo1-DN118-DC376 and Cdc73 (Supplementary Fig. 1c). The dened Paf1C core comprises 1,926 amino-acid residues out of a total of 2,937 residues, that is, 65% of the total protein mass. Thus, one-third of Paf1C forms exible regions on the periphery of the complex. The C-terminal region of Rtf1 (Rtf1-DN441) can interact with the Paf1C core, indicating that the C-terminal region of Paf1 and both terminal regions of Leo1 are not crucial for Rtf1 binding (Supplementary Fig. 1d). In addition, we expressed and puried the Ctr9-Paf1-Leo1 trimer (Supplementary Fig. 1e). Taken together, these results show that Paf1C contains a structured core and several exible regions around its periphery.
TFIIS enhances Pol II-Paf1C binding. To investigate whether puried Paf1C binds to yeast Pol II in vitro, we performed pull-down assays using biotinylated Pol II coupled to streptavidin beads (see Methods section). Paf1C interacted with Pol II, albeit in a substoichiometric manner (Supplementary Fig. 2a, lane 4). Considering the known interaction of human Paf1C with TFIIS45, we tested whether TFIIS enhances Pol II-Paf1C binding. Indeed, TFIIS strongly enhanced binding of Paf1C to Pol II in our assay (Supplementary Fig. 2a, lane 5). In contrast, an N-terminal deletion variant of TFIIS lacking 130 residues (TFIIS-DN130)
could not enhance Pol II-Paf1C interaction (Supplementary Fig. 2b, lanes 7 and 8). These results suggest that the N-terminal domain I of TFIIS, which is mobile in the Pol II-TFIIS structure46, interacts with Paf1C to increase its afnity to PolII. In addition, Paf1C lacking the C-terminal domain of Ctr9 and Rtf1 retained Pol II binding (Supplementary Fig. 2c, lanes 6 and 8) and TFIIS-enhanced binding (Supplementary Fig. 2c, lanes 7 and 9). These results show that recombinant Paf1C interacts with Pol II in vitro, that this interaction does not require Rtf1 and that it is strongly enhanced by TFIIS (Supplementary Fig. 2d).
Cryo-EM analysis. To determine the structure of Paf1C bound to a Pol II EC, we prepared a complex containing the complete 12-subunit Pol II47, a DNA-RNA scaffold48, full-length TFIIS carrying two point mutations that render it inactive in RNA cleavage stimulation46,49 and recombinant Paf1C lacking the Ctr9 C terminus (Paf1C-Ctr9-DC913) (see Methods section), which is more stable than Paf1C with full-length Ctr9. The complex contained all 18 polypeptides in apparently stoichiometric amounts after sucrose gradient ultracentrifugation (Fig. 2a). After gradient xation50 (Supplementary Fig. 3a), the sample contained single particles, as revealed by EM in negative stain (Supplementary Fig. 3b), and was subjected to cryo-EM data collection (see Methods section). A total of 2,741 micrographs were collected on a Titan Krios equipped with a Gatan K2 direct electron detection device (Supplementary Fig. 3c).
From the cryo-EM micrographs, a total of 947,597 particles were extracted. Particle images were processed and subjected to reference-free two-dimensional (2D) classication in RELION51, yielding 370,841 particles after clearance. In several 2D classes, smeared densities for the peripheral Paf1C and the Pol II stalk subcomplex Rpb4-Rpb7 were observed on the polymerase surface (Fig. 2b). As a reference, we used a reconstruction of bovine Pol II48 ltered to low resolution (50 ). After particle polishing in RELION, three-dimensional (3D) classication was used to separate out 158,422 particles that contained Pol II-TFIIS EC that lost Paf1C.
We rened the remaining 212,419 Paf1C containing particles, which led to a structure at 5.5 global resolution that contained the Pol II-TFIIS EC that closely resembled the crystal structure (Supplementary Fig. 4). Although the polymerase, TFIIS and the nucleic acids are very well dened, the local resolution of different parts of Paf1C is lower, B1020 , indicating exibility. A globally ltered 18 resolution reconstruction of the Pol II-Paf1C-TFIIS EC complex shows the location of Paf1C on the Pol II-TFIIS EC (Fig. 2c). To improve Paf1C density, we performed sub-3D classication. This resulted in three different Pol II-Paf1C-TFIIS EC reconstructions that we refer to as A, B and C (see Supplementary Fig. 4 and Methods section). These reconstructions showed average resolutions of 5.7, 5.9 and 6.2 , respectively (Fourier shell correlation (FSC) 0.143; see Supplementary
Fig. 5a,b and Methods section) and enabled rigid-body tting of the Pol II-TFIIS crystal structure49,52 and the DNA-RNA model48 (see Supplementary Fig. 5c,d and Methods section). The reconstructions did not reveal structural changes within the Pol II-TFIIS complex compared to the known crystal structure.
NATURE COMMUNICATIONS | 8:15741 | DOI: 10.1038/ncomms15741 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 3
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms15741
a
b c
kDa
Paf1C
Paf1C
Rpb1
Rpb3TFIIS (DE-AA)
Rpb4
Rpb5
Rpb6/7
Rpb9/11 Rpb10/12
Stalk
*
140
65
50
40
30
25
15
Rpb2
Leo1
Paf1
Cdc73
Paf1C
115
80
Ctr9-C913
External 2
External 2
Rtf1
Lobe
Lobe
Protrusion
Protrusion
TFIIS
TFIIS
Rpb3
Rpb3
Rpb11
Rpb11
Downstream DNA
Downstream DNA
Upstream DNA
Upstream DNA
RNA exit
RNA exit
Pol II
Pol II
Top view
Stalk
Stalk
Rpb8
10 nm
Figure 2 | Reconstitution and cryo-EM analysis of Pol II-Paf1C-TFIIS EC. (a) SDSPAGE analysis (Coomassie staining) of Pol II-Paf1C-TFIIS EC after sucrose gradient ultracentrifugation. Subunits from Pol II and Paf1C are labelled in black and pink, respectively. The inactive variant of TFIIS is labelled in green. The asterisk marks a degradation product of Rpb1. (b) Ten representative reference-free 2D cryo-EM class averages reveal the known exibility of the Pol II stalk subcomplex Rpb4/7 (white arrow) and density for Paf1C (pink arrow). Scale bar, 10 nm. (c) Location of Paf1C on the Pol II-TFIIS EC.
A reconstruction of Pol II-Paf1C-TFIIS EC was rened from 212,419 particles after the rst 3D classication and globally ltered to 18 resolution. Density for the Pol II-TFIIS EC subcomplex is shown in grey. Continuous Paf1C density at the outer surface of Pol II is shown in pink. The structure of the Pol II-TFIIS EC is tted and shown in ribbon representation (non-template DNA, cyan; template DNA, blue; RNA, red, TFIIS, green; Rpb2 lobe, yellow; Rpb2 protrusion, orange; Rpb2 external 2, green; Rpb3, red; Rpb11, yellow).
Architecture of Pol II-Paf1C-TFIIS EC. Reconstructions A, B and C revealed different portions of Paf1C on the Pol II surface that we refer to as Paf1C parts A, B and C, respectively (Fig. 3a). The observation that these three parts of Paf1C could only be observed in different cryo-EM reconstructions shows that their relative orientation is exible. Owing to this mobility on the Pol II surface, the local resolution of Paf1C was low (Supplementary Fig. 5eg). When contoured at a level where no noise peaks are observed (Fig. 3b), the volumes of parts A, B and C could account for B50, 45 and 90 kDa of folded protein (see Methods section). Because part C overlaps with part A by B10 kDa, a total of B175 kDa of Paf1C is visible by cryo-EM, corresponding to B50% of the total mass of Paf1C, suggesting that most of the structured core (65%) is revealed (Supplementary Fig. 1). We combined reconstructions AC into a composite map that is further interpreted below.
Paf1C binds to the outer surface of Pol II on the Rpb2 side, spanning from the Rpb2 external 2 and protrusion domains to subunit Rpb3 and reaching near the rim of the funnel opposite the active centre cleft (Fig. 3a,b). Paf1C part A contacts mainly the external 2 domain and likely the protrusion domain of Rpb2. Part B contacts the protrusion and part C contacts Rpb11 and Rpb3 around its helix a3. Part B also bridges between parts A and
C, which otherwise do not contact each other. Taken together, cryo-EM revealed that the Paf1C core may be divided into three parts, with parts A and C contacting Pol II at the Rpb2 lobe and external 2 domains and Rpb3, respectively, and part B being highly mobile and bridging between parts A and C.
Crosslinking analysis. To conrm the contacts of Paf1C with Pol II, and to assign Paf1C subunits to the three parts of Paf1C, we
subjected the Pol II-Paf1C-TFIIS EC to chemical crosslinking coupled to mass spectrometry (XL-MS; see Methods section). We used the crosslinking reagent bis(sulfosuccinimidyl) suberate (BS3), which reacts with lysine side chains and N termini. In each crosslinking data set, two replicates were measured. All the spectra of the crosslinks were ltered at a false discovery rate cutoff of 1% and the maximum score value (negative logarithm of E-value) 45. Each crosslink was required to have a minimal spectral count of 2 in the two replicates of each data set (see Methods section). We obtained 239 unique intersubunit cross-links (Fig. 4a, Table 1 and Supplementary Data 1). We rst mapped crosslinks between Pol II subunits and between Pol II and TFIIS onto the Pol II-TFIIS EC crystal structure49. The CaCa distances between crosslinked residues were within the allowed distances of 30 for 35 out of 38 intersubunit pairs. The other three crosslinks that showed longer CaCa distances fell in regions with structural exibility and higher crystallographic B-factors (Fig. 4b,c and Supplementary Data 1). In addition, we observed that Leo1 was crosslinked to two lysine residues (K78, K80) in domain I of TFIIS, consistent with our pull-down assays (see above). These results provided an internal, positive control for our crosslinking approach, and demonstrated the proximity of Paf1C to TFIIS.
Paf1C subcomplex architecture. The data also contained 136 intersubunit crosslinks within Paf1C (Fig. 4a, Table 1 and Supplementary Data 1). These included 25 crosslinks between Paf1 and Leo1, consistent with the known dimerization of Paf1 and Leo1 subunits. Crosslinks of the Leo1 residue K225 to Paf1 residues K167 and K170 can be rationalized with the Paf1-Leo1
4 NATURE COMMUNICATIONS | 8:15741 | DOI: 10.1038/ncomms15741 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms15741 ARTICLE
a
Paf1
Paf1C - A Paf1
Paf1C - B Paf1
Paf1C - C
Pol II
Pol II
Stalk
Stalk
Pol II
Pol II
Stalk
Stalk
Pol II
Pol II
Stalk
Stalk
Top viewPol II-Paf1C-TFIIS-A
Top viewPol II-Paf1C-TFIIS-B
Top viewPol II-Paf1C-TFIIS-C
b
Paf1C
Paf1C
B
Paf1C
Paf1C AB
A
C Protrusion
Protrusion
External 2
External 2
90
135
External 2
External 2
Lobe
Lobe
Downstream DNA
Downstream DNA
Protrusion
Protrusion
Downstream DNA
Downstream DNA
Stalk
Stalk
C
TFIIS
TFIIS
Rpb3
Rpb3
Rpb11
Rpb11
Pol II
Pol II
Upstream DNA
Upstream DNA RNA exit
RNA exit
Rpb3
Rpb3
Pol II
Pol II
Rpb11
Rpb11
TFIIS
TFIIS
Stalk
Stalk
Front-side view
Top view
Figure 3 | Cryo-EM structure of Pol II-Paf1C-TFIIS EC. (a) Three reconstructions of Pol II-Paf1C-TFIIS EC after sub-3D classication. Density for the Pol IITFIIS EC subcomplex is shown as a silver semitransparent surface. The electron densities for Paf1C are shown as a solid surface coloured in pink, magenta and purple for Paf1C parts A, B and C, respectively. All three maps are unsharpened and ltered according to local resolution. The tted structure of the Pol II-TFIIS EC is shown in ribbon representation (non-template DNA, cyan; template DNA, blue; RNA, red, TFIIS, green; Rpb2 lobe, yellow; Rpb2 protrusion, orange; Rpb2 external 2, green; Rpb3, red; Rpb11, yellow). (b) Two views of the composited EM density for the Pol II-Paf1C-TFIIS EC from three reconstructions in a and the tted structure of the Pol II-TFIIS EC. These densities of locally ltered Pol II and parts AC are shown at the same threshold. The view on the left is from the top80, and the view on the right is related by two rotations as indicated. The same threshold level in chimera was used to contour densities.
X-ray structure. A total of 27 crosslinks were observed between the Paf1-Leo1 dimer and Ctr9, revealing their proximity. The subassembly Paf1-Leo1-Ctr9 crosslinked to the C-terminal regions of Cdc73 and Rtf1. Consistent with this, the C-terminal region of Rtf1 sufces to bind Paf1C, because its coexpression with the Paf1C core resulted in a stable complex (Supplementary Fig. 1c). We also crosslinked free Paf1C and the obtained cross-linking pattern was similar (Supplementary Data 1).
We also obtained 49 intersubunit crosslinks between Paf1C and Pol II (Fig. 4a, Table 1, Supplementary Fig. 6 and Supplementary Data 1). Forty-four of these crosslinks map to the Rpb2 lobe, protrusion and external 2 domains, and subunits Rpb3 and Rpb11 (Supplementary Data 1), consistent with the location of Paf1C observed by cryo-EM. The lysine-rich region (K298K369) of Leo1
gives rise to many crosslinks to the protrusion, lobe and external 2 of Rpb2, which together with extensive crosslinking between the Paf1-Leo1 dimer and the Pol II protrusion and lobe show that the Paf1-Leo1 heterodimer resides in part A. Crosslinks between Cdc73 and Pol II subunits Rpb3 and Rpb11 reveal that Cdc73 resides in part C. Indeed, part C contains a globular density that contacts Rpb3 and corresponds in size to the crystal structure of the C-terminal GTPase-like domain of Cdc73 (ref. 36) (Fig. 4d). We could t this crystal structure to the globular density53 such that the crosslinks between Cdc73 and Pol II were explained. Three crosslinks occurred between the tted structure and Rpb3, whereas Cdc73 residues in the adjacent N-terminal tail crosslinked to the nearby Rpb11 subunit (Fig. 4d). Finally, Ctr9 can be assigned to part B because it crosslinks to both Paf1-Leo1 in
NATURE COMMUNICATIONS | 8:15741 | DOI: 10.1038/ncomms15741 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 5
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms15741
a
Rpb1
1 200 400 600 800 1,000 1,200 1,400 1,600 1,733
Rpb2
1 200 400 600 800 1,000 1,224
1 200 400 600 800 913
Paf1
1 200 445
Ctr9-C913
Ctr9-C913
1 200 400 600 800 913
Paf11 200 445
Rtf1
Cdc73
1 200 393
Protrusion Lobe External 2
Rpb11 1 120
Cdc73
1 200 393
1 200 400 558
1 200 400 464
Rpb3 1 200 318
Leo1
Rtf1
1 200 400 558
Leo11 200 400 464
TFIIS 1 200 309
TPR Low complexity Paf1C interaction Paf1 binding
Plus3 domain GTPase-like domain
Leo1 binding
b
BS3 allowed distance
C distances ()
d
18
Paf1C
Paf1C
A C
B
16
14
Number of crosslinks
12
10
8
Cdc73
Cdc73
6
4
External 2
External 2
K263
K263
2
* *
Lobe
Lobe
Protrusion
Protrusion
05 15
10 20 25 35 40
30
218
218
TFIIS
TFIIS
205
205
194
194
K451
K451
c
Template DNA
Template DNA Non-template DNA
Non-template DNA
Downstream DNA
Downstream DNA
Jaw_up
Jaw_up
Flexible N-terminal
RNA
RNA
RNA exit
RNA exit
Active site
Active site
Funnel
Funnel
Linker
Linker
Domain II
Domain II
III
TFIIS
TFIIS
Pol II
Pol II
Stalk
Stalk
Pol II
Pol II
Top view
Figure 4 | Crosslinking analysis of Pol II-Paf1C-TFIIS EC. (a) Network diagrams of pairwise crosslinks (black lines) obtained after incubation of Pol IIPaf1C-TFIIS EC with BS3. The left diagram depicts intersubunit crosslinks between Pol II and Paf1C, Pol II and TFIIS, and between Paf1C and TFIIS. The right diagram depicts crosslinks between Paf1C subunits. Crosslinks between Pol II subunits were excluded for clarity. Paf1C subunits coloured as in Fig. 1. In addition, the regions of protrusion, lobe and external 2 in Rpb2 are indicated with orange, yellow and green bars below Rpb2 diagram, respectively. (b) CaCa distance distribution of observed Pol II-Pol II and Pol II-TFIIS crosslinks for lysine residues that were resolved in the Pol II-TFIIS crystal structure.
These crosslinks serve as a positive control for our crosslinking data. Asterisks indicate crosslinks to exible protein regions. (c) Crosslinks between Pol II and TFIIS (black lines) are consistent with the known Pol II-TFIIS complex crystal structure. Colour coding as in Fig. 3. (d) Position of the Cdc73 C-terminal GTPase-like domain (purple) contacting Rpb3. The crystal structure of the GTPase-like domain was docked into density of the correct size and oriented according to crosslinks to Rpb3 and Rpb11 (black lines). Paf1C density is shown in mesh in the same orientation as in Fig. 3. View is from the top. The distances between site of crosslinking in Cdc73 and Rpb3/Rpb11 are revealed. The distance between K263 of Cdc73 and K199 of Rpb1 in this t is 18.5 . the distances between K236 and K385 in the Cdc73 GTPase-like domain structure and K55 of Rpb11 are around 40 due to the exibility of both terminal regions of the GTPase-like domain.
part A and Cdc73 in part C. Thus, Ctr9 bridges between Paf1-Leo1 and Cdc73, which both contact Pol II. Ctr9 did not crosslink efciently to Pol II, consistent with the bridging density B that forms only limited contacts with the Pol II surface.
Combining the crosslinking data with the cryo-EM results and our mapping of exible regions in Paf1C enabled us to derive the
overall architecture of Paf1C bound to Pol II. The sizes of the EM densities can be reconciled with the Paf1C subunit molecular weights as follows. Part A reveals only about half of the Paf1-Leo1 heterodimer (B50 out of 105 kDa), the other half is exible. Part
B reects about half of the mass (B45 kDa) of Ctr9-DC913 (105 kDa), whereas the other half of Ctr9 resides in part C, which
6 NATURE COMMUNICATIONS | 8:15741 | DOI: 10.1038/ncomms15741 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms15741 ARTICLE
Table 1 | Summary of BS3 crosslinks in Pol II-Paf1C-TFIIS EC.
Protein 1 Protein 2 Unique crosslinks BS3 Paf1C Pol II 49
Ctr9 Rpb1 1
Rpb3 1 Rtf1 Rpb2 1 Leo1 Rpb1 2
Rpb2 35 Paf1 Rpb2 3 Cdc73 Rpb11 4
Rpb2 1 Rpb3 1 TFIIS Paf1C 2
Leo1 2 Paf1C Paf1C 136 Ctr9 Rtf1 36
Leo1 7 Paf1 20 Cdc73 19 Rtf1 Leo1 6
Paf1 19 Leo1 Paf1 25
Cdc73 4 Pol II Pol II 47 Pol II TFIIS 5
Total 239
BS3, bis(sulfosuccinimidyl) suberate; EC, elongation complex; Paf1C, polymerase-associated factor 1 complex; Pol II, polymerase II.
The number of interprotein crosslinks are given. Interprotein crosslinks between Pol II subunits are not shown here for clarity. Each unique crosslink is identied by more than one crosslink. BS3 is a crosslinker with 11.4 spacer arm and reacts efciently with amino groups (lysine and N terminus).
is much larger than the Cdc73 GTPase-like domain alone (B20 kDa). Finally, Rtf1 (65 kDa) remains exible.
Competitive Pol II binding of Paf1C and TFIIF. Superposition of the Pol II-Paf1C-TFIIS EC onto our previously reported Pol II initiation complex structure54 reveals a clash between Paf1C part A and the dimerization domain of the initiation factor TFIIF bound to the outer lobe of Pol II at an overlapping position (Fig. 5a). This suggested that Paf1C and TFIIF may bind to Pol II in a competitive manner. To investigate this, we performed binding assays using analytical sucrose gradient ultracentrifugation (see Methods section). We incubated preformed Pol II-Paf1C-TFIIS complex with a 1.8-fold molar excess of TFIIF and separated the resulting complexes on a 1030% sucrose gradient. Based on subsequent SDS polyacrylamide gel electrophoresis (SDSPAGE) analysis, we estimate that TFIIF replaced Paf1C in about half of the Pol II complexes (Fig. 5b). When we instead incubated preformed Pol II-TFIIF complex with Paf1C and TFIIS, some TFIIF was displaced, TFIIS was able to join the complex and Paf1C bound to a low extent (Fig. 5c). Analysis of the relevant gradient fractions by native PAGE shows that TFIIF can almost completely compete off the Pol II-bound Paf1C, but not the other way around (Fig. 5d,e). These results indicate that Paf1C and TFIIF bind to Pol II in a competitive manner, consistent with the cryo-EM reconstruction.
Paf1C is globally required for Pol II transcription. Thus far, it had not been demonstrated whether Paf1C is a general transcription factor in yeast. To investigate whether Paf1C is globally required for transcription in vivo, or whether it has gene-specic functions, we monitored RNA synthesis with 4-thiouracil-Seq (4tU-Seq) in yeast5557. This method uses metabolic RNA labelling with 4tU coupled to strand-specic sequencing of labelled, newly synthesized RNA. We used 4tU-Seq to monitor
RNA synthesis in strains lacking either Paf1 (paf1D) or Rtf1 (rtf1D), and compared this to a wild-type strain using global normalization based on spike-in probes58. Two biological replicates were measured. We found that knockout of Paf1 or Rtf1 led to a strong, global decrease in synthesis of mRNA transcripts (Fig. 6a), showing that Paf1C is globally required for normal Pol II transcription. Furthermore, the analysis of signicantly downregulated transcripts in paf1D and rtf1D strains with a Venn diagram shows strong overlap between paf1D and rtf1D strains, but that the rtf1D strain shows 35% more downregulated RNAs, consistent with the idea that Rtf1 has Paf1C-independent roles34 (Fig. 6b).
DiscussionHere, we use a combination of cryo-EM and crosslinking to show that Paf1C forms a tripartite architecture. Parts A and C contain Paf1-Leo1 and Cdc73, respectively, and contact Pol II near the Rpb2 external 2 domain and Rpb3, respectively. Ctr9 forms part B and extends into part C, forming a exible bridge between parts A and C that is not directly contacting Pol II. These results explain published data. Ctr9 has been predicted from biochemical data to form a scaffold that bridges between Paf1-Leo1 and Cdc73 (ref. 38). The Cdc73 GTPase-like domain binds Pol II, consistent with the requirement for this domain to recruit Paf1C to chromatin36. The C-terminal region of Cdc73 (201393 amino acids) has been reported to bind the Pol II C-terminal domain35, but this region could not be localized here. Most of Rtf1 is exible, including the Plus3 domain that binds the phosphorylated CTR of Spt5 (ref. 41), which is also exibly connected to the EC. Our results also explain how Paf1C and TFIIS can cooperatively bind to Pol II45. Pull-down assays show that TFIIS facilitates Pol II-Paf1C binding via its domain I. Correspondingly, we observe crosslinking between Leo1 and TFIIS domain I, and this interaction occurs near the jaw-lobe module of Pol II. We arrive at an overall architecture of the Paf1C on the Pol II EC that explains known interactions (Fig. 7).
NATURE COMMUNICATIONS | 8:15741 | DOI: 10.1038/ncomms15741 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 7
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms15741
a
Paf1C
Paf1C
B C
A
Tfg2
Tfg2
External 2
External 2
Tfg1
Tfg1
Protrusion
Protrusion
Part A
Part A
Tfg1
Tfg1
Lobe
Lobe
Downstream DNA
Downstream DNA
Tfg2
Tfg2
TFIIS
TFIIS
Rpb3-Rpb11
Rpb3-Rpb11
External 2
External 2
RNA exit
RNA exit
Lobe
Lobe
Upstream DNA
Upstream DNA
Close-up view
Pol II
Pol II
Stalk
Stalk
Top view
b
c
Pol II-Paf1C-TFIIS + TFIIF
Pol II-TFIIF
+ buffer control
Pol II-TFIIF
+ Paf1C-TFIIS
Fractions
F13 F14
Fractions
F13 F14
Fractions
F13 F14
Fractions
F13 F14
MW
Pol II-Paf1C-TFIIS + buffer control
F10 F11 F12
F10 F11 F12
MW
F10 F11 F12
F10 F11 F12
kDa
Rpb1
Rpb3TFIIS (DE-AA)
kDa
Rpb1
Rpb3TFIIS (DE-AA)
140
65
50
40
Rpb2
Leo1
Paf1
Cdc73
140
65
50
40
Rpb2
Leo1
Paf1
Cdc73
Ctr9C913
Rtf1
Ctr9C913
Rtf1
115
80
115
80
Tfg1
Tfg2
Tfg1
Tfg2
d
e
Pol II-Paf1C-TFIIS + TFIIF
Pol II-TFIIF
+ buffer control
Pol II-TFIIF
+ Paf1C-TFIIS
MW
720
kDa
Pol II-Paf1C-TFIIS + buffer control
F11 F12
F11 F12
kDa
F11
F10 F12
F11
F10 F12
1,048
1,048
Pol II-Paf1C-TFIIS
Pol II-TFIIF
Pol II
MW
720
Pol II-TFIIF
Pol II
Paf1C
Figure 5 | Paf1C and TFIIF compete for Pol II binding. (a) Positioning the initiation factor TFIIF dimerization domain as in the Pol II initiation complex (PDB accession: 5FYW)54 onto the Pol II-Paf1C-TFIIS EC cryo-EM reconstruction results in a clash of TFIIF with Paf1C (top view). Colours as in Fig. 3. The TFIIF dimerization domain is coloured in medium purple (TFIIF subunit Tfg1) and plum (Tfg2). A close-up view of the clashing region is shown on the right. (b,c) SDSPAGE analysis of TFIIF and Paf1C competitive binding to Pol II after analytical sucrose gradient ultracentrifugation. In each panel, the gels on the left depict complexes without competitor protein. The gels on the right represent complexes formed after competitor addition. In b, a 1.8-fold molar excess of TFIIF was added to preformed Pol II-Paf1C-TFIIS EC. In c, Paf1C-TFIIS was added to preformed Pol II-TFIIF complex. (d,e) Native PAGE analysis of indicated fractions from the analysis in b. In our minimal in vitro system, TFIIF is able to largely compete out Paf1C. The identity of the proteins in the indicated complexes was conrmed by mass spectrometry.
8 NATURE COMMUNICATIONS | 8:15741 | DOI: 10.1038/ncomms15741 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms15741 ARTICLE
a
1.5
Expression fold change (read counts)
WT rtf1
paf1
5,000
4tU-Seq Coverage
4,000
1.0
3,000
2,000
1,000
0.5
0
Gene body
TSS pA
0.0
Genomic position scaled and aligned at TSS and pA of transcripts
paf1/WT rtf1/WT
Figure 6 | Paf1C is globally required for Pol II transcription in yeast. (a) Left panel: coverage of newly synthesized RNA measured by 4tU-Seq in WT (solid blue line), rtf1D (dashed red line) and paf1D yeast cells (dashed green line). 4tU-Seq signals were globally normalized using spike-in probes. The gene body is dened as the region spanning from the transcription start site (TSS) to the pA site and is depicted as a grey box. Right panel: box plot showing the expression fold change of read counts for paf1D and rtf1D versus the wild-type (WT) strain. Dashed lines indicate median fold (0.52 for paf1D and 0.23 for rtf1D). The majority of mRNAs have signals below 1 (indicated by the solid black line). mRNA transcripts that are downregulated make up 85% for paf1D and 94% for rtf1D of all mRNAs with a coverage of at least 2 in the WT strain. (b) Analysis of signicantly downregulated transcripts in paf1D and rtf1D strains using a Venn diagram. Most of the downregulated transcripts in paf1D (4,607 out of 4,733) strain are also affected by the rtf1D mutation, although 2,640 transcripts are uniquely downregulated in the rtf1D strain.
We further show that Paf1C is generally required for transcription in vivo, and thus our results elucidate aspects of the general transcription cycle. In particular, they provide insights into the transition from transcription initiation to elongation, when initiation factors are replaced by elongation factors on the Pol II surface29,59,60. We suggest that in the initiation complex, Paf1C cannot bind Pol II because the site for binding part A is occupied by TFIIF, which binds the lobe. Similarly, TFIIE blocks the site for binding the elongation factor Spt4-Spt5 on the clamp61,62. These observations explain why Paf1C and Spt4-Spt5 bind the EC only upon disassembly of the initiation complex. These elongation factors appear to bind Pol II weakly in isolation, but interactions between them32,33 and between Paf1C and TFIIS45 may enhance their polymerase association. We note that the second contact of Paf1C with Pol II, formed between part C and Rpb3, does not overlap with known factor positions on Pol II. This contact may therefore help to retain Paf1C in the EC when TFIIF reassembles during elongation63.
Despite our success in obtaining structural information on the highly exible EC containing Paf1C, the mechanisms Paf1C uses to facilitate chromatin transcription remain unclear. Insights into the mechanisms of chromatin transcription will only come from a structural analysis of the interface of Paf1C with nucleosomes and other chromatin-associated factors. This poses a highly
formidable challenge for the future. In the meantime, our results provide the overall Paf1C architecture and location on the polymerase, elucidate the molecular basis for transcription factor exchange during the initiationelongation transition, and provide a framework for further dissection of the multiple functions of Paf1C in transcription.
Methods
Paf1C expression and purication. The gene sequences encoding for Ctr9, Rtf1 and Cdc73 were amplied from the S. cerevisiae genomic DNA by polymerase chain reaction (PCR). Open reading frames of Paf1 and Leo1 were chemically synthesized by GeneArt (Thermo Fisher Scientic) to optimize codon usage for more efcient bacterial expression. Full-length Ctr9 was cloned into pET24b (Novagen) resulting in a non-cleavable hexahistidine tag fused to the Ctr9 carboxyl terminus. The PCR products of full-length Rtf1 or Cdc73 were cloned into an in-house modied version of the pET21b vector (Novagen). Ribosome binding sequence-Cdc73, which contains an ribosome binding sequence on the amino terminus of Cdc73 was inserted sequentially after Rtf1. Additionally, Cdc73 was cloned into multiple cloning sites 2 of pETDuet-1 (Novagen) for the coexpression of a four-subunit Paf1 complex lacking Rtf1 (Paf1C-DRtf1). Leo1 and Paf1 were
PCR amplied and cloned into two multiple cloning sites of pCDFDuet-1 vector (Novagen) separately and sequentially. The ve-subunit Paf1C and its variants were heterologously coexpressed in E. coli BL21 CodonPlus(DE3)RIL cells (Stratagene). Cultures of bacteria transformed with pET24b-Ctr9 and pCDFDuet-1-Leo1-Paf1 were grown in LuriaBertani medium at 37 C to an OD600 nm of B0.8.
The temperature was reduced to 18 C, and protein overexpression was induced by addition of 1 mM isopropyl-b-D-thiogalactoside and with continued growth at
NATURE COMMUNICATIONS | 8:15741 | DOI: 10.1038/ncomms15741 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 9
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms15741
Paf1C
Paf1C
C
N
Figure 7 | Model of the Pol II EC with bound elongation factors. Summary of our current understanding of elongation factor location and interactions on the Pol II EC surface. In addition to direct Pol II interactions by domains in TFIIS, Spt5 and Paf1C, elongation factors contain many exible domains and regions that interact. In particular, the Paf1-Leo1 heterodimer may contact the N-terminal domain I of TFIIS and the C-terminal region of Rtf1 is anchored to Paf1C mainly by Paf1 and Ctr9 subunits shown in XL-MS, whereas its Plus3 domain binds the exible CTR of Spt5. The N-terminal tail of Rtf1 is important for recruiting the chromatin remodeller Chd1 (ref. 9). Dashed lines in this model indicate exible regions.
18 C overnight. The same strategy was used for the coexpression of Rtf1 and Cdc73.
Cells were collected and colysed by sonication in buffer A (50 mM Tris (pH8.7), 600 mM potassium acetate (KOAC), 2 mM dithiothreitol (DTT), 2 mM MgCl2, 10 mM imidazole, 10 mM ZnCl2) containing a 1:100 dilution of protease inhibitor cocktail (1 mM leupeptin, 2 mM pepstatin A, 100 mM phenylmethylsulfonyl uoride, 280 mM benzamidine), 1,000 U benzonase and0.4 mg ml 1 DNaseI. The extract was cleared by centrifugation (20 000g, 30 min) and the supernatant was loaded onto a 2-ml Ni-NTA agarose bead column (Qiagen), equilibrated in buffer A. The column was washed extensively with buffer A containing 20 mM imidazole. The complex was eluted with buffer A containing 150 mM imidazole. The eluted protein was diluted 10-fold and was further puried by cation exchange chromatography using a 1-ml HiTrap SP HP column (GE Healthcare). The column was equilibrated in buffer B (50 mM Tris (pH 8.7),70 mM KOAC, 2 mM MgCl2, 2 mM DTT, 10 mM ZnCl2) and proteins were eluted with a linear gradient from 70 mM to 2 M KOAC in buffer B. Fractions containing the protein of interest were concentrated and loaded onto a Superose 6 10/300 (GE Healthcare) size-exclusion chromatography column equilibrated with buffer C (10 mM Tris (pH 8.5), 200 mM NaCl, 2 mM MgCl2, 2 mM DTT, 10 mM ZnCl2) or buffer D (10 mM HEPES (pH 7.5), 100 mM NaCl, 2 mM MgCl2, 2 mM DTT,
10 mM ZnCl2). The protein complex was concentrated by centrifugation in Amicon Ultra 4-ml concentrators (molecular weight cutoff 50 kDa; Millipore) to
3 mg ml 1. Protein was aliquoted, ash frozen in liquid nitrogen and stored at
80 C. The similar strategy was used for purifying the ve-subunit Paf1C variants.
The four-subunit Paf1 complex (Paf1C-DRtf1) and its variants were expressed and puried as described for the ve-subunit complex (Paf1C), except that we cotransformed pET24b-Ctr9, pCDFDuet-1-Leo1-Paf1 and pETDuet-1-Cdc73 plasmids together in E. coli BL21 CodonPlus (DE3) RIL cells. All the components were separated by SDSPAGE and conrmed by mass spectrometry.
Limited proteolysis and Edman sequencing. Limited proteolysis experiments were performed in buffer C (10 mM Tris (pH 8.5), 200 mM NaCl, 2 mM MgCl2,
2 mM DTT, 10 mM ZnCl2) containing 1.6 mg ml 1 Paf1C and different concentration of respective protease. A measure of 0.040.4 ng ml 1 subtilisin,110 ng ml 1 trypsin and 2 ng ml 1 ArgC were incubated with Paf1C at room temperature for 2 min, 2 min and a 520 min time course, respectively. The reactions were stopped using SDS loading buffer. Samples were analysed by SDS
PAGE. The separated limited proteolysis products on the SDS gel were blotted onto a polyvinylidene diuoride membrane. The membrane was stained with Ponceau S before the fragments were excised and sequenced using a Procise cLC (Applied Biosystems, Foster City, CA, USA).
Pull-down assay. Pol II containing a biotin acceptor peptide at the N terminus of Rpb3 was puried. Enzymatic biotinylation was carried out in vitro in 100 ml reactions containing 124 mg of Pol II, 8 mg BirA, 2 mM ATP and 100 mM biotin supplemented with dilution buffer (20 mM HEPES (pH 7.5), 16 mM MgCl2, 10 mM
ZnCl2, 10 mM DTT). The mixture was incubated for 2 h in a thermomixer at 20 C and 600 r.p.m. Buffer exchange to Pol II buffer (5 mM HEPES (pH 7.5), 10 mM ZnCl2, 40 mM (NH4)2SO4, 10 mM DTT) was carried out using Micro Bio-SpinTM
Chromatography columns (Bio-Rad). In each pull-down assay, 5.8 mg biotinylated Pol II was immobilized on 20 ml Dynabeads M-280 streptavidin resin (Thermo Fisher Scientic), equilibrated in buffer P (50 mM HEPES (pH 7.5), 0.1% NP-40, 150 mM KOAC, 2 mM DTT). Fivefold molar excess of puried Paf1C or/and TFIIS were incubated with immobilized Pol II or control resin at 4 C for 1 h. Beads were washed ve times. Input and the bound proteins were subjected to SDSPAGE analysis. Relative binding fold changes between Pol II and Paf1 complexes when TFIIS or TFIIS-DN130 presents were quantied using Image J. The amount of Leo1 (Paf1C subunit) is quantied relative to the amount of Pol II (subunit Rpb4) in each lane. The fold change is set to 1 in cases lacking TFIIS (representing unchanged amounts). Error bars indicate the s.e.m. from three replicates.
EC preparation. The nucleic acid scaffold (Integrated DNA Technologies) used for transcribing mammalian RNA Pol II48, which contains an 11 nucleotide mismatch transcription bubble and 20 nucleotide RNA (bubble-RNA) was used to assemble Pol II-Paf1C-TFIIS EC (template DNA sequence 50-AAGCTCAAGTACTTA
AGCCTGGTCATTACTAGTACTGCC-30, non-template DNA sequence 50-GGCAGTACTAGTAAACTAGTATTGAAAGTACTTGAG CTT-30 and RNA sequence 50-UAUAUGCAUAAAGACCAGGC-30). Equimolar amounts of DNA and RNA were mixed and annealed in a Biometra T3000 Thermocycler that heated to 95 C and cooled in 1 C increments every 30 s until 10 C were reached. The Pol II-bubble-RNA was assembled by incubating 250 pmol puried Pol II with equimolar bubble-RNA in Pol II buffer for 10 min at 25 C. A 1.8-fold molar excess of Paf1C-Ctr9-DC913 and a 1.8-fold molar excess of TFIIS (DE-AA)-inactive mutant were incubated with Pol II-bubble-RNA in assembly buffer D (10 mM
10 NATURE COMMUNICATIONS | 8:15741 | DOI: 10.1038/ncomms15741 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms15741 ARTICLE
HEPES (pH 7.5), 100 mM NaCl, 2 mM MgCl2, 2 mM DTT, 10 mM ZnCl2) for15 min at 20 C in a 65 ml reaction volume. We then centrifuged the reaction for 10 min at 4 C at 15,000 r.p.m. and kept the supernatant for size-exclusion chromatography or gradient xation (GraFix) before XL-MS or EM, respectively.
Crosslinking and mass spectrometry analysis. The assembled Pol II-Paf1CTFIIS complex was injected onto size-exclusion chromatography to obtain a homogeneous complex. The fractions containing target complex were collected and crosslinked at various concentrations of BS3 (Thermo Fisher Scientic) to determine empirically the optimal reaction conditions. The best condition, 0.5 mM BS3, was sufcient to convert most of individual component into a high-molecular-weight band in SDSPAGE and was chosen for nal sample preparation. The Pol II-Paf1C-TFIIS at a concentration of 425 mg ml 1 was crosslinked with 0.5 mM
BS3 and incubated for 30 min at 30 C. The reaction was quenched by adding 50 mM ammonium bicarbonate. The crosslinked sample was repuried by size-exclusion chromatography on a Superose 6 PC 3.2/300 column (GE Healthcare) equilibrated in buffer D (10 mM HEPES (pH 7.5), 100 mM NaCl, 2 mM MgCl2,
2 mM DTT, 10 mM ZnCl2). Crosslinked sample was digested with trypsin.
Crosslinked peptides were enriched and divided into two parts. Both halves of the sample were measured on an Orbitrap Fusion LC-MS/MS instrumentation platform (Thermo Fisher Scientic) and the data sets were analysed with pLink1.23 (ref. 64) against a database containing the sequences of the proteins components in the complex separately. An initial false discovery rate cutoff of 1% was set. E-value was calculated in this processing. To visualize this score better, the negative logarithm of E-value was used. The nal set of crosslinks were required to satisfy 3 criteria: (1) appeared in both replicates; (2) the max score value from each data set was higher than 5; (3) each crosslink must have a minimal spectral count of 2 in each data set. The nal result was subsequently visualized using the xiNET online server65. The same strategy was used for Paf1C XL-MS, except that the samples were measured on an AB Sciex Triple-ToF instrument (AB SCIEX).
Gradient xation. To reconstitute a homogeneous Pol II-Paf1C-TFIIS EC, the sucrose GraFix was carried out as described50. Each sucrose gradient for GraFix was generated by mixing equal volumes of light solution (10 mM HEPES (pH 7.5), 100 mM NaCl, 2 mM DTT, 2 mM MgCl2, 10 mM ZnCl2, 10% (w/v) sucrose) and heavy solution 1 (10 mM HEPES (pH 7.5), 100 mM NaCl, 2 mM DTT, 2 mM MgCl2, 10 mM ZnCl2, 30% (w/v) sucrose, 0.075% (w/v) glutaraldehyde) using a gradient mixer (Gradient Master 108; BioComp Instruments). This resulted in a dual gradient of 1030% sucrose and 00.075% glutaraldehyde in an 11 60 mm2
ultracentrifuge tube (Beckman Coulter). Next, 60 ml of the in vitro reconstituted Pol II-Paf1C-TFIIS EC were applied on top of the gradient. After ultracentrifugation at 32,000 r.p.m. in a SW60 swinging bucket rotor (Beckman Coulter) for 16 h at 4 C, 200 ml fractions of the gradient were collected by pipetting carefully from top to bottom of the tube. Parallel sucrose gradient fractions of samples applied to gradients either containing or lacking glutaraldehyde showed the same sedimentation prole when analysed by SDSPAGE. The crosslinking reaction was quenched by adding 0.5 M (pH 7.8) aspartate to a nal concentration of 12.5 mM. Micro spin chromatography columns (Bio-Rad) were used for buffer exchange to remove sucrose, glutaraldehyde and aspartate. The samples were concentrated using a GE concentrator (molecular weight cutoff 100 kDa; GE Healthcare) and
immediately used for cryo-EM.
Negative stain EM. Pol II-Paf1C-TFIIS EC was prepared as described above. Negative staining of the specimen was carried out as described48. Micrographs were obtained using an FEI Philips CM-200 at 160 kV with a original magnication of
88 000 (corresponding to a calibrated sampling of 2.51 per physical pixel) and a defocus B2.0 mm.
Cryo-EM specimen preparation and data acquisition. An FEI Vitrobot Mark IV plunger (FEI) was used for preparation of frozen-hydrated specimens. Four microlitres of sample was placed onto Quantifoil Cu R3.5/1 and Cu R2/1 glow-discharged 200 mesh holey carbon grids, which were then blotted for 8.5 s with blot force 13 to remove the excess solution before they were ash frozen in liquid ethane. The Vitrobot chamber was operated at constant 4 C and 100% humidity during blotting. The grids were transferred and stored in liquid nitrogen before data acquisition.
Two cryo-EM data sets were acquired on a 300 keV FEI Titan Krios electron microscope equipped with a K2 Summit direct electron counting camera (Gatan) positioned post a GIF Quantum energy lter (Gatan) to increase the signal-to-noise ratio. Automated data collection was carried out using the TOM toolbox66. Movie images were recorded at a nominal magnication of 37,000 (corresponding to a
calibrated sampling of 1.35 per physical pixel) in super-resolution mode, thus yielding a pixel size of 0.675 per pixel. For the rst data set, two movies were acquired in each hole and a total of 595 movie stacks with a defocus range of 0.7
to 4.2 mm were collected from Quantifoil Cu R3.5/1 grids at a dose rate of 7.6
electrons per pixel per second. Each movie encompassed a total dose of B33 electrons per 2 with a total exposure time of 10.8 s fractionated into 27 frames. Each frame had an exposure time of 0.4 s. For the second data set collected from the Quantifoil Cu R2/1 grids, one movie was acquired in each hole and 2,146 movie
stacks were recorded with an exposure time of 12 s fractionated into 30 frames, a dose rate of 4.2 electrons per pixel per second. Each frame had an exposuretime of 0.4 s, resulting in a total accumulated dose of B28 electrons per 2 per stack. Defocus values ranged from 0.6 to 4.2 mm. Movie stacks from two data
sets were aligned and binned as previously described with the frame-based motion-correction algorithm to generate drift-corrected micrographs for further processing67, except images were not partitioned into quadrants.
Cryo-EM image processing. Contrast transfer function was estimated using CTFFIND3 (ref. 68) and CTFFIND4 (ref. 69) for the data sets from Quantifoil Cu R3.5/1 grids (R3.5) and Quantifoil Cu R2/1 grids (R2), respectively. For the R3.5 data set, 123 aligned micrographs were excluded because of contaminations or bad ice quality. After removing these micrographs, we used e2boxer.py from EMAN2 package70 to semiautomatically pick 84,362 particles with a box size of 2402 pixels from the remaining micrographs. 2D reference-free classication within RELION1.4 (ref. 51) was used to remove micelles or other false positives. After this step 79,024 particles remained. We deleted 512 aligned micrographs from the R2 data set and semiautomatically picked 21,301 particles using e2boxer.py from EMAN2 package with a box size of 2402 pixels.
Reference-free 2D classes were generated, and nine representative classes were low-pass ltered to 20 and used as templates for autopicking71. The resulting 863,235 particles were screened manually and subjected to reference-free 2D classication, yielding 291,817 particles. We then merged particles from two datasets. 370,841 particles were used as an input for the subsequent 3D reconstruction performed with RELION 1.4 (ref. 51), unless noted otherwise.
A published reconstruction of bovine Pol II48 ltered to 50 was used as an initial reference for 3D renement. The aligned particles were subjected to particle polishing using RELION 1.4 (ref. 72) to reduce the noise and correct the local motion and radiation damage.
3D classications were carried out without image alignment. The rst 3D classication was performed to separate out particles lacking the Paf1C density. This led to the dismissal of 158,422 particles. We rened the remaining 212,419 Paf1C-containing particles. Gold-standard FSC was calculated after the 3D renement in RELION. A 5.5 average resolution map was observed at the gold-standard FSC criteria of 0.143. In the second step, a soft mask encompassing Paf1C was generated using the volume erase option in UCSF Chimera53 and RELION51. After 3D classication within this mask, we obtained three Pol II-Paf1C-TFIIS maps resolving different parts of Paf1C density (parts A, B and C). The individual classes were autorened using the respective masks. The complex containing part A was rened from 114,672 particles to an average resolution of 5.7 . The complex containing part B was rened from 54,722 particles to an average resolution of5.9 . The complex containing part C was rened from 43,025 particles to an average resolution of 6.2 . All three maps were unsharpened and ltered according to local resolution73. All structural gures were generated using UCSF Chimera53 and PyMOL (Schrdinger LLC.).
Structural modelling. The X-ray crystallographic structure of the Pol II-TFIIS complex (PDB entry: 3PO3; ref. 49) was used as the starting reference model and placed into the cryo-EM map using UCSF Chimera53. Amino acids 6889 and 132168 of the protrusion domain of Rpb2 were replaced by the coordinates from the PDB entry 5C4J52. Template DNA (nucleotides 133), non-template DNA (nucleotides 739) and RNA (nucleotides 720) were derived from the PDB entry 5FLM (ref. 48). The models were tted as rigid bodies into the density map using Fit in Map in UCSF Chimera53. The C-terminal GTPase-like domain of Cdc73 (PDB entry: 3V46) was tted into the map of the complex containing part C using UCSF Chimera53 and adjusted in COOT74 based on the XL-MS results. We also calculated the approximate molecular weight of the density, which is present in addition to Pol II, with Chimera using TFIIS as a reference53.
Analytical sucrose gradient ultracentrifugation. To investigate the binding of TFIIF and Paf1C to Pol II, analytical sucrose gradient ultracentrifugation was carried out. A sucrose gradient was generated by mixing equal volumes of light solution (10 mM HEPES (pH 7.5), 100 mM NaCl, 2 mM DTT, 2 mM MgCl2, 10 mM
ZnCl2, 10% (w/v) sucrose) and heavy solution 2 (10 mM HEPES (pH 7.5), 100 mM NaCl, 2 mM DTT, 2 mM MgCl2, 10 mM ZnCl2, 30% (w/v) sucrose) using a gradient mixer (Gradient Master 108; BioComp Instruments).
Reconstituted Pol II-Paf1C-TFIIS EC was divided into portions. The rst portion was incubated with a 1.8-fold molar excess of TFIIF54, while the other portion was incubated with buffer D (10 mM HEPES (pH 7.5), 100 mM NaCl,2 mM MgCl2, 2 mM DTT, 10 mM ZnCl2) as a control. Next, the reconstituted complexes were applied on top of the gradient. After ultracentrifugation at 32,000 r.p.m. in a SW60 swinging bucket rotor (Beckman Coulter) for 16 h at 4 C, 200 ml fractions were collected by pipetting carefully from top to bottom of the tube before analysing them by SDSPAGE and native PAGE. The assay was repeated but starting from a Pol II-TFIIF complex and then incubating with excess Paf1CTFIIS.
4tU-Seq data collection. 4tU labelling of cellular RNA was performed as described75 with minor changes. Forty microlitres of each replicate culture were
NATURE COMMUNICATIONS | 8:15741 | DOI: 10.1038/ncomms15741 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 11
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms15741
used for metabolic RNA labelling. The strains used in this study paf1D (MATa his3D1 leu2D0 met15D0 ura3D0 paf1::kanMX) and rtf1D (MATa his3D1 leu2D0 met15D0 ura3D0 rtf1::kanMX) were purchased from Euroscarf. Yeast cells were grown in yeast extract peptone dextrose medium overnight, diluted to an OD600 nm
of 0.1 and grown to an OD600 nm of 0.8. 4tU (Sigma; 2 M in dimethylsulfoxide) was added to the media at a nal concentration of 5 mM, and cells were collected after 6 min of labelling by centrifugation at 2,465g and 30 C for 1 min. The supernatant was discarded and the pellet resuspended in RNAlater solution (Ambion/Applied Biosystems) and then ash frozen in liquid nitrogen. Cell number was determined from an aliquot with a Cellometer N10 (Nexus) cell counter. Total RNA was extracted with phenol chloroform. RNA spike-ins were added to cell pellets at the rst step of RNA purication58. Briey, six articial RNAs (spike-ins: ERCC-00043, ERCC-00170, ERCC-00136, ERCC-00145, ERCC-00092 and ERCC-00002) from the ERCC RNA spike-in mix (Life Technologies) were mixed in equal amount to all samples. Each spike-in was subjected to in vitro transcription beforehand with the Megascript T7 Transcription Kit (Life Technologies) with either 1:10 4sUTP:UTP ratio for spike-ins ERCC-00043, ERCC-00136 and ERCC-00092 or only UTP for spike-ins ERCC-00170, ERCC-00145 and ERCC-00002; resulting in labelled and non-labelled spike-ins, respectively. The amount of spike-ins was adjusted to the cell number of each sample (120 ng of spike-in mix to 2.5 108 cells for all
samples). Labelled RNA was chemically biotinylated and puried using strepatavidin-coated magnetic beads as described76. Sequencing libraries were prepared according to the manufacturers recommendations using the Ovation Universal RNA-Seq System (NuGEN). Libraries were quantied with Qubit 1.0. Libraries were pooled and sequenced on an Illumina HiSeq 2500 sequencer.
4tU-Seq data analysis. Data analysis was performed as described57, with minor modications. Briey, paired-end 50 bp reads with additional 6 bp of barcodes were obtained for labelled RNA. Reads were demultiplexed and aligned to theS. cerevisiae genome (sacCer3, version 64.2.1) using STAR (version 2.3.0)77. SAMTools was used to quality lter SAM les77. Alignments with MAPQ smaller than 7 ( q 7) were skipped and only proper pairs ( f99, f147, f83, f163)
were selected. Further processing of the RNA-Seq data was carried out using the R/ Bioconductor environment. Piled-up counts for every genomic position were summed up over replicates, using physical coverage, that is, counting both sequenced bases covered by reads and unsequenced bases spanned between proper mate-pair reads. We used a spike-in (RNAs) normalization strategy as described before58 to allow for observation of global changes in the 4sU-Seq signal. Read counts for all spike-ins necessary to that end were calculated using HTSeq78 after mapping. To estimate these normalization factors, we used a statistical model that describes the read counts for each gene in each sample by gene-specic amounts of labelled RNA. The model includes a parameter for the length of the respective feature. The expectation of the number of reads is tted by maximum-likelihood assuming negative binomial distribution with dispersion parameters as calculated by DESeq79.
Data availability. The cryo-EM data has been deposited in Electron Microscopy Data Bank (EMDB) under accession codes EMD-3626, EMD-3627, EMD-3628 and EMD-3629 for the RNA polymerase II-Paf1C-TFIIS, RNA polymerase II-Paf1CTFIIS-A, -B and -C structures, respectively. The sequencing data and the annotation le have been deposited in the Gene Expression Omnibus database under accession code GSE95556. The data that support the ndings of this study are available from the corresponding author on request.
References
1. Tomson, B. N. & Arndt, K. M. The many roles of the conserved eukaryotic Paf1 complex in regulating transcription, histone modications, and disease states. Biochim. Biophys. Acta 1829, 116126 (2013).
2. Wade, P. A. et al. A novel collection of accessory factors associated with yeast RNA polymerase II. Protein Expr. Purif. 8, 8590 (1996).
3. Shi, X. et al. Cdc73p and Paf1p are found in a novel RNA polymerase II-containing complex distinct from the Srbp-containing holoenzyme. Mol. Cell Biol. 17, 11601169 (1997).
4. Jaehning, J. A. The Paf1 complex: platform or player in RNA polymerase II transcription? Biochim. Biophys. Acta 1799, 379388 (2010).
5. Squazzo, S. L. et al. The Paf1 complex physically and functionally associates with transcription elongation factors in vivo. EMBO J. 21, 17641774 (2002).
6. Pokholok, D. K., Hannett, N. M. & Young, R. A. Exchange of RNA polymerase II initiation and elongation factors during gene expression in vivo. Mol. Cell 9, 799809 (2002).
7. Rondon, A. G., Gallardo, M., Garcia-Rubio, M. & Aguilera, A. Molecular evidence indicating that the yeast PAF complex is required for transcription elongation. EMBO Rep. 5, 4753 (2004).
8. Krogan, N. J. et al. The Paf1 complex is required for histone H3 methylation by COMPASS and Dot1p: linking transcriptional elongation to histone methylation. Mol. Cell 11, 721729 (2003).
9. Simic, R. et al. Chromatin remodeling protein Chd1 interacts with transcription elongation factors and localizes to transcribed genes. EMBO J. 22, 18461856 (2003).
10. Ng, H. H., Dole, S. & Struhl, K. The Rtf1 component of the Paf1 transcriptional elongation complex is required for ubiquitination of histone H2B. J Biol Chem 278, 3362533628 (2003).
11. Wood, A., Schneider, J., Dover, J., Johnston, M. & Shilatifard, A. The Paf1 complex is essential for histone monoubiquitination by the Rad6-Bre1 complex, which signals for histone methylation by COMPASS and Dot1p. J. Biol. Chem. 278, 3473934742 (2003).
12. Xiao, T. et al. Histone H2B ubiquitylation is associated with elongating RNA polymerase II. Mol. Cell Biol. 25, 637651 (2005).
13. Warner, M. H., Roinick, K. L. & Arndt, K. M. Rtf1 is a multifunctional component of the Paf1 complex that regulates gene expression by directing cotranscriptional histone modication. Mol. Cell Biol. 27, 61036115 (2007).
14. Tenney, K. et al. Drosophila Rtf1 functions in histone methylation, gene expression, and Notch signaling. Proc. Natl Acad. Sci. USA 103, 1197011974 (2006).
15. Wu, J. & Xu, W. Histone H3R17me2a mark recruits human RNA polymerase-associated factor 1 complex to activate transcription. Proc. Natl Acad. Sci. USA 109, 56755680 (2012).
16. Sheldon, K. E., Mauger, D. M. & Arndt, K. M. A requirement for the Saccharomyces cerevisiae Paf1 complex in snoRNA 30 end formation. Mol. Cell 20, 225236 (2005).
17. Penheiter, K. L., Washburn, T. M., Porter, S. E., Hoffman, M. G. & Jaehning, J. A. A posttranscriptional role for the yeast Paf1-RNA polymerase II complex is revealed by identication of primary targets. Mol. Cell 20, 213223 (2005).
18. Rozenblatt-Rosen, O. et al. The tumor suppressor Cdc73 functionally associates with CPSF and CstF 30 mRNA processing factors. Proc. Natl Acad. Sci. USA 106, 755760 (2009).
19. Chu, Y., Simic, R., Warner, M. H., Arndt, K. M. & Prelich, G. Regulation of histone modication and cryptic transcription by the Bur1 and Paf1 complexes. EMBO J. 26, 46464656 (2007).
20. Kubota, Y. et al. The PAF1 complex is involved in embryonic epidermal morphogenesis in Caenorhabditis elegans. Dev. Biol. 391, 4353 (2014).
21. Chaudhary, K., Deb, S., Moniaux, N., Ponnusamy, M. P. & Batra, S. K. Human RNA polymerase II-associated factor complex: dysregulation in cancer. Oncogene 26, 74997507 (2007).
22. Tan, J., Muntean, A. G. & Hess, J. L. PAFc, a key player in MLL-rearranged leukemogenesis. Oncotarget 1, 461465 (2010).
23. Takahashi, A. et al. SHP2 tyrosine phosphatase converts parabromin/Cdc73 from a tumor suppressor to an oncogenic driver. Mol. Cell 43, 4556 (2011).
24. Kowalik, K. M. et al. The Paf1 complex represses small-RNA-mediated epigenetic gene silencing. Nature 520, 248252 (2015).
25. Verrier, L. et al. Global regulation of heterochromatin spreading by Leo1. Open Biol. 5, 150045 (2015).
26. Chen, F. X. et al. PAF1, a molecular regulator of promoterproximal pausing by rna polymerase II. Cell 162, 10031015 (2015).
27. Yu, M. et al. RNA polymerase II-associated factor 1 regulates the release and phosphorylation of paused RNA polymerase II. Science 350, 13831386 (2015).
28. Poli, J. et al. Mec1, INO80, and the PAF1 complex cooperate to limit transcription replication conicts through RNAPII removal during replication stress. Genes Dev. 30, 337354 (2016).
29. Mayer, A. et al. Uniform transitions of the general RNA polymerase II transcription complex. Nat. Struct. Mol. Biol. 17, 12721278 (2010).
30. Laribee, R. N. et al. BUR kinase selectively regulates H3 K4 trimethylation and H2B ubiquitylation through recruitment of the PAF elongation complex. Curr. Biol. 15, 14871493 (2005).
31. Qiu, H., Hu, C., Wong, C. M. & Hinnebusch, A. G. The Spt4p subunit of yeast DSIF stimulates association of the Paf1 complex with elongating RNA polymerase II. Mol. Cell Biol. 26, 31353148 (2006).
32. Qiu, H., Hu, C., Gaur, N. A. & Hinnebusch, A. G. Pol II CTD kinases Bur1 and Kin28 promote Spt5 CTR-independent recruitment of Paf1 complex. EMBO J. 31, 34943505 (2012).
33. Mayekar, M. K., Gardner, R. G. & Arndt, K. M. The recruitment of the Saccharomyces cerevisiae Paf1 complex to active genes requires a domain of Rtf1 that directly interacts with the Spt4-Spt5 complex. Mol. Cell Biol. 33, 32593273 (2013).
34. Cao, Q. F. et al. Characterization of the human transcription elongation factor Rtf1: evidence for nonoverlapping functions of Rtf1 and the Paf1 Complex. Mol. Cell Biol. 35, 34593470 (2015).
35. Mbogning, J. et al. The PAF complex and Prf1/Rtf1 delineate distinct Cdk9-dependent pathways regulating transcription elongation in ssion yeast. PLoS Genet. 9, e1004029 (2013).
36. Amrich, C. G. et al. Cdc73 subunit of Paf1 complex contains C-terminal Ras-like domain that promotes association of Paf1 complex with chromatin. J. Biol. Chem. 287, 1086310875 (2012).
37. Dermody, J. L. & Buratowski, S. Leo1 subunit of the yeast paf1 complex binds RNA and contributes to complex recruitment. J. Biol. Chem. 285, 3367133679 (2010).
38. Chu, X. et al. Structural insights into Paf1 complex assembly and histone binding. Nucleic Acids Res. 41, 1061910629 (2013).
12 NATURE COMMUNICATIONS | 8:15741 | DOI: 10.1038/ncomms15741 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms15741 ARTICLE
39. Chen, H. et al. Crystallographic analysis of the conserved C-terminal domain of transcription factor Cdc73 from Saccharomyces cerevisiae reveals a GTPase-like fold. Acta Crystallogr. D 68, 953959 (2012).
40. de Jong, R. N. et al. Structure and DNA binding of the human Rtf1 Plus3 domain. Structure 16, 149159 (2008).
41. Wier, A. D., Mayekar, M. K., Heroux, A., Arndt, K. M. & VanDemark, A. P. Structural basis for Spt5-mediated recruitment of the Paf1 complex to chromatin. Proc. Natl Acad. Sci. USA 110, 1729017295 (2013).
42. Letunic, I., Doerks, T. & Bork, P. SMART: recent updates, new developments and status in 2015. Nucleic Acids Res. 43, D257D260 (2015).
43. Alva, V., Nam, S. Z., Soding, J. & Lupas, A. N. The MPI bioinformatics Toolkit as an integrative platform for advanced protein sequence and structure analysis. Nucleic Acids Res. 44, W410W415 (2016).
44. Van Oss, S. B. et al. The histone modication domain of Paf1 complex subunit Rtf1 directly stimulates H2B ubiquitylation through an interaction with Rad6. Mol. Cell 64, 815825 (2016).
45. Kim, J., Guermah, M. & Roeder, R. G. The human PAF1 complex acts in chromatin transcription elongation both independently and cooperatively with SII/TFIIS. Cell 140, 491503 (2010).
46. Kettenberger, H., Armache, K. J. & Cramer, P. Architecture of the RNA polymerase II-TFIIS complex and implications for mRNA cleavage. Cell 114, 347357 (2003).
47. Sydow, J. F. et al. Structural basis of transcription: mismatch-specic delity mechanisms and paused RNA polymerase II with frayed RNA. Mol. Cell 34, 710721 (2009).
48. Bernecky, C., Herzog, F., Baumeister, W., Plitzko, J. M. & Cramer, P. Structure of transcribing mammalian RNA polymerase II. Nature 529, 551554 (2016).
49. Cheung, A. C. & Cramer, P. Structural basis of RNA polymerase II backtracking, arrest and reactivation. Nature 471, 249253 (2011).
50. Stark, H. GraFix: stabilization of fragile macromolecular complexes for single particle cryo-EM. Methods Enzymol. 481, 109126 (2010).
51. Scheres, S. H. RELION: implementation of a Bayesian approach to cryo-EM structure determination. J. Struct. Biol. 180, 519530 (2012).
52. Barnes, C. O. et al. Crystal structure of a transcribing RNA polymerase II complex reveals a complete transcription bubble. Mol. Cell 59, 258269 (2015).
53. Pettersen, E. F. et al. UCSF Chimeraa visualization system for exploratory research and analysis. J. Comput. Chem. 25, 16051612 (2004).
54. Plaschka, C. et al. Transcription initiation complex structures elucidate DNA opening. Nature 533, 353358 (2016).
55. Eser, P. et al. Determinants of RNA metabolism in the Schizosaccharomyces pombe genome. Mol. Syst. Biol. 12, 857 (2016).
56. Miller, C. et al. Dynamic transcriptome analysis measures rates of mRNA synthesis and decay in yeast. Mol. Syst. Biol. 7, 458 (2011).
57. Schulz, D. et al. Transcriptome surveillance by selective termination of noncoding RNA synthesis. Cell 155, 10751087 (2013).
58. Schwalb, B. et al. TT-seq maps the human transient transcriptome. Science 352, 12251228 (2016).
59. Guo, J. & Price, D. H. RNA polymerase II transcription elongation control. Chem. Rev. 113, 85838603 (2013).
60. Kwak, H. & Lis, J. T. Control of transcriptional elongation. Annu. Rev. Genet. 47, 483508 (2013).
61. Grohmann, D. et al. The initiation factor TFE and the elongation factor Spt4/5 compete for the RNAP clamp during transcription initiation and elongation. Mol. Cell 43, 263274 (2011).
62. Martinez-Rucobo, F. W., Sainsbury, S., Cheung, A. C. & Cramer, P. Architecture of the RNA polymerase-Spt4/5 complex and basis of universal transcription processivity. EMBO J. 30, 13021310 (2011).
63. Zawel, L., Kumar, K. P. & Reinberg, D. Recycling of the general transcription factors during RNA polymerase II transcription. Genes Dev. 9, 14791490 (1995).
64. Yang, B. et al. Identication of cross-linked peptides from complex samples. Nat. Methods 9, 904906 (2012).
65. Combe, C. W., Fischer, L. & Rappsilber, J. xiNET: cross-link network maps with residue resolution. Mol. Cell Proteomics 14, 11371147 (2015).
66. Korinek, A., Beck, F., Baumeister, W., Nickell, S. & Plitzko, J. M. Computer controlled cryo-electron microscopyTOM(2) a software package for high-throughput applications. J. Struct. Biol. 175, 394405 (2011).
67. Li, X., Zheng, S. Q., Egami, K., Agard, D. A. & Cheng, Y. Inuence of electron dose rate on electron counting images recorded with the K2 camera. J. Struct. Biol. 184, 251260 (2013).
68. Mindell, J. A. & Grigorieff, N. Accurate determination of local defocus and specimen tilt in electron microscopy. J. Struct. Biol. 142, 334347 (2003).69. Rohou, A. & Grigorieff, N. CTFFIND4: fast and accurate defocus estimation from electron micrographs. J. Struct. Biol. 192, 216221 (2015).
70. Tang, G. et al. EMAN2: an extensible image processing suite for electron microscopy. J. Struct. Biol. 157, 3846 (2007).
71. Scheres, S. H. Semi-automated selection of cryo-EM particles in RELION-1.3. J. Struct. Biol. 189, 114122 (2015).
72. Scheres, S. H. Beam-induced motion correction for sub-megadalton cryo-EM particles. Elife 3, e03665 (2014).
73. Cardone, G., Heymann, J. B. & Steven, A. C. One number does not t all: mapping local variations in resolution in cryo-EM reconstructions. J. Struct. Biol. 184, 226236 (2013).
74. Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr. D 66, 486501 (2010).
75. Sun, M. et al. Comparative dynamic transcriptome analysis (cDTA) reveals mutual feedback between mRNA synthesis and degradation. Genome Res. 22, 13501359 (2012).
76. Dolken, L. et al. High-resolution gene expression proling for simultaneous kinetic parameter analysis of RNA synthesis and decay. RNA 14, 19591972 (2008).
77. Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 1521 (2013).
78. Anders, S., Pyl, P. T. & Huber, W. HTSeqa Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166169 (2015).
79. Love, M. I., Anders, S., Kim, V. & Huber, W. RNA-Seq workow: gene-level exploratory analysis and differential expression. F1000Res 4, 1070 (2015).80. Cramer, P. et al. Architecture of RNA polymerase II and implications for the transcription mechanism. Science 288, 640649 (2000).
Acknowledgements
We thank Thomas Schulz for yeast fermentation, Carina Burzinski for Pol II and TFIIF purication, Thomas Frhlich (LAFUGA, Gene Center Munich) and Monika Raabe from the Urlaub laboratory for protein identication, Romina Hofele from the Urlaub Laboratory for Paf1C XL-MS analysis, Simon Neyer for help with EM data processing and suggestions on the manuscript. P.C. was supported by the Deutsche Forschungsgemeinschaft (SFB860, SPP1935), the Advanced Grant TRANSREGULON from the European Research Council (Grant Agreement No. 693023) and the Volkswagen Foundation.
Author contributions
Y.X. designed and performed experiments, unless stated otherwise. C.B. helped with EM data collection. J.M.P. provided access to the EM facility and supervised EM data collection. C.-T.L. performed MS analysis. H.U. supervised MS analysis. K.C.M. carried out the 4tU-Seq experiment. B.S. analysed 4tU-Seq data. C.B. and D.T. helped with EM data processing. P.C. designed and supervised research. Y.X. and P.C. prepared the manuscript, with the input from all authors.
Additional information
Supplementary Information accompanies this paper at http://www.nature.com/naturecommunications
Web End =http://www.nature.com/ http://www.nature.com/naturecommunications
Web End =naturecommunications
Competing interests: The authors declare no competing nancial interests.
Reprints and permission information is available online at http://npg.nature.com/reprintsandpermissions/
Web End =http://npg.nature.com/ http://npg.nature.com/reprintsandpermissions/
Web End =reprintsandpermissions/
How to cite this article: Xu, Y. et al. Architecture of the RNA polymerase II-Paf1CTFIIS transcription elongation complex. Nat. Commun. 8, 15741 doi: 10.1038/ ncomms15741 (2017).
Publishers note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional afliations.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articles Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articles Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
Web End =http://creativecommons.org/ http://creativecommons.org/licenses/by/4.0/
Web End =licenses/by/4.0/
r The Author(s) 2017
NATURE COMMUNICATIONS | 8:15741 | DOI: 10.1038/ncomms15741 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 13
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright Nature Publishing Group Jun 2017
Abstract
The conserved polymerase-associated factor 1 complex (Paf1C) plays multiple roles in chromatin transcription and genomic regulation. Paf1C comprises the five subunits Paf1, Leo1, Ctr9, Cdc73 and Rtf1, and binds to the RNA polymerase II (Pol II) transcription elongation complex (EC). Here we report the reconstitution of Paf1C from Saccharomyces cerevisiae, and a structural analysis of Paf1C bound to a Pol II EC containing the elongation factor TFIIS. Cryo-electron microscopy and crosslinking data reveal that Paf1C is highly mobile and extends over the outer Pol II surface from the Rpb2 to the Rpb3 subunit. The Paf1-Leo1 heterodimer and Cdc73 form opposite ends of Paf1C, whereas Ctr9 bridges between them. Consistent with the structural observations, the initiation factor TFIIF impairs Paf1C binding to Pol II, whereas the elongation factor TFIIS enhances it. We further show that Paf1C is globally required for normal mRNA transcription in yeast. These results provide a three-dimensional framework for further analysis of Paf1C function in transcription through chromatin.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer




