Introduction
Glioblastoma (GBM) is characterized by intra-tumoural heterogeneity and stem cell-like properties that underpin treatment resistance and poor prognosis (Bulstrode et al., 2017; Suvà et al., 2014). GBM is divided into distinct transcriptional subtypes that span a continuum of stem cell/developmental and injury response/immune evasion cell states (Richards et al., 2021; Verhaak et al., 2010; Wang et al., 2021). Genetically, activation or amplification of
Although a long-recognized feature of cancer (Cox et al., 1965), ecDNA are particularly common in GBM, with 90% of patient-derived GBM tumour models harbouring ecDNA (Turner et al., 2017). However, there is much broader interest in mechanisms of ecDNA function across many solid tumours, as ecDNA enable rapid oncogene amplification in response to selective pressures, and have been shown to correlate with poor prognosis and treatment resistance (Kim et al., 2020; Nathanson et al., 2014; Vicario et al., 2015). EcDNA are centromere-free DNA circles of around 1–3 Mb in size that frequently exist as doublets (double minutes), but also as single elements (Hamkalo et al., 1985; Verhaak et al., 2019; Vogt et al., 2004). EcDNA can be composed of multiple genetic fragments generated as a result of chromothripsis (Gibaud et al., 2010; Shoshani et al., 2021; Rosswog et al., 2021). Although ecDNA were previously identified in 1.4% of cancers, more recent studies have shown their prevalence to be significantly higher (Fan et al., 2011; Kim et al., 2020; Turner et al., 2017). EcDNA can lead to oncogene copy number being amplified to >100 in any given cell, with significant copy number heterogeneity between cells (Lange et al., 2022; Turner et al., 2017). Freed from the constraints imposed by being embedded within a chromosome, ecDNA have spatial freedom and can adapt to targeted therapeutics (Lange et al., 2022; Nathanson et al., 2014). For example, the
As well as their resident oncogenes, ecDNA also harbour regulatory elements (enhancers) required to drive oncogene expression (Morton et al., 2019; Zhu et al., 2021). Consistent with this, ecDNA have been found to have regions of largely accessible chromatin (assayed by ATAC-seq), indicative of nucleosome displacement by bound transcription factors, and to be decorated with histone modifications associated with active chromatin (Wu et al., 2019). Transcription factors densely co-bound at enhancers have been suggested to nucleate condensates or ‘hubs’ (Cho et al., 2018; Rai et al., 2018; Strom and Brangwynne, 2019), enriched with key transcriptional components such as mediator and RNA polymerase II (PolII) to drive high levels of gene expression (Cho et al., 2018; Chong et al., 2018; Sabari et al., 2018). Given the colocation of enhancers and driver oncogenes on ecDNA, it has therefore been suggested that ecDNA cluster together in the nucleus, driving the recruitment of a high concentration of RNA PolII and creating ecDNA-driven nuclear hubs that in turn enhance the transcriptional output from ecDNA (Adelman and Martin, 2021; Hung et al., 2021; Yi et al., 2021; Zhu et al., 2021).
Here, using super-resolution imaging of primary GBM cell lines, we find that ecDNA are widely dispersed throughout the nucleus and we find neither evidence of ecDNA clustering together nor any significant spatial overlap between ecDNA and large PolII hubs. As expected, we show that expression from genes on ecDNA, both at mRNA and protein level, correlates with ecDNA copy number in the tumour cell lines. However, transcription of genes present on each individual ecDNA molecule appears to occur at a similar efficiency (transcripts per copy number) to that of the equivalent endogenous chromosomally located gene. These data suggest that it is primarily the increased copy number of ecDNA in GBM stem cells, and not a specific property of nuclear colocalization, that drives the increased transcriptional capacity of their resident oncogenes.
Results
EcDNA are more frequently located centrally in the nucleus in GBM stem cells
We characterized two GBM-derived glioma stem cell (GSC) primary cell lines containing multiple
Figure 1.
The nuclear localization of extrachromosomal DNA (ecDNA) in glioblastoma (GBM) cell lines.
(A) Whole genome sequencing (WGS) and AmpliconArchitect analysis for ecDNA regions for E26 and E28 cell lines showing an
Figure 1—figure supplement 1.
Additional
Radial distribution, normalized to DAPI, across bins of equal area eroded from the edge (1) to the centre (5) of the nucleus for (A) chromosome 7 (TxR mean normalized intensity per nucleus) or (B)
Median and quartiles are shown. Statistical significance was examined by Kruskall-Wallis. **** p<0.0001. Statistical data relevant for this figure are in Figure 1—source data 1.
Human chromosomes have non-random nuclear organization, with active regions preferentially located towards the central regions of the nucleus (Boyle et al., 2001; Croft et al., 1999). We sought to determine the nuclear localization of ecDNA in GBM cell lines as compared with the endogenous chromosomal
It has been suggested that ecDNA cluster into ‘ecDNA hubs’ within nuclei of cancer cells, including for
Figure 2.
EGFR-containing extrachromosomal DNA (ecDNA) do not cluster in the nucleus.
(A) Representative images shown as maximum intensity projection of DNA FISH for
Figure 2—figure supplement 1.
Additional analysis of
(A) Cumulative frequency distribution of shortest
The analysis above quantified distances between FISH hybridization signals but does not determine whether there is a non-random distribution of foci in the nuclei at distances in keeping with transcription hubs. We therefore used 3D Ripley’s K function to determine the observed spatial pattern of the foci in each nucleus and compared this with a random null distribution of 10,000 simulations of the same number of foci in the same volume. We powered this to identify any significant clustering at each radius in 0.1 μm increments between 0.1 and 1 μm (examples of E26 and E28 nuclei and their corresponding Ripley’s K function in Figure 2C). The E26 cell line had some nuclei with significant non-random distribution of ecDNA, but only at ≥400 nm radial distances, and E28 only had occasional nuclei with significant non-random distribution of ecDNA at ≥700 nm (Figure 2D). We repeated this analysis, reducing the focus spot size from 300 to 150 nm diameter to ensure no small FISH foci were omitted that might skew our analysis. No significant clustering was observed at <300 nm (Figure 2—figure supplement 1F).
Different ecDNA populations do not cluster in the nucleus of GBM stem cells
To ensure that multiple ecDNAs are not so tightly clustered that they cannot be resolved by FISH, we analysed another primary GBM cell line (E25) which has two different oncogenes carried on separate ecDNA populations:
Figure 3.
Two separate extrachromosomal DNA (ecDNA) populations do not cluster in the nucleus.
(A) Representative maximum intensity projection images of DNA FISH for
Figure 3—figure supplement 1.
Additional analysis of the distribution of
(A) DNA FISH for
Figure 3—figure supplement 2.
DNA FISH on metaphase spreads of the E20 cell line showing hybridization signal for
Scale bar = 10 μm. (A) Metaphase spread representative of most cells with
We used 3D Ripley’s K function to evaluate point patterns in the E25 dual ecDNA oncogene cell line (Figure 3C). Some nuclei had a significant non-random distribution of
To further validate this, we repeated 3D Ripley’s function analysis in a second GBM cell line (E20) harbouring
Our analysis of two independent GBM cell lines harbouring different ecDNA populations (
ecDNA do not colocalize with large RNA PolII hubs in GBM stem cells
DNA FISH detects all ecDNA, so it might be that only transcriptionally active elements cluster. Therefore, we used RNA FISH to detect nascent
Figure 4.
Extrachromosomal DNA (ecDNA) do not colocalize with large foci of the transcriptional machinery.
(A) Representative maximum intensity projection image of nascent
Figure 4—figure supplement 1.
Analysis of sites of EGFR nascent transcription relative to RNA polymerase II in GBM cell lines.
(A) Number of nascent
We next assessed whether ecDNA foci, albeit not clustered with each other, colocalize with high focal concentrations of the transcriptional machinery to create ecDNA/large PolII transcription hubs. First, we examined the presence of such hubs by immunofluorescence for RPB1 (POLR2A), the largest subunit of RNA PolII. The large RPB1 foci we detected were sparse with only a few clearly visible per nucleus (Figure 4—figure supplement 1C).
We used 3D analysis of immunoFISH in NSCs and compared this to E26 and E28 GBM cells to establish whether ecDNA and large RPB1 foci colocalized. There was no obvious overlap between foci of RPB1 and
To ensure this result was not specific to this PolII antibody, we repeated this analysis using E28 cells in which mCherry was fused by knock-in to endogenous POLR2G, a key subunit of RNA PolII (Cramer et al., 2000). The mean distance between
Levels of EGFR transcription from ecDNA reflect copy number, not enhanced transcriptional efficiency
Having shown a lack of colocalization of ecDNA, either with each other or with large PolII foci, we proceeded to characterize the levels of
Previous studies have reported that ecDNA have greater transcript production per oncogene than chromosomal loci (Wu et al., 2019). We therefore sought to characterize the transcriptional efficiency (per copy number) of chromosomal and ecDNA-located
Figure 5.
Levels of transcription from extrachromosomal DNA (ecDNA) reflect copy number but not enhanced transcriptional efficiency.
(A) Representative maximum intensity projection (MIP) images of nascent
Figure 5—figure supplement 1.
EGFR levels, ecDNA number, and ecDNA SNP allele frequency in E26 and E28 cell lines.
(A) Histogram of flow cytometry with EGF-647 showing signal in neural stem cell (NSC), E28 and E26 cell lines from live cells, normalized to peak count per cell line. Median EGF-647 – NSC = 172.2; E28 = 985.64; E26 = 7191.81. (B) Flow cytometry with EGF-647; gates showing negative, normal (NSC), and elevated (glioma stem cell [GSC]) EGF-647 signal in NSC, E28 and E26 cell lines. (C) Fluorescence activated cell sorting (FACS) into EGF-647 high and low populations from E26 and E26 cell lines. The percentage of total live cell population in each sorted population are shown. (D) Representative EGFR DNA FISH images (shown in greyscale) of E26 and E28 cells sorted via flow cytometry with EGF-647 into EGFR high and low cells. MIP, scale bar = 5 μm. (E) Number of
When comparing the RNA:DNA ratio of all nuclei, only E26 had a higher ratio than NSCs (Figure 5B). To explore whether
To test this using an independent method, we took advantage of WGS and RNA-seq data (Figure 5D) and called SNPs present in the amplicon region at 40% to 60% allele frequencies in patient control blood WGS (control) samples. Most of the allele frequencies of these SNPs were >80% in GBM samples in the main part of the amplicons, in line with the amplification being derived from one parental allele (Figure 5—figure supplement 1F). We then selected those SNPs located in expressed exons of the amplicon, including several in
Discussion
Understanding the importance of ecDNA in the etiology of cancer, and whether this poses an interesting target for therapeutic interventions, depends on deeper analysis of ecDNA activity (Nathanson et al., 2014; Kim et al., 2020). Clustering of ecDNA into ‘ecDNA hubs’ based on imaging and chromosome conformation capture data has been reported in a range of established cancer cell lines, and has been suggested to underlie the ability of ecDNA to drive very high levels of transcription (Hung et al., 2021; Yi et al., 2021; Zhu et al., 2021). However, in multiple primary human GBM cells studied here, we observe no significant colocalization at distances (~200 nm) thought to be functionally important in driving transcription. We reach this conclusion for both cells with single ecDNA species, as well as with heterogeneous ecDNA harbouring different oncogenes. EcDNA were not colocalized with, or notably close to, large PolII foci. Moreover, taking advantage of the unique transcripts from ecDNA, and the presence of SNPs in these transcripts, to compare ecDNA-derived and chromosomal transcripts, we demonstrate that increased copy number primarily drives increased transcription of ecDNA-located genes rather than increased transcriptional efficiency of ecDNA in GBM stem cells.
Our data support a regional, rather than clustered, spatial organization of ecDNA in GBM stem cells. We observe that oncogenes on ecDNA are distributed more towards the centre of the nucleus than the corresponding endogenous gene loci. This is consistent with an actively transcribing state (Boyle et al., 2001; Croft et al., 1999) and independence from the constraints of chromosome territories (Kalhor et al., 2011; Mahy et al., 2002).
We sought to maximize our opportunity of observing ecDNA clustering at close distances by performing 3D spot analysis, using Ripley’s K to call instances of significant clustering at given distances using ecDNA x,y,z coordinates, and utilizing cells with two distinct ecDNA species to ensure we were not under-scoring colocalization. 3D analysis ensures a false positive clustering effect is avoided that might be seen when 3D images are combined via tools such as maximum intensity projection (MIP). Other tools to assess clustering have noted the possibility of the 2D Ripley’s K function resulting in over-counting, leading to the development of alternative auto-correlation tools, but this was not observed in this 3D Ripley’s K analysis (Veatch et al., 2012). It is possible that multiple clustered DNA/RNA foci appear as a single DNA/RNA FISH signal that we cannot resolve. We controlled for this by repeating cluster analysis with smaller spot sizes, analyzing cell lines with two ecDNA populations and using super-resolution imaging (optical resolution ~120 nm). We did observe ecDNA clustering at close distances (≤200 nm) in a small proportion of E20 dual-ecDNA cells, but in the case of
Our findings may reflect fundamentally different functional characteristics of the ecDNA in patient-derived primary GBM cell cultures used in our experiments versus previously published studies (Hung et al., 2021; Yi et al., 2021). These might include the size of the ecDNA, or the number of oncogene loci per ecDNA (which was singular in our cell lines, with the exception of ~10% E20
Recent work proposing that ecDNA act as mobile super-enhancers for chromosomal targets has raised the possibility that ecDNA can actively recruit RNA PolII to drive ‘ecDNA-associated phase separation’ (Zhu et al., 2021). A live-cell ecDNA-labelling strategy reported colocalization of ecDNA and RNA PolII (Yi et al., 2021). We did not detect evidence of a close relationship between ecDNA, or their nascent transcript, with large PolII foci, but cannot exclude that there are smaller, sub-diffraction limit sized transcriptional hubs associated with our ecDNA.
We observe that while the copy number of
Overall, our data suggest that in primary GBM stem cells, ecDNA can succeed at driving oncogene expression without requiring close colocalization with each other, or with transcriptional hubs. It is the increased copy number that is primarily responsible for higher levels, rather than ecDNA-intrinsic features or nuclear sub-localization.
Materials and methods
Key resources table
Reagent type (species) or resource | Designation | Source or reference | Identifiers | Additional information |
---|---|---|---|---|
Antibody | mCherry (Rabbit poly-clonal) | abcam | ab167453 | IF (1 in 500) |
Antibody | Rpb1 NTD (D8L4Y) (Rabbit mono-clonal) | Cell Signaling Technology | #14958 | IF (1 in 1000) |
Antibody | Anti-Digoxigenin (Sheep poly-clonal) | Roche | Ref 11333089001 | DNA FISH (1 in 10) |
Antibody | Secondary Antibody – Alexa Fluor 647 (Donkey anti-Sheep IgG poly-clonal) | Thermo Fisher Scientific | A-21448 | DNA FISH (1 in 10) |
Antibody | Secondary Antibody – Alexa Fluor 568 (Donkey anti-Rabbit IgG poly-clonal) | Thermo Fisher Scientific | A-10042 | IF (1 in 1000) |
Antibody | Secondary Antibody – Alexa Fluor 488 (Donkey anti-Rabbit IgG poly-clonal) | Thermo Fisher Scientific | A-21206 | IF (1 in 1000) |
Antibody | Secondary Antibody – Alexa Fluor 488 (Donkey anti-Rat IgG poly-clonal) | Thermo Fisher Scientific | A-21208 | IF (1 in 1000) |
Genetic reagent (human) | Fosmid FISH probe (Human) | BACPAC resource | https://bacpacresources.org/library.php?id=275 | See Materials and methods - Supplementary file 1 |
Cell line ( | E20, E25, E26, E28, NSC – GCGR Human Glioma Stem Cells | This paper, Glioma Cellular Genetics Resource, CRUK, UK | http://gcgr.org.uk; pending publication | |
Other | DMEM/HAMS-F12 | Sigma-Aldrich | Cat#: D8437 | Cell culture, media |
Chemical compound, drug | Pen/Strep | GIBCO | Cat#: 15140–122 | Cell culture, media supplement |
Other | BSA Solution | GIBCO | Cat#: 15260–037 | Cell culture, media supplement |
Other | B27 Supplement (×50) | LifeTech/GIBCO | Cat#: 17504–044 | Cell culture, media supplement |
Other | N2 Supplement (×100) | LifeTech/GIBCO | Cat#: 17502–048 | Cell culture, media supplement |
Other | Laminin | Cultrex | Cat#: 3446-005-01 | Cell culture, media supplement, and pre-lamination of culture vessels |
Peptide, recombinant protein | EGF | Peprotech | Cat: 315–09 | Cell culture, media supplement |
Peptide, recombinant protein | FGF-2 | Peprotech | 100-18B | Cell culture, media supplement |
Other | Accutase | Sigma-Aldrich | Cat#: A6964 | Cell culture, cell dissociation agent |
Other | DMSO | Sigma-Aldrich | Cat#: 276855 | Cell culture, freeze media, and drug diluent |
Other | Triton X-100 | Merck Life Sciences | Cat#: X-100 | Cell permeabiliz-ation agent following cell fixation |
Other | Paraformaldehyde Powder 95% | Sigma | Cat#: 158127 | Cell fixation agent |
Other | Tween 20 | Cambridge Bioscience | Cat#: TW0020 | DNA FISH (hybridization mix) |
Other | PBS Tablets | Sigma-Aldrich | Cat#: P4417 | Diluent and washing agent |
Other | Ethanol | VWR | Cat#: 20821–330 | DNA FISH |
Other | Methanol | Fisher Chemical | M/4000/17 | Used 3:1 with acetic acid for metaphase spreads |
Other | Acetic acid | Honeywell Research Chemicals | 33209-1L | See above |
Peptide, recombinant protein | Alexa Fluor 647 EGF complex | Thermo Fisher Scientific | E35351 | Flow cytometry |
Other | Green496-dUTP | ENZO Life Sciences | ENZ-42831L | Direct labelling of Fosmid DNA FISH probes via nick translation |
Other | ChromaTide Alexa Fluor 594–5-dUTP | Thermo Fisher Scientific | C11400 | Direct labelling of Fosmid DNA FISH probes via nick translation |
Peptide, recombinant protein | DNA Polymerase 1 | Invitrogen | 18010–017 | |
Peptide, recombinant protein | DNase I recombinant, RNase-free | Roche | 04716728001 | |
Genetic reagent (human) | Human Cot-1 DNA | Thermo Fisher Scientific | 15279011 | |
Genetic reagent (salmon) | Salmon Sperm DNA | Invitrogen | 15632011 | |
Chemical compound, drug | Paclitaxel | Cambridge Bioscience | CAY10461 | 10–100 nM |
Chemical compound, drug | Nocodazole | Sigma-Aldrich | SML1665 | 50–100 ng/ml |
Other | XCP 7 Orange Chromosome Paint | MetaSystems Probes | D-0307-100-OR | DNA FISH (see Figure 1 and Materials and methods referring to this) |
Commercial assay or kit | Stellaris RNA-FISH probes (Custom Assay with Quasar 570 Dye) | LGC Biosearch Technologies | SMF-1063–5 | RNA FISH |
Commercial assay or kit | Stellaris RNA FISH Hybridization Buffer | LGC Biosearch Technologies | SMF-HB1-10 | RNA FISH |
Genetic reagent (human) | Alt-R CRISPR-Cas9 crRNA | IDT-Technologies | Alt-R CRISPR-Cas9 crRNA | |
Genetic reagent (human) | Alt-R CRISPR-Cas9 tracrRNA | IDT-Technologies | 1072532 | |
Commercial assay or kit | SG Cell Line 4D-NucleofectorTM X Kit S | Lonza Bioscience | V4XC-3032 | |
Genetic reagent (human) | Chromosome 7 Control Probe | Pisces Scientific | CHR07-10-DIG | Probe and hybridization mix |
Other | DAPI (4',6-Diamidino-2-Phenylindole, Dihydrochloride) | Thermo Fisher Scientific | D1306 | Nuclear staining; 50 ng/ml and 5 ng/ml (as indicated in Materials and methods) |
Sequence-based reagent | mCherry_PolR2G crRNA and dsDNA (donor) | Twist Bioscience | See Materials and methods and Supplementary file 1 | |
Other | WGS and RNAseq | This paper | GEO: GSE215420 | See Materials and methods |
Other | Erosion Territories analysis | This paper | Code available at: https://github.com/IGC-Advanced-Imaging-Resource/Purshouse2022_paper | |
Other | Cluster analysis | This paper | Code available at: https://github.com/SjoerdVBeentjes/ripleyk | |
Other | RNA-seq/WGS analysis | This paper | Code available at: https://github.com/kpurshouse/ecDNAcluster | |
Software, algorithm | GraphPad Prism 9.0 | GraphPad Software, Inc | https://www.graphpad.com/ | |
Software, algorithm | FCS Express | FCS Express 7 | https://denovosoftware.com/ | |
Software, algorithm | Fiji/ImageJ | Open Source | https://imagej.net/Fiji | |
Software, algorithm | BioRender | BioRender | https://biorender.com/ | |
Software, algorithm | Python v3.9 | Open Source | https://www.python.org | |
Software, algorithm | Algorithm - RipleyK package | Python Package Index | https://pypi.org/project/ripleyk/ | |
Software, algorithm | Imaris x64 v9.4.0 | Imaris Microscopy Image Analysis Software | https://imaris.oxinst.com/ | |
Software, algorithm | UCSC Genome Browser | Kent et al., 2002 | https://genome.cshlp.org/content/12/6/996 | |
Software, algorithm | STAR 2.7.1a | Dobin et al., 2013 | https://github.com/alexdobin/STAR; Dobin et al., 2013 | |
Software, algorithm | Picard | Broad Institute | https://broadinstitute.github.io/picard/ | |
Software, algorithm | AmpliconArchitect | Deshpande et al., 2019 | https://github.com/virajbdeshpande/AmpliconArchitect; Deshpande et al., 2019 (with Python v2.7) | |
Software, algorithm | AmpliconClassifier | Kim et al., 2020 | https://github.com/jluebeck/AmpliconClassifier (with Python v2.7) | |
Software, algorithm | deepTools v3.4 | Ramírez et al., 2016 | https://deeptools.readthedocs.io/en/develop/ | |
Software, algorithm | HOMER2 4.10 | Heinz et al., 2010 | http://homer.ucsd.edu/homer/ | |
Software, algorithm | SAMtools v1.10 | Li et al., 2009 | http://www.htslib.org | |
Software, algorithm | BEDTools v2.3 | Quinlan and Hall, 2010 | http://code.google.com/p/bedtools | |
Software, algorithm | bcftools | Danecek et al., 2021 | https://doi.org/10.1093/gigascience/giab008 | |
Software, algorithm | strelka v2.9.10 | Kim et al., 2018 | https://doi.org/10.1038/s41592-018-0051-x |
Lead contact
Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contacts, Wendy Bickmore (
Materials availability
This study generated a new CRISPR engineered knock-in reporter cell line – E28 mCherry_POLR2G.
Experimental model and subject details
GSC and NSC lines from the Glioma Cellular Genetics Resource (GCGR) (https://gcgr.org.uk) were cultured in serum-free basal DMEM/F12 medium (Sigma) supplemented with N2 and B27 (Life Technologies), 2 μg/ml laminin (Cultrex), and 10 ng/ml growth factors EGF and FGF-2 (Peprotech) (Pollard et al., 2009). Cells were split with Accutase solution (Sigma), and centrifuged approximately weekly as previously reported. All GBM cell lines were derived from treatment-naive patients, and the NSC cell line GCGR-NS9FB_B was derived from 9 week of gestation forebrain. GSC cell lines were selected on the basis of predominantly (E26) or entirely (E28, E25, and E20) harbouring oncogenes on ecDNAs (rather than HSRs) via metaphase spread analysis (see Materials and method below). Human GBM tissue was obtained with informed consent and ethical approval (East of Scotland Research Ethics service, REC reference 15/ES/0094). Human embryonic brain tissue was obtained with informed consent and ethical approval (South East Scotland Research Ethics Committee, REC reference 08/S1101/1). Cell lines were regularly tested for mycoplasma.
Method details
Metaphase spreads and interphase nuclei
Cell lines were optimized to generate metaphase spreads. Briefly, cells at near confluence in a T75 flask were incubated between 4 and 16 hr in the presence of 10–100 nm paclitaxel (Cambridge BioScience) with or without 50–100 ng/ml nocodazole (Sigma-Aldrich). Along with the media, cells dissociated with accutase were centrifuged, washed in PBS, and resuspended in 10 ml potassium chloride (KCl) 0.56%, with sodium citrate dihydrate (0.9%) if required, for 20 min. After further centrifugation, cells were resuspended in methanol:acetic acid 3:1 and dropped onto humidified slides.
For all other fixed cell experiments described below, cells were seeded overnight onto glass cover-slips or poly-L-lysine coated glass slides (Sigma-Aldrich). Cells were fixed with 4% paraformaldehyde (PFA – 10 min) and permeabilized with 0.5% Triton X-100 (15 min) with thorough PBS washes in-between. Where cells were dried (see FISH methods), this only occurred following PFA fixation in order to preserve 3D structures and minimize cell and nuclear flattening.
DNA FISH
A detailed method for DNA FISH has been described elsewhere (Jubb and Boyle, 2020). Briefly, DNA stocks of fosmid clones targeting EGFR (WI2-2910M03), CDK4 (WI2-0793J08), and PDGFRA (WI2-2022O22) (Supplementary file 1) were prepared via an alkaline lysis miniprep protocol (Jubb and Boyle, 2020). Each fosmid DNA probe was labelled via Nick Translation directly to a fluorescent dUTP (Green496-dUTP, ENZO Life Sciences; ChromaTide Alexa Fluor 594-5-dUTP, Thermo Fisher Scientific) and incubated with unlabelled dATP, dCTP, and dGTP, ice-cold DNase and DNA PolI for 90 min at 16°C. The reaction was quenched with EDTA and 20% SDS, TE buffer added, and the reaction mix run through a Quick Spin Sephadex G50 column.
Cells on slides or cover-slips were prepared by incubating for 1 hr in ×2 trisodium citrate and sodium chloride (SSC)/RNaseA 100 μg/ml at 37°C, then dehydrated in 70%, 90%, and 100% ethanol. Slides were warmed at 70°C prior to immersion in a denaturing solution (×2 SSC/70% formamide, pH 7.5) heated to 70°C (methanol:acetic acid-fixed cells) or 80°C (PFA-fixed cells), the duration of which was optimized to each cell line. After denaturing, slides were immersed in ice-cold 70% ethanol, then 90% and 100% ethanol at room temperature before air drying.
FISH probes were prepared by combining 100 ng of each directly labelled fosmid probe (per slide), 6 μg Human Cot-1 DNA (per probe), 5 μg sonicated salmon sperm (per slide), and 100% ethanol. Once completely dried, the resulting pellet was suspended in hybridization mix (50% deionized formamide [DF], ×2 SSC, 10% dextran sulfate, 1% Tween 20) for 1 hr at room temperature, denatured for 5 min at >70°C and annealed at 37°C for 15 min. Where relevant, FISH probes were instead hybridized in Chromosome 7 paint (XCP 7 Orange, Metasystems). The probes were incubated overnight at 37°C. The following day, the slides were washed in ×2 SSC (45°C), 0.1% SSC (60°C) and finally in ×4 SSC/0.1% Tween 20 with 50 ng/ml 4′,6-diamidino-2-phenylindole (DAPI). Slides were mounted with Vectashield.
RNA FISH
RNA FISH probes (Custom Assay with Quasar 570 Dye) targeting the first intron (pool of 48 22-mer probes) of
Combined RNA:DNA FISH
Nascent EGFR RNA FISH was performed as above, and nuclei imaged as described below. The x,y,z coordinates for each image were recorded via NIS software at the time of imaging. After removing the cover-slips and washing the slides in PBS, EGFR DNA FISH was performed whereby the probe preparation was as above. Centromere 7 (CEN7 – CHR07-Dig Control) FISH probe (Pisces Scientific) was prepared, denatured for 5 min at 80°C and snap-frozen on crushed ice. Slides were transferred from PBS wash to denaturing solution at 80°C for 15–30 min, washed in ×2 SSC, and incubated overnight with the probe(s) at 37°C. The subsequent stringency washes were as described above. Slides were then incubated in blocking buffer (×4 SSC/5% Marvel) for 5 min, followed by anti-digoxigenin antibody (Roche; 1 in 10; 1 hr at humidified 37°C) and anti-sheep Alexa Fluor 647 secondary antibody (Thermo Fisher Scientific; 1 in 10; 1 hr at humidified 37°C) with ×4 SSC/0.1% Tween 20 washes in between. After the final washes, slides were stained with DAPI and mounted as described above. The stored x,y,z coordinates were used to relocate and image each nucleus. Owing to the irregularity of the tumour nuclei, it was possible to be confident in re-imaging the correct nucleus – nuclei were excluded where this was not the case, or where nuclei were lost between RNA and DNA FISH. Spot counting was subsequently performed as described below with RNA and DNA foci being defined and counted separately to avoid influencing the outcome. For CEN7, nuclei were excluded if the number of foci could not be clearly identified.
Immunofluorescence and immuno-FISH
Slides were blocked in 1%BSA/PBS/Triton X-100 0.1% for 30 min at 37°C before overnight incubation with the primary antibody at 4°C (Rpb1 NTD (D8L4Y) #14958, Cell Signaling Technology, 1 in 1000; mCherry [ab167453], abcam, 1 in 500). The following day, slides were washed in PBS before incubation with an appropriate secondary antibody (1 in 1000 Alexa Fluor) for 1 hr at 37°C. After further PBS washes and DAPI staining, slides were mounted with Vectashield.
For immuno-FISH (DNA), the IF signal was fixed via incubation with 4% PFA for 30 min. Following thorough PBS washes, the DNA FISH protocol was then followed as above.
For immuno-FISH (RNA), the antibodies were added at the same concentration as described above to the hybridization mix (primary antibody) and ×2 SSC/10% DF washes (secondary antibody).
Flow cytometry and FACS
Cells were prepared by adding EGF-free media for 30 min before lifting and suspending cells in 0.1% BSA/PBS. Cells were incubated in 100 ng/ml EGF-647 (E35351, Thermo Fisher Scientific) in 0.1%BSA/PBS, with cells incubated in 0.1% BSA/PBS as a negative control, for 25 min. Cells were washed three times in 0.1%BSA/PBS before being analysed on the BD FACSAria III FUSION. Where indicated, cells were sorted by EGF-647 gated into high and low groups, and a sort check was performed to verify these were true populations prior to expanding these cells onto 22×22 mm2 cover-slips. Fifteen days after the cells were sorted, the slides were fixed, permeabilized, and DNA FISH performed as above.
mCherry_POLR2G knock-in cell line
crRNA and donor DNA was designed using the previously reported TAG-IN tool (Dewari et al., 2018), with the corresponding fluorescent reporter gene sequences for mCherry implemented into the existing tool (Supplementary file 1). Output sequences from the TAG-IN tool were manufactured by Twist Bioscience. Gene-specific crRNA (100 pmoles – IDT Technologies, Coralville, IA, USA) and universal tracrRNA (100 pmoles, IDT Technologies, Coralville, IA, USA) were assembled to a cr:tracrRNA complex by annealing at the following settings on a PCR block: 95°C for 5 min, step down cooling from 95°C to 85°C at 0.5°C/s, step down cooling from 85°C to 20°C at 0.1°C/s, store at 4°C. Recombinant Cas9 protein (10 μg, purified in house – see Dewari et al., 2018) was added to form the ribonucleoprotein (RNP) complex at room temperature for 10 min, then stored on ice; 300 ng of donor dsDNA were denatured in 30% DMSO by incubating at 95°C for 5 min followed by immediate immersion in ice. The donor dsDNA and RNPs were electroporated into E28 cells using the 4D Amaxa X Unit (programme DN-100). After 2 weeks of serial expansion of cells in 2D culture, assessment of knock-in efficiency was assessed by suspending 5–7 × 105 cells in 0.2% BSA/PBS and analysed on BD LSRFortessa Cell Analyzer, with cells electroporated with tracrRNA:Cas9 only as a negative control. Cells were then further sorted into a pure KI population, and mCherry KI was verified by immunofluorescence for mCherry and Rpb1.
Imaging
Slides were imaged on epifluorescence microscopes (Zeiss AxioImager 2 and Zeiss AxioImager.A1) and the SoRa spinning disk confocal microscope (Nikon CSU-W1 SoRa). For 3D image analysis, images were taken with the SoRa microscope and a 3 μm section across each nucleus was imaged in 0.1 μm steps. Images were denoised and deconvolved using NIS deconvolution software (blind preset or Lucy-Richardson) (Nikon). 3D images are shown in the figures as MIP prepared using ImageJ.
Quantification and statistical analysis
Image analysis of nuclear localization
Images were analysed using Imarisv9.7 and Fiji. The scripts used to perform nuclear territory analysis have been described elsewhere (Boyle et al., 2001; Croft et al., 1999; see also Data availability). Briefly, single-slice images were taken with a ×20 lens using the Zeiss AxioImager 2, imaging at least 50 nuclei per cell line. The images were segmented first to individual nuclei, and subsequently the area of the DAPI signal was segmented to define the nuclear area. This area was segmented into concentric shells of equal area from the periphery to the centre of each nucleus. The signal intensity of each FISH probe or chromosome paint signal was calculated, with normalization for the DAPI signal in each shell.
Image analysis of ecDNA and large PolII foci
For 3D analysis, deconvolved images were analysed using Imaris (v9.7) and all analysis was performed on the full 3D image. RNA and DNA FISH foci, and where relevant, large PolII foci, were defined, counted and distances between them calculated, using the Spots function within Imaris. Imaris spot size diameter was selected by single plane measurement of representative foci and this defined diameter was applied to all nuclei of a given experiment for 3D analysis. For DNA FISH analysis, E26, E28, and E25 spot size was 300 nm diameter, and where indicated in the text, reanalysed with 150 nm spot diameter. For E20 and all RNA FISH experiments, a spot size diameter of 200 nm was used. For RPB1 and POLR2G foci (IF), large foci were defined as those ≥500 nm diameter (Cho et al., 2018; Sabari et al., 2018).
For 3D cluster analysis of FISH spots, Ripley’s K function was performed using the x,y,z coordinates for each FISH spot using the Imaris Spots function to determine observed and null distribution values.
Ripley’s K function compares the number of points at a distance smaller than a given radius r, relative to the average number of points in the volume. This average is the density lambda, in this case the number of foci, n, divided by the volume. In the above equation,
is the indicator function which equals 1 if the distance between points i and j is no larger than r, and 0 otherwise. A high value of Ripley’s K function represents clustering at the given radius r, whereas a low value represents dispersion. Consequently, a high Ripley’s K function at a given radius is indicative of clustering at this radius. By comparing the observed value of Ripley’s K function at a given radius with that computed on the same number of foci and with the same volume but drawn from a uniform null distribution, the presence of significant clustering in the given cluster at the given radius can be detected.
The code written to perform this analysis was formed using a script written in Python (v3.9) and has been made available on GitHub (see Data availability). Ripley’s K function was determined across a radius of 0.1–1 µm in 0.1 µm increments. After calculating the observed Ripley’s K function value, a null distribution of no clustering, estimated on uniformly distributed samples with the same number of spots, was generated using the coordinates for each given nucleus to calculate 10,000 Ripley’s K function values at each radial increment. We tested a sample of nuclei with 50,000 values and confirmed that 10,000 values would provide sufficient accuracy. Having sampled that nucleus shape and size did not affect the significance of a result at each increment in the given range of radii, a bounding radius of 5 was used for all samples. Only nuclei with greater than 20
All other statistical analysis was performed with GraphPad Prism v9.0. The statistical details for each experiment can be found in the relevant figure legends and in the Source Data. For figures, p-values are represented as follows: *<0.05, **<0.01, ***<0.001, ****<0.0001. Where appropriate, Bonferroni correction for multiple hypothesis testing was performed, and, where relevant, corrected p-values are those plotted in the figures and are given in the Source Data in brackets next to the uncorrected p value.
RNA and WGS sequencing sample preparation, analysis, and processing
The preparation of these cell lines for RNA-seq has been described in detail elsewhere (Gangoso et al., 2021). WGS was undertaken by BGI Tech Solutions with PE100 and normal library construction. WGS, RNA-seq, and AmpliconArchitect data for GBM39 was taken from data made available via publication and in the NCBI Sequence Read Archive (BioProject: PRJNA506071) (Wu et al., 2019).
Sequences were aligned to hg38 with STAR 2.7.1a with settings ‘--outFilterMultimapNmax 1’ used for WGS and RNA-seq data and settings ‘
Source data
Source data regarding the statistical tests applied, the exact sample number, p-values of tests (and adjustments for multiple hypothesis testing), and details of replicates are included where indicated in the article. N=number of nuclei.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022, Purshouse et al. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Extrachromosomal DNA (ecDNA) are frequently observed in human cancers and are responsible for high levels of oncogene expression. In glioblastoma (GBM), ecDNA copy number correlates with poor prognosis. It is hypothesized that their copy number, size, and chromatin accessibility facilitate clustering of ecDNA and colocalization with transcriptional hubs, and that this underpins their elevated transcriptional activity. Here, we use super-resolution imaging and quantitative image analysis to evaluate GBM stem cells harbouring distinct ecDNA species (
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer