Introduction
Transgenesis, which is the specific and heritable introduction of foreign DNA into genomes, has been a central tool for functional analysis and genetic engineering for nearly 40 y. The power of transgenesis is due in part to the wide variety of assays and techniques that are built upon controlled introduction of novel DNA sequences into a native genome. While there are many uses for transgenesis, in practice most can be grouped into those inserting a small number of known sequences (specific transgenesis) and those introducing many sequence variants from experimental libraries (exploratory transgenesis). While the ability to perform specific transgenesis has become a de facto requirement for all model organisms, exploratory transgenesis remains effectively limited to single-cell models (both prokaryotic and eukaryotic) because of biological limitations generated by inheritance in multicellular organisms. In single-cell models, high-throughput transgenesis has been used for exploratory sampling of sequence space using protein interaction libraries (Joung et al., 2000), barcode-lineage tracking libraries (Levy et al., 2015; Nguyen Ba et al., 2019), directed evolution (Packer and Liu, 2015), synthetic promoter library screens (Wu et al., 2019), and mutagenesis screens (Bock et al., 2022; Erwood et al., 2022; Kim et al., 2022; Sánchez-Rivera et al., 2022). Despite the usefulness of such experiments in single-celled systems, either in microorganisms or in cell culture, increasing transgenic throughput in multicellular models holds the potential to expand the impact of exploratory transgenesis in functional domains, such as inter-tissue signaling, neuronal health, and animal behavior, that are dependent on multicellular interactions and therefore difficult to replicate in single-cell models.
Exploratory transgenesis in single-cell models has been facilitated by the availability of in vitro-generated DNA libraries, selectable markers, plasmids, in vivo homologous recombination, and most importantly, the ability to massively parallelize transgenesis using microbial transformation or eukaryotic cell transfection/transduction. Currently, there is no practical means to make populations of uniquely transgenic individuals from sequence libraries at a similar scale in animal systems due to the Weismann barrier (Weismann, 1893): the split between soma and germline. The requirement that the germline be accessible and editable has forced animal systems into a transgenic bottleneck compared to single-cell systems because it is very difficult to introduce exogenous DNA directly into the germline in a high-throughput manner, relying instead on injection, bombardment, or some other physical intervention. This low-throughput limitation in animals dramatically reduces the sequence diversity that can be sampled, effectively preventing large-scale exploratory experiments from being performed. Attempts have been made to parallelize transgenic creation in multicellular model organisms, for example, the development of Brainbow (Livet et al., 2007; Weissman and Pan, 2015), ifgMosaic analysis (Pontes-Quero et al., 2017), P[acman] libraries in
Here, we present ‘Transgenic Arrays Resulting in Diversity of Integrated Sequences’ (TARDIS) (Stevenson et al., 2021), a simple yet powerful alternative to traditional single-copy transgenesis. TARDIS greatly expands throughput by explicitly separating and reordering of the conceptual steps of transgenesis (Figure 1). To increase throughput, TARDIS begins with an in vitro-generated DNA sequence library that is introduced into germ cells via traditional low-throughput methods (i.e., germline transformation, Figure 1). While traditional transgenesis typically couples the physical introduction of DNA into cells with the integration of a selected sequence from the original library, the DNA sequences in TARDIS are designed to be incorporated in large numbers into diverse, heritable sub-libraries (TARDIS libraries), rather than be directly integrated into the desired genomic locus. In addition to the sequence library, a functioning selectable marker is also included to stabilize the inheritance of the TARDIS library over generations. These TARDIS libraries function to create ‘metaploidy’ – expanding the total number of alleles available for inheritance, essentially making the worm genetically ‘bigger on the inside.’ TARDIS library-bearing animals are then allowed to propagate under selection to generate a large population of TARDIS library carriers. After population expansion, genome integration of a single-sequence unit is performed by inducing a double-strand break at a genetically engineered landing pad. This landing pad is designed to both integrate a sequence unit and act as a second selectable marker. We chose
Figure 1.
Transformation compared to Transgenic Arrays Resulting in Diversity of Integrated Sequences (TARDIS).
For transformation, a large population of cells are individually transformed with a DNA library, resulting in a diverse population of individuals. TARDIS achieves a diversity of individuals by splitting transgenesis into two separate processes: (1) the introduction of a diverse library, which is formed into a TARDIS library array, passed down to future generations and thus replicated; and (2) an event that triggers the integration a sequence from the library at random, resulting in a diversity of integrated sequences.
We demonstrate the functionality of TARDIS for two use cases: unique animal barcoding and promoter library transgenesis. Barcoding has been widely adopted in microbial systems for evolutionary lineage tracking (Jahn et al., 2018; Levy et al., 2015; Nguyen Ba et al., 2019) and for developmental lineage tracking in animals (Kebschull and Zador, 2018; McKenna et al., 2016). In microbial systems, barcode libraries have relied on highly diverse randomized oligo libraries, compared to animal systems, which have relied on CRE recombinases or randomized Cas9-induced mutations. Here, we present a novel TARDIS barcoding system for an animal model that mimics the scope and diversity previously only possible using microbial systems. Our results show that large, heritable libraries containing thousands of barcodes can be created and maintained as extrachromosomal arrays. Individual sequences are selected and removed from the library upon experimental induction of Cas9 in a proportion consistent with the composition of the TLA with rare overrepresented sequences. We found that TARDIS is also compatible with the integration of large promoters and can be used to simultaneously integrate promoters into multiple genomic locations, providing a tool for multiple insertions at defined locations across the genome. While we demonstrate the system’s advantages in
Results
Generation of barcode landing pad
We designed a specific landing pad for the introduction and selection of small barcode fragments from high-diversity, multiplexed barcode libraries (Figure 2). This landing pad was designed to be targeted by Cas9 and requires perfect integration on both the 5′ and 3′ ends of a synthetic intron for functional hygromycin B resistance. Current split selection landing pads only provide selection on one side of the double-strand break, which can result in a small percentage of incomplete integrations (Stevenson et al., 2020). To fully test a large library approach, the requirement of genotyping to identify correct integrations must be overcome. A split-selection, hygromycin resistance (HygR) system was chosen for its simplicity and integration-specific selection. A unique synthetic CRISPR guide RNA target sequence was created by removing coding sequence on both sides of an artificial intron, resulting in a nonfunctional HygR gene. By removing critical coding sequence on both sides of the gene, only ‘perfect’ integration events will result in hygromycin resistance (Figure 2A). The synthetic landing pad was integrated at Chromosome II: 8,420,157, which has previously been shown to be permissive for germline expression (Dickinson et al., 2015; Frøkjær-Jensen et al., 2012; Frøkjaer-Jensen et al., 2008).
Figure 2.
Barcode landing pad and diverse donor library.
(A) Schematic design for the barcode landing pad and integration. A broken hygromycin resistance gene is targeted by Cas9, which repairs off the Transgenic Arrays Resulting in Diversity of Integrated Sequences (TARDIS) array, integrating a barcode and restoring the functionality of the gene. (B) The TARDIS multiplex library was created from a randomized oligo library, which underwent 10 cycles of PCR to make a dsDNA template. The barcode fragment was then added into a three fragment overlap PCR to add homology arms and make the final library for injection.
Figure 2—figure supplement 1.
Schematic layout for the two separate PCR processes for identification of barcode counts in arrays (Amplicon One-Array) and integrants (Amplicon One-Integrant).
Generation of high-diversity donor library and TARDIS arrays
Transgenes or DNA sequences can be cloned into plasmid vectors for injections in
We injected our complexed barcodes and isolated individual TARDIS array lines, each containing a subset of the barcode library (Figure 3). Individual injected worms were singled, and we identified four arrays from three plates. Arrays 1 and 2 were identified on separate plates, and were therefore derived from independent array formation events, while array 3, profile 1 and array 3, profile 2 were both identified on the same plate. Analysis of array diversity within these lines shows, somewhat unexpectedly, that during array formation a subset of barcode sequences tended to increase in frequency (Figure 3A and B). Higher frequency barcodes in arrays tend to be independent of the jackpotted sequences of the injection mix as very few are represented in the set of high-frequency barcodes from the injection mix. The high-frequency barcodes also varied between arrays.
Figure 3.
Transgenic Arrays Resulting in Diversity of Integrated Sequences (TARDIS) library arrays can contain large barcode diversity.
(A) Frequency distribution of 1319 unique barcodes in array 1 (PX816). (B) Frequency distribution of the 3001 unique barcode sequences in array 2 (PX817). (C) Sequence logo probabilities of the 15 base pair positions of the barcodes in the injection mix, array 1 and array 2.
Figure 3—figure supplement 1.
Barcode frequency in injection mix.
Barcode frequencies for Injection mix used for arrays 1–3. There are approximately 1 million reads represented. In total, 797,353 unique sequences were identified. A few of these unique sequences were represented at a higher frequency with a count cutoff greater than 50.
Figure 3—figure supplement 2.
Transgenic Arrays Resulting in Diversity of Integrated Sequences (TARDIS) array 3.
Two individual arrays were isolated from the same plate. Both show considerably less diversity than TARDIS arrays 1 and 2. Distribution of unique barcode frequencies, sequence logo base pair probability, and count cutoff for (A) TARDIS array 3 profile 1 and (B) TARDIS array 3 profile 2.
Figure 3—figure supplement 3.
Determination of proper count cutoff for (A) Transgenic Arrays Resulting in Diversity of Integrated Sequences (TARDIS) array 1 and (B) TARDIS array 2.
We found that array formation does not seem to favor any particular barcode sequence motif (Figure 3C) and that arrays can range considerably in diversity. Array 1 had 1319 unique barcode sequences, array 2 had 3001 unique barcode sequences, array 3 profile 1 had 91 unique barcode sequences, and array 3 profile 2 had 204 unique barcode sequences (Figure 3—figure supplement 2). Across the four arrays, we found a total of 4395 unique barcode sequences. When we compared the individual sequences incorporated during the three independent injections, we found little overlap. 96.5% (4395/4553) of the identified sequences were unique to one injection, 3.0% (136) were incorporated twice, and 0.5% (22) were recovered from all three injections. In contrast to the diversity between injection events, a similar comparison of the two profiles derived from a single injection for array 3 showed considerable overlap, with 68% (62/91) of the profile 1 sequences also being present in profile 2. Overall, our results suggest our complexing PCR oligo library can produce a highly diverse library and that arrays can store a large diversity of unique sequences.
The distribution of element frequency within a given array follows a clear Poisson distribution. Arrays 1 and 2 show more diversity, with barcode frequencies more similar to one another than the two profiles isolated from array 3 (Figure 3—figure supplement 2). The null assumption is that the array is formed from a simple sample of the injected barcodes in equal proportions. However, arrays have been already reported to jackpot certain sequences. For example, when Lin et al., 2021 injected fragmented DNA, they found that larger fragments were favored in the assembly. In our case, we find some barcode sequences become jackpotted, despite being identical in size. A possible explanation is that early in formation, arrays are replicating sequences, possibly to reach a size threshold. Consistent with this hypothesis, arrays with higher barcode diversity had frequencies closer to one another, while arrays with lower diversity had wider frequency ranges.
Integration from TARDIS array to F1
Our primary motivation in developing the TARDIS method was to utilize individual sequences from the TARDIS array as integrated barcodes. To assay the integration efficiency, we performed TARDIS integration on two biological replicates from a TARDIS array line (PX786) synchronized in the presence of G-418. Out of the 100 L1’s per plate initially plated on antibiotic free plates, an average of 41 worms (N = 255 plates) for replicate 1 and 62 worms for replicate 2 (N = 125 plates) survived to the next day. These surviving individuals contained the array, allowing them to survive early-life G-418 exposure and generally showed fluorescent co-marker expression as well. Following heat shock to induce Cas9, replicate 1 produced 104 plates with hygromycin-resistant individuals, indicating barcode integration, and replicate 2 produced 71. These results suggest that approximately 200–300 worms need to be heat-shocked to obtain an integrated line when using 150 bp homology arms and relatively small inserts such as the barcodes. To assay the integration frequency from the array to the F1, we performed TARDIS integration on four biological replicates derived from PX786. We found that the frequency of integration for barcodes in F1 individuals was strongly correlated with the barcodes’ frequency in the TLA (Figure 4A;
Figure 4.
Integration frequency from Transgenic Arrays Resulting in Diversity of Integrated Sequences (TARDIS) library array to F1.
(A) Frequency of integration from TARDIS library array to the F1, R ≈ 0.96, p≈5.7 × 10-154. Different colors represent four biological replicates. Line shading represents 95% confidence interval. (B) Sequence probabilities of PX786 compared to the F1 integrations (91 unique barcodes were identified in the array and 118 in the F1s, with a five read threshold).
Figure 4—figure supplement 1.
F1 integration events followed a consistent pattern, with replicated outlier barcode sequences.
Generally, the same barcodes integrated at approximately the same frequency across the four replicates.
Generation and integration of TARDIS promoter library
For testing insertion of promoter libraries via TARDIS, two separate landing pad sites utilizing split selection were engineered in chromosome II (Figure 5A). The first contained the 3′ portions of both the
Figure 5.
Transgenic Arrays Resulting in Diversity of Integrated Sequences (TARDIS) promoter library.
(A) Overview of the two split landing pads and their associated promoter insertion vectors. Both the selective marker and the fluorophore expression are restored upon correct integration. (B) Transcriptional reporters for nine genes were recovered from a single heatshock of a single TARDIS array line (PX819). Integration was into the single mScarlet-I/HygR landing pad. Main images show mScarlet-I expression for the indicated reporter while insets show polarized image of the same region. (C) Example simultaneous, dual integration from a single TARDIS array into the double landing pad strain with PEST. ceh-10p::mNeonGreen::PEST is false-colored green and ceh-40p::mScarlet-I::PEST is false-colored magenta. All scale bars represent 20 µm.
Figure 5—figure supplement 1.
Transformation efficiency for promoter arrays.
(A) Promoters from seven array-bearing lines were amplified using universal primers and show distinct profiles. Note that not all promoters found in the arrays were detected in this screen. (B) Promoters contained in each array-bearing line as determined by promoter specific PCR (C) Line PX819 was heat-shocked to trigger integration and four hygromycin-resistant progeny were singled from each of the 59/60 plates with hygromycin-resistant individuals. Singled worms were screened by PCR, and select promoters were chosen for sequencing based on their size profile. PCR and sequencing result are shown for representative plates. *Due to their size of the egl-46 and nhr-67, promoters do not reliably amplify with the universal primers. Therefore, samples with no or weak amplification were rescreened with primers specific to these two promoters (not shown).
The initial promoter library tested was composed of 13 promoters targeted to a single landing pad site with split
Table 1.
Characteristics of injected promoters and presence in tested array line (PX819) and integrated lines derived from that array.
| Promoter | Promoter size (bp) | Expected expression location | Array | Integrated |
|---|---|---|---|---|
|
| 330 | Neurons, hypodermis, intestine, pharynx (Jiang et al., 2001) | Y | Y |
|
| 514 | Head neurons (Bertrand et al., 2011) | Y | N |
|
| 965 | Dopaminergic neurons (Sarov et al., 2012) | Y | Y |
|
| 1172 | Neurons, seam cells (Reece-Hoyes et al., 2007) | Y | Y |
|
| 1387 | ALM and RME neurons (Huang et al., 2004) | Y | N |
|
| 2000 | Neurons, body wall, pharynx (Reece-Hoyes et al., 2007) | Y | Y |
|
| 2001 | Neurons, gonad (Hwang et al., 2007) | Y | N |
|
| 2015 | Neurons, seam cells, vulva (Reece-Hoyes et al., 2007) | Y | Y |
|
| 2096 | Neurons, anterior hypodermis (Reece-Hoyes et al., 2007) | Y | Y |
|
| 2524 | Nead neurons, coelemocytes, pharynx (Klabonski et al., 2016) | Y | Y |
|
| 2857 | Neurons, uterus, vulva, head muscle (Gupta et al., 2003) | Y | Y |
|
| 4477 | Neurons (Wu et al., 2001) | N | N |
|
| 5545 | Neurons, excretory cell, rectal valve cell, vulva (Fernandes and Sternberg, 2007) | Y | Y |
Y, yes; N, no.
To test whether TARDIS could be used to target multiple sites simultaneously, a second promoter library containing seven promoters targeted to each site (
When transcriptional reporter lines were examined by fluorescent microscopy, expression of the fluorophores was concentrated in but not exclusive to the nucleus, consistent with the presence of nuclear localization signals (NLS) on the fluorophores. For all promoters, expression was seen in at least one previously reported tissue (Table 1) but was absent in one or more tissues for several of the promoters. Expression of single-copy reporters is frequently more spatially restricted than that from integrated or extrachromosomal arrays (Aljohani et al., 2020). The differences in expression pattern may also reflect the differences in the region used as the promoter or the fact that only a single developmental stage (late L4/early adult) was examined. Overall, we find that TARDIS can be used to screen functional libraries, either individually or in combination.
Discussion
Here, we present the first implementation of a practical approach to large-scale library transgenesis in an animal system (Figure 1). Building on over a half century of advancements in
TARDIS as a method for creating barcoded individuals
Genetic barcode libraries have been applied to many high-throughput investigations to reduce sequencing costs and achieve a higher resolution within complex pools of individuals. By focusing the sequencing reads on a small section of the genome, a larger number of individual variants can be identified or experimentally followed. This critical advancement has led to the widespread use of barcoding for evolutionary lineage tracking in microbial systems (Blundell and Levy, 2014; Kasimatis et al., 2021; Levy et al., 2015; Levy, 2016; Nguyen Ba et al., 2019; Venkataram et al., 2016) – uncovering the fitness effects of thousands of individual lineages without requiring large coverage depth of the whole genome. In addition to this application, using barcoded individuals can be used to facilitate any application that involves screening a large pool of diverse individuals within a shared environment. For example, barcodes have been used in microbial studies investigating pharmaceutical efficacy (Smith et al., 2011) and barcoded variant screening (Emanuel et al., 2017). The TARDIS-based system presented here provides an approximately 1000×-fold increase in barcoding throughput in
While we designed our barcode sequence units for the purpose of barcoding individuals, this approach could also prove useful in future optimization and functional understanding of array-based processes. In particular, the high-sequence diversity but identical physical design of the synthetic barcode library may provide a unique window into extrachromosomal array biology that would be helpful in optimizing sequence units for incorporation into heritable TLAs. For example, an unexpected result of the barcoding experiment was the discovery that a small minority of sequences were overrepresented, or ‘jackpotted,’ in the TLA relative to their frequency in the injection mix (Figure 3 and Figure 3—figure supplement 1). Our expectation was that arrays would form in an equal molar fashion proportional to the injection mix based on the model that arrays are formed by physical ligation of the injected DNA fragments (Mello et al., 1991). Deviations from random array incorporation have been observed before, and a bias for incorporating larger fragments has been proposed as an explanatory mechanism (Lin et al., 2021). Our results suggest that the ultimate array composition is not directly proportional to the molarity of the injected fragments or strictly weighted towards the size of the fragment as has been suggested. In contrast, we propose that array size affects the maintenance of extrachromosomal arrays. As such, selection can act to increase the rate of recovery for arrays that have increased their size through random amplification of some sequences by an unknown process early in the formation of the array or by expansion of similar sequences by DNA polymerase slippage during replication, as has been well documented for native chromosomes (Levinson and Gutman, 1987). These hypotheses would be consistent with observations of Lin et al., 2021 if the underlying mechanism for their observation is that inclusion of larger fragments tends to be positively correlated with ultimate array size, and therefore likelihood of maintenance.
TARDIS as a method for the introduction of promoters and other large constructs
While the barcode approach demonstrates the potential for using TARDIS to integrate large numbers of 433 bp PCR products, previous work using CRISPR/Cas9-initiated homology-directed repair has suggested that integration efficiencies decrease with the size of the insert (Dickinson and Goldstein, 2016). We therefore implemented TARDIS for integrating promoters cloned into a vector backbone and ranging in size from 330 bp to 5.5 kb to determine TARDIS functionality under a physically different use case directed specifically at functional analysis. We found that promoter libraries could be integrated into either single sites or two sites simultaneously. Unsurprisingly, the frequency at which various promoters were recovered varied from array to array (e.g.,
For both the one- and two-site promoter library integrations, transgenic individuals were readily detected, suggesting that the TARDIS method for integration was highly efficient. It has long been understood that successful CRISPR editing at one site significantly increases the chances of successful editing at a second site. This is the premise behind commonly used co-conversion screening strategies (also referred to as co-CRISPR), such as the
In order to recover individual edits most efficiently, given the high frequency of integration using TARDIS, we recommend to either heat-shock small cohorts of array-bearing individuals, such that most cohorts only yield one edited individual or to screen multiple individuals per cohort. Additionally, while split-selection methods allow for direct verification of integration, depending on the downstream use case, integrations should be confirmed by sequencing as errors can still occur, including internal deletions within the insert.
Expansion of TARDIS to other multicellular systems
Unlocking the investigative potential of transgenesis in animal systems would enable exploratory experiments normally restricted to single-cell models. For example, alanine scanning libraries and protein–protein interactions (Cunningham and Wells, 1989; Matthews, 1996; Wells, 1991), CRISPR library screening (Bock et al., 2022), and promoter library generation (Delvigne et al., 2015; Zaslaver et al., 2006). While we demonstrate the use of TARDIS in
The second component of the TARDIS integration system is a pre-integrated landing pad sequence. We have generally favored split selectable landing pads (SSLPs) that use HygR for its effectiveness (Mouridi et al., 2022; Stevenson et al., 2020; Stevenson et al., 2021). The SSLPs are engineered to accept experiment-specific units from the array. For example, here we used SSLPs designed to accept barcodes for experimental lineage tracking and promoters for generation of transcriptional reporters. To translate TARDIS to other systems, a genomic site needs to be engineered to act as a landing pad that can utilize sequence units from the TLA and can be customized to the specific system and use. Because TLAs allow the experimenter to design the library of interest and the landing pad to recapitulate the strengths of single-cell systems, adoption of TARDIS in multicellular animal experiments can leverage the high-resolution, high-diversity exploratory space of DNA synthesis. In addition to adapting assays currently restricted to single-cell models, TARDIS also opens the door to animal-specific uses, such as developmental biology, neurobiology, endocrinology, and cancer research.
In developmental genetics, the lack of large-library transgenesis has resulted in ‘barcode’ libraries in a different form, utilizing randomized CRISPR-induced mutations to form a unique indel. For example, GESTALT (McKenna et al., 2016) creates a diversity of barcodes in vivo via random indel formation at a synthetic target location. LINNAEUS (Spanjaard et al., 2018) similarly utilizes randomized targeting of multiple RFP transgenes to create indels, allowing for cells to be barcoded for single-cell sequencing. TARDIS barcodes do not rely on randomized indel generation and thus can be much simpler to implement with sequencing approaches outlined above.
In vivo cancer models have also adopted the high-resolution, high-variant detection of barcodes for the study of tumor growth and evolution. Rogers et al. developed Tuba-seq (Rogers et al., 2017; Winslow, 2022), a pipeline that takes advantage of small barcodes allowing for in vivo quantification of tumor size. In Tuba-seq, barcodes are introduced via lentiviral infection, leading to the barcoding of individual tumors. TARDIS brings the multiplexed library into the animal context without requiring viral vectors or intermediates, thereby allowing large in vivo library utilization and maintenance. Capitalizing on the large-sequence diversity possible within synthesized DNA libraries with a novel application in multicellular systems generates new opportunities for experimental investigation in animal systems heretofore only possible within microbial models.
Conclusion
In conclusion, here we have presented TARDIS, a simple yet powerful approach to transgenesis that overcomes the limitations of multicellular systems. TARDIS uses synthesized sequence libraries and inducible extraction and integration of individual sequences from these heritable libraries into engineered genomic sites to increase transgenesis throughput up to 1000-fold. While we demonstrate the utility of TARDIS using
Materials and methods
Key resources table
| Reagent type (species) or resource | Designation | Source or reference | Identifiers | Additional information |
|---|---|---|---|---|
| Genetic reagent ( |
| wormbase.org | WBGene00000095 | |
| Genetic reagent ( |
| wormbase.org | WBGene00001960 | |
| Genetic reagent ( |
| wormbase.org | WBGene00000461 | |
| Genetic reagent ( |
| wormbase.org | WBGene00000435 | |
| Genetic reagent ( |
| wormbase.org | WBGene00000096 | |
| Genetic reagent ( |
| wormbase.org | WBGene00003163 | |
| Genetic reagent ( |
| wormbase.org | WBGene00001207 | |
| Genetic reagent ( |
| wormbase.org | WBGene00000443 | |
| Genetic reagent ( |
| wormbase.org | WBGene00000463 | |
| Genetic reagent ( |
| wormbase.org | WBGene00000903 | |
| Genetic reagent ( |
| wormbase.org | WBGene00003000 | |
| Genetic reagent ( |
| wormbase.org | WBGene00001210 | |
| Genetic reagent ( |
| wormbase.org | WBGene00003657 | |
| Strain, strain background ( |
| Caenorhabditis Genetics Center | ||
| Strain, strain background ( |
| doi:10.17912/micropub.biology.000518 | Available from the | |
| Strain, strain background ( | PX740 | This paper | N2-PD1073 fxIs47 [ | |
| Strain, strain background ( | GT331 | This paper | aSi9[ | |
| Strain, strain background ( | GT332 | This paper | aSi10[ | |
| Strain, strain background ( | GT336 | This paper | aSi12 | |
| Strain, strain background ( | GT337 | This paper | aSi13[ | |
| Strain, strain background ( | QL74 | Gift from QueeLim Ch’ng | oxEx1578 [ | |
| Strain, strain background ( | PX786 | This paper | fxEx23 [TARDIS #5 | |
| Strain, strain background ( | PX816 | This paper | fxEx25 [TARDIS #1 | |
| Strain, strain background ( | PX817 | This paper | fxEx26 [TARDIS #2 | |
| Strain, strain background ( | PX818 profile 1 | This paper | fxEx27 [TARDIS #3 | |
| Strain, strain background ( | PX818 profile 2 | This paper | fxEx28 [TARDIS #4 | |
| Strain, strain background ( | PX819 | This paper | N2 fxEx24 [( | |
| Strain, strain background ( | EG4322 | doi.org/10.1038ng.248; Caenorhabditis Genetics Center | ||
| Strain, strain background ( | PXKR1 | This paper | NA22 transformed with pUC19 | |
| Recombinant DNA reagent | Plasmid pDSP15 | This paper | 193853 (Addgene) |
|
| Recombinant DNA reagent | Plasmid pDSP16 | This paper | 193854 (Addgene) |
|
| Recombinant DNA reagent | Plasmid pMS84 | This paper | 193852 (Addgene) | |
| Recombinant DNA reagent | Plasmid pZCS36 | This paper | 193048 (Addgene) |
|
| Recombinant DNA reagent | Plasmid pZCS38 | This paper | 193049 (Addgene) |
|
| Recombinant DNA reagent | Plasmid pZCS41 | This paper | 193050 (Addgene) | |
| Sequence-based reagent | ZCS422 | This paper | Design and construction of barcode donor library | |
| Commercial assay or kit | DNA Clean and Concentrator | Zymo Research | Cat# D4004 | |
| Commercial assay or kit | Genomic DNA Clean and Concentrator | Zymo Research | Cat# D4011 | |
| Commercial assay or kit | Zymoclean Gel DNA Recovery Kit | Zymo Research | Cat# D4008 | |
| Commercial assay or kit | Zyppy Plasmid Miniprep Kit | Zymo Research | Cat# D4019 | |
| Software, algorithm | Cutadept | doi.org/10.14806/ej.17.1.200 | Version 4.1 | |
| Software, algorithm | AmpUMI | doi.org/10.1093/bioinformatics/bty264 | Version 1.2 | |
| Software, algorithm | Starcode | doi.org/10.1093/bioinformatics/btv053 | Version 1.4 | |
| Software, algorithm | Google colab | colab.research.google.com | ||
| Software, algorithm | Python (version) | Guido van Rossum, 1991 | Version 3.7.13 | |
| Software, algorithm | Juypter Notebook (IPython) | doi:10.3233/978-1-61499-649-1-87 | Version 7.9.0 | |
| Software, algorithm | matplotlib | doi:10.5281/zenodo.3898017 | Version 3.7.13 | |
| Software, algorithm | Fiji | imagej.net/software/fiji/ | Version 2.9.011.53t | |
| Chemical compound, drug | G-418 | GoldBio (CAS number 108321-42-2) | Cat# G-418-5 | |
| Chemical compound, drug | Hygromycin B | GoldBio (CAS number 31282-04-9) | Cat# H-270-10-1 |
General TARDIS reagents
Strains generated for this publication along with key plasmids and reagents are listed in the Key Resources Table. A full list of all plasmids is given in Supplementary file 1. All plasmids were cloned by Gibson Assembly following the standard NEB Builder HiFi DNA Assembly master mix protocol (New England Bio Labs [NEB], MA), unless otherwise indicated. All plasmids have been confirmed by restriction digest, Sanger sequencing, and/or full plasmid sequencing. All primers used in the construction and validation of plasmids are listed in Supplementary file 2.
To generate our heatshock-inducible Cas9,
To generate a standard empty guide vector,
To generate
Genomic DNA isolation for array and integrant characterization
For processing large populations of worms, a widely used bulk lysis protocol was adapted (Fire Lab 1997 Vector Supplement, February 1997). In brief, 450 µl of worm lysis buffer (0.1 M Tris-Cl pH 8.0, 0.1 M NaCl, 50 mM EDTA pH 8.0, and 1% SDS) and 20 µl 20 mg/ml proteinase K were added to approximately 50 µl of concentrated worm pellet. Samples were inverted several times to mix and incubated at 62°C for 2 hr. After incubation, samples were checked under the microscope to ensure no visible worm bodies were left in the solution. ChIP DNA binding buffer (Zymo, CA) was added in a 2:1 ratio and gently inverted to mix. Samples were then purified with Zymo-Spin IIC-XLR columns following the manufacturer’s protocol. Samples were eluted in 50 µl of water. Each sample was then digested with 10 mg/ml RNase A (Thermo Fisher Scientific, MA, Cat# EN0531) at 42°C for 2 hr. Genomic DNA was then reisolated by adding a 2:1 ratio of ChIP DNA binding buffer and purifying with Zymo-Spin IIC-XLR columns. Final genomic samples were quantified by Nanodrop.
For individual worm lysis, individual array-bearing worms were isolated and lysed in 4 µl of EB (Zymo, Cat# D3004-4-16) buffer with 1 mg/ml proteinase K (NEB). Each sample was rapidly frozen in liquid nitrogen and then thawed to disrupt the cuticle and then incubated at 58°C for 1 hr, with a subsequent incubation at 95°C for 20 min to inactivate the proteinase K.
TARDIS integration: General protocol
On day 0, TARDIS array-bearing
Construction of landing pad for barcodes
To create the barcode landing pad, an intermediate Chr. II insertion vector, pZCS30, was built from pMS4 by using PCR to remove the
The barcode landing pad TARDIS strain, PX740, was created by injecting a mixture of 10 ng/µl pZCS32, 50 ng/µl pMS8, and 3 ng/µl pZCS16 (Addgene ID154824) (Stevenson et al., 2020) into the gonad of young adult N2-PD1073 (Teterina et al., 2022) hermaphrodites. Screening and removal of the SEC were performed following Dickinson et al., 2015. Presence of the correct insertion was confirmed by Sanger sequencing using the primers listed in Supplementary file 3.
To create the barcode landing pad targeting guide RNA,
Design and construction of barcode donor library
Oligo ZCS422 was ordered with 11 randomized N’s (hand-mixed bases) (Integrated DNA Technologies [IDT], IA) and has the following sequence:
For ‘barcode-15X,’ to generate the complete donor homology, the double-stranded barcode template was combined with both the left and right homology arms for a three-fragment overlap PCR. To maximize diversity, high concentrations of the individual templates were used. The reaction contained 52 fmol/µl of barcode template and 22 fmol/µl of left right arms in a single 50 µl Q5 reaction. A total of 15 cycles were performed. The lower cycle was again done to reduce PCR jackpotting. The single product was gel extracted as a 433 bp fragment. The final donor fragment is referred to as ‘barcode-15X.’
To generate ‘barcode-20X,’ a similar three-fragment overlap PCR was used. 4.3 fmol/µl of barcode template, 15.33 fmol/µl of left arm, and 3.3 fmols/µl of right arm were combined across six Q5 50 ul reactions and a total of 20 cycles were performed. The right arm concentration was lower caused by low concentration extraction. The single product was gel extracted as a 433 bp fragment. The final donor fragment is referred to as ‘barcode-20X.’
Generation of barcode TLA lines
The TARDIS array-bearing line PX786 was created by injecting 50 ng/µl of barcode-15X, 10 ng/µl pZCS38, 15 ng/µl pZCS41, 5 ng/µl pZCS16, and 20 ng/µl pZCS36 into young adult PX740 hermaphrodites. Individual injected worms were grown at 15°C for 4 d and then treated with G-418 (1.56 mg/ml). A single stable array line was isolated and designated PX786.
The TARDIS array-bearing lines PX816, PX817, PX818 profile 1 and PX818 profile 2 were created by injecting 100 ng/µl of barcode-20X, 10 ng/µl pZCS38, 15 ng/µl pZCS41, and 20 ng/µl pZCS36. Individual injections were grown at 15°C for 4 d and then treated with G-418 (1.56 mg/ml). Full genotypes are provided in Supplementary file 7 as the full genotypes cannot be contained within a table.
Estimation of barcode integration frequency population sample preparation
PX786 was grown to gravid adults in the presence of G-418 with concentrated NA22 transformed with pUC19 for ampicillin resistance as a food source (designated PXKR1). Once gravid, the strain was hypochlorite synchronized and grown overnight in 15 ml NGM buffer with G-418 at 15°C with nutation. For each of the four replicates, a synchronized L1 population was divided in half. The first half was pelleted by centrifugation (2400 ×
PCR for barcode quantification
Several different PCRs were performed depending on the intended downstream sequencing quantification. See Figure 2—figure supplement 1 for a schematic layout of the different PCR steps. The primers used for barcode quantification are given in Supplementary file 5. To quantify the diversity of arrays from either a bulk population or individual worms, two separate PCRs were performed to quantify the diversity of arrays.
The first PCR (Amplicon one array) was performed for three cycles to add Unique Molecular Identifiers (UMI), allowing for downstream de-duplication. For each sample, either 100 ng of genomic DNA (bulk samples) or the entirety of the worm lysate (single worms) was used as the template. PCR samples were then purified using the Zymo DNA Clean and Concentrator-5 Kit (Cat# D4004) following the manufacturer’s protocol and eluted with 24 µl water. Samples were not quantified prior to the next step as most DNA was not from the target PCR product. A second PCR (Amplicon two) using the entire 24 µl of the extract from the previous step was performed for 24 cycles to add indices. In some cases, a smaller, nonspecific product was also formed, so each sample was run on a 2% agarose gel and extracted for the 169 bp size product.
Two separate PCRs were performed to quantify the diversity of integrated barcode sequences. The first PCR (Amplicon one integrant) was performed for three cycles to add UMI sequences. For each sample, 100 ng of genomic DNA was used as the template. PCR products were then purified as described above and followed the Amplicon two protocol. Each product was quantified on a Synergy H1 plate reader using software Gen 5 3.11. Samples were mixed at an equal molar ratio for a 20 nM final concentration for Illumina sequencing.
Illumina sequencing and data processing for barcode characterization
To quantify the diversity of barcodes in each sample, PCR products were sequenced on either a single NextSeq 500 lane or NovaSeq SP, with single read protocols performed by the Genomics and Cell Characterization Facility (GC3F) at the University of Oregon. Compressed fastq files were processed with cutadept 4.1 (Martin, 2011) to remove low-quality reads (quality score < 30, max expected error = 1, presence of ‘N’ within the read) and trimmed to 87 bp. For the NextSeq lane, the specific nextseq trim = 30 command was used. The sequences were then demultiplexed using cutadept. For duplicate removal, AmpUMI (Clement et al., 2018) in ‘processing mode’ was used with umi regex ‘CACIIIIIIIIIIGAC’ for individual index files. De-duplicated reads were then trimmed to 15 base pairs with cutadapt for each file. Starcode (Zorita et al., 2015) was then used for mutation correction and counting of each barcode sequence. Each unique sequence was only kept if its final length was 15 base pairs. For the injection mix, each unique barcode was kept regardless of total reads. For all TARDIS arrays and F1 integrations, we used the observed plateau in the number of observed unique barcodes for various count cutoffs to establish a conservative threshold of five or more reads for true barcode sequence (Figure 3—figure supplement 3). Visualizations were created with Python 3.7.13 (Guido van Rossum, 1991) and matplotlib 3.5.2 (Hunter, 2007). Sequence logos were created with Logomaker (Tareen and Kinney, 2020). Correlation and p-values were generated by scipy input stats.pearsonr (Virtanen et al., 2020). This statistical test was chosen because the relationship from array to integration is approximately linear. All data were processed in Jupyter Notebooks (Kluyver et al., 2016) utilizing Google Colaboratory (colab.research.google.com). All Python code is available on Figshare.
Design of landing pads for transcriptional reporters
The utilized fluorophores, mScarlet-I (Bindels et al., 2017) and mNeonGreen (Shaner et al., 2013), were synthesized with the desired modifications as genes incorporated into the pUCIDT-KAN plasmid (IDT). First, a SV40 nuclear localization sequence (NLS) was added after the 13th codon of the
Landing pads were built using a modification of our previous split landing pad strategy (Stevenson et al., 2020). Each landing pad contained the 3′ portion of a selectable marker followed by a validated guide sequence and the 3′ portion of a fluorophore. The guide sequence (
Construction of split
The split
Next, the 3′ 949 bases of the
Construction of Split
To construct the
Chromosomal regions II:9830799–9831548 and II:9831573–9832322 were amplified from genomic DNA for use as homology arms. The self-excising cassette (SEC) was PCR amplified from pDD282 such that the loxP sites were replaced by lox2272 sites. An MCS was amplified from pDSP2 while a linear vector backbone fragment was amplified from pDSP1. All five of these PCR fragments were assembled into a circular plasmid, which was immediately used as a template for seven synonymous single-nucleotide substitutions into the terminal 21 bp of the
The 3′ 846 bases of the
Following the protocol from Dickinson et al., 2015, the landing pad from pDSP63 was integrated into the GT331 strain using pDSP45 as the guide plasmid. Upon integration, this yielded strain GT336. Activation of the Cre recombinase within the SEC by heat shock caused both the removal of the SEC from the
Design and construction of promoter library
Targeting vectors were constructed to provide the 5′ portions of each split gene pairing. Both targeting vectors had the same multiple cloning site, allowing promoter amplicons to be assembled into either vector using the same set of primers. In addition, each selectable marker gene is flanked by a lox site that matches the sequence and orientation of the lox site flanking the 3′ portion of the marker in the genomic landing pad, allowing for the optional post-integration removal of the selectable marker gene using Cre recombinase.
To construct the split
To construct the split
The entire intergenic region was used for
Insertion of promoter libraries by TARDIS
For integration of a promoter library into a single landing pad site, a mixture consisting of 15 ng/μl guide plasmid (pMS84), 20 ng/µl
For integration of a promoter library into two landing pad site, a mixture consisting of 15 ng/μl guide plasmid (pMS84), 20 ng/µl
For both scenarios, candidate worms (those which had both hygromycin resistance and wild-type movement) were singled and screened by PCR. The identity of the integrated promoters was determined by Sanger sequencing of the PCR product. The primers used for genotyping are given in Supplementary file 6.
Microscopy
Individual late L4/young adults were mounted on 2% agarose pads and immobilized with 0.5 M levamisole. Imaging was performed on a DeltaVision Ultra microscope (Cytiva, MA) using the 20x objective and Acquire Ultra software version 1.2.1. Fluorescent images were acquired using the orange (542/32 nm) and green (525/48 nm) filter sets for mScarlet-I and mNeonGreen, respectively. Light images were captured at 5% transmission and a 0.01 s exposure. Fluorescent images were captured at 5% transmission and a 2 s (
Accessibility of reagents, data, code, and protocols
The authors affirm that all data necessary for confirming the conclusions of the article are present within the article, figures, and tables. Plasmids pDSP15 (Addgene ID 193853), pDSP16 (Addgene ID193854), pMS84 (Addgene ID 193852), pZCS36 (Addgene ID 193048), pZCS38 (Addgene ID193049), and pZCS41 (Addgene ID 193050) are available through Addgene and can be freely viewed and edited in ApE (Davis and Jorgensen, 2022) and other compatible programs. Strains PX740, GT332, and GT337 are available from the Caenorhabditis Genetics Center (cgc.umn.edu). Strains and plasmids not available at a public repository are available upon request. Illumina sequencing data are available at BioProject ID: PRJNA893002. All other data, code, plasmid and landing sequences, and original microscopy images are available on Figshare (Stevenson et al., 2022). We plan to continue to develop TARDIS technology and provided descriptions of updated libraries and advancements at https://github.com/phillips-lab/TARDIS, (copy archived at ZCST, 2022).
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2023, Stevenson et al. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
High-throughput transgenesis using synthetic DNA libraries is a powerful method for systematically exploring genetic function. Diverse synthesized libraries have been used for protein engineering, identification of protein–protein interactions, characterization of promoter libraries, developmental and evolutionary lineage tracking, and various other exploratory assays. However, the need for library transgenesis has effectively restricted these approaches to single-cell models. Here, we present Transgenic Arrays Resulting in Diversity of Integrated Sequences (TARDIS), a simple yet powerful approach to large-scale transgenesis that overcomes typical limitations encountered in multicellular systems. TARDIS splits the transgenesis process into a two-step process: creation of individuals carrying experimentally introduced sequence libraries, followed by inducible extraction and integration of individual sequences/library components from the larger library cassette into engineered genomic sites. Thus, transformation of a single individual, followed by lineage expansion and functional transgenesis, gives rise to thousands of genetically unique transgenic individuals. We demonstrate the power of this system using engineered, split selectable TARDIS sites in
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer




