Introduction
Over 85% of living animal species belong to the superphylum ecdysozoa. This group includes nematodes, arthropods, tardigrades, and five other phyla (Telford et al., 2008; Aguinaldo et al., 1997). They are defined by having a common ancestor and a specialized extracellular matrix that covers their body called the cuticle. The ecdysozoan cuticle is shed and regrown to accommodate juvenile growth in a process called ecdysis or molting.
Cuticle shape is patterned by the tissue beneath it, but also takes on additional diversity beyond the underlying tissue shape. One example of this structural diversity is the mouthparts of nematodes. Many carnivorous nematodes and nematode parasites of animals have cuticle-based teeth that bite into their prey or host (Sieriebriennikov and Sommer, 2018; John and Petri, 2006). Nematode parasites of plants have needle-like cuticle stylets that pierce plants and act as a syringe to deposit effectors and suck out vital nutrients (Mejias et al., 2019). Bacterivorous nematodes, like the model nematode
All epithelia in
The non-chitinous body cuticle has multiple layers that include an outer carbohydrate-rich glycocalyx, a lipid-rich epicuticle, and multiple inner collagenous layers (Altun and Hall, 2020; Page and Johnstone, 2007; Cox et al., 1981). By contrast, the pharyngeal cuticle is not collagenous (Altun and Hall, 2020; Cox et al., 1981) and instead contains a chitin-chitosan matrix that likely helps maintain luminal integrity (Zhang et al., 2005; Heustis et al., 2012). The pharyngeal cuticle is layered (Sparacio et al., 2020; Wright and Thomson, 1981), but the molecular composition of the different layers is unknown. Like other ecdysozoans,
In addition to chitin, the pharyngeal cuticle contains a group of largely disordered proteins called the APPGs (also known as the
IDRs are defined here as a 30 or more continuous residues whose primary sequence fails to form a stereotypical stable tertiary structure and instead rapidly interconverts between heterogenous conformations (van der Lee et al., 2014). Despite lacking ordered structure, IDRs can interact with other IDRs through local areas of hydrophobicity, complementary charge, hydrogen-bond formation, and pi-stacking interactions along the respective peptide chains (Vernon and Forman-Kay, 2019). IDRs often harbor repeating sequence features that can facilitate the formation of multivalent interaction networks with multiple binding partners (Vernon and Forman-Kay, 2019). Depending on the local environment, multivalent IDRs, and particularly low-complexity IDRs, can phase separate to form liquid–liquid phase-separated droplets (LLPS) (i.e., liquid condensates) or gels, which can then transition to more solid structures, including fibers (Mittag and Parker, 2018; Banani et al., 2017). LLPS has been shown to be an important first step in the self-assembly of IDR-rich proteins into the extracellular matrices of insects, arachnids, and molluscs (reviewed in Muiznieks et al., 2018). For example, IDR-rich proteins that form liquid condensates fill a porous chitin-based matrix in a key step of squid beak development (Tan et al., 2015). Given that the affinity of any one interaction along an IDR is relatively weak, the ability of IDRs to form these phase-separated networks is easily modulated by a variety of factors, including pH, ions, temperature, protein concentration, and post-translational modifications (Murray et al., 2017).
Here, we describe the spatiotemporal logic of pharyngeal cuticle construction that we have uncovered by mining published mRNA expression datasets and canonical amyloid and chitin-binding dyes. We identify six families of low-complexity proteins that are likely secreted into the developing cuticle, including the IDPAs, IDPBs, and IDPCs, each of which are described for the first time here, and the APPGs, NSPBs, and the FIPRs. These six families peak in expression level in successive waves over the course of each larval stage. Computational analyses predict that the IDPA, IDPB, IDPC, and APPG families, and 12 other singletons are IDR-rich proteins capable of phase separation. We speculate that the malleable properties of the disordered phase-separating proteins are especially suited to a flexible cuticle that must be rapidly destroyed and reconstructed during molting.
Results
Validating fluorescent dyes as probes of pharyngeal cuticle structure
Earlier transmission electron microscopy of the
Figure 1.
The pharyngeal cuticle and surrounding cells.
(A) A schematic of the relative position of the
We confirmed that the four dyes specifically bind components within the pharyngeal cuticle in two ways. First, we performed pulse-chase experiments with the dyes to determine whether the dye’s fluorescent signal would be lost as the larvae shed their old cuticle during their transition to the next developmental stage (see ‘Materials and methods’ for details). After the 18 hr chase, very few animals who were initially L3s had CFW, EY, CR, or ThS signal (Figure 2, Figure 2—figure supplement 1). By contrast, the dyes’ signal persisted in animals that were initially young adults (Figure 2). The loss of the four dyes from the larvae but not adults in the pulse-chase experiments indicates that the dyes bind the pharyngeal cuticle.
Figure 2.
Pulse-chase and cuticle mutant experiments show dye association with the cuticle.
(A) Schematic showing the pulse-chase assay. Synchronized populations of L3 or adult worms were incubated with a dye for 3 hr (the ‘pulse’), after which worms were washed with M9 and run on normal plates with food for 1 hr. Worms are transferred to fresh plates and the presence of the dye was scored (see ‘Materials and methods’ for details). Then, 18 hr later (i.e., after the chase), worms were again scored for the presence of the dye. (B–Q) In each of the four groups of eight micrographs with the dye indicated in the header, the top two rows show the pulse-chase experiment done starting with L3s, and the bottom two rows show the pulse-chase experiment done with adults. The filter used to visualize the dyes is indicated at the header of the rightmost column in each of the four panel sets. In all panels, white arrows highlight the presence of the dye in the cuticle and black arrows show cuticle without dye signal. The scale bar is indicated. (R) The fraction of worms with stained cuticle before and after the chase for each dye is shown; a minimum of four repeats (N) were done with a sample size of 7–34 animals (average = 13) (n) per repeat. Asterisk denotes statistically significant difference relative to the pre-chase values (p<0.05). Standard error of the mean is shown. (S–X) Wildtype animals treated with
Figure 2—figure supplement 1.
The fluorescence and filter controls for dye staining.
Images of young adult wildtype worms incubated with the indicated dyes for 3 hr at the following concentrations: calcofluor white (CFW) (0.01% w/v), eosin Y (EY) (0.15 mg/mL), Congo Red (CR) (0.02% w/v), and thioflavin S (ThS) (0.01%). Also shown is an unstained ABU-14::GFP animal. Green arrows highlight the buccal collar dots that are often visible by differential interference contrast (DIC).
The scale in (A) applies to all panels.
Second, we tested whether the dyes bind the pharyngeal cuticle after the cuticle has separated from the animal, the attachment of which persists in
Cuticle dyes stain distinct structures within the pharyngeal cuticle
We examined the colocalization of the four dyes in wildtype animals and correlated the resulting patterns to the ultrastructural features observed in a series of unpublished TEM images by Kenneth A. Wright and Nicole Thomson (Wright and Thomson, 1981; Figure 3). These TEM images show that the cuticle of the buccal cavity and the channels is a mixture of electron-light and electron-dense (dark) material, with the dark material forming circumferential ribs (white arrows) and ‘flaps’ (yellow arrows).
Figure 3.
Probing pharyngeal cuticle composition with characterized dyes.
(A–D) Images of the buccal and mc1 channel cuticles and surrounding cells. The dyes or GFP-fusion protein examined is indicated. DIC, differential interference contrast; CFW, calcofluor white; ThS, thioflavin S. The scale shown in (A) is the same for (B–D). (E) Serial coronal sections of unpublished transmission electron micrographs taken by Wright and Thomson, 1981. The scale in (E(i)) applies to all images in the E series. (F–H) Magnifications of the boxed areas highlighted in the images to the left. (G) represents a slightly different plane than that depicted in (E(iv)) and was chosen because of the clearly visible filaments. The scale in (F) is the same as that for (G) and (H). For all panels: a, arcade cells; ch, mc1 channel; cu, cuticle; e, e epithelium; gl, gland cell; bc, buccal cavity; cl central lumen; orange arrows, the anterior enlargement of the buccal cuticle; green arrows, the prominent ring at the base of the buccal cavity; yellow arrows, the electron-dense flaps at the base of the buccal cavity; red arrows, the electron-dense material at the anterior end of the channel cuticle; black arrows, pharyngeal cuticle when too small to be labeled with ‘cu’; white arrows, the ribbing of the pharyngeal cuticle; pink arrows, the cytoplasmic filaments that correspond to the abutment of the ribbing.
Two features suggest that the chitin-binding dyes may bind components within the electron-light material. First, the expansive electron-light material at the anterior half of the buccal cuticle correlates with the expanded CFW and EY signal (orange arrows in Figure 3A and E). Second, CFW and EY brightly stain a prominent collar at the base of the buccal cavity (green arrows in Figure 3A and E). The amyloid-binding dyes stain the collar less (Figure 3B and C), and ABU-14::GFP fails to mark the collar (Figure 3D). In the TEM images, this collar is composed of light material. Hence, the electron-light material is likely enriched with chitin.
The CR dye and the ABU-14::GFP localize to the cuticle flaps (yellow arrows in Figures 1E and 3B and D), which are composed of the darker electron-dense material in the TEM (Figure 3E). The dark material of the flaps is contiguous with the dark ribbing of the buccal cuticle and the luminal-facing coating of the cuticle, all of which encapsulate the less electron-dense material (Figure 3E). An analogous organization is present in the cuticle that lines the channels (Figure 3E). Together, these observations suggest that the electron-dense material may be enriched in amyloid-like proteins and establish CR, ThS, CFW, and EY as useful markers of pharyngeal cuticle structure.
Mining expression datasets yields a spatiotemporal map of pharyngeal cuticle development
To better understand pharynx cuticle construction, we built a spatiotemporal map of cuticle-centric gene expression by combining four published datasets (see Figure 4—source data 1). First, we anchored the map using a dataset that tracked gene expression levels in synchronized animals every hour for 16 hr from the mid L3-stage to adulthood at 25°C (Hendriks et al., 2014). This study identified 2718 genes whose expression oscillates during larval development with a peak in expression every 8 hr (p<0.001); this period corresponds to the 8 hr duration of the third and fourth larval stages at 25°C. Two of these 2718 genes have been retired due to reannotation. The 2716 genes can be grouped into bins of genes that peak at different larval development phases. For example, some genes peak during the first and ninth hour, others peak during second and tenth hour etc., such that there are successive waves of genes that oscillate through time (see Figure 1e of Hendriks et al., 2014). We present the 2716 genes from this dataset in the temporal order in which the genes peak in their expression over the 8 hr cycle (Figure 4A). We note that since we initiated our study an additional temporally resolved dataset has been published (Meeuse et al., 2020).
Figure 4.
An informatic reconstruction of the pharyngeal cuticle.
(A) A chart of 2716 genes whose expression oscillates over larval development with a periodicity that corresponds to larval stages. See text for details. Each row represents a single gene. Rows are arranged along the y-axis in order of the time at which each gene reaches its peak expression level with those earliest in the time course at the bottom and those latest in the period at the top. Because the periodicity is a continuum during larval development, Hendriks et al., 2014 represented time as degrees of a circle. That concept is preserved here, and the degree is indicated along the y-axis and divided into bins of time relative to the molting period. The first data column (red) represents the 602 oscillating genes that also were found to be upregulated in expression during the L4 lethargus (molting) period (see Supplemental Table 1 in George-Raizen et al., 2014); the scale of the relative expression level from this independent study (George-Raizen et al., 2014) (is indicated in the legend). The second data column (black) represents the 367 genes from the set of 2716 that are enriched in expression in the pharynx (data from Cao et al., 2017; see Figure 4—source data 1 for enrichment). The purple columns show the 226 genes (of the 367 pharynx-enriched set) that are predicted to be secreted. They are duplicated to show the 26 protein–protein interactions (PPI) among the 17 oscillating pharynx-secreted proteins identified through Genemania (see Figure 5C, Figure 4—source data 1, and Figure 5—source data 1 for the details of which protein pairs interact). The identity of the interacting proteins is indicated with bold lettering and a dotted arrow on the right of the graph. The last column (blue) represents those pharynx-enriched genes that lack an obvious domain as predicted by WormBase, PFAM, and SMART databases (see text for details). 78 pharynx-expressed genes of special interest are indicated with arrows to the right of the graph. The color of the arrows and text corresponds to broad categories indicated in the legend. (B–E) The average number of transcripts produced by genes whose expression is enriched in the indicated tissue as a function of developmental time. In all graphs, results are binned according to the hours indicated in (A), the global average transcript number (49.33) is indicated by the red dotted line. Statistical differences were measured using a Student’s
Figure 4—figure supplement 1.
Tissue-enriched expression levels of tissue-enriched classes of genes.
Each plot shows the expression level (as measured in single-cell sequencing analyses; Cao et al., 2017) (y-axes) of the tissue-enriched set of genes in each tissue type (x-axis). Each dot represents the expression level for a single gene; each column reports the expression level of all genes in the respective set. For example, each of the 470 pharynx-enriched genes are expressed in the pharynx tissue at least 1.5-fold more than all other tissues combined. The plots demonstrate the enrichment of expression in the respective tissue type in each tissue-enriched gene set. The mean expression level for each tissue type and standard error of the mean is shown.
Second, we defined the interval on the map that corresponds to the molt by overlaying a dataset of genes that are upregulated during the L4 molt (p<0.001) (George-Raizen et al., 2014). The overlay indicates that molting peaks in the sixth hour on the map (Figure 4A and B,, Figure 4—source data 1). The fact that the genes that are upregulated during the L4 molt are clustered on the map provides reciprocal validation for both datasets (George-Raizen et al., 2014; Hendriks et al., 2014). We herein routinely refer to hour 6 as the reference peak molting hour.
Third, we identified the genes on the temporal map whose expression is enriched in the cells surrounding the pharyngeal cuticle relative to all other tissues. We did this by overlaying single-cell expression data from cells isolated from L2-staged animals (Cao et al., 2017). We found 367 ‘pharynx’-enriched transcripts (
Fourth, we determined the likelihood of gene products being secreted using Signal P (v4.1) predictions extracted from the WormBase Parasite database to identify signal peptides (with scores of 0.45 or more) genome-wide (Hertz-Fowler and Hall, 2004). We recognize that while this approach is systematic, Signal P does not identify all secreted or plasma membrane-associated transmembrane proteins. The oscillating pharynx-enriched set contained 226 genes (62%) that encode a signal peptide (Figure 4A, Figure 4—source data 1). By comparison, only 39% of the remaining oscillating gene set (n = 2349) and only 17% of the entire non-oscillating genes of the genome (n = 17,614) encode a signal peptide (Figure 4—source data 1). The temporal map shows a concentration of genes that peak in expression from the pharynx and are secreted at the time of molting (Figure 4A).
We investigated the change in transcript abundance in the pharynx over the cyclical 8 hr window of larval development for the oscillating genes. We found a nearly 30-fold increase in transcript abundance for those gene products predicted to be secreted relative to the global average of pharynx gene expression during the peak molting hour (Figure 4B). There is a shoulder of peak expression at hour 7 for those non-secreted gene products (Figure 4B) that may correspond to the increase in tissue growth after the molt. Cao et al., 2017 further dissected their single-cell sequencing data into tissue subtypes. We find that the expression of predicted secreted products from the pharynx epithelial cells peaks dramatically during the peak molting hour, whereas pharynx gland transcription peaks in the preceding hour (Figure 4). Non-secreted epithelial and muscle products peak in expression during hour 7 (Figure 4C and E). Given that mRNA expression levels are positively correlated with protein abundance in invertebrate systems (Ho et al., 2018; Schrimpf et al., 2009), we conclude that there is a likely a burst of proteins secreted in preparation for the molt.
Orthogonal data validate the spatiotemporal map
We explored the validity of the spatiotemporal map in four ways. First, previous work established that the molting of the body cuticle precedes that of the pharyngeal cuticle (Wright and Thomson, 1981). We therefore expected a peak in gene expression from the hypodermis that precedes that of the pharynx, which is what we observe (Figure 5A).
Figure 5.
The spatiotemporal map has predictive power.
(A) Average gene expression in each of the indicated tissue types plotted as a function of developmental time. In the first hour of the time course, for example, 219 genes peak in expression and the average expression of each of these 219 genes in each of the indicated tissues is plotted for hour 1 on the graph. Standard error of the mean is shown. The peak pharynx molting hour is highlighted by the transparent red box. Significant differences relative to the global mean is calculated with a Student’s
Figure 5—figure supplement 1.
The pharynx secretome has a dense protein–protein interaction (PPIs) network relative to other secretomes.
(A) The PPIs among gene products whose corresponding mRNAs are enriched in the indicated tissue (as derived from the L2 single-cell sequencing dataset; Cao et al., 2017) and secreted (as defined by the presence of a signal sequence) (see Supplementary file 1 for details) as revealed through Genemania (El-Gebali et al., 2019). Genemania settings used are ‘equal by network weighting’, and max resultant genes and attributes set to zero. Only networks that include clusters of three or more genes are considered here. Each interaction must be represented by multiple independent lines of experimental evidence. (B) A comparison of the indicated properties between the group of 20 interacting proteins of the pharynx secretome and the entire proteome. All differences are significant (p<0.001) as measured by a Student’s
Second, we systematically investigated published reports of expression (not including the datasets used to build the spatiotemporal map) for the 226 oscillating pharynx secretome genes. In this analysis, we also included the 17 additional genes of special interest called out in Figure 4A that include
Third, we reasoned that the pharynx secretome might be rich in protein–protein interactions (PPIs) because many of the secreted proteins likely interact to form a matrix. We explored PPIs systematically using Genemania, which is an online tool that facilitates the analysis of experimentally derived interaction data curated from the literature (Franz et al., 2018). To analyze each tissue’s secretome, we returned to the Cao et al., 2017 single-cell sequence datato parse the proteome into proteins that are enriched in the major tissues using the same criteria described above for the pharynx (Figure 4—source data 1). These tissues included the pharynx (470 proteins), body wall muscles (BWMs) (326 proteins), glia (426 proteins), gonad (832 proteins), hypodermis (411 proteins), intestine (781 proteins), and neurons (965 proteins) (Figure 4—figure supplement 1). We separated out the 166 collagens from the proteome because of their unique sequence properties. The remaining 15,892 proteins are binned into a non-specific group. For each of these groups, we parsed them into those encoding a signal peptide, and those without. Genemania reports multiple lines of evidence for 36 PPIs among a network of 20 proteins within the pharynx secretome (Figure 5C). This interaction network is denser than that from most other secretomes (Figure 5—figure supplement 1, Figure 5—source data 1).
Fourth, literature searches reveal that the spatiotemporal map includes many genes with known roles in pharynx development (
Finally, we further confirmed the map’s ability to predict spatial expression patterns by inserting green fluorescent protein coding sequence in frame with five poorly characterized gene products, namely, IDPA-3, IDPB-3, IDPC-1, FIPR-4, and NSPB-12 (Figure 6). We also included the previously characterized ABU-14::GFP (Figure 6A). We counterstained the resulting transgenic animals with CFW to interrogate the spatial overlap of the tagged proteins with the chitinous cuticle. As predicted, we found that all five reporters are expressed exclusively in association with the pharynx and overlap in their localization with the pharynx cuticle. Briefly, tagged IDPA-3 was enriched in the grinder, overlapping the CFW-stained component and lining of the terminal bulb cuticle. In addition, we observed enrichment of tagged IDPA-3 in the presumptive ECM that lies between the terminal bulb and the intestinal valve (white arrow in Figure 6B). Tagged IDPB-3 was expressed weakly and localized exclusively to the pm6 cells and material surrounding the CFW-stained grinder (Figure 6C). Tagged IDPC-1 had a similar pattern to that of tagged ABU-14; associating with both the anterior and posterior components of the pharyngeal cuticle. However, tagged ABU-14 appears to localize adjacent to CFW-stained components whereas tagged IDPC-1 overlaps CFW-stained components (Figure 6A and D, Figure 6—figure supplement 1). Tagged NSPB-12 localized to the anterior pharynx cuticle components exclusively, including that of the buccal cavity, flaps, and anterior channels (Figure 6E). Tagged FIPR-4 localized to both anterior and posterior pharynx cuticle components (but not the grinder teeth proper) and the presumptive pharynx-intestinal valve ECM (Figure 6F). Together, these analyses provide confidence in the predictive value of the spatiotemporal map.
Figure 6.
The localization of six fluorescently tagged pharynx cuticle components.
Each of the six large horizontal boxes contain data about the six predicted gene products indicated on the left. In each box, the four images on the left are of the head of a single worm, imaged first with differential interference contrast (DIC), then with calcofluor white (CFW) in green, then the fluorescently tagged protein (tp) protein of interest in fuchsia, followed by a merged (m) image as indicated at the top of the columns. A blue horizontal line indicates the intersection of two cropped images to show different relevant focal planes of the same animal. The scale is indicated. The middle set of eight images correspond to magnified buccal cavity and channels (top) and terminal bulb and grinder (bottom). The scale is indicated. Colored arrows are used for reference in (A) and used to draw attention to particular features in (B–F): Orange, buccal cavity; yellow, flaps; green, collar; red, anterior channels; blue, grinder; white, presumptive ECM between the terminal bulb and intestinal valve. The graph on the right is a UMAP plot of the pharynx mRNA expression pattern for the respective gene (see text for details). The relative expression level is indicated. (A) An RP3439 animal harboring the
Figure 6—figure supplement 1.
A comparison of the tagged ABU-14 and IDPC-1 localization patterns.
(A) The
The pharynx secretome is enriched in proteins with high predictions of phase separation
To better understand the types of proteins that are secreted by the pharynx, we manually curated the domain organization of all 367 oscillating pharynx-enriched gene products as reported by the WormBase, SMART, and PFAM protein databases (Letunic and Bork, 2018; El-Gebali et al., 2019; Figure 4—source data 1). We found that 106 of the 226 secreted proteins (47%) lacked any defined domain (last column of the chart in Figure 4A, Figure 4—source data 1). This prompted a systematic investigation of low-complexity sequence within the pharynx secretome using NCBI’s SEG algorithm (Wootton and Federhen, 1993). Indeed, we found the pharynx secretome to be greatly enriched with low-complexity regions (LCRs) (p=1E-69) (Figure 7A). Given that low complexity is tightly associated with intrinsic disorder, we used the Spot-Disorder algorithm (Hanson et al., 2017) to systematically analyze whether the pharynx secretome is also enriched for IDRs and found that it is (p=8E-10) (Figure 7B).
Figure 7.
The pharynx secretome is enriched with intrinsically disordered proteins with phase separation capability.
(A–E) An analysis of the entire proteome for the indicated properties. The tissue type examined, as well as the number of genes in each bin, is indicated at the bottom of the graph in (E) (hypoderm., hypodermis; non-spec., non-specific). Statistical differences, indicated with an asterisk at the bottom of each graph, were measured using a Student’s
Low-complexity intrinsically disordered protein regions often provide multivalency that can enable a protein to transition from being soluble to becoming a phase-separated liquid, gel, stable polymeric matrix, or an insoluble amyloid (Muiznieks et al., 2018). We explored the potential of the different protein sets to phase separate using three different predictive algorithms, including PSPredictor (Chu et al., 2022), PLAAC (Lancaster et al., 2014), and LLPhyScore (Cai et al., 2022). PLAAC was originally designed to scan for prion-like sequences, but has been retrospectively used as a reliable tool to predict phase separation (Vernon and Forman-Kay, 2019). Each algorithm reveals that the pharynx secretome is enriched in proteins with phase separation capability (p=2E-46, p=2E-52, and p=2E-31, respectively) (Figure 7C–E).
We also examined low-complexity, intrinsic disorder and phase-separation propensity as a function of developmental time. The peak molting hour corresponds to a clear peak in low-complexity and intrinsic disorder of secreted products (Figure 7A’ and B’). The other three predictors also show significant peaks in phase separation propensity of secreted products during the peak molting hour, but variably show peaks at other time points as well (Figure 7C’–E’). To better understand the relative abundance of gene products with the specific sequence features highlighted in Figure 7A’–E’, we multiplied the trait value for each gene with the relative number of transcripts for each respective gene. In this light, we see a striking peak of all trends at the peak molting hour (Figure 7A’’–E’’). This analysis suggests that the pharyngeal cuticle is likely flooded with low-complexity, intrinsically disordered proteins with phase separation potential during the peak molting hour.
Finally, we tested these predictions by asking whether IDPC-2 can phase separate. Upon cleaving off the MBP affinity tag from the in vitro-expressed proteins, we see that IDPC-2 and the positive control FUS can form phase-separated droplets (Figure 8A and B). In these experiments, we use a molecular crowding reagent (Ficoll) to mimic in vivo molecular crowding (André and Spruijt, 2020). These data support the informatic analyses that predict that many of the proteins incorporated into the cuticle may be capable of phase separation.
Figure 8.
Cuticle proteins can likely phase separate and are enriched with protofilament but not amyloidogenic sequence.
(A) In vitro purified maltose-binding protein (MBP) control (15% Ficoll) or fusions with the FUS-positive control (5% Ficoll) and IDPC-2 (15% Ficoll) phase separate into spheres upon cleaving off the MBP tag with TEV protease, while the MBP-only negative control does not. The inset is a magnification of the corresponding area. The scales for all insets and larger images are respectively the same. (B) Quantification of the fold change in optical density (OD; 395 nm) of indicated samples after 1 hr of treatment with TEV relative to the OD without the addition of TEV.
The pharynx secretome is not enriched with amyloidogenic proteins
We investigated the propensity of pharynx secretome proteins to form filaments. We first used the LARKS algorithm that predicts kinked b-structure, which can drive proto-filament assembly and reversible fiber formation (Hughes et al., 2018). Indeed, we find a significant enrichment in LARKS scores within the pharynx secretome (Figure 8C). This prediction is corroborated by the LLPhyScore predictor of kinked b-structure (Figure 8D). We also investigated whether the pharynx secretome is enriched in amyloidogenic proteins. Both the Budapest (Keresztes et al., 2021) and AmyloGram (Burdukiewicz et al., 2017) machine-learning predictors, as well as the structure-based PATH predictor (Wojciechowski and Kotulska, 2020), fail to show any enrichment within the pharynx secretome of amyloidogenic proteins (Figure 8E–G).
We further probed the ability of the pharynx secretome to form amyloid fibers using CR dye. CR has long been used as a diagnostic tool to identify rigid amyloid fibrils because of its special property of emitting apple green birefringence upon binding the ordered fibril array in the presence of polarized white light (Divry, M, 1927). This is in sharp contrast to the colorless birefringence of the crystalizing compounds (Figure 8H). While CR specifically stains the pharynx cuticle, we found that CR-stained cuticles do not emit apple green birefringence (n > 30) (Figure 8I and J). We are confident that our imaging system is capable of detecting CR-derived apple green birefringence because of a serendipitous observation. We found that when CR is co-incubated with a small molecule (called wact-190) that forms crystals in the pharyngeal cuticle (Kamal et al., 2019), the resulting crystals exhibit apple green birefringence (Figure 8K). We infer that this happens because CR likely becomes incorporated into a regular array, that is, the wact-190 crystal. Together, these results indicate that it is unlikely that the cuticle harbors rigid amyloid fibrils, which is consistent with both the flexible nature of the pharynx cuticle (Huang et al., 2008; Avery, 1993) and the absence of any detectable amyloid-like fibers in previous transmission electron micrographs of the pharynx cuticle (Wright and Thomson, 1981; White et al., 1986). We conclude that the pharynx secretome is likely enriched in proteins with intrinsic disorder, phase separation capability, and proto-filament formation capability, but not enriched with proteins that form rigid amyloid fibrils.
The transcripts encoding secreted IDR protein families peak in expression in overlapping waves during cuticle construction
Given the enrichment in low-complexity sequence within the pharynx secretome, we were curious to know whether it has any global bias in amino acid residue distribution relative to other protein sets. We found a significant enrichment of nine residues with a strong bias against charged and hydrophobic residues (at least p<2E-05; Figure 9A). Upon considering relative abundance of amino acid residues as a function of time, we see that proteins rich in cysteine, proline, and glutamine peak in expression during new cuticle construction (Figure 9B).
Figure 9.
Properties of the low-complexity protein families that are likely secreted into the developing cuticle.
(A) Average percent amino acid composition of the proteins within the indicated tissue type. The percentages along a single row sum to 100. The color scale indicates the range of values within a single column so as to compare the relative abundance of the indicated residue among the different protein sets. The collagens are not included in the color scale comparison. SS, secreted proteins based on harboring a signal sequence; BWM, body wall muscles. All of the mean residue percentages from the set of proteins secreted from the pharynx cells are significantly different compared to that of the remaining proteome (Student’s
Figure 9—figure supplement 1.
Consensus sequence for select low-complexity families.
Sequence logos based on full-length protein sequence alignments (Wozniak and Kotulska, 2015) of the APPGs (12 sequences), IDPAs (3 sequences), IDPBs (5 sequences), and IDPCs (7 sequences). Sequences were aligned with CLUSTAL W, and Sequence Logos were generated using WebLogo 3.7.4 (Sali and Blundell, 1993; Chaudhury et al., 2010). Details of the sequences used are given in Supplementary file 1.
We used the Clustal Omega clustering tool (Sievers et al., 2011) to determine whether there were families of proteins with similar sequence within the 106 proteins that lacked domains within the pharynx secretome. We found six distinct families of low-complexity proteins through this analysis (Figure 9C, Figure 9—source data 1). Members of each family share an enrichment of particular residues (Figure 9D), contain regions of high percentage positional sequence identity (Figure 9E, Figure 9—figure supplement 1), and are expressed at similar times as one another (Figures 4A and 9F). These six families include three new families of IDR-rich proteins, which we have named IDPA, IDPB, and IDPC, a subgroup of APPGs (George-Raizen et al., 2014; Figure 9E), and the relatively short NSPBs and FIPRs about which little is known. See Supplementary file 1 for all newly named genes presented in this study and Supplementary file 2 for all members of the six families described here. Systematic searches relying on positional alignment reveal no obvious homologs of these six families in any group beyond Nematoda (WormBase). Furthermore, a comparison of the consensus sequence from these families (Figure 9E, Figure 9—figure supplement 1) to the cuticle proteins of other Ecdysozoans (Willis, 2010) reveals no obvious similarity in the pattern or amino acid sequence biases.
The transcription of the six families of low-complexity proteins peaks in expression in successive overlapping waves, with five of the waves concentrated around the peak molting hour (Figure 9F). The combined use of the three different predictors of phase separation suggests that the IDPAs, IDPBs, IDPCs, and the APPGs may be able to phase separate (Figure 9F). The FIPRs and NSBPs are also likely to phase separate but fail to score high with the SpotDisorder algorithm because of their small size. The IDPAs and IDPBs are predicted to form protofilaments (as measured by LARKS), the IDPAs and APPGs score especially high with the prion sequence evaluator (PLAAC), and five members of the APPGs (ABU-6, ABU-7, ABU-8, ABU-15, and PQN-54) are predicted to be amyloidogenic (as measured by AmyloGram and PATH) (Figure 9F). These results further support the idea that a large proportion of the proteins secreted by the pharynx during cuticle construction are IDR-rich with phase-separating capability.
Epithelial and transdifferentiated cells secrete abundant products during the molt
We sought increased spatial resolution of peak gene expression that is associated with pharyngeal cuticle construction over the course of the temporal map. We therefore returned to the Cao et al. single-cell sequencing dataset (Cao et al., 2017; Packer et al., 2019) to systematically visualize the expression patterns of pharynx secretome components. Cao et al., 2017 and Packer et al., 2019 identified 1675 sequenced cells that belong to the pharynx. When grouped according to similar expression profiles, the pharynx cells form subclusters on a Uniform Manifold Approximation and Projection (UMAP) created by Packer et al. (see https://cello.shinyapps.io/celegans_L2/) that represent cells of a similar type (Packer et al., 2019; Figure 10A). Based on the expression of some characterized reporter transgenes and their single-cell sequence analysis of the embryo, Packer et al. made tentative cell assignments for most subclusters of the L2 pharynx (see Supplemental Table 12 in Packer et al., 2019).
Figure 10.
Expression of pharynx-enriched genes in distinct cell types.
(A) A UMAP of 1675 pharynx cells modified with permission from Packer et al., 2019's online tool. The clusters are numbered according to Packer et al., 2019. The cell type identities are partially based on those from Packer et al., 2019 (see Figure 10—figure supplements 1 and 2 for details). Due to space constraints, three cluster groups from the map are shown as insets. (B, C) The expression level of the pharynx-enriched gene set in the indicated tissue type. The graph notation and the order of genes in rows is preserved from Figure 4A. Genes encoding a signal peptide are shown in (B) and those without a signal sequence are shown in (C). The mc1, pm3-4, and pm5-8 values represent the average gene expression of the cells within the respective clusters (26, 16, and 22). The values corresponding to the other cell types represent the highest average from among the group of clusters that constitute that cell type. For example, the arcade cells are represented by clusters 6, 20, and 25, but the expression level from each of these clusters is distinguished by time, not space, and averaging signal from all three would dilute the expression level that represents that cell type. All members of the six low-complexity families are indicated on the left of (B) and the color code is the same as that present in Figure 4A. (D–F) The average transcript level of all genes within the indicated cell type as a function of binned time. Open white circles represent a significantly greater value (p<0.01) compared to the bin 2 hr previous.
Figure 10—figure supplement 1.
Identity assignment of the pharynx UMAP clusters.
(A) The cluster map and cluster numbers were derived from Packer et al., 2019 using the single-cell L2 sequencing data from Cao et al., 2017. Note the cropped Y-axis for reasons of graphical clarity. (B) Published gene expression patterns from transgenic reporters were used to assign identities to each of the clusters. Several of our assignments are different from that of Packer et al., 2019. See Figure 10—figure supplement 2 for the corresponding UMAP plots of the listed genes.
Figure 10—figure supplement 2.
UMAP plots of the gold standard genes used to assign identity to the pharynx UMAP reference cluster.
The UMAP plots of the genes listed in Figure 10—figure supplement 1 were extracted from the Packer et al. dataset (see https://cello.shinyapps.io/celegans_L2/) and show the published markers used to assign identity to the clusters. Genes are arranged alphabetically, and the cells in which GFP reporters serve as markers are indicated. For reasons of graphical clarity, the gland clusters are not shown.
We searched the literature for additional GFP reporter transgenes that are expressed in the postembryonic pharynx to help refine the identities of many of the L2 pharynx subclusters (Figure 10—figure supplements 1 and 2). We then transformed the Cao and Packer et al. L2 pharynx subcluster data into transcript summaries (see ‘Materials and methods’) and examined the expression level of oscillating pharynx-enriched transcripts in each of the subclusters (Figure 10B and C; see Figure 1 for the relative location of each cell type).
During hours 3, 4, and 5, abundant products are secreted by the e epithelial cells, the mc3 marginal cells and presumptive pm6 and 7 transdifferentiated cells (see below). The identity of these transcripts (see Figure 4 and Figure 4—source data 1) suggests that the cells are accumulating stores for the catabolism of the old cuticle and construction of the new one at the onset of the molt. Despite being confident in our assignment of cluster 11 as pm1 (Figure 10—figure supplements 1 and 2), the expression profile of cluster 11 is more like the arcade, e epithelial cells, and mc3 marginal cells than muscle, suggesting that pm1 may also play a role in the catabolism of the old cuticle. This is consistent with the correlation between the pharynx UMAP plot for ABU-14 and what we observe in animals with fluorescently tagged ABU-14 (Figure 6A and A’).
During hours 5 and 6 (which is the peak molting hour), the arcade and e epithelial cells produce abundant secreted components, consistent with the construction of a new buccal cuticle (Figure 10B and D). The mc1 and mc2 marginal cells also secrete abundant product (Figure 10B and E), again consistent with the construction of the channel cuticles and sieve (see Figures 1, 6A and A’).
Conspicuously absent from the expression profiles of confidently assigned subclusters is abundant secretion from the cells that surround the grinder in the posterior bulb (i.e., pm6 and pm7). Subcluster 22, which is confidently identified as pm5, pm6, pm7, and pm8 muscle, express only low levels of secreted proteins during the peak molting hour. Previous work has shown that the pm6 and pm7 cells transdifferentiate from muscle into highly secretory cells during the molting period to build a larger grinder (Sparacio et al., 2020). Based on the expression of a combination of markers (Figure 10—figure supplements 1 and 2) and the abundant expression of secreted products, we infer that subclusters 1 and 5 represent transdifferentiated pm6 and pm7 that secrete many of the same components used in the anterior pharynx epithelia to build the grinder (Figure 10B and F). We find that the IDPAs and IDPBs are expressed in the early transdifferentiating pm6 and pm7 cells (Figure 10B and Supplementary file 2), and therefore likely contribute to grinder formation. This prediction is consistent with our finding that disruption of IDPA-3, which localizes to the grinder (Figure 6B and B’), results in obvious grinder defects (Figure 5D). This prediction is also supported by the exclusive localization of tagged IDPB-3 to the grinder and pm6 cells (Figure 6C and C’). Finally,
During the peak molting hour 6, IDPCs and the APPGs are expressed in most cells that contribute to the pharyngeal cuticle. Again, Kohara’s mRNA in situ database confirms this interpretation with robust and specific pharynx expression patterns for
During hours 7 and 8, NSPB and FIPR expression is more restricted to the arcade, e epithelial cells, and the mc1 cells (Figure 10B and Supplementary file 2). Tagged NSPB-12 supports this prediction (Figure 6E and E’). Tagged FIPR-4, while localizing to the anterior cuticle, is also present in the posterior cuticle, suggesting that secreted FIPR-4 may be able to diffuse extensively (Figure 6F and F’). Cytoplasmic components involved in muscle development peak in expression during hours 7 and 8 (Figure 10C and F).
The number of genes expressed from the gland cells is not obviously enriched in any one temporal interval (Figure 10B and C), yet the overall abundance of gland transcripts peak in hour 5 (Figure 4D). This apparent contradiction is due to the two most abundantly expressed genes from the gland,
Discussion
A model of pharyngeal cuticle construction
Here, we have mined published resources to bioinformatically reconstruct the
During hours 4, 5, and 6, anabolic enzymes and constructive components peak in expression. These include the characterized chitin synthase CHS-2 (Zhang et al., 2005), putative chitosan synthases LGX-1 and CHTS-1 that deacetylates chitin to produce chitosan (Heustis et al., 2012), and putative chitin binders and cross-linkers CHTB-1, CHTB-2, and CHTB-3. In this interval, components implicated in amyloid metabolism also peak in expression. These include a predicted amyloid chaperone LRX-1 (Cam et al., 2004), two predicted amyloid-chitin linkers LRPC-1 and PQN-74 (Brodeur et al., 2012), and a predicted amyloid precursor protein interactor FEH-1 (McLoughlin and Miller, 2008).
During hours 5 and 6, a massive increase in gene expression of the pharynx secretome occurs. The period coincides with the upregulation of secreted intrinsically disordered proteins from the pharynx epithelium and includes successive waves of peak transcript expression encoding four of the intrinsically disordered families, IDPA, IDPB, IDPC, and APPG members that have been previously implicated in cuticle development (George-Raizen et al., 2014).
During hours 5 and 6, the gene products that peak in expression are rich in PPIs compared to the proteins secreted by other tissues. The protein interactors within the pharynx secretome network are highly enriched in low-complexity sequences predicted to phase separate.
Finally, during hours 7 and 8, genes that encode muscle contraction components are upregulated, which likely corresponds to a period of tissue growth at the tail end of molting. We also see the peak expression of the low-complexity families NSPB and FIPR, which are likely added to the cuticle in its final phase of maturation. Together, these observations illustrate the utility of the spatiotemporal map in revealing the logic by which a cuticle is assembled.
The pharynx cuticle is unlikely to harbor amyloid fibrils
Despite the pharynx secretome not being
The idea that the pharyngeal cuticle contains a non-rigid network of IDRs is appealing because the pharyngeal cuticle must be sufficiently flexible to accommodate pharynx movements along the dorsal–ventral (Huang et al., 2008) and anterior–posterior (Avery, 1993) axes. Indeed, others have suggested that IDR-rich proteins within chitin-based cuticles might add elastic properties to what might otherwise be an inflexible chitin-based material (Andersen, 2011). An elastic cuticle might also aid in returning the open and extended lumen (which results from pharynx muscle contraction) to the relaxed ground state position.
Potential contributions of IDPs to the cycles of cuticle formation and destruction
A key feature of phase-separating IDRs is their potential to reversibly transition between different states of matter depending on local conditions and post-translational modifications (Murray et al., 2017; Deiana et al., 2019), including liquids and gel-like biomaterials. The pharyngeal cuticle must soften, be shed, and be reconstructed about every 8 hr during larval development (Lazetic and Fay, 2017). The notion that a network of IDR-rich proteins is not locked into a rigid state but may instead be regulated to increase or decrease intermolecular interactions and change material properties as needed during the molting cycle is an appealing idea that requires further investigation.
Both the APPGs and the IDPBs are highly enriched with cysteines and contribute heavily to an increase in the relative abundance of cysteines that is likely deposited into the developing cuticle as the animal prepares to molt. Other work has shown that the
The spatiotemporal map suggests that many different types of IDPs likely contribute to the pharyngeal cuticle. Previous studies have shown that coexisting condensed protein phases, each with distinct protein compositions, can yield complex biomaterials with layers and other non-uniform properties (Mountain and Keating, 2020; Lu and Spruijt, 2020; Lin et al., 2018). The distinct compositions of the six families uncovered by the spatiotemporal map are suggestive of the potential immiscibility of their condensed phases and of physical mechanisms for building the cuticle, particularly when combined with varying temporal expression, similar to what is observed during cuticle formation of the mussel byssus (Jehle et al., 2020). What is becoming clearer is how evolution has repeatedly capitalized on biomolecular condensates to make complex protective structures.
The molecular composition of cuticles may be evolutionarily plastic
The extent to which the blueprint of
The spatiotemporal map is a foundation for future investigation
The spatiotemporal map provides a starting point to investigate many important questions. First, what is the mechanism by which the temporal unfurling of gene expression is coordinated? While the global oscillatory pattern of
Second, how are catabolic and anabolic processes separated and regulated? The process of molting leaves animals vulnerable and must occur rapidly. In that light, it is perhaps not surprising that we observe a temporal overlap of expression of catabolic and anabolic components. Previous work on the ultrastructure of the grinder cuticle and molt indicates that dense core vesicles (DCVs) lie in wait until the new cuticle is assembled, at which point the DCVs likely fuse with the plasma membrane and dump their contents (Sparacio et al., 2020). Based on the timing of the peak expression of secreted components with respect to the timing of the molt itself, we surmise that (1) there is a temporal lag between the period of peak expression for a given gene and when protein abundance peaks, and (2) unknown mechanisms regulate the timing at which catabolic and anabolic components, perhaps within distinct DCVs, are released into the ECM. In this way, it might be possible to have temporal overlap in the peak expression of genes that encode catabolic and anabolic components. Exactly how the secretion of catabolic and anabolic components is regulated remains to be determined.
Finally, how are patterns within the pharyngeal cuticle established? Cuticle lumen shape and size are likely patterned by the underlying cells, but this simply extends the question. How is the patterning of the electron-dense cuticle ribbing established? Is the information that governs pattern of the flaps, which is seemingly independent of the shape of nearby cells, contained within the flaps’ protein components? Do the successive waves of expression of low-complexity protein families contribute to the layering of the cuticle seen in the electron micrograph cross sections? How might coexisting condensed phases of these proteins establish layering and other complexities of the cuticle structure? The spatiotemporal map of pharyngeal cuticle construction presented here may serve as the foundation for answering these and other questions in the future.
Materials and methods
Methods
Synchronized populations of worms were obtained by first washing off a population of worms rich with gravid adults on plates with M9 buffer, collecting the sample in 15 mL conical tubes, and centrifuging the samples at 800 ×
NQ824
IDPA-3, IDPB-3, FIPR-4, and NSPB-12 were tagged C-terminally with mNeonGreen. The mNeonGreen coding sequence was PCR-amplified from the
Pulse-chase analyses
Synchronized wildtype L1 worms are plated on 10 cm plates at 7000 L1s/plate seeded with OP50
Generating
Dye staining of wildtype and
Congo Red (CR) staining
Synchronized wildtype adult worms were washed and incubated with 0.02% CR from a 1% stock (w/v, dissolved in DMSO; Fisher chemical C580-25; CAS 573-58-0) in 500 µL of liquid NGM for 3 hr in the dark. Worms are then prepped for microscopic analysis as described above.
Thioflavin S (ThS) staining
Synchronized wildtype adult worms were washed and incubated with 0.1% ThS from a 10% stock (w/v, dissolved in DMSO; ThS; SIGMA, T1892-25G) in 500 µL of liquid NGM for 3 hr in the dark. Worms are then prepped for microscopic analysis as described above. The concentration chosen for ThS staining of
Eosin Y (EY) staining
EY staining was performed as described (Heustis et al., 2012). Briefly, synchronized wildtype adult worms were washed and incubated with 0.15 mg/mL from a 5 mg/mL stock (dissolved in 70% ethanol; Eosin Y; Sigma-Aldrich, E4009) in 500 µL of liquid NGM for 3 hr in the dark. Worms are then prepped for microscopic analysis as described above. Note that eosin Y stock should be stored at –20°C and before its use it should be incubated at 55°C for ~2 min and vigorously vortexed to ensure its solvation.
Calcofluor white (CFW) staining
Synchronized wildtype adult worms were washed and incubated with 0.005% CFW from a 1% stock (w/v, dissolved in DMSO; Fluorescent Brightener 28, Sigma-Aldrich, CAS 4404-43-7) in 500 µL NGM for 3 hr in the dark. Worms are then prepped for microscopic analysis as described above. Note that the CFW stock should be placed in boiling water for ~2 min and then vigorously vortexed to ensure solvation of the dye.
Calculations of low-complexity and intrinsic disorder
LCRs in the amino acid sequences of each protein within the
LLPhyScore calculations
The LLPhyScore phase separation score of each protein was calculated using the LLPhyScore algorithm (Cai et al., 2022). The LLPhyScore algorithm is a machine learning-based interpretable predictive algorithm that is based on the idea that a combination of multiple different physical interactions drives protein liquid–liquid phase separation. A protein’s LLPhyScore is a weighted combination of eight sub-scores, each representing one physical feature that is inferred from the input sequence. These physical features include protein–water interactions, hydrogen bonds, pi–pi interactions, disorder, kinked-beta structure, and electrostatics. The scores are optimized via training with 500+ experimentally known phase-separating protein sequences against selected negative sequences. More details about this algorithm can be found in the manuscript in preparation.
AmyloGram and path analyses
AmyloGram (Burdukiewicz et al., 2017) is a method based on machine learning, trained on hexapeptides experimentally tested for their amyloidogenic propensities (Wozniak and Kotulska, 2015). Amino acids are represented by the alphabet that best encoded amyloidogenicity of peptides modeled by n-grams, and it was optimized by a random forest classifier. Classification of a protein amyloidogenicity included calculating its profile with a hexapeptide window shifting along the protein chain. Proteins with amyloid propensity were identified on the basis of an appearance of at least one amyloidogenic fragment. To avoid an excessive number of false positives, non-default specificity values were used: 0.95 and 0.99.
PATH (Wojciechowski and Kotulska, 2020) uses molecular modeling and machine learning. It is a computational pipeline based on Python and bash scripts, using Modeller (Sali and Blundell, 1993) and PyRosetta (Chaudhury et al., 2010). A potentially amyloidogenic query sequence of a hexapeptide was threaded on seven representative amyloid templates. Comparative structure modeling provided evaluation of the models with statistics and physics-based functions. Next, the scores were used by the logistic regression classifier. The analyses with PATH were carried out in two stages. The first scan along the protein chain was done by AmyloGram with the specificity threshold at 0.99, which was then followed by structural modeling and classification using PATH. The second stage was only applied to amyloid-positive regions found by AmyloGram.
LARKS analyses
LARKS predictions were done on a proteome downloaded from WormBase on October 18, 2021. Sequences not completely comprised of the 20 canonical amino acids were rejected from analysis. Each protein from the filtered proteome set of 20,042 proteins was then submitted for LARKS predictions. First, the sequence was separated into a series of overlapping hexapeptide segments (each segment overlapped with five residues from the segment before it; a 150 amino acid sequence contains 145 hexapeptides). The sidechains for each residue in a hexapeptide are computationally grafted onto a fibril model for each of three different LARKS structures (FUS-SYSGYS, FUS-STGGYG, and hnRNPA1-GYNGFG; PDB IDs: 6BWZ, 6BZP, and 6BXX). Energy minimization is done using a Rosetta energy score as a readout, and if the final energy is below a backbone-dependent threshold, then hexapeptide segment is considered a LARKS. Proteins’ LARKS content was determined by the number of favorable LARKS segments divided by the length of the protein.
In vitro expression and analysis of IDPs
Expression vectors and constructs
All protein expression vectors generated for this work were derivatives of the pMBP-FUS-FL-WT (a gift from Nicolas Fawzi [Addgene plasmid # 98651; http://n2t.net/addgene:98651; RRID:Addgene_98651;
Burke et al., 2015], which was modified to remove the FUS1 coding region and to have two cloning sites BamHI and NotI) for facile cloning of new proteins in phase with the HIS-tagged Maltose Binding Protein (MBP) at the N-terminus followed by a TEV protease cleavage site (TEVcs) to generate pPRRH1197. The coding region of proteins of interest (minus signal sequences) was codon optimized for expression in
Protein preparation and purification
Proteins were expressed in
Phase separation assays
Proteins were incubated in 2.5 mM Tris pH 7.5, 150 mM NaCl, 2 mM DTT with either 5% Ficoll (Sigma, F2637) for MBP::FUS1 or 15% Ficoll for MBP::IDPC-2 for 1 hr at 30°C with or without TEV protease (10 units in a 50 μL reaction). The optimal percent Ficoll was determined empirically. Turbidity was measure at 395 nm with a Clariostar plate reader (Mandel). 10 μL of each reaction was spotted onto slides with coverslips then condensates visualized with DIC using a Leica DMRA2 microscope at ×63 magnification.
Protein sequence analysis and logo generation
We used Clustal Omega (Sievers et al., 2011) to align the 110 low-complexity protein sequences and generate a percent identity matrix based on the multiple sequence alignment. For those low-complexity proteins with a predicted signal peptide, the first 20 amino acids were removed from the protein sequence before alignment.
To generate sequence logos, full-length protein sequences from each of the low-complexity protein families identified by the percent identity matrix were aligned using ClustalW (Thompson et al., 1994). Sequence logos were constructed based on these alignments using WebLogo 3.7.4; (https://weblogo.berkeley.edu/; Crooks et al., 2004; Schneider and Stephens, 1990). Amino acid residues were colored according to their chemical properties: polar (G,S,T,Y,C) in green, neutral (Q,N) in purple, basic (K,R,H) in blue, acidic (D,E) in red, and hydrophobic (A,V,L,I,P,W,F,M) in black. The height of the symbol within each stack indicates the relative frequency of that amino acid in that position. Stack widths are scaled by the fraction of symbols in that position (positions with many gaps are narrow). Details of protein sequences used can be found in Figure 4—source data 1.
Statistics and graphs
Except where indicated, statistical differences were measured using a two-tailed Student’s
Materials availability statement
The
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022, Kamal, Tokmakjian, Knox et al. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
How the cuticles of the roughly 4.5 million species of ecdysozoan animals are constructed is not well understood. Here, we systematically mine gene expression datasets to uncover the spatiotemporal blueprint for how the chitin-based pharyngeal cuticle of the nematode
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer