Background
A major constraint to the livestock industry is the variety of helminth (worm) species that can infect animals and inflict significant losses in terms of feed intake and conversion efficiency, milk and meat yields, wool production and fertility, etc. Helminths account for > 55% of all farm animal disease which is reflected in a global market of $1.9 billion for veterinary drugs with activity against endoparasites [1, 2]. Whilst several helminth species are endemic in the UK, the prevalence of the rumen fluke, Calicophoron daubneyi, in particular has increased considerably in the last 10–15 years [3].
C. daubneyi is a member of the family paramphistomidae, a group of flukes generally found in the forestomachs of ruminants, that exhibit a conical shape quite unlike the flattened morphology of other trematodes [4]. Infection occurs when grazing livestock ingest metacercariae that are found encysted on vegetation. Once in the duodenum, newly excysted juvenile (NEJ) flukes emerge and migrate into the intestinal submucosa where they can cause considerable pathology if present in large numbers [5]. If large numbers of metacercariae are ingested (e.g. > 5000 cysts per day over a number of weeks), this can lead to a massive build-up of NEJs in the duodenum which results in acute, clinical paramphistomosis [6]. This is characterised by haemorrhage of damaged regions of the small intestine, causing significant blood loss and hypoalbuminaemia whilst severe cases can be exacerbated by secondary bacterial infection and may be fatal [7, 8].
After feeding on host tissue in the duodenum, the immature flukes begin their migration upwards along the digestive tract to the rumen where they attach to the papillae via their muscular posterior acetabula [9] and begin to produce eggs at around 8–10 weeks post-infection [10]. The mechanism by which mature rumen fluke acquire nutrition from their host remains unclear but may be linked to the differing pathogenicity displayed by different paramphistome species: chronic paramphistomosis caused by mature rumen flukes is considered a production-limiting disease in many tropical and sub-tropical regions where species such as Paramphistomum cervi dominate [11, 12]. In contrast, chronic infection due to C. daubneyi, the predominant species in Europe [13,14,15], is not currently associated with clinical disease or impacts on production [16].
The attachment of C. daubneyi to the rumen epithelium brings the parasite into intimate contact with host tissues but also with the plethora of anaerobic bacteria, protozoa, fungi and methanogenic archaea that constitute the rumen microbiome, residing in the main fermentative compartment of the forestomach. These microbes play a crucial role in breaking down indigestible plant-based material, converting complex carbohydrates into volatile fatty acids that serve as a primary source of energy for the host. However, as a by-product, hydrogen is released and utilised by rumen archaea as an electron donor to reduce carbon dioxide producing methane which is a major greenhouse gas (GHG) released from ruminants [17, 18]. The discovery that extracellular vesicles (EVs) secreted by adult C. daubneyi can alter rumen bacterial species diversity via direct antimicrobial activity [19] demonstrates that paramphistomes can indeed interact with/modulate the rumen microbiome and perhaps utilise them as a food source.
Although there are over 70 recognised species of rumen fluke [20], nucleotide sequence data that would facilitate molecular-level understanding of their feeding mechanisms and host-/microbiome interactions are lacking. To date, transcriptome datasets have only been generated for P. cervi (adults; [21]) and C. daubneyi (4 intra-mammalian life-cycle stages; [22, 23]) whilst no genome sequence information exists for any paramphistome species. Here, using PacBio long-range sequencing, we present a high-quality genome assembly for C. daubneyi and identify key gene families that have undergone specific expansion in this species. We propose that, together with unique morphological adaptations, the molecules encoded by these genes, confer unique functionality for feeding, reproduction and microbial recognition/defence that allow C. daubneyi to survive in a hostile and microorganism-rich host environment. These data will have far-reaching implications for our understanding of host-helminth-microbiome interactions and how these may influence enteric GHG emissions, particularly methane, by livestock [24].
Results
The C. daubneyi genome assembly
The summary statistics of the C. daubneyi genome assembly are shown in Table 1. The final C. daubneyi assembly comprised a 1.76-Gb genome sequence which is considerably larger than those of other trematodes such as C. sinensis (0.54 Gb), O. viverrini (0.60 Gb), P. westermani (0.92 Gb) and F. hepatica (1.14 / 1.27 Gb) [25,26,27,28,29]. The GC content was 43.14% which was comparable to other trematodes. The completeness of the assembly was assessed by BUSCO, using eukaryote lineage data, and revealed that 78.1% completeness and 2.0% duplication level were achieved. This suggests that the C. daubneyi genome is more complete than those described for related species; e.g. C. sinensis (70.2–73.2% complete; 0.7–1.3% duplicated), O. viverrini (73.1% complete; 1.3% duplicated), P. westermani (68.6–70.1% complete; 0.7–1.9% duplicated) and F. hepatica (66.5–71.1% complete; 0.9–1.7% duplicated) [30].
[IMAGE OMITTED: SEE PDF]
A considerable proportion of repetitive content (36.3%) was identified in the C. daubneyi genome which is average when compared to the levels observed in other trematode species where repeating content ranges from 28.9% in C. sinensis to 57.1% in F. hepatica [25, 27]. A total of 17,639 protein-encoding genes (and 19,329 putative transcript isoforms) were predicted from the C. daubneyi genome based on previously published RNA evidence [23]. Analysis of the gene models using the Funannotate annotate module resulted in 18,712 EggNog mapper annotations, 15,182 PFAM annotations and 502 MEROPS annotations. To assess the completeness of the resulting gene predictions, the predicted protein sequences were submitted to BUSCO to identify single-copy core orthologs. Based on this analysis, the predicted proteome of C. daubneyi was estimated to be 93.7% complete. Whilst this is higher than the BUSCO genome score, it may reflect a greater gene prediction accuracy when using proteins on account of redundancy and more conserved amino acid sequence.
Phylogenetic and biochemical relationships within the platyhelminthes
Phylogenetic analysis was performed using 180 shared single-copy orthologues from C. daubneyi and 16 other trematode/cestode/monogenean species and the free-living turbellarian Schmidtea mediterranea selected to represent the main phylogenetic groups and with protein BUSCO scores > 80%. Human sequences were used as an outgroup. Briefly, the longest isoform protein sets were concatenated into a single alignment and submitted to modeltest-ng to obtain the best-fit model for amino acid substitution. The resulting model (LG + I + G4) was provided to raxml-ng along with the concatenated single alignment to generate the final alignment. In the resulting phylogenetic tree, S. mediterranea and monogenean G. salaris both segregated into monophyletic groups, albeit represented here with a single species, leaving the cestoda and trematoda as sister-groups (Fig. 1). Within the trematoda clade, C. daubneyi segregated from the non-schistosome trematodes (likely representing a paramphistomidae monophyletic group) after the split from the schistosomes.
[IMAGE OMITTED: SEE PDF]
In the context of understanding how rumen fluke have adapted to survive in the rumen, and how they acquire nutrition, metabolic pathways were reconstructed for C. daubneyi and 19 other platyhelminth species based on the proportion of complete KEGG modules. Using this approach, the free-living S. mediterranea segregated from the parasitic species, whilst the monogeneans formed a sister group with the trematodes. Overall, the biochemical pathways used by C. daubneyi were similar to other non-schistosome species, particularly O. viverrini, O. felineus and P. westermani (Fig. 2; Additional file 1).
[IMAGE OMITTED: SEE PDF]
A number of gene families have undergone expansion in C. daubneyi
Although the C. daubneyi genome showed a similar level of conservation of biochemical (KEGG) pathways as other trematodes, a number of gene families have undergone significant expansion/contraction. Initial analysis using Cafe5 revealed 1800 significantly changed orthogroups (out of ~ 18,000 orthogroups containing at least 2 genes; p < 0.05). In contrast, GLM analysis identified 137 and 38 significantly altered orthogroups, using standard GLM parameters or a GLM/zero-inflated model, respectively. Thirteen orthogroup expansions shared between the Cafe5 analysis and the GLM-zero-inflated model are shown in Table 2.
[IMAGE OMITTED: SEE PDF]
These included OG0000234, comprised of 19 genes encoding members of the LicD transferase family (IPR007074) which are involved with lipopolysaccharide modification in bacteria [31]; OG0000284 representing 16 genes encoding diacylglycerol O-acyltransferases (IPR007130) that participate in triacylglycerol synthesis [32]; OG0000541, comprised of 11 genes encoding alpha-(1,6)-fucosyltransferases (IPR045573) which participate in core fucosylation of N-linked glycans [33]; OG0000554 representing 14 genes encoding calmodulin-like proteins (IPR002048) which contribute to calcium signalling pathways and regulate growth in F. hepatica [34]; OG0000797, comprising 11 genes encoding enoyl-Acyl carrier protein reductases (IPR002347) which participate in the fatty acid elongation cycle [35]; OG0000828, representing 13 genes encoding the oxygen carrier myoglobin (IPR011406) [36]; OG0001071, comprising 6 genes encoding multiple inositol polyphosphate phosphatases (IPR) which may regulate cellular phosphate levels [37]; OG0001742 and OG0001766, representing 10 and 7 genes encoding uncharacterised proteins respectively; OG0002059, comprising 6 genes encoding ADP-ribose glycohydrolase (IPR005502) which perform ADP-ribosylations [38]; OG0002976, representing 11 genes encoding peptidoglycan-recognition proteins (PGRPs; IPR015510) which are pattern-recognition receptors (PRRs) with important roles in innate immunity [39]; OG0005388, containing 6 genes encoding Von Willebrand factor A-like proteins (IPR013694) which may inhibit plasma proteases or help stabilise the extracellular matrix [40] and OG0005485, representing 5 genes encoding nicotinamide riboside kinases (IPR027417) which participate in cofactor biosynthesis [41].
C. daubneyi has an expanded repertoire of pattern-recognition receptors and putative antimicrobial molecules
The expansion of the PGRP gene family in the rumen fluke genome was of particular interest given the intimate contact of C. daubneyi with the array of bacteria in host rumen fluid. Accordingly, other PRR genes or those with putative antibacterial activity, whose products were previously identified within the secretome of C. daubneyi [23] were identified via BLAST analysis (Fig. 3). Using this approach, a total of 23 sequences encoding PGRPs were confirmed (representing 18 genes and 5 isoforms) (Additional file 2). Phylogenetic analysis showed that these sequences separated into 4 distinct clades (Additional file 3). The majority (15 genes) represented short-form PGRPs, encoding proteins of ~ 200 amino acids in length (~ 20 kDa).
[IMAGE OMITTED: SEE PDF]
These contained the characteristic PGRP/N-acetylmuromoyl-L-alanine amidase domain (IPR015510/IPR002502), within which conserved residues comprising Zn2+-binding (His55, His164 and His172), substrate binding (His56, Thr57, Asp86, Tyr90, Arg104, Ala111, His112, Asn117, His164, Thr170, Asn171 and Cys172) and amidase catalytic sites (His55, Tyr90, His164, Thr170, Cys172) were found. The remaining 8 genes encoded long form PGRPs ~ 700–1200 amino acids in length (~ 86–138 kDa) which contained a single ~ 22 amino acid predicted transmembrane helix downstream of the PGRP/N-acetylmuromoyl-L-alanine amidase domain. Furthermore, 13 C. daubneyi PGRPs possessed a predicted N-terminal signal peptide for secretion whilst 10 did not.
DM9 domain-containing proteins (DM9CPs) are a recently described family of invertebrate PRRs [42]. BLAST analysis identified a total of 22 sequences (representing 20 genes and 2 isoforms) encoding DM9CPs in the C. daubneyi genome. Phylogenetic analysis showed that these sequences segregated into 4 distinct clades (Additional file 3). The largest protein was 399 amino acids in length (~ 44 kDa) and contained 2 DM9 repeat domains (IPR006616) whilst the majority were around 150–230 amino acids (~ 16–25 kDa) and contained a single DM9 repeat domain. None of the encoded C. daubneyi DM9CPs contained a predicted N-terminal signal peptide.
Saposin-like proteins (SAPLIPs) are members of the amoebapore superfamily of lipid-interacting molecules with antibacterial activities [43]. Focusing on proteins only bearing saposin B type domains (IPR008139; typical of SAPLIPs involved in defence), we identified 30 sequences encoding SAPLIPs (representing 26 genes and 4 isoforms) in the C. daubneyi genome. Ten of these showed particular sequence identity with NK lysins (antibacterial proteins that are also amoebapore superfamily members) following reciprocal BLAST analysis. Five major SAPLIP clades were evident following phylogenetic analysis of the amino acid sequences (Additional file 3). Whilst the majority of the C. daubneyi SAPLIP genes encoded proteins of ~ 100 amino acids in length (typically ~ 12 kDa) bearing a single saposin B type domain, one 193 amino acid protein (~ 21 kDa) possessed 2 saposin B type domains whilst 2 SAPLIPs (254 and 441 amino acids in length; 28 and 50 kDa, respectively) contained 3 saposin B domains. All but two of the C. daubneyi SAPLIPs contained a predicted N-terminal signal peptide.
BLAST analysis of 33 platyhelminth genomes representing the major taxonomic groups, and different niches with the mammalian host, suggests that all three gene families (i.e. PGRPs, DM9CPs and SAPLIPs) have specifically expanded in C. daubneyi. For example, the PGRPs were absent from the cestodes and monogeneans and, with the exception of the free-living planarian S. mediterranea, were restricted to the non-schistosome trematodes with 23 PGRPs identified in C. daubneyi, 4 in F. hepatica, 2 in E. caproni and 1 in O. viverrini, O. felineus and C. sinensis (Fig. 3). Whilst the DM9CPs displayed a wider taxonomic distribution, the parasitic trematodes possessed 2–6 putative DM9CPs whilst 22 were identified in C. daubneyi. Similarly, C. daubneyi has at least twice as many SAPLIP genes as the other trematodes for which 2–12 putative SAPLIPs were identified. Although the quality of the various genome assemblies may influence the number of genes reported by homology searches, it is unlikely that this alone accounts for the difference in gene counts between C. daubneyi and the other platyhelminth species since species with high BUSCO scores consistently yielded few BLAST hits (Fig. 3).
Changes in gene transcription were followed across the four intra-mammalian developmental stages of C. daubneyi, namely the newly excysted juvenile (NEJ) flukes that establish infection in the duodenal submucosa, the immature flukes that feed on host blood/tissue in the intestine, the newly migrated (NM) flukes that have recently arrived in the rumen following their migration from the duodenum and the mature adult flukes that are well-established in the rumen. For this, transcripts per million (TPM) values derived from RNA-seq libraries, in biological triplicate for each stage [23], were mapped to the gene models (Fig. 4). The PGRP, SAPLIP and DM9CP gene families all showed developmental regulation with the highest expression levels generally seen in the infective NEJ stage and in the NM flukes that establish chronic infection within the rumen. Sub-groups of the DM9CPs showed increasing (and overlapping) expression with fluke development, with a clear peak at the NM stage, whilst expression of almost all family members had declined in the adult flukes. The expression of the PGRPs and SAPLIPs were broadly similar with sub-groups showing specific upregulation at the NEJ and immature stages with most family members showing clear upregulation in the NM flukes. In contrast to the DM9CPs, expression of numerous PGRP and SAPLIP genes remained high in the adult flukes.
[IMAGE OMITTED: SEE PDF]
Recombinant C. daubneyi PGRP binds to bacteria via interaction with peptidoglycans and induces agglutination
A full-length C. daubneyi PGRP (lacking the predicted N-terminal signal peptide) initially identified within the secretome of adult fluke [23] was expressed in Escherichia coli as a ~ 18.7 kDa recombinant protein fused to a C-terminal His6-tag for purification. The recombinant PGRP protein was isolated to homogeneity from E. coli cell lysates using Ni-chelate affinity chromatography (Additional file 4). The binding of increasing concentrations of recombinant C. daubneyi PGRP to model gram negative (E. coli) and gram positive (Bacillus subtilis) bacterial species was analysed by western blot. PGRP bound to both species in a concentration-dependent manner, with bands of ~ 18 kDa observed with increasing intensity from 0.5 to 2 µg (Fig. 5A).
[IMAGE OMITTED: SEE PDF]
Further experiments showed that 2 µg of PGRP bound to both species after only 10 min incubation in vitro (Fig. 5B). To determine whether C. daubneyi PGRP could interact with biologically relevant species, the binding experiments were also conducted using key anaerobic bacteria found in rumen fluid; namely Pseudobutyrivibrio ruminis and Butyrivibrio spp. (gram negative) and Streptococcus lutetiensis and Propionibacterium australiense (gram positive). As seen with the model species, C. daubneyi PGRP bound to the surface of all rumen anaerobes tested (Fig. 5C).
Given the binding of C. daubneyi PGRP to the surface of bacteria, its specific interaction with peptidoglycan from Staphylococcus aureus was determined (Fig. 5D). Recombinant C. daubneyi PGRP bound to peptidoglycan in a concentration-dependent manner. Absorbance readings were significantly higher (p < 0.05 or p < 0.001) when recombinant PGRP was added to peptidoglycan-coated plates compared with uncoated plates, demonstrating the specificity of the interaction. The specific interaction between C. daubneyi PGRP and peptidoglycan was additionally supported by a trypsinised PGRP and an un-related recombinant protein, F. hepatica thioredoxin glutathione reductase (TGR; expressed/purified in the same manner as PGRP) that displayed 16 and 6% of the peptidoglycan binding shown by PGRP respectively (Fig. 5E). To investigate the functional effect of the binding of C. daubneyi PGRP to the bacterial cell surface, E. coli and B. subtilis were stained with acridine orange and incubated with PGRP, with or without ZnSO4, and observed using fluorescence microscopy (Fig. 5F). PGRP induced agglutination of both species in the presence of ZnSO4. This effect was also observed for E. coli when incubated with PGRP alone, but to a lesser extent. No signs of agglutination were observed when the bacterial cells were incubated with PBS or ZnSO4. Although inhibition of growth was observed in the gentamycin control, C. daubneyi PGRP showed no direct antimicrobial activity when tested against an ovine ruminal isolate of S. lutetiensis during MIC assays.
The C. daubneyi genome suggests an atypical eggshell protein crosslinking chemistry
When examining the eggshell-producing vitellaria of C. daubneyi, it appears similar to that of other trematodes; specifically, it is arranged in follicles that occupy much of the lateral margins of the fluke. At the ultrastructural level (Fig. 6A), each vitelline follicle is seen to comprise a group of vitelline cells at various stages of development, ranging from undifferentiated stem cells, featuring scant ribosome-rich cytoplasm and euchromatic nuclei, to fully mature vitelline cells. The latter are situated centrally in each follicle and contain numerous peripherally located clusters of dense eggshell protein globules, large cytoplasmic reserves of glycogen, and clusters of heterophagosomes (‘yolk globules’), often located near the nucleus, which is relatively electron dense. Intermediate type vitelline cells exhibit, initially, accumulation of eggshell protein globules in clusters (corresponding to Golgi bodies) within the cytoplasm (It1 cells), followed by marked cell swelling as glycogen reserves amass in the cytoplasm (It2 cells).
[IMAGE OMITTED: SEE PDF]
Nurse cells, the nuclei of which are often located near the periphery of the follicle, send out numerous dendritic extensions of cytoplasm which surround the developing vitelline cells, making intimate contact with their limiting membrane by means of junctional complexes. Within these cytoplasmic extensions, dense mitochondria and elongated dense lysosomes are present. However, when tissue sections were stained with Mallory’s Trichrome (which stains keratin and collagen), the eggshell protein globule clusters, and the eggshell of whole eggs within the uterus (and those shed by the flukes) of C. daubneyi stained a deep red colour whilst those of F. hepatica were a golden brown (Additional file 5) suggesting differences in eggshell biochemistry between the two species.
Trematode eggshells are typically sclerotised by crosslink formation between L-DOPA residues on neighbouring vitelline proteins (reviewed by [44]). There are two schools of thought about DOPA-mediated cross-linking. One posits that DOPA is oxidised to quinone which is attacked by nucleophiles such as lysine and histidine to produce DOPA-Lys/His adducts [45]. This pathway has been confirmed as the mechanism of squid beak hardening [46]. The other, proposes that cross-links arise as bi- and tri-DOPA-Fe3+ complexes and has been best characterised in Mytilus byssus [47]. As DOPA formation prior to protein secretion is necessary in both pathways, it is reasonable to expect a protein-directed tyrosyl hydroxylase. BLAST analysis, using Schistosoma mansoni tyrosine hydroxylase as query, identified three putative C. daubneyi aromatic amino acid hydroxylases; however, backBLAST indicated these were either phenylalanine or tryptophan hydroxylases. In support of this, amino acid analysis found no evidence of L-DOPA in either C. daubneyi whole eggs or eggshell fragments whilst F. hepatica eggshells contained up to 5.1% L-DOPA residues (Fig. 6B; Additional file 6). Six vitelline protein genes were identified in the C. daubneyi genome. These typically encoded proteins ~ 29 kDa containing a conserved trematode eggshell synthesis (TES) domain (IPR012615) although one had a predicted mass of 46.4 kDa due to an additional 133 amino acid C-terminal cysteine-, proline- and tyrosine-rich extensin-like region. Three vitelline proteins (including the atypical 46.4 kDa form) were identified within C. daubneyi eggshells by mass spectrometry (Additional file 7). Primary sequence analysis showed that the C. daubneyi vitelline proteins contained ~ 7% of both tyrosine and cysteine residues and that several tyrosines known to be converted to L-DOPA in F. hepatica had been substituted for cysteine (Fig. 7). In contrast, vitelline proteins from other trematodes lack cysteine and contain ~ 12–15% tyrosine residues.
[IMAGE OMITTED: SEE PDF]
Tyrosinases are often associated with the oxidation of DOPA to DOPAquinone [48]. A single tyrosinase gene was identified in the C. daubneyi genome and whilst the functional tyrosinase domain (IPR002227) is present, the enzyme is absent from both of the transcriptome databases generated for adult C. daubneyi [22, 23] and that of P. cervi [21] and thus is not expressed by the egg-producing stage of paramphistomes. Indeed, the enzyme was not detected during mass spectrometry analysis of C. daubneyi eggshell extracts but was found in parallel proteomics analysis of F. hepatica eggshells (Additional file 7). Similarly, no staining was observed when adult C. daubneyi were incubated with catechol, which visualises tyrosinase as an intense red-brown stain within the vitellaria of F. hepatica (Additional file 5; [49]). Taken together, the data suggest that C. daubneyi uses different eggshell protein crosslinking chemistry compared to other trematodes.
The C. daubneyi eggshell is resistant to adverse biochemical conditions
Experiments were conducted to determine whether the biochemical differences between C. daubneyi and F. hepatica eggshells would influence their stability in varied environmental conditions. Initially, whole eggs were collected from both species and exposed to different pH values for 1 or 4 h in vitro and observed by light microscopy. Incubation at pH 4.0 for 4 h led to degradation of the outer layer of the F. hepatica eggshell whilst C. daubneyi eggs remained intact (Additional file 8). Similar disintegration of the F. hepatica eggshell was evident when they were exposed to pH 9.0 for 1 h, and after 4 h incubation very few intact eggs remained. In contrast, C. daubneyi eggs remained intact following incubation at pH 9.0, at either time point.
Sodium hypochlorite (NaClO) has been described as a “de-tanning” agent of dityrosine cross-linked trematode eggs and has been shown to cause severe ultrastructural defects to eggs from S. mansoni and F. gigantica [50, 51]. Here, whole F. hepatica and C. daubneyi eggs were incubated in increasing concentrations of NaCIO for 1 h and their structural integrity was assessed by light microscopy. No change was observed when either sample was incubated in 0.25% NaCIO solution. However, when incubated in 0.5% NaClO, the F. hepatica eggs completely disintegrated whilst the C. daubneyi eggs remained intact (Additional file 8). At higher (0.75% and 1.00%) NaClO concentrations, the F. hepatica were again completely disintegrated with no visible fragments remaining. In contrast, whilst the C. daubneyi eggs were disrupted at these higher concentrations, numerous eggshell fragments were still visible. Collectively, these observations demonstrate that C. daubneyi eggs are more resilient than F. hepatica eggs when exposed to NaClO and have greater stability across a wider pH range.
C. daubneyi displays specialised gastrodermal cell morphology and a different repertoire of secreted peptidases compared to haematophagous species
Histological examination of adult C. daubneyi, that were fixed immediately after removal from the host rumen, revealed the presence of ciliates within the oral cavity/foregut. However, recognisable structure is rapidly lost as these move deeper within the gut caeca (Additional file 9), consistent with extracellular digestion. A total of 423 peptidases were identified in the C. daubneyi genome, of which 126 encoded an N-terminal signal peptide for secretion via the classical ER/Golgi pathway. These were dominated by the C1A papain-like cysteine peptidases (47 sequences) which comprised 43 cathepsins B (representing 38 genes and 5 isoforms), four putative cathepsin L/F genes and a single dipeptidylpeptidase (cathepsin C) gene. Other abundant groups of secreted peptidases included the A1A pepsin-like aspartic peptidases (15 genes and 5 isoforms) and the C13 legumain-like asparaginyl endopeptidases (5 genes and 2 isoforms) (Fig. 8A).
[IMAGE OMITTED: SEE PDF]
The cells lining the C. daubneyi gut exhibit ultrastructural features associated with secretory and absorptive activity, whilst the apex of each cell bears numerous microvillae/lamellae, each approximately 50 nm thick, which are densely packed and extend far into the gut lumen (Fig. 8B). The limiting membranes of these bear a dense glycocalyx from which fibrous strands emanate, often connecting adjoining lamellae.
The apical cytoplasm of the gastrodermal cells features Golgi bodies, each with 4–5 elongated sacs per Golgi stack, and small dense secretory vesicles are associated with the maturing face. These dense vesicles apparently join to give rise to larger spherical to elongated secretory bodies below the apical membrane. These discharge their contents at the base of the microvilli/lamellae, as evidenced by the appearance of temporary depressions at the cell surface. Predominant in the cytoplasm of the basal and perinuclear regions in each gastrodermal cell are free ribosomes and cisternae of granular endoplasmic reticulum. Mitochondria, though numerous, are small, inconspicuous and of moderate electron density. The basal plasma membranes of the gastrodermal cells are extensively infolded, forming lamellae that extend to the perinuclear and apical cytoplasm.
The extensive lamellae/microvillae that extend from the apical surface of the gastrodermal cells of C. daubneyi would provide a huge surface area for absorption of molecules, such as volatile fatty acids (VFAs), that are abundant in rumen fluid. In mammalian gastrointestinal epithelial cells, these are imported via transport proteins expressed on the plasma membrane that include the H+-coupled MCT1 (SLC16A1) and MCT4 (SLC16A3) transporters and the Na+-coupled SMCT1 (SLC5A8) and SMCT2 (SLC5A12) transporters [52]. Whilst BLAST searches using human VFA transporters as queries identified 12 and 6 genes in C. daubneyi encoding putative H+-coupled (IPR050327; proton-linked MCT) and Na+-coupled (IPR051163; sodium:solute symporter) transporters respectively, similar genes were also identified in F. hepatica, an obligate blood-feeder.
Discussion
Clinical paramphistomosis, associated with large numbers of immature C. daubneyi in the duodenal submucosa, can cause significant damage to host tissue [5] and may result in fatal disease outbreaks in both cattle and sheep [6, 8, 53]. In contrast, chronic infection caused by adult flukes residing within the rumen is seemingly well-tolerated in terms of animal health and productivity [9, 16, 54] even with very heavy parasite burdens (e.g. > 1000 flukes per cow; [16]) which are frequently observed. The rumen niche is biochemically unique compared with the sites typically occupied by other trematode species (e.g. bile duct, blood vessels, lung tissue). Moreover, rumen fluid is particularly microbiome-rich containing a vast array of bacteria, fungi, ciliates and viruses [17]. However, it remains unclear how C. daubneyi interacts with, and modulates, these microorganisms with which it shares the rumen epithelial microenvironment. To begin to unravel such interactions, and other poorly understood areas of rumen fluke biology, we present a 1.76-Gb draft genome for C. daubneyi, the first to be described for any paramphistome species. The genomic data, integrated with microscopical, biochemical and proteomics investigations reveal a number of novel molecular and morphological adaptations for microbial recognition/defence, feeding and reproduction that enable C. daubneyi to thrive in such a hostile host environment.
The BUSCO scores of the final C. daubneyi 1.76 Gb genome demonstrate that our assembly is amongst the largest and highest quality reported for any trematode species to date. This is likely due to the use of the PacBio sequencing platform for both the genome sequencing and supporting RNA evidence [23] which generates much longer, better quality reads than other sequencing technologies. The availability of the genome resources allowed the first phylogenomic-level analysis to include a paramphistome to be performed using 180 shared single-copy orthologues from C. daubneyi and 17 other flatworms. In the resulting tree, the phylogenetic relationships of free-living flatworms, monogeneans, cestodes and trematodes were in agreement with a previous genome-wide phylogeny of 25 platyhelminth species [55] with the addition of C. daubneyi (representing the paramphistomidae) which segregated from other sub-order Plagiorchiida trematodes, such as F. hepatica, after the split from the schistosome lineage. Whilst C. daubneyi showed similar coverage of KEGG metabolic pathways as phylogenetically related species, a number of gene families, with putative roles in bacterial recognition/defence, have undergone significant expansion. Given, the microbial milieu in which C. daubneyi reside, an expansion of gene families with such roles is perhaps not surprising. Despite this, little is known with regard to the impact of rumen fluke on the host microbiome with a solitary investigation into rumen fluke EVs [19] providing our only insight to date.
Peptidoglycan recognition proteins (PGRPs) are a highly conserved family of pattern recognition receptors (PRR) first described in the silkworm, Bombyx mori [56]. They exist in short (secretory) and long (transmembrane/secretory) forms and are considered key components of the innate immune system as they recognise pathogen-associated molecular patterns (PAMPs), such as peptidoglycan (PGN), on bacterial cell walls [39]. PGRPs are classified according to their biochemical activity. Catalytic PGRPs possess amidase activity and can cleave the amide bond between N-acetylmuramic acid and L-alanine of PGN [57]. This can lead to loss of cell wall integrity and may be responsible for the direct bactericidal effects of some insect and mammalian PGRPs [39, 58]. In contrast, non-catalytic PGRPs (typically seen in invertebrates) lack enzymatic activity but, after binding to PGN, activate the Imd, Toll and prophenoloxidase (PPO) pathways, the three major immune effector mechanisms in insects [58,59,60]. The C. daubneyi genome contains 18 PGRP genes with an additional 5 isoforms, which in comparison to other parasitic flatworm species (with 0–4 PGRP genes), represents a major expansion of this family. We previously identified two short form PGRPs in the secretome of adult C. daubneyi using mass spectrometry [23], the most abundant of which was produced as an ~ 18.7 kDa recombinant protein in E. coli. The recombinant PGRP bound rapidly to the surface of gram-positive and gram-negative aerobic bacteria, and to several obligate anaerobes typically found in the rumen microbiome, via specific interaction with purified cell wall peptidoglycans. Moreover, the recombinant PGRP had a moderate agglutinating effect on bacteria which was enhanced in the presence of Zn2+ ions, a co-factor of catalytic PGRPs [61]. Conceivably, this action could confer protection against colonisation of the fluke tissues by pathogenic bacteria but may also facilitate ingestion of bacterial cells, in clusters, as a source of nutrition (see below). Although the recombinant PGRP showed no direct antimicrobial activity against S. lutetiensis during MIC assays, it is possible that C. daubneyi PGRPs work synergistically with other secreted molecules to achieve an antimicrobial effect.
Comparative genome analysis also revealed that the C. daubneyi saposin-like protein (SAPLIP) family has undergone significant, and specific, expansion (26 genes and 4 additional isoforms) compared to other flatworm species (with 0–12 genes). SAPLIPs are members of the amoebapore superfamily of lipid-binding molecules that interact with biological membranes in one of three ways: binding, perturbation without permeabilization, or permeabilization [62]. In blood-feeding trematodes, such as F. hepatica and C. sinensis, SAPLIPs have been shown to induce lysis of erythrocytes and release of haemoglobin which can be degraded by secreted peptidases [63,64,65]. However, many SAPLIPs are important effectors of the innate immune system, with roles in pathogen recognition, membrane permeabilization and cellular lysis [62]. Indeed, many of the C. daubneyi SAPLIP genes showed particular sequence identity with NK lysins which are SAPLIP family members, typically characterised from vertebrates, that show antimicrobial activity against a broad range of pathogens including fungi, protozoa and bacteria (reviewed by [66]). Since adult paramphistomes do not appear to be haematophagous (see below), it is likely that the large SAPLIP family of C. daubneyi has evolved as a means of defence against the vast numbers, and great diversity, of microorganisms present within the rumen environment. As SAPLIPs are also present in C. daubneyi soluble secretions [23] and EVs (Morphew & Robinson, unpublished data), it is conceivable that they work in concert with the PGRPs to disrupt the bacterial cell wall. In this scenario, the putative amidase activity of the PGRPs would first degrade the outer peptidoglycan layer and thus allow the SAPLIPs to access, and lyse, the underlying phospholipid membrane. Continued investigation of the function of C. daubneyi secretory molecules will help answer one of four key questions posed recently by Rooney et al. [67] in understanding the mechanisms of helminth-microbiome interactions.
A family of DM9 domain-containing proteins (DM9CPs) have also undergone expansion in C. daubneyi (20 genes and 2 additional isoforms) compared with other flatworm species (0–9 genes). The DM9CPs are mainly found in arthropods and platyhelminths [68], and typically possess two or more DM9 domains which contain a 60–75 amino acid motif first identified in the fruit fly, Drosophila melanogaster [42, 69]. Although not well characterised, recent studies have shown that oyster and crab-derived DM9CPs have roles in innate immunity and can induce agglutination of bacterial and fungal cells and enhance the encapsulation activity of haemocytes. DM9CPs can also bind to a variety of PAMPs such as mannose, lipopolysaccharide and peptidoglycan and are thought to act as multipotent PRRs [70,71,72]. DM9CPs were identified within the protein cargo of EVs secreted by C. daubneyi [19], S. mansoni [73] and F. hepatica where they were specifically found on the EV surface [74]. Although the function of the C. daubneyi DM9CPs has yet to be determined, an orthologue from F. gigantica displayed agglutination activity against gram-negative and gram-positive bacteria [75]. As for the PGRPs and SAPLIPs, it is likely that the unique microbial-associated selective pressures exerted on C. daubneyi have driven the expansion of a DM9CP family with sensory/protective functions. Gene expression analysis suggests that the PGRPs, SAPLIPs and, to a lesser extent, the DM9CPs, cluster according to those upregulated in the NEJ and immature fluke stages (which inhabit the duodenum) and those upregulated in the newly migrated and adult flukes (which inhabit the rumen). Since the composition of the host microbiome varies greatly between these locations [76], it appears that the gene expansions have created large families of secretory molecules conferring (potentially synergistic) recognition of/defence against a wide range of bacteria concomitant with fluke development and migration.
Adult C. daubneyi attach to the rumen epithelium via a muscular posterior sucker. This arrangement ensures that the oral sucker, and thus the opening to the blind-ended gut, is free to move within the rumen fluid [9]. The general absence of red blood cells in the gut of paramphistomes [77,78,79] and the lack of significant pathology within the rumen epithelium suggests that adult rumen flukes are not obligate blood-feeders [53, 80]. The detection of cellular debris, bacteria and ciliates, derived from rumen fluid, within the pharynx and oesophagus of C. daubneyi suggests that they feed by direct digestion of the rumen microbial community [80]. In support of this, our microscopical observations of rumen fluke tissue sections suggest that adult C. daubneyi are capable of extracellular digestion of rumen microorganisms. Indeed, UK-sourced paramphistomes, identified as Paramphistomum hiberniae in the 1950s (which may have been misidentified C. daubneyi specimens; [20]), could be maintained in rumen fluid alone for up to 9 days in vitro [81]. However, C. daubneyi had no effect on ciliate numbers when co-cultured in vitro [22].
Ultrastructural examination of the gastrodermal cells that line the C. daubneyi gut revealed features associated with secretory and absorptive activity. Transcriptome and secretome data demonstrate that whilst C. daubneyi adults secrete a very different repertoire of digestive peptidases compared to blood-feeding adult F. hepatica (i.e. cathepsins B and aspartic peptidases rather than cathepsins L1/2/5; [23, 82], they are not particularly abundant. This is in contrast to liver fluke where cathepsins L comprise around 80% of the total protein secreted by the adult stage [82, 83] and may be linked to the abundance of free amino acids in rumen fluid for direct uptake [84]. Collectively, these observations suggest that extracellular digestion may not be the sole source of nutrient acquisition by C. daubneyi residing within the rumen. The dense lamellae/microvillae that extend from the apical surface of C. daubneyi gastrodermal cells would provide a huge surface area for absorption of molecules from the rumen fluid. Indeed, the ruminant host primarily utilises volatile fatty acids (VFAs) such as acetate, propionate and butyrate that are produced during microbial fermentation in the rumen as a source of energy [85]. In mammalian gastrointestinal epithelial cells, these are typically imported via transport proteins expressed on the plasma membrane that include the H+-coupled MCT1 (SLC16A1) and MCT4 (SLC16A3) transporters and the Na+-coupled SMCT1 (SLC5A8) and SMCT2 (SLC5A12) transporters [52]. Whilst putative homologues of these VFA transporters were identified within the C. daubneyi genome, similar genes were also observed in F. hepatica demonstrating that they are not exclusive to rumen-dwelling paramphistomes. However, the gastrodermal cell lamellae/microvillae of C. daubneyi are around three times longer than those of F. hepatica [77] which would yield a much larger surface area for expression of VFA transporters and VFA uptake from rumen fluid.
Despite the apparent differences in the feeding mechanism used by C. daubneyi compared with other trematodes, all share the same nutritional requirement—to support the production of vast numbers of eggs; e.g. F. hepatica produces an average of 25,000 eggs per fluke, per day [86]. Trematode eggshells are comprised of vitelline proteins, released from protein globule clusters in mature vitelline cells, that coalesce around the ovum [44]. The vitelline proteins form a tough protective layer around the developing embryo in a process that has been known as quinone tanning, during which tyrosine residues present in the vitelline proteins are initially modified by the addition of a hydroxyl group to form L-DOPA (3,4-dihydroxyphenyl-l-alanine), in a step catalysed by tyrosine hydroxylase. The L-DOPA residues are presumed to be oxidised by tyrosinase/phenol oxidases to form DOPA-Lys and/or DOPA-His crosslinks between neighbouring vitelline proteins which serves to harden and stabilise the eggshell [46, 87, 88]. This scenario, however, is being questioned by the finding that in Mytilus byssus most of the DOPA is cross-linked by bis- and tris-coordination of Fe3+ [47]. Given that trematodes like Fasciola spp. and Schistosoma spp. consume blood with iron-rich haemoglobin, they may rely on it as well [89]. At the ultrastructural level, C. daubneyi vitelline follicles appear similar to that of other trematodes and contain vitelline cells at progressive stages of development that culminate in mature cells bearing numerous eggshell protein globule clusters within their peripheral cytoplasm. However, our data suggest that C. daubneyi uses atypical eggshell protein crosslinking chemistry. Firstly, whilst amino acid analysis revealed up to 5.1% L-DOPA residues in F. hepatica eggshell proteins, L-DOPA was not detected in C. daubneyi eggshells, likely due to the lack of a tyrosine hydroxylase gene which catalyses the conversion of tyrosine to L-DOPA. Indeed, alignment of six conceptually translated C. daubneyi vitelline protein genes showed that several tyrosine residues, known to be converted to L-DOPA in F. hepatica [90], were substituted for cysteine which is not found in the vitelline proteins of other trematodes. Secondly, although a single gene encoding the putative eggshell crosslinking enzyme, tyrosinase, was identified in the C. daubneyi genome, RNA-seq data showed that it is not expressed by egg-producing adult flukes. In addition, the enzyme was not detected during MS/MS analysis of rumen fluke eggshell extracts. Previous histochemical studies of other paramphistome species have suggested that their eggshells are keratinised and are stabilised by disulphide (Cys-Cys) bonds rather than via dityrosine crosslinks [91]. Cysteine-rich eggshell membrane proteins are common in avian and reptilian eggshells and are disulfide crosslinked in the mature membranes [92]. Despite their occurrence in C. daubneyi vitelline protein primary sequences (~ 7%), cysteines were not detected in their eggshells by direct biochemical analysis. Thiol cysteines do not typically survive the acid hydrolysis extraction procedure used during amino acid analysis, but disulphide cysteines do, so the absence of cysteines suggests that disulphide crosslinks may not occur in C. daubneyi eggshells.
Mass spectrometry analysis of C. daubneyi eggshells led to the identification of an uncharacterised 113 kDa secretory protein with an internal region showing ~ 37% sequence identity with a proline-rich extensin-like protein. Moreover, one of the rumen fluke vitelline proteins (also confirmed within the eggshell by mass spectrometry) also possessed a C-terminal extension with ~ 30% sequence identity with an extensin-like protein. Extensins are important structural components of plant cell walls which form intramolecular isodityrosine crosslinks and intermolecular di-isodityrosine crosslinks, catalysed by extensin peroxidases, at Tyr-Xaa-Tyr motifs [93, 94]. The vitelline protein C-terminal region contains 4 Tyr-Xaa-Tyr motifs whilst a thioredoxin domain-containing protein 17, with putative peroxidase activity [95] was detected in eggshell protein extracts by mass spectrometry. Although requiring experimental validation, peroxidase-mediated di-isodityrosine crosslinks between adjacent vitelline proteins may stabilise the eggshell and contribute to the enhanced stability of C. daubneyi eggs. Furthermore, the large number of ferritins identified during MS/MS analysis of C. daubneyi eggshell extracts suggests that catechol-iron coordination could play a role in eggshell hardening as suggested for S. mansoni [96]. Due to the heteropolymeric nature of trematode eggshells, it is perhaps likely that multiple intermolecular bonds participate in this process.
Conclusions
Our analysis of the C. daubneyi draft genome has allowed us to investigate key aspects of paramphistome biology like never before. One of the defining features of C. daubneyi is its final niche within its mammalian host—the rumen. The rumen maintains a highly complex ecosystem of microorganisms that break down plant-based feed thus increasing nutrient availability to the host [17]. In particular, enteric fermentation by methanogenic archaea within the rumen is the leading cause of methane (CH4) emissions from ruminant livestock which contributes substantially to global greenhouse gas (GHG) emissions (reviewed by [24]). Our genome-led discovery of a family of C. daubneyi-derived PGRPs that can bind to the bacterial cell surface and induce agglutination represents a novel route for helminth-microbe interaction. Further characterisation of the putative antibacterial effects of PGRPs, and the similarly expanded SAPLIP family, will lead to a greater understanding of how paramphistomes modulate microbial communities within the rumen and perhaps influence enteric CH4 emissions from livestock [97]. This is not without precedent as the considerable increases in CH4 emissions yield seen with Haemonchus contortus and Trichostrongylus colubriformis infections in sheep were also associated with changes in rumen bacterial diversity—characterised by suppression of species that regulate ruminal homeostasis and expansion of methanogenic archaea [98].
In addition to direct antimicrobial effects, molecules secreted by C. daubneyi may also influence how host cells sense, and respond to, bacterial PAMPs. Detection of PAMPs by PRRs, such as toll-like receptors (TLRs) expressed by host epithelial cells and/or innate immune cells leads to inflammatory responses, the resolution of which results in tolerance to commensal microbes or clearance of pathogens [89]. We have found that C. daubneyi secretes an expanded repertoire of putative PRRs (PGRPs and DM9CPs) with confirmed/predicted ability to bind bacterial ligands such as peptidoglycan and lipopolysaccharide (LPS). In addition, our previous mass spectrometry analysis revealed that helminth defence molecule (HDM) and fatty-acid binding proteins (FABPs) were amongst the most abundant proteins in the rumen fluke secretome [22, 23]. Strikingly, orthologues of both molecules block TLR4 signalling and protected mice against LPS-induced inflammation by significantly reducing levels of TNFα and IL-1β [99,100,101,102]. Thus, it is conceivable that the array of PRRs secreted by C. daubneyi will regulate inflammatory responses within the rumen, resulting in altered microbial diversity. Although adult C. daubneyi are not overtly pathogenic, such dysregulation of the rumen microbiome can influence ruminal fermentation processes, which are critical for the animal’s nutrient absorption and energy production, thereby reducing feed utilisation efficiency. This disruption may have indirect, yet significant, implications for animal health and productivity.
By reporting the draft genome for C. daubneyi, we have created a platform for comparative genome-level studies with other rumen-dwelling paramphistomes such as P. cervi, Gastrothylax crumenifer, Cotylophoron cotylophorum and Fischoederius elongates which are considered production-limiting infections of livestock in tropical/sub-tropical regions [11, 12, 103]. The continued generation, and exploitation, of multi-omics resources for paramphistomes should lead to the discovery of novel targets for molecular diagnostic tests for pre-patent infection (associated with the most damaging pathology). This would be a major advance on current microscopical methods and would help avoid economic losses associated with paramphistomosis, especially in low/middle-income countries.
Methods
Parasite material and DNA sequencing
Adult C. daubneyi were collected from the rumen of a naturally infected cow in a local abattoir (Dungannon, Northern Ireland). Flukes were washed several times in warm (39 °C) sterile PBS, snap-frozen in liquid nitrogen and stored at − 80 °C until use. High molecular weight genomic DNA was isolated using the Qiagen Genomic Tip 100/G kit and quantified/quality-controlled using a Qubit fluorometer (ThermoFisher) and agarose gel electrophoresis. Hi-Fi libraries were prepared using the SMRTbell Express Template Prep Kit 2.0 (PacBio). Five micrograms genomic DNA (from a single fluke) was sheared to an appropriate size (15–20 kb) with the Megaruptor 3 (Digenode). This material was made free of single-stranded ends, damage repaired and end tailed. This was ligated to an overhang adapter to create the SMRT bell library. Non-circular molecules were removed in an enzyme clean up step. The DNA was size selected using a 0.75% agarose cassette and s1 marker using the Sage Blue Pippin system and sequenced using Sequel Polymerase 2.2 on the PacBio Sequel 2e platform using 30-h movies and adaptative loading.
Genome assembly
PacBio HiFi reads were generated by SMRTlink 10 [104] and subsequently assembled by HiFiASM [105] using default settings. The genome assembly was screened for contamination using Blobtools2 [106] using default settings. The completeness of the assembly was assessed by BUSCO [107] using Eukaryota Odb10 [108]. Prior to gene prediction, repeat models were generated using Red [109], which were subsequently used as BLASTX queries against the UniProt/Swiss-Prot database of curated proteins. In order to improve the sensitivity of the gene prediction process, sequences that significantly matched with any protein were removed from the repeat models using ProtExcluder [110]. Finally, filtered repeat models were applied to the genome, using RepeatMasker [111].
Gene prediction and annotation
Genes were predicted using the Funannotate pipeline which uses both protein and RNA evidence to train a suite of ab initio predictor tools: AUGUSTUS [112], Snap [113], GeneMark [114], GlimmerHMM [115] and PASA [116]. Here, the UniProt/Swiss-Prot curated protein dataset was used as protein evidence, and our previous long-read Isoseq and short-read RNA-seq data for C. daubneyi [23] was provided as transcript evidence. The protein data was aligned to the genome using Diamond BLASTX [117] to identify strong matches to coding sequences. RNA-seq data were aligned to the genome using the splice-aware short-read alignment tool, HISAT2 [118], with a maximum intron size of 300 kb. Aligned data were used to generate a reference-guided transcriptome assembly with Trinity [119] that was post-processed with Stringtie [120] and PASA. The above steps were carried out using the Funannotate train1 module, and the resulting filtered, high-quality set of transcript and protein alignments was used in the subsequent training of the gene predictors. As an additional option, the module was instructed to carry out optimisation of AUGUSTUS gene models. Predictions from each tool were collated and filtered by the Funannotate evidence modeller (EVM) to yield a final set of genes. Gene models were refined (splitting/fusing of gene models, and prediction of putative transcript isoforms) and updated to add predicted UTR sequences, based on RNA evidence. Genes were annotated using the Funannotate annotate module, which uses the Eggnog-mapper [121], UniProt MEROPS and PFAM to generate functional predictions where possible. Finally, to correlate gene expression with parasite development, RNA-seq read count data from 4 intra-mammalian life-cycle stages of C. daubneyi (NEJs, immature flukes, newly migrated flukes and adults; identified by gross morphology, histological examination and location within the mammalian host [23]) was mapped to the gene models and expressed as transcripts per million (TPM) as previously described [23].
Phylogenetic analysis
Phylogenetic tree generation was performed using a range of platyhelminth genomes and Homo sapiens as an outgroup. Single-copy orthologue groups (Additional file 10) were identified between all species using Orthofinder (v2.5.4) [122]. Protein sequences within single-copy orthogroups were aligned using maaft (v7.487) [123], followed by alignment trimming with gblocks (0.91b) [124]. All alignments were concatenated and provided to modeltest-ng-static (v0.1.7) [125] for amino-acid substitution model testing. Finally, the alignment and selected model were provided to RaxML-ng (v0.6.0) [126] for tree inference. RaxML-ng was performed using 10 randomised parsimony starting trees and 1000 bootstrap replicates.
Comparative genomics of platyhelminth species
A number of the best-quality platyhelminth genomes were selected for comparison with the C. daubneyi genome assembly using a similar approach to Coghlan et al. [55]. Species, representing the major clades, were selected based on BUSCO (v5.2.2) [107] proteome score (> 80% completeness, eukaryota_odb10). Longest iso-forms were chosen as a representative for each gene, which prevents false positives when studying gene expansion/contraction. Orthogroups were identified across all species using Orthofinder, with newick tree generation again described as above. Changes in gene family numbers were assessed two ways: (1) using CAFE (v5) [127], and (2) GLM analysis using Maaslin2 (v1.6.0) [128] using the “ZINB” option, a zero inflation model to take into account the high number of zeros for some species in orthogroups. A final set of expanded or contracted gene families were taken as those that were significant in both analyses. C. daubenyi genes were annotated with gene ontology (GO) terms using eggnog (v5.0) [129].
Identification, and phylogenetic analysis, of C. daubneyi gene families
C. daubneyi gene families and specific homologues of interest were identified using BlastP (1.0 × 10−6 e-value cut-off) run locally within BioEdit [130] using query sequences from other trematodes (e.g. F. hepatica, Opisthorchis viverrini, Clonorchis sinensis and Schistosoma spp.). TPM values for each gene across 4 intra-mammalian lifecycle stages, assigned using previous RNA-seq data [23], were used to produce heatmaps of gene expression via the heatmapper.ca expression tool [131]. Neighbour-Joining phylogenetic trees were constructed for selected protein sequences using MEGAX [132]. The bootstrap consensus tree inferred from 1000 replicates was taken to represent the evolutionary history of the taxa analysed [133]. Branches corresponding to partitions reproduced in less than 50% bootstrap replicates were collapsed. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (1000 replicates) were shown next to the branches. The evolutionary distances were computed using the Poisson correction method [134] and are in the units of the number of amino acid substitutions per site.
Expression and purification of recombinant C. daubneyi PGRP
Recombinant C. daubneyi PGRP was produced in Escherichia coli by Genscript (NJ, USA). Briefly the protein coding sequence of transcript Cdaub_10767, the most abundant PGRP found in the C. daubneyi secretome [23], was codon-optimised for E. coli, chemically synthesised, and subcloned into the pET30a expression vector. E. coli BL21 Star (DE3) cells were transformed with the construct and bacterial cultures (1 L) were grown in LB media supplemented with kanamycin (100 mg/ml) at 37 °C until OD600 0.6–0.8 and expression of the recombinant PGRP was induced by the addition of 0.5 mM IPTG for 16 h at 15 °C. Soluble His-tagged recombinant PGRP was purified from cell lysates by Ni–NTA affinity chromatography, buffer exchanged into 50 mM Tris–HCl, 500 mM NaCl, 10% v/v glycerol (pH 8.0) and stored in aliquots at − 20 °C until use.
Binding of recombinant PGRP to bacteria
A range of model facultatively anaerobic bacteria (E. coli and Bacillus subtilis; Blades Biological Ltd, UK) and strict rumen anaerobes (Pseudobutyrivibrio ruminis, Butyrivibrio spp., S. lutetiensis and Propionibacterium australiense) were used to assess the binding activity of the recombinant C. daubneyi PGRP. The ruminal bacteria were isolated via the dilution-to-extinction method [135] in an anaerobic cabinet (Don Whitley Scientific, UK) using ruminal fluid from lambs fed a 50:50 grass silage-to-concentrate diet. Bacterial cultures (200 µl) in the mid-log phase (OD600 ~ 0.6) were mixed with recombinant PGRP (0–2 µg) and incubated for different time points at room temperature (18–21 °C). The cells were harvested by centrifugation at 5000 rpm for 10 min and washed three times in PBS. The final cell pellets were resuspended in LDS loading buffer and run on reducing 4–12% NuPAGE Bis–Tris gels (Life Technologies) and transferred to nitrocellulose membranes (GE Healthcare). Blots were probed with 1.0 µg/ml of an affinity-purified peptide antibody raised in rabbits against an antigenic peptide (CEQWGAQAPRTAHTK) of C. daubneyi PGRP (Genscript, NJ, USA) overnight at 4 °C. After washing, an appropriate alkaline phosphatase-conjugated IgG secondary antibody was applied to the membranes for 1 h at room temperature before detection using the BCIP/NBT substrate (Sigma-Aldrich).
Peptidoglycan binding assay
Microtiter plates (96 well; Nunc) were coated with peptidoglycan from Staphylococcus aureus (10 µg/well; InvivoGen, USA) overnight at 4 °C. After blocking excess binding sites with 1% BSA (w/v) in PBS, recombinant C. daubneyi PGRP (0–5 µg/well) were added to the plate which was then incubated for 15 min at room temperature (18–21 °C). Binding of PGRP to peptidoglycan was detected by the addition of affinity-purified rabbit anti-CdPGRP (2 µg/ml in PBS) for 1 h at 37 °C, followed by alkaline phosphatase-conjugated goat anti-rabbit IgG (Sigma; 1:10,000 dilution PBS). Binding of secondary antibody was visualised by the addition of p-nitrophenol phosphate (Sigma; 80 µl/well) and measuring the absorbance at 405 nm. Binding specificity was demonstrated by adding recombinant PGRP to peptidoglycan-coated/uncoated plates or by adding trypsinised PGRP or a His-tagged recombinant F. hepatica thioredoxin glutathione reductase (TGR; R. Morphew) to the plates. TGR was detected by the addition of rabbit anti-FhTGR antibody (1:2000 dilution) as described above.
Minimum inhibitory concentration
Minimum inhibitory concentration (MIC) assays for antimicrobials were performed according to the standard broth microdilution method [136], with minor modifications as described below. The assays utilised 96-well microtiter plates (Nunclon Delta-Treated, Flat-Bottom Microplate, ThermoFisher, UK) and were conducted in a Whitley A35 Workstation anaerobic chamber to ensure strict anaerobic conditions. S. lutetiensis, isolated from ovine ruminal fluid, was cultured without agitation overnight in Hungate tubes using a basal medium supplemented with 4 g/L glucose [137] at 39 °C. The bacterial inoculum was adjusted to approximately 1 × 105 CFU/mL. Gentamicin sulphate (potency: ≥ 590 IU, Sigma-Alrich, UK) was prepared according to the manufacturer’s instructions and used as positive control. MIC values were assessed using 12 two-fold serial dilutions, starting at 0.025 mg/ml for gentamicin. The recombinant C. daubneyi PGRP was tested with and without 5 µM ZnSO4 (Sigma-Aldrich, UK) with a maximum concentration of 0.94 mg/ml. Each MIC experiment was conducted with two biological replicates, each comprising three technical replicates. The 96-well plates were incubated at 39 °C for 24 h. Post-incubation, bacterial viability was determined by the addition of 30 µL of a 0.1% (w/v) sterile resazurin solution (Sigma-Aldrich, UK) to each well, followed by an additional 30-min incubation under the same conditions [138]. The MIC was defined as the lowest concentration of antimicrobial at which the resazurin remained purple, indicating it was not reduced, thus signifying the inhibition of bacterial growth.
Bacterial agglutination assay
E. coli and B. subtilis cultures in the mid-log phase (OD600 ~ 0.6) were collected by centrifugation (2500 × g for 8 min) and stained for 2 min in acridine orange (1% w/v; Sigma-Alrich, UK). After three washes with PBS, the cells were incubated in a 96-well plate with recombinant C. daubneyi PGRP (200 µg/ml) with and without 5 µM ZnSO4 for 1 h at room temperature. The cells, in biological triplicate, were observed using an EVOS Floid imaging system (ThermoFisher, UK).
Isolation and extraction of fluke eggshells
Whole C. daubneyi and F. hepatica eggs were collected following in vitro culture of adult flukes in RPMI-1640 medium containing 0.1% w/v glucose, 100 U penicillin and 100 μg/ml streptomycin (Sigma-Aldrich), at 1 worm/ml for 5 h at 37 °C. Eggs were washed three times with PBS and processed to produce pure eggshell fragments using the method described by Waite and Rice-Ficht [87]. Briefly, approximately 250 µl of C. daubneyi or F. hepatica eggs (packed volume) were homogenised in 5% acetic acid with 8 M urea and incubated with gentle rotation for 6 h at room temperature. The sample was spun at 2000 × g for 5 min and the pellet was resuspended in 5% v/v acetic with 2 M urea and 0.01% w/v pepsin (Sigma-Alrich, UK) and digested overnight. Insoluble residue was collected by centrifugation as above and washed twice with 0.1 M borate buffer (pH 8.0) containing 2 M urea. Trypsin (0.01% w/v) was added and digestion was carried out overnight at room temperature. The final insoluble pellet, containing eggshell fragments, was collected by centrifugation and washed four times with deionised water.
Mass spectrometry analysis of fluke eggshells
Purified eggshell fragments from C. daubneyi and F. hepatica (in biological triplicate) were solubilised in NuPage LDS loading buffer with 50 mM dithiothreitol as reducing agent (Life Technologies, UK) at 95 °C for 5 min and run on 4–12% NuPAGE Bis–Tris gels (Life Technologies). Gels were lightly stained with coomassie blue, destained with 45% v/v methanol and 10% v/v acetic acid and the dominant ~ 29 kDa band was excised for mass spectrometry analysis. The gel pieces were digested with 100 ng/μl sequencing grade trypsin (Promega, UK) overnight at 37 °C. Tryptic peptides were dried in a vacuum centrifuge and reconstituted with 10 µL of 0.1% TFA before analysis by liquid chromatography-tandem mass spectrometry (LC–MS/MS) using a LTQ OrbiTrap Velos Pro (Thermo Scientific) as previously described [23].
All MS/MS samples were analysed using MASCOT (Matrix Science, London, UK; version 2.6.2). MASCOT was set up to search the C. daubneyi predicted proteome database (version 1.0, 13,292 entries) assuming the digestion enzyme strict trypsin. MASCOT was searched with a fragment ion mass tolerance of 0.060 Da and a parent ion tolerance of 10.0 PPM. Carbamidomethyl of cysteine was specified as a fixed modification whilst Gln- > pyro-Glu of the N-terminus, deamidation of asparagine and glutamine, oxidation of methionine, dioxidation of methionine and acetyl of the n-terminus were specified as variable modifications. Scaffold (version Scaffold_4.4.5, Proteome Software Inc., Portland, OR, USA) was used to validate MS/MS-based peptide and protein identifications. Peptide identifications were accepted if they could be established at greater than 95.0% probability by the Scaffold Local FDR algorithm. Protein identifications were accepted if they could be established at greater than 99.0% probability and contained at least 2 identified peptides. Protein probabilities were assigned by the Protein Prophet algorithm [139]. Proteins that contained similar peptides and could not be differentiated based on MS/MS analysis alone were grouped to satisfy the principles of parsimony. Proteins sharing significant peptide evidence were grouped into clusters. Additionally, a label-free quantitative analysis (based on the exponentially modified protein abundance index; emPAI) performed in Scaffold was used as a measure of protein abundance. Only proteins with at least two unique peptides, that were present in all three biological replicates, were retained.
Amino acid analysis of fluke eggshells
Whole eggs and eggshell fragments were hydrolysed under vacuum in sealed ampules with 100 µl of 6 M HCl and 5 µl of phenol for 24 h at 110 °C. The resulting amino acid constituents were analysed with a Hitachi L-8900 Amino Acid Analyzer calibrated with an amino acid standard solution (Sigma-Aldrich) and 3,4-dihydroxyphenylalanine (DOPA; Sigma-Aldrich). Each sample was run in triplicate.
Egg stability assays
Approximately 200 C. daubneyi and F. hepatica eggs were suspended in increasing sodium hypochlorite concentrations (0–1%) and incubated, in biological triplicate, for 1 h at room temperature (18–21 °C). Eggs were similarly incubated in buffers with varying pH: 0.1 M sodium acetate (pH 4.0), PBS (pH 7.0) or 0.1 M sodium borate (pH 9.0) at room temperature and observed at 0-, 1- and 4-h incubations according to Afifi and Ali [51].
Histochemistry and electron microscopy
For histological examination, adult C. daubneyi were immediately fixed in 10% w/v neutral buffered formalin for 24 h after their removal from the rumen according to Hanna et al. [140]. Flukes were then dehydrated in ethanol, cleared in Clearene (Surgipath, UK), and embedded in wax following conventional procedures. Sections (5 µm) were cut, stained with haematoxylin and eosin or Mallory’s Trichrome (Sigma-Aldrich, UK) and viewed with a Leica DM LN2 microscope with a Nikon Coolpix 5000 camera system. Whole mounted flukes were stained in 0.1% w/v catechol (Sigma-Aldrich, UK) as previously described [49].
For transmission electron microscopy (TEM), adult C. daubneyi were fixed in 4% w/v glutaraldehyde in 0.1 M sodium cacodylate buffer (pH 7.4) containing 3% w/v sucrose for 1 h at room temperature. After fixation, the material was washed in 0.1 M sodium cacodylate buffer (pH 7.4) containing 3% w/v sucrose overnight. This was followed by post-fixation for 1 h in 1% v/v aqueous osmium tetroxide (Agar scientific, UK), dehydration through an ethanol series and embedding in Agar 100 epoxy resin. Ultrathin sections were mounted on uncoated copper grids, double-stained with 1% w/v uranyl acetate and 1% w/v lead citrate, and viewed using a JEOL JEM-1400 transmission electron microscope with an AMT Activue XR16 digital camera system, operating at an accelerating voltage of 80 kV.
Data availability
The datasets supporting the conclusions of this article are available from the European Nucleotide Archive under accessions PRJEB76148 [141] (genomic read data) and PRJEB28150 [23, 142] (transcriptomic read data). The MS proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD053790 [143]. Unprocessed blots are provided in Additional File 11.
Charlier J, Rinaldi L, Musella V, Ploeger HW, Chartier C, Vineer HR, et al. Initial assessment of the economic burden of major parasitic helminth infections to the ruminant livestock industry in Europe. Prev Vet Med. 2020;182: 105103.
Selzer PM, Epe C. Antiparasitics in animal health: quo vadis? Trends Parasitol. 2021;37(1):77–89.
Fenemore C, Floyd T, Mitchell S. Rumen fluke in Great Britain. J Comp Pathol. 2021;184:31–6.
Huson KM, Oliver NA, Robinson MW. Paramphistomosis of ruminants: an emerging parasitic disease in Europe. Trends Parasitol. 2017;33(11):836–44.
Devos J, Vassiloglou B, Amenna-Bernard N, Marcotty T. Paramphistomosis in sheep; natural infection of lambs by Calicophoron daubneyi. Rev Med Vet. 2013;164(11):528–35.
O’Shaughnessy J, Garcia-Campos A, McAloon CG, Fagan S, de Waal T, McElroy M, et al. Epidemiological investigation of a severe rumen fluke outbreak on an Irish dairy farm. Parasitol. 2018;145(7):948–52.
Horak IG. Paramphistomiasis of domestic ruminants. Adv Parasitol. 1971;9:33–72.
Millar M, Colloff A, Scholes S. Disease associated with immature paramphistome infection. Vet Rec. 2012;171(20):509–10.
Atcheson E, Skuce PJ, Oliver NA, McNeilly TN, Robinson MW. Calicophoron daubneyi—the path toward understanding its pathogenicity and host interactions. Front Vet Sci. 2020;7:606.
Sanabria RE, Romero JR. Review and update of paramphistomosis. Helminthologia. 2008;45:64–8.
Anuracpreeda P, Wanichanon C, Sobhon P. Paramphistomum cervi: antigenic profile of adults as recognized by infected cattle sera. Exp Parasitol. 2008;118(2):203–7.
Pfukenyi DM, Mukaratirwa S. Amphistome infections in domestic and wild ruminants in East and Southern Africa: a review. Onderstepoort J Vet Res. 2018;85(1):1–3.
Gordon DK, Roberts LC, Lean N, Zadoks RN, Sargison ND, Skuce PJ. Identification of the rumen fluke, Calicophoron daubneyi, in GB livestock: possible implications for liver fluke diagnosis. Vet Parasitol. 2013;195(1–2):65–71.
Wiedermann S, Harl J, Fuehrer HP, Mayr S, Schmid J, Hinney B, et al. DNA barcoding of rumen flukes (Paramphistomidae) from bovines in Germany and Austria. Parasitol Res. 2021;120:4061–6.
Červená B, Anettová L, Nosková E, Pafčo B, Pšenková I, Javorská K, et al. The winner takes it all: dominance of Calicophoron daubneyi (Digenea: Paramphistomidae) among flukes in Central European beef cattle. Parasitol. 2022;149(5):612–21.
Atcheson E, Lagan B, McCormick R, Edgar H, Hanna RE, Rutherford NH, et al. The effect of naturally acquired rumen fluke infection on animal health and production in dairy and beef cattle in the UK. Front Vet Sci. 2022;9: 968753.
Huws SA, Creevey CJ, Oyama LB, Mizrahi I, Denman SE, Popova M, et al. Addressing global ruminant agricultural challenges through understanding the rumen microbiome: past, present, and future. Front Microbiol. 2018;9:2161.
Mizrahi I, Wallace RJ, Moraïs S. The rumen microbiome: balancing food security and environmental impacts. Nat Rev Microbiol. 2021;19(9):553–66.
Allen NR, Taylor-Mew AR, Wilkinson TJ, Huws S, Phillips H, Morphew RM, et al. Modulation of rumen microbes through extracellular vesicle released by the rumen fluke Calicophoron daubneyi. Front Cell Infect Microbiol. 2021;11: 661830.
Mitchell G, Zadoks RN, Skuce PJ. A universal approach to molecular identification of rumen fluke species across hosts, continents, and sample types. Front Vet Sci. 2021;7: 605259.
Choudhary V, Garg S, Chourasia R, Hasnani JJ, Patel PV, Shah TM, et al. Transcriptome analysis of the adult rumen fluke Paramphistomum cervi following next generation sequencing. Gene. 2015;570(1):64–70.
Huson KM, Morphew RM, Allen NR, Hegarty MJ, Worgan HJ, Girdwood SE, et al. Polyomic tools for an emerging livestock parasite, the rumen fluke Calicophoron daubneyi; identifying shifts in rumen functionality. Parasit Vectors. 2018;11:1–22.
Huson KM, Atcheson E, Oliver NA, Best P, Barley JP, Hanna RE, et al. Transcriptome and secretome analysis of intra-mammalian life-stages of Calicophoron daubneyi reveals adaptation to a unique host environment. Mol Cell Proteomics. 2021;20:100055.
Kyriazakis I, Arndt C, Aubry A, Charlier J, Ezenwa VO, Godber OF, et al. Improve animal health to reduce livestock emissions: quantifying an open goal. Proc Biol Sci. 2027;2024(291):20240675.
Wang X, Chen W, Huang Y, Sun J, Men J, Liu H, et al. The draft genome of the carcinogenic human liver fluke Clonorchis sinensis. Genome Biol. 2011;12:1–4.
Young ND, Nagarajan N, Lin SJ, Korhonen PK, Jex AR, Hall RS, et al. The Opisthorchis viverrini genome provides insights into life in the bile duct. Nat Commun. 2014;5(1):4378.
Cwiklinski K, Dalton JP, Dufresne PJ, La Course J, Williams DJ, Hodgkinson J, et al. The Fasciola hepatica genome: gene duplication and polymorphism reveals adaptation to the host environment and the capacity for rapid evolution. Genome Biol. 2015;16:1–3.
Oey H, Zakrzewski M, Narain K, Devi KR, Agatsuma T, Nawaratna S, et al. Whole-genome sequence of the oriental lung fluke Paragonimus westermani. Gigascience. 2019;8(1):giy146.
McNulty SN, Tort JF, Rinaldi G, Fischer K, Rosa BA, Smircich P, et al. Genomes of Fasciola hepatica from the Americas reveal colonization with Neorickettsia endobacteria related to the agents of Potomac horse and human Sennetsu fevers. PLoS Genet. 2017;13(1): e1006537.
Wormbase Parasite. https://parasite.wormbase.org/species.html#Platyhelminthes. Accessed 19 Sep 2024.
Lysenko E, Richards JC, Cox AD, Stewart A, Martin A, Kapoor M, et al. The position of phosphorylcholine on the lipopolysaccharide of Haemophilus influenzae affects binding and sensitivity to C-reactive protein-mediated killing. Mol Microbiol. 2000;35(1):234–45.
Brandt C, McFie PJ, Stone SJ. Diacylglycerol acyltransferase-2 and monoacylglycerol acyltransferase-2 are ubiquitinated proteins that are degraded by the 26S proteasome. Biochem J. 2016;473(20):3621–37.
Yang Q, Wang LX. Mammalian α-1, 6-fucosyltransferase (FUT8) is the sole enzyme responsible for the N-acetylglucosaminyltransferase I-independent core fucosylation of high-mannose N-glycans. J Biol Chem. 2016;291(21):11064–71.
McCammick EM, McVeigh P, McCusker P, Timson DJ, Morphew RM, Brophy PM, et al. Calmodulin disruption impacts growth and motility in juvenile liver fluke. Parasit Vectors. 2016;9:1–3.
Jörnvall H, Persson B, Krook M, Atrian S, Gonzalez-Duarte R, Jeffery J, et al. Short-chain dehydrogenases/reductases (SDR). Biochem. 1995;34(18):6003–13.
Kiger L, Rashid AK, Griffon N, Haque M, Moens L, Gibson QH, et al. Trematode hemoglobins show exceptionally high oxygen affinity. Biophys J. 1998;75(2):990–8.
Rigden DJ. The histidine phosphatase superfamily: structure and function. Biochem J. 2008;409(2):333–48.
Kasamatsu A, Nakao M, Smith BC, Comstock LR, Ono T, Kato J, et al. Hydrolysis of O-acetyl-ADP-ribose isomers by ADP-ribosylhydrolase 3. J Biol Chem. 2011;286(24):21110–7.
Wolf AJ, Underhill DM. Peptidoglycan recognition by the innate immune system. Nat Rev Immunol. 2018;18(4):243–54.
Himmelfarb M, Klopocki E, Grube S, Staub E, Klaman I, Hinzmann B, et al. ITIH5, a novel member of the inter-α-trypsin inhibitor heavy chain family is downregulated in breast cancer. Cancer Lett. 2004;204(1):69–77.
Tempel W, Rabeh WM, Bogan KL, Belenky P, Wojcik M, Seidle HF, et al. Nicotinamide riboside kinase structures reveal new pathways to NAD+. PLoS Biol. 2007;5(10): e263.
Jiang S, Wang L, Huang M, Jia Z, Weinert T, Warkentin E, et al. DM9 domain containing protein functions as a pattern recognition receptor with broad microbial recognition spectrum. Front Immunol. 2017;8:1607.
Zhai Y, Saier MH Jr. The amoebapore superfamily. Biochim Biophys Acta Rev Biomembr. 2000;1469(2):87–99.
Hanna REB, Fairweather I, Robinson MW. The reproductive system of Fasciola hepatica. In: Dalton JP, editor. Fasciolosis. 2nd ed. Wallingford, Oxfordshire: CAB International; 2022:112–44.
Smyth JD. The Physiology of trematodes. Edinburgh and London: Oliver and Boyd; 1966.
Miserez A, Rubin D, Waite JH. Cross-linking chemistry of squid beak. J Biol Chem. 2010;285(49):38115–24.
Harrington MJ, Masic A, Holten-Andersen N, Waite JH, Fratzl P. Iron-clad fibers: a metal-based biological strategy for hard flexible coatings. Science. 2010;328(5975):216–20.
Rubin DJ, Miserez A, Waite JH. Diverse strategies of protein sclerotization in marine invertebrates: structure–property relationships in natural biomaterials. Adv Insect Physiol. 2010;38:75–133.
Robinson MW, Colhoun LM, Fairweather I, Brennan GP, Waite JH. Development of the vitellaria of the liver fluke, Fasciola hepatica in the rat host. Parasitol. 2001;123(5):509–18.
Taha HA, El-Shaikh KA, Al-Sadi MM. Effect of sodium hypochlorite on Fasciola gigantica eggs and the intermediate host, Lymnaea natalensis: a scanning electron microscopy study. J Taibah Univ Med Sci. 2014;8(2):75–83.
Afifi R, Ali M. The effect of a bleaching solution on Schistosoma mansoni eggs: a scanning ultrastructural study. Egypt Acad J Biol Sci E Med Entomol Parasitol. 2017;9(1):21–32.
Sivaprakasam S, Bhutia YD, Yang S, Ganapathy V. Short-chain fatty acid transporters: role in colonic homeostasis. Compr Physiol. 2017;8(1):299.
Mason C, Stevenson H, Cox A, Dick I. Disease associated with immature paramphistome infection in sheep. Vet Rec. 2012;170(13):343.
Sargison N, Francis E, Davison C, Barend M, Handel I, Mazeri S. Observations on the biology, epidemiology and economic relevance of rumen flukes (Paramphistomidae) in cattle kept in a temperate environment. Vet Parasitol. 2016;219:7–16.
Coghlan A, Tyagi R, Cotton JA, Holroyd N, Rosa BA, Tsai IJ, et al. Comparative genomics of the major parasitic worms. Nat Genet. 2019;51:163–74.
Yoshida H, Kinoshita K, Ashida M. Purification of a peptidoglycan recognition protein from hemolymph of the silkworm, Bombyx mori. J Biol Chem. 1996;271(23):13854–60.
Montaño AM, Tsujino F, Takahata N, Satta Y. Evolutionary origin of peptidoglycan recognition proteins in vertebrate innate immune system. BMC Evol Biol. 2011;11:1.
Kordaczuk J, Sułek M, Wojda I. General overview on the role of peptidoglycan recognition proteins in insect immunity. Acta Biochim Pol. 2020;67(3):319–26.
Wang Q, Ren M, Liu X, Xia H, Chen K. Peptidoglycan recognition proteins in insect immunity. Mol Immunol. 2019;106:69–76.
Orlans J, Vincent-Monegat C, Rahioui I, Sivignon C, Butryn A, Soulère L, et al. PGRP-LB: an inside view into the mechanism of the amidase reaction. Int J Mol Sci. 2021;22(9): 4957.
Wang M, Liu LH, Wang S, Li X, Lu X, Gupta D, et al. Human peptidoglycan recognition proteins require zinc to kill both gram-positive and gram-negative bacteria and are synergistic with antibacterial peptides. J Immunol. 2007;178(5):3116–25.
Bruhn H. A short guided tour through functional and structural features of saposin-like proteins. Biochem J. 2005;389(2):249–57.
Cabán-Hernández K, Espino AM. Differential expression and localization of saposin-like protein 2 of Fasciola hepatica. Acta Trop. 2013;128(3):591–7.
Lee JY, Cho PY, Kim TY, Kang SY, Song KY, Hong SJ. Hemolytic activity and developmental expression of pore-forming peptide, clonorin. Biochem Biophys Res Commun. 2002;296(5):1238–44.
Lowther J, Robinson MW, Donnelly SM, Xu W, Stack CM, Matthews JM, et al. The importance of pH in regulating the function of the Fasciola hepatica cathepsin L1 cysteine protease. PLoS Negl Trop Dis. 2009;3(1): e369.
Kim WH, Lillehoj HS, Gay CG. Using genomics to identify novel antimicrobials. Rev Sci Tech Off Int Epizoot. 2016;35(1):95–103.
Rooney J, Northcote HM, Williams TL, Cortés A, Cantacessi C, Morphew RM. Parasitic helminths and the host microbiome–a missing ‘extracellular vesicle-sized’ link? Trends Parasitol. 2022;38(9):737–47.
Chertemps T, Mitri C, Perrot S, Sautereau J, Jacques JC, Thiery I, et al. Anopheles gambiae PRS1 modulates Plasmodium development at both midgut and salivary gland steps. PLoS ONE. 2010;5(7): e11538.
Jia Z, Jiang S, Wang M, Wang X, Liu Y, Lv Z, et al. Identification of a novel pattern recognition receptor DM9 domain containing protein 4 as a marker for pro-hemocyte of Pacific oyster Crassostrea gigas. Front Immunol. 2021;11: 603270.
Liu Y, Wang W, Zhao Q, Yuan P, Li J, Song X, et al. A DM9-containing protein from oyster Crassostrea gigas (CgDM9CP-3) mediating immune recognition and encapsulation. Dev Comp Immunol. 2021;116: 103937.
Li Y, Yang W, Sun J, Lian X, Li X, Zhao X, et al. A DM9-containing protein from crab Eriocheir sinensis functions as a novel multipotent pattern recognition receptor. Fish Shellfish Immunol. 2024;145: 109356.
Hatakeyama T, Kojima F, Ohkawachi I, Sawai H, Unno H. Carbohydrate-binding ability of a recombinant protein containing the DM9 motif from Drosophila melanogaster. J Biochem. 2024;175(6):659–69.
Kifle DW, Pearson MS, Becker L, Pickering D, Loukas A, Sotillo J. Proteomic analysis of two populations of Schistosoma mansoni-derived extracellular vesicles: 15k pellet and 120k pellet vesicles. Mol Biochem Parasitol. 2020;236: 111264.
de la Torre-Escudero E, Gerlach JQ, Bennett AP, Cwiklinski K, Jewhurst HL, Huson KM, et al. Surface molecules of extracellular vesicles secreted by the helminth pathogen Fasciola hepatica direct their internalisation by host cells. PLoS Negl Trop Dis. 2019;13(1): e0007087.
Phadungsil W, Grams R. Agglutination activity of Fasciola gigantica DM9-1, a mannose-binding lectin. Korean J Parasitol. 2021;59(2): 173.
Akonyani ZP, Song F, Li Y, Qiqige S, Wu J. Comparative analysis of the microbiota between rumen and duodenum of twin lambs based on diets of ceratoides or Alfalfa. Pol J Microbio. 2021;70(2):175–87.
Oliver N. The development of the rumen fluke, Calicophoron daubneyi, within the mammalian host. Doctoral dissertation. Belfast: Queen's University Belfast; 2021.
Dunn TS, Hanna RE, Nizami WA. Ultrastructural and histochemical observations on the foregut and gut caeca of Gigantocotyle explanatum, Gastrothylax crumenifer and Srivastavaia indica (Trematoda: Paramphistomidae). Int J Parasitol. 1987;17(6):1141–52.
Panyarachun B, Ngamniyom A, Sobhon P, Anuracpreeda P. Morphology and histology of the adult Paramphistomum gracile Fischoeder, 1901. J Vet Sci. 2013;14(4): 425.
Fuertes M, Pérez V, Benavides J, González-Lanza MC, Mezo M, González-Warleta M, et al. Pathological changes in cattle naturally infected by Calicophoron daubneyi adult flukes. Vet Parasitol. 2015;209(3–4):188–96.
Willmott SM, Pester FR. On the transference of adult Paramphistomum hiberniae Willmott, 1950 from one definitive host to another of the same species. J Helminthol. 1951;25(1–2):93–6.
Robinson MW, Menon R, Donnelly SM, Dalton JP, Ranganathan S. An integrated transcriptomics and proteomics analysis of the secretome of the helminth pathogen Fasciola hepatica: proteins associated with invasion and infection of the mammalian host. Mol Cell Proteomics. 2009;8(8):1891–907.
Morphew RM, Wright HA, LaCourse EJ, Porter J, Barrett J, Woods DJ, et al. Towards delineating functions within the Fasciola secreted cathepsin l protease family by integrating in vivo based sub-proteomics and phylogenetics. PLoS Negl Trop Dis. 2011;5(1): e937.
Jovanović M, Čuperlović M. Nutritive value of rumen contents for monogastric animals. Anim Feed Sci Technol. 1977;2(4):351–60.
McDonald P, Edwards RA, Greenhalgh JFD, Morgan CA, Sinclair LA, Wilkinson TG. Animal Nutrition. 7th ed. London: Pearson; 2011.
Happich FA, Boray JC. Quantitative diagnosis of chronic fasciolosis: 1. Comparative studies on quantitative faecal examinations for chronic Fasciola hepatica infection in sheep. Aust Vet J. 1969;45(7):326–8.
Waite JH, Rice-Ficht AC. Presclerotized eggshell protein from the liver fluke Fasciola hepatica. Biochem. 1987;26(24):7819–25.
Fitzpatrick JM, Hirai Y, Hirai H, Hoffmann KF. Schistosome egg production is dependent upon the activities of two developmentally regulated tyrosinases. FASEB J. 2007;21(3):823–35.
Garcia M, Bradford BJ, Nagaraja TG. Invited review: ruminal microbes, microbial products, and systemic inflammation. Prof Anim Sci. 2017;33(6):635–50.
Rice-Ficht AC, Dusek KA, Kochevar GJ, Waite JH. Eggshell precursor proteins of Fasciola hepatica, I. Structure and expression of vitelline protein B. Mol Biochem Parasitol. 1992;54(2):129–41.
Gupta BC, Parshad VR, Guraya SS. Histochemical studies on eggshell formation in Paramphistomum cervi (Digenea: Paramphistomatidae). J Helminthol. 1987;61(1):59–64.
Kodali VK, Gannon SA, Paramasivam S, Raje S, Polenova T, Thorpe C. A novel disulfide-rich protein motif from avian eggshell membranes. PLoS ONE. 2011;6(3): e18187.
Held MA, Tan L, Kamyab A, Hare M, Shpak E, Kieliszewski MJ. Di-isodityrosine is the intermolecular cross-link of isodityrosine-rich extensin analogs cross-linked in vitro. J Biol Chem. 2004;279(53):55474–82.
Mishler-Elmore JW, Zhou Y, Sukul A, Oblak M, Tan L, Faik A, et al. Extensins: self-assembly, crosslinking, and the role of peroxidases. Front Plant Sci. 2021;12: 664738.
Jeong W, Yoon HW, Lee SR, Rhee SG. Identification and characterization of TRP14, a thioredoxin-related protein of 14 kDa: new insights into the specificity of thioredoxin function. J Biol Chem. 2004;279(5):3142–50.
Jones MK, McManus DP, Sivadorai P, Glanfield A, Moertel L, Belli SI, et al. Tracking the fate of iron in early development of human blood flukes. Int J Biochem Cell Biol. 2007;39(9):1646–58.
Rutherford NH, Barley J, Robinson MW, Gordon AW, Yan T, Aubry A. Effect of rumen fluke on feed digestibility and methane emissions in sheep. Parasitol Res. 2022;121(11):3347–50.
Corrêa PS, Mendes LW, Lemos LN, Sampaio AC, Issakowicz J, McManus CM, et al. The effect of Haemonchus contortus and Trichostrongylus colubriforms infection on the ruminal microbiome of lambs. Exp Parasitol. 2021;231: 108175.
Martin I, Cabán-Hernández K, Figueroa-Santiago O, Espino AM. Fasciola hepatica fatty acid binding protein inhibits TLR4 activation and suppresses the inflammatory cytokines induced by lipopolysaccharide in vitro and in vivo. J Immunol. 2015;194(8):3924–36.
Ramos-Benítez MJ, Ruiz-Jiménez C, Aguayo V, Espino AM. Recombinant Fasciola hepatica fatty acid binding protein suppresses toll-like receptor stimulation in response to multiple bacterial ligands. Sci Rep. 2017;7(1):5455.
Robinson MW, Donnelly S, Hutchinson AT, To J, Taylor NL, Norton RS, et al. A family of helminth molecules that modulate innate cell responses via molecular mimicry of host antimicrobial peptides. PLoS Pathog. 2011;7(5): e1002042.
Robinson MW, Alvarado R, To J, Hutchinson AT, Dowdell SN, Lund M, et al. A helminth cathelicidin-like protein suppresses antigen processing and presentation in macrophages via inhibition of lysosomal vATPase. FASEB J. 2012;26(11):4614–27.
Tandon V, Roy B, Shylla JA, Ghatani S. Amphistomes. Adv Exp Med Biol. 2019;1154:255–77.
SMRTlink 10. https://www.pacb.com/support/software-downloads/. Accessed 1 June 2021.
Cheng H, Concepcion GT, Feng X, Zhang H, Li H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods. 2021;18(2):170–5.
Blobtools2. https://github.com/blobtoolkit/blobtools2. Accessed 1 June 2021.
Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31(19):3210–2.
Eukaryota Odb10. https://busco.ezlab.org/frames/euka.htm. Accessed 1 June 2021.
Girgis HZ. Red: an intelligent, rapid, accurate tool for detecting repeats de-novo on the genomic scale. BMC Bioinformatics. 2015;16:1–9.
Campbell MS, Law M, Holt C, Stein JC, Moghe GD, Hufnagel DE, et al. MAKER-P: a tool kit for the rapid creation, management, and quality control of plant genome annotations. Plant Physiol. 2014;164(2):513–24.
RepeatMasker. http://www.repeatmasker.org. Accessed 1 June 2021.
Stanke M, Keller O, Gunduz I, Hayes A, Waack S, Morgenstern B. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 2006;34(suppl_2):W435-9.
Korf I. Gene finding in novel genomes. BMC Bioinformatics. 2004;5:1–9.
Lomsadze A, Ter-Hovhannisyan V, Chernoff YO, Borodovsky M. Gene identification in novel eukaryotic genomes by self-training algorithm. Nucleic Acids Res. 2005;33(20):6494–506.
Majoros WH, Pertea M, Salzberg SL. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics. 2004;20(16):2878–9.
Haas BJ, Delcher AL, Mount SM, Wortman JR, Smith RK Jr, Hannick LI, et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 2003;31(19):5654–66.
Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2015;12(1):59–60.
Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015;12(4):357–60.
Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Blood PD, Bowden J, et al. De novo transcript sequence reconstruction from RNA-seq using the trinity platform for reference generation and analysis. Nat Protoc. 2013;8(8):1494–512.
Pertea M, Pertea GM, Antonescu CM, Chang TC, Mendell JT, Salzberg SL. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol. 2015;33(3):290–5.
Huerta-Cepas J, Forslund K, Coelho LP, Szklarczyk D, Jensen LJ, Von Mering C, et al. Fast genome-wide functional annotation through orthology assignment by eggNOG-mapper. Mol Biol Evol. 2017;34(8):2115–22.
Emms DM, Kelly S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 2019;20:1–4.
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772–80.
Talavera G, Castresana J. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol. 2007;56(4):564–77.
Darriba D, Posada D, Kozlov AM, Stamatakis A, Morel B, Flouri T. ModelTest-NG: a new and scalable tool for the selection of DNA and protein evolutionary models. Mol Biol Evol. 2020;37(1):291–4.
Kozlov AM, Darriba D, Flouri T, Morel B, Stamatakis A. RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference. Bioinformatics. 2019;35(21):4453–5.
Mendes FK, Vanderpool D, Fulton B, Hahn MW. CAFE 5 models variation in evolutionary rates among gene families. Bioinformatics. 2020;36(22–23):5516–8.
Mallick H, Rahnavard A, McIver LJ, Ma S, Zhang Y, Nguyen LH, et al. Multivariable association discovery in population-scale meta-omics studies. PLoS Comput Biol. 2021;17(11): e1009442.
Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H, et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 2019;47(D1):D309–14.
Hall TA. BioEdit: a user-friendly biological sequence alignment editor and analysis program for windows 95/98/NT. Nucleic Acids Symp Ser. 1999;41(41):95–8.
Babicki S, Arndt D, Marcu A, Liang Y, Grant JR, Maciejewski A, et al. Heatmapper: web-enabled heat mapping for all. Nucleic Acids Res. 2016;44(W1):W147–53.
Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol Biol Evol. 2018;35(6):1547–9.
Felsenstein J. Phylogenies and the comparative method. Am Nat. 1985;125(1):1–5.
Zuckerkandl E, Pauling L. Molecules as documents of evolutionary history. J Theor Biol. 1965;8(2):357–66.
Kenters N, Henderson G, Jeyanathan J, Kittelmann S, Janssen PH. Isolation of previously uncultured rumen bacteria by dilution to extinction using a new liquid culture medium. J Microbiol Methods. 2011;84(1):52–60.
Clinical and Laboratory Standards Institute. Performance standards for antimicrobial susceptibility testing. 33rd ed. In: CLSI document M100. Wayne: Clinical and Laboratory Standards Institute; 2023.
Mantovani HC, Kam DK, Ha JK, Russell JB. The antibacterial activity and sensitivity of Streptococcus bovis strains isolated from the rumen of cattle. FEMS Microbiol Ecol. 2001;37(3):223–9.
Ricciardi A, Ianniello RG, Tramutola A, Parente E, Zotta T. Rapid detection assay for oxygen consumption in the Lactobacillus casei group. Ann Microbiol. 2014;64:1861–4.
Keller A, Nesvizhskii AI, Kolker E, Aebersold R. Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. Anal Chem. 2002;74(20):5383–92.
Hanna RE, Gordon AW, Moffett D, Edgar HW, Oliver LF, McConnell S, et al. Fasciola hepatica: comparative effects of host resistance and parasite intra-specific interactions on size and reproductive histology in flukes from rats infected with isolates differing in triclabendazole sensitivity. Vet Parasitol. 2011;178(3–4):251–63.
The Calicophoron daubneyi (rumen fluke) draft genome assembly. European nucleotide archive. 2024. https://identifiers.org/ena.embl:PRJEB76148.
Generating transcriptome resources for the rumen fluke, Calicophoron daubneyi. European Nucleotide Archive https://identifiers.org/ena.embl:PRJEB28150. 2020.
Robinson, MW. Proteomics analysis of trematode eggshells. PRIDE. 2024. https://identifiers.org/pride.project:PXD053790.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2025. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Background
The rumen fluke, Calicophoron daubneyi, is the major paramphistome species infecting ruminants within Europe. Adult flukes reside within the rumen where they are in direct contact with a unique collection of microorganisms. Here, we report a 1.76-Gb draft genome for C. daubneyi, the first for any paramphistome species.
Results
Several gene families have undergone specific expansion in C. daubneyi, including the peptidoglycan-recognition proteins (PGRPs) and DM9 domain-containing proteins, which function as pattern-recognition receptors, as well as the saposin-like proteins with putative antibacterial properties, and are upregulated upon arrival of the fluke in the microbe-rich rumen. We describe the first characterisation of a helminth PGRP and show that a recombinant C. daubneyi PGRP binds to the surface of bacteria, including obligate anaerobes from the rumen, via specific interaction with cell wall peptidoglycan. We reveal that C. daubneyi eggshell proteins lack L-DOPA typically required for eggshell crosslinking in trematodes and propose that C. daubneyi employs atypical eggshell crosslinking chemistry that produces eggs with greater stability. Finally, although extracellular digestion of rumen ciliates occurs within the C. daubneyi gut, unique ultrastructural and biochemical adaptations of the gastrodermal cells suggest that adult flukes also acquire nutrients via uptake of volatile fatty acids from rumen fluid.
Conclusions
Our findings suggest that unique selective pressures, associated with inhabiting a host environment so rich in microbial diversity, have driven the evolution of molecular and morphological adaptations that enable C. daubneyi to defend itself against microorganisms, feed and reproduce within the rumen.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer