1. Introduction
Unrivaled in their adaptability and ubiquity, insects serve as a cornerstone of Earth’s biodiversity and play a crucial role in ecosystems as decomposers, pollinators, and food. Beyond their ecological significance, certain insects synthesize silk proteins to construct various marvelous structures, including a cocoon, nest, and egg case [1]. Silkworm (Bombyx mori) has the unique ability to spin silk, a sustainable biomaterial captivating human attention for millennia due to its distinctive physical and chemical properties [2].
Insects are renowned for their silk production, with moths and butterflies being the most common silk-producing insects. Other fascinating orders such as Trichoptera and Hymenoptera including bees, wasps, and ants are also known for their silk-spinning abilities [1,3,4]. Silk is an impressive and versatile biopolymer, primarily composed of proteins interwoven with metal ions, and other inorganic and organic substances. These components vary across species, resulting in silk with distinct characteristics. Silkworm cocoon silk is predominantly made up of sericin and fibroin including FibH, FibL, and P25/fibrohexamerin [5,6]. The oriental moth cocoon is composed of the most abundant asparagine-rich proteins, fibroin, mineral crystals, and organic substances [7]. The caddisfly silk contains FibH, FibL, endoplasmin, serpin, chymotrypsin-like protease, two mucin-like proteins, pseudofibrin, and three additional unannotated proteins without P25 [8].
The FibH is a fundamental component of insect silk, nearly universally present in the silk of Lepidoptera and Trichoptera. The FibH of Bombyx mori (BmorFibH) is encoded by the fibroin heavy chain gene nearly 17 kb in length, with a molecular weight of 350 kDa [5]. Based on sequence characteristics, BmorFibH is divided into N- and C-terminal non-repetitive domains (NTDs and CTDs, respectively), enclosing a repetitive core (RC). The NTD and CTD are involved in the folding, assembly, and secretion of silk proteins, and extremely conserved across silkworm strains [9,10,11,12,13,14]. The RC is composed of many tandem repetitive sequences [13,15,16]. Under the synergistic effect of many factors such as pH, metal ions, and shear force, the RC forms the crystalline region, which confers physical properties to the silk fiber [17,18,19,20]. Despite having the same repetitive motifs, such as hexapeptide GAGAGS and GAGAGY, the number and distribution of these motifs vary, leading to significant differences in the FibH sequences across silkworm strains [13,16].
The FibH is a pivotal determinant in the mechanical properties of insect silk fibers [21,22]. Deciphering and expanding the repertoire of the FibH gene promise to shed light on the intricate molecular evolution of insect silk [23]. This knowledge is critical for the synthetic replication of silk’s unique properties [24]. Furthermore, understanding the diversity of FibH can shed light on the pathways by which nature fine-tunes these proteins for a specific function. This revelation could spark innovative methodologies in the design of silk-based materials, enabling tailoring the properties of silk fiber to a specific application, such as achieving superior mechanical properties and flexibility. To date, the complete FibH sequences of about 280 mulberry silkworms (Bombyx mori and Bombyx mandarina) and six other insect species have been deciphered, revealing a vast and unexplored landscape of FibH diversity [13,16]. However, the complete FibH sequences of countless insects remain unknown, presenting a formidable challenge to our quest for a holistic grasp of silk’s evolutionary history and its functional diversity across the insect kingdom.
In this study, we undertook a comprehensive analysis of publicly available Lepidoptera and Trichoptera genomic data, focusing on extracting the complete FibH sequence through a homology analysis. We systematically examined the characteristics of all the FibH sequences, including unique amino acid composition, repetitive motifs, and evolutionary relationships. Our work can substantially enrich our grasp of the insect silk diversity. Moreover, it will establish a solid theoretical framework for the innovation and synthesis of novel artificial silk, thereby opening new avenues in the field of biomaterial science.
2. Results
2.1. FibH Gene Copy Number Variation Occurs Only in Skipper (Hesperiidae)
Using publicly available genomic data, we extracted 101 complete FibH gene sequences from Lepidoptera, involving 40 families. Limited by the available assembled genomic data, we only extracted 13 complete FibH gene sequences from four families of Trichoptera. The FibH gene of Lepidoptera is 8040 (AperFibH)-30,526 bp (HcomFibH 2) long. The FibH gene of Trichoptera varied between 16,133 bp (OalbFibH) and 28,991 bp (CdisFibH). All insect FibH genes consist of two exons separated by a single intron. Exon 1 is relatively short, ranging from 39 to 45 bp, while exon 2 is between 7 and 29 kb and varies greatly among different species. The introns of all FibH genes range from 73 to 5159 bp, and their guanine–cytosine (GC) content is around 50% (Figure 1A and Figure S1).
The FibH in Trichoptera and moths is a single-copy gene. Interestingly, there is copy number variation (CNV) in the FibH gene of skippers. Both Thymelicus acteon and Thymelicus sylvestris have three copies of the FibH gene, and they are located on chromosome 20. The lengths of the TactFibH (TactFibH 1, 2, and 3) and TsylFibH (TsylFibH 1, 2, and 3) genes are 21,450 bp, 27,758 bp, 27,295 bp, 15,511 bp, 23,197 bp, and 23,172 bp, respectively. Hesperia comma and Carterocephalus palaemon each have two FibH genes located on chromosomes 22 and 17, respectively. The HcomFibH 1 and 2, as well as CpalFibH 1 and 2, have 14,044, 30,526, 19,690, and 14,355 base pairs, respectively (Figure 1B, Table 1).
2.2. FibH Differs in Length, Amino Acid Composition, and Properties among Insects
The protein sequences of all FibH genes have been deduced. The length and molecular weight of FibH in moths vary significantly, ranging from 2603 (IaveFibH) to 8773 (TacrFibH) amino acids (with an average of 5006), with 216.05948–746.16907 kDa. Butterflies have FibH of equal length to moths, between 3178 and 8754 (average: 5420). Skippers with multiple copies have their FibH distributed between 4284 (HcomFibH 1) and 8754 (TactFibH 2) (Table 1). In contrast, the FibH of Trichoptera is generally longer and has a larger molecular weight than that of the Lepidoptera, with 5344–9027 amino acids (average: 7650), and 531.07887–912.29248 kDa. Ala, Gly, Ser, Pro, Gln, and Asn are the most abundant in Lepidoptera FibH, while Trichoptera FibH is rich in Gly, Ser, and Arg, with only less Ala (Figure 2A–C, Table 1). Based on the complexity of the sequence and domain, the FibH is divided into the conserved NTD, CTD, and highly variable repetitive core (RC). The NTD of BmorFibH contains 151 amino acids, with the first 14 amino acids encoded by exon 1, and the first 18 amino acids being a signal peptide, which is the same as that in skippers (Table 1). Gly 1, Ile 8, and Gly 52 are three conserved sites that are present in all NTDs. In the NTD of Trichoptera FibH, the sixth and seventh amino acids are LL (LeuLeu). The CTD of BmorFibH contains 58 amino acids, and the 20th residue (Cys-c20) from the C-terminus forms a disulfide bond with FibL, which is also conserved and present in the CTD of non-Saturniidae FibH (Figure S2).
To investigate the effects of amino acid composition on the properties of FibH, the isoelectric points (pI) and grand average of hydropathicity (GRAVY) of all FibHs were analyzed. The pI of moth FibH is distributed between 3.2 and 12.3, with 85.9155% being less than 7; the GRAVY scores range from −1.415 to 0.557, with hydrophilic FibH constituting 57.7465%. For butterfly FibH, the pI is distributed between 3.8 and 9.19, with 76.6667% of the pI values being less than 7; the GRAVY scores are between −1.256 and 0.183, with hydrophilic FibH making up 86.6667%. Compared to moths and butterflies, the pI and GRAVY of Trichoptera (caddisfly) FibH show more pronounced differences. Except for OalbFibH, which has a pI less than 7, the pI of other Trichoptera FibH ranges from 10.33 to 11.99. All Trichoptera FibH are hydrophilic proteins with an average hydrophobicity ranging from −0.827 to −0.048, which is likely closely related to their living environments (Figure 2D). In skippers, different copies of FibH exhibit varying properties. The pI values for both HcomFibH 1 and HcomFibH 2 are above 8, with GRAVY scores of −0.288 and −1.029, respectively. Conversely, the pI of CpalFibH 1 and CpalFibH 2 are both less than 7, and their GRAVY scores are −0.977 and −0.395, respectively. Interestingly, the pI values for TactFibH 1 and TsyFibH 1 are 4.89 and 5.29, respectively, with GRAVY scores of −0.323 and −0.104. The pI values for the other copies are all around 8, and both TactFibH 3 and TsyFibH 3 are highly hydrophilic (Table 1). The different chemical properties of the various FibH copies suggest that the mechanical properties of this skipper silk may also differ.
2.3. Insect FibH Has Developed Unique Motifs during Evolution
The fibroin heavy chain is divided into the NTD, CTD, and repetitive region (RC), which constitutes approximately 85% of FibH and is composed of numerous motifs. FibH is the core protein of insect silk, and its evolutionary direction is consistent with that of insect silk fibers. To elucidate the evolutionary history of FibH, the phylogenetic relationships of all FibH were analyzed. In the phylogenetic tree, the Trichoptera (caddisfly) FibH forms a separate branch and possesses typical motifs distinct from those of the Lepidoptera (moth and butterfly) (Figure 3 and Figure S3). The amino acids Tyr, Val, Arg, and Trp are unique to the majority of Trichoptera FibH. The phylogenetic tree indicates that MminFibH and PmusFibH are ancestral to other Lepidoptera FibH, from which many branches have evolved. There are differences in the typical motifs among different Lepidoptera insects. PolyAla, polySer, and the hexapeptide GAGSGA are ubiquitously present in Lepidoptera FibH and represent peptides that distinguish Lepidoptera FibH motifs from those of Trichoptera FibH. It is well known that GAGSGA is a typical motif of the BmorFibH [13,15]. In fact, apart from the silkworm, other species in the Bombycidae family, such as TvarFibH, and in the Notodontidae and Psychidae family, such as FfurFibH and LferFibH, also contain numerous GAGSGA motifs (Figure 3, Table S1).
Butterfly FibH is believed to have evolved from moth FibH, which is consistent with the conclusion that moths are the ancestors of butterflies. The FibH of skippers (a subfamily of butterflies) evolved from that of Papilionoidea, and multiple copies of FibH have evolved from single-copy FibH, such as PmalFibH and EtagFibH (Figure S3). Interestingly, there are significant differences in the motif of different FibH copies. While they possess different motifs, all contain the tripeptide QGP. TactFibH 1 and 2 and TsylFibH 2 have similar motifs that include polyAla and GPQ. TactFibH 3 and TsylFibH 3 share the same typical motif, QGPQGPYGPQ. Other FibH have different motifs, and all contain the tripeptide QGP (Figure 3, Table S1). These results suggest that the evolution of skipper FibH is not merely the result of a simple gene duplication event.
3. Discussion
In this study, we obtained and characterized 114 FibH sequences, providing the most systematic description of insect FibH to date. There are four significant findings. Firstly, we have completed the FibH sequences of many insects, such as bagworms (Luffia ferchaultella). Second, it was the first time that copy number variation events of the FibH gene were found in insects, occurring only in skippers. Third, we revealed that the FibH of skippers evolved from that of Papilionoidea. Lastly, we identified GAGAGS, polyAla, or polySer as the predominant motifs in Lepidopteran FibH, whereas amino acids Tyr, Val, Arg, and Trp are unique to the majority of Trichoptera FibH.
Lepidoptera comprises approximately 130 families, with a genome of about 2279 insect species from about 46 families being sequenced. However, many insect genomes are still incomplete with only contigs; thus, we were only able to extract the FibH gene from 40 families. In contrast, there is even less whole-genome data available for the Trichoptera. Scaling up the sequencing of Lepidoptera and Trichoptera, accelerating insect whole-genome assembly, and improving genome quality will further expedite research on the FibH gene, providing a more comprehensive and in-depth understanding of insect silk proteins.
The complexity and functionality of spider silk originate from the diversity of spidroins, including MaSp, MiSp, and PySp, among others. These spidroins have various variants, each with a unique structure, which imparts different physical and chemical properties to spider silk. Additionally, multiple spidroin variants can enhance silk protein synthesis efficiency, which is particularly important in extreme environments or rapid spider web construction [25,26,27,28,29]. Previously, there have been no reports on copy number variations of the insect FibH gene. In this study, we first discovered that some butterfly species have 2–3 copies of the FibH gene, with each copy having different isoelectric points and hydrophobicity, suggesting unique mechanical properties of these butterfly silks (Table 1). Silkworm silk exhibits different mechanical properties at each instar, primarily due to the varying proportions of three silk fibroins (FibH, FibL, and P25) in different instar silks. In butterflies, the relationship between an increased FibH copy number and the silk protein yield, synthesis rate, and mechanical properties of silk fibers still needs experimental verification.
BmFibH is synthesized by the silkworm posterior silk gland (PSG). The FibH forms a silk protein complex with FibL and P25 in a 6:6:1 ratio, and its secondary structure is mainly a random coil or α-helix in the PSG cell. When the silk protein is secreted into the silk gland lumen, during the transportation process from the PSG to the anterior silk gland (ASG), FibH undergoes changes in physical and chemical environments, causing its secondary structure to transition from a random coil or α-helix to an antiparallel β-sheet [17,18,19,20]. Near the spinneret of ASG, a highly ordered anisotropic herringbone pattern gradually forms, preparing for spinning silk [30]. To understand the structural differences of FibH across insects, we used Alphafold 2 to predict the three-dimensional structure of FibH and MaSp 1. BmorFibH, LhesMaSp 1, and MiSp (A6YP79.1.A) exhibit similar structures characterized by numerous relatively uniform helices, which may be related to strong selection pressures acting on silkworms and spiders. AalFibH (moth) and OalbFibH (caddisfly) share a similar structure with fewer non-uniform helices, and their NTD contains both β-sheets and α-helices. However, the NTD of PbarFibH (butterfly) also contains β-sheets and α-helices, but no uniform helical structure (Figure S4). It is worth mentioning that these structures are only predictions and need to be further experimentally verified.
Insect FibH is composed of numerous repetitive motifs (Figure 3, Table S1). The type, number, and arrangement of motifs determine the mechanical properties of the silk fibers. Spider dragline silk is stronger than silkworm silk, mainly due to the high uniformity of repetitive sequences and polyAla in MaSp. The FibH of bagworms contains both polyAla and GAGAGS motifs, making the mechanical properties of bagworm silk comparable to that of spider silk [31]. The most abundant motifs in the BmorFibH are hexapeptide GAGAGS and GAGAGY [13]. Yoko Takasu et al. [24] assembled these two motifs into a highly ordered artificial FibH and completely replaced the original FibH gene. The resulting homozygous individuals were unable to spin silk, suggesting that the design of silk proteins must follow natural principles. Artificial intelligence, with its powerful learning and computational capabilities, has been used for protein structure prediction and the development of new Cas proteins, among other applications [32,33,34,35]. It is also an inevitable trend to predict the structure of silk proteins, design silk proteins, and customize artificial silk [36,37,38]. A large database of silk protein sequences and mechanical properties’ data of silk fibers is fundamental for artificial intelligence to design silk proteins. Currently, there are about 500 silk protein sequences available, and many insect genomes are accessible [13,16,27]. However, what is most lacking at present are data on the mechanical properties of silk fibers. Collecting numerous silk fibers and measuring their mechanical properties to clarify the relationship between mechanical performance and the FibH sequence will help deep learning to accelerate the development and utilization of novel silk fibers.
All in all, our study develops a profound understanding of insect FibH, providing a broader perspective on the possible utility of these proteins in bioengineering and other apposite applications.
4. Materials and Methods
4.1. Extract the Complete FibH Gene Sequence from Database
Search in NCBI (
4.2. Data Analysis and Graphing
The theoretical isoelectric point (pI) and grand average of hydropathicity (GRAVY) of FibH were calculated by protparam (
4.3. Three-Dimensional Structure Prediction of FibH
The 3D structure of FibH was predicted using online Alphafold 2 [41] (
Conceptualization, W.L.; methodology, W.L.; validation, T.Z. and W.L.; formal analysis, T.Z. and Y.G.; investigation, T.Z. and W.L.; resources, T.Z.; writing—original draft, T.Z.; writing—review and editing, W.L., S.M., Z.Z., and D.Y.; visualization, T.Z., Z.Z., and W.L.; supervision, W.L.; project administration, W.L.; funding acquisition, W.L. All authors have read and agreed to the published version of the manuscript.
Not applicable.
Not applicable.
Data is contained within the article and
The authors declare no conflicts of interest.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Figure 1. The overview and copy number variation of the insect FibH gene. (A) The FibH genes of Trichoptera, moths, and butterflies. (B) The copy number variation of the FibH gene in skippers.
Figure 2. The length, molecular weight, composition, and properties of FibH. (A) The length and molecular weight of FibH. (B) The amino acid composition of each insect FibH. The red, black, and orange colors represent the FibH of Trichoptera, moth, and butterfly, respectively. (C) The amino acid composition of moth, butterfly, and Trichoptera FibH. (D) The theoretical isoelectric point (pI) and grand average of hydropathicity (GRAVY) of FibH.
Figure 3. The phylogenetic tree of insect FibH and their motifs. d1 represents the proportion of all motifs to the total sequence length, with a median of 0.89; d2 indicates the proportion of the most prevalent motif (the typical motif) to the total sequence length, with a median of 0.50; d3 denotes the number of amino acid residues in the typical motif, with a median of 35.64. The amino acid sequence on the right represents the motif sequence with the highest percentage in each insect FibH. The red, black, and orange colors in the phylogenetic tree represent the FibH of Trichoptera, moth, and butterfly, respectively.
The skipper FibH genes with multiple copies and their encoded protein.
Gene | Exon 1 (bp) | Intron (bp) | CDS (bp) | Protein (aa) | Signal Peptide (aa) | PI | Grand Average of Hydropathicity (GRAVY) | Top 5 Amino Acids |
---|---|---|---|---|---|---|---|---|
BmorFibH | 42 | 971 | 15,792 | 5263 | 18 | 4.39 | 0.216 | Gly, Ala, Ser, Tyr, Val |
HcomFibH 1 | 42 | 1189 | 12,855 | 4284 | 18 | 9.19 | −0.288 | Ala, Gly, Pro, Gln, Ser |
HcomFibH 2 | 42 | 4906 | 25,620 | 8539 | 18 | 8.85 | −1.029 | Gly, Pro, Gln, Ala, Ser |
TactFibH 1 | 42 | 1338 | 20,112 | 6703 | 18 | 4.89 | −0.323 | Gly, Ala, Pro, Ser, Gln |
TactFibH 2 | 42 | 1493 | 26,265 | 8754 | 18 | 7.87 | −0.153 | Gly, Ala, Pro, Ser, Gln |
TactFibH 3 | 42 | 2878 | 24,417 | 8138 | 18 | 7.85 | −1.256 | Gly, Pro, Gln, Ala, Ser |
TsylFibH 1 | 42 | 1648 | 13,863 | 4620 | 18 | 5.29 | −0.104 | Gly, Ala, Ser, Pro, Gln |
TsylFibH 2 | 42 | 2491 | 20,706 | 6901 | 18 | 9.14 | −0.155 | Ala, Gly, Pro, Ser, Gln |
TsylFibH 3 | 42 | 2718 | 20,454 | 6817 | 18 | 8.14 | −1.178 | Gly, Gln, Pro, Ala, Ser |
CpalFibH 1 | 42 | 3349 | 16,341 | 5446 | 18 | 6.74 | −0.977 | Gly, Pro, Gln, Ala, Ser |
CpalFibH 2 | 42 | 933 | 13,422 | 4473 | 18 | 5.02 | −0.395 | Ala, Gly, Ser, Pro, Tyr |
Supplementary Materials
The following supporting information can be downloaded at:
References
1. Sutherland, T.D.; Young, J.H.; Weisman, S.; Hayashi, C.Y.; Merritt, D.J. Insect Silk: One Name, Many Materials. Annu. Rev. Entomol.; 2010; 55, pp. 171-188. [DOI: https://dx.doi.org/10.1146/annurev-ento-112408-085401] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/19728833]
2. Xia, Q.; Li, S.; Feng, Q. Advances in Silkworm Studies Accelerated by the Genome Sequencing of Bombyx mori. Annu. Rev. Entomol.; 2014; 59, pp. 513-536. [DOI: https://dx.doi.org/10.1146/annurev-ento-011613-161940] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/24160415]
3. Sutherland, T.D.; Weisman, S.; Trueman, H.E.; Sriskantha, A.; Trueman, J.W.H.; Haritos, V.S. Conservation of Essential Design Features in Coiled Coil Silks. Mol. Biol. Evol.; 2007; 24, pp. 2424-2432. [DOI: https://dx.doi.org/10.1093/molbev/msm171] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/17703050]
4. Shi, J.; Lua, S.; Du, N.; Liu, X.; Song, J. Identification, recombinant production and structural characterization of four silk proteins from the Asiatic honeybee Apis cerana. Biomaterials; 2008; 29, pp. 2820-2828. [DOI: https://dx.doi.org/10.1016/j.biomaterials.2008.03.020] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/18394700]
5. Inoue, S.; Tanaka, K.; Arisaka, F.; Kimura, S.; Ohtomo, K.; Mizuno, S. Silk fibroin of Bombyx mori is secreted, assembling a high molecular mass elementary unit consisting of H-chain, L-chain, and P25, with a 6:6:1 molar ratio. J. Biol. Chem.; 2000; 275, pp. 40517-40528. [DOI: https://dx.doi.org/10.1074/jbc.M006897200] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/10986287]
6. Peng, Z.; Yang, X.; Liu, C.; Dong, Z.; Wang, F.; Wang, X.; Hu, W.; Zhang, X.; Zhao, P.; Xia, Q. Structural and Mechanical Properties of Silk from Different Instars of Bombyx mori. Biomacromolecules; 2019; 20, pp. 1203-1216. [DOI: https://dx.doi.org/10.1021/acs.biomac.8b01576] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30702870]
7. Qin, L.; Li, J.; Guo, K.; Lu, M.; Zhang, Y.; Zhang, X.; Zeng, Y.; Wang, X.; Xia, Q.; Zhao, P. et al. Insights into the structure and composition of mineralized hard cocoons constructed by the oriental moth, Monema (Cnidocampa) flavescens Walker. Insect Biochem. Mol. Biol.; 2022; 151, 103878. [DOI: https://dx.doi.org/10.1016/j.ibmb.2022.103878] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36410578]
8. Rouhová, L.; Sehadová, H.; Pauchová, L.; Hradilová, M.; Žurovcová, M.; Šerý, M.; Rindoš, M.; Žurovec, M. Using the multi-omics approach to reveal the silk composition in Plectrocnemia conspersa. Front. Mol. Biosci.; 2022; 9, 945239. [DOI: https://dx.doi.org/10.3389/fmolb.2022.945239] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36060257]
9. Zhou, C.Z.; Confalonieri, F.; Jacquet, M.; Perasso, R.; Li, Z.G.; Janin, J. Silk fibroin: Structural implications of a remarkable amino acid sequence. Proteins; 2001; 44, pp. 119-122. [DOI: https://dx.doi.org/10.1002/prot.1078]
10. He, Y.-X.; Zhang, N.-N.; Li, W.-F.; Jia, N.; Chen, B.-Y.; Zhou, K.; Zhang, J.; Chen, Y.; Zhou, C.-Z. N-Terminal Domain of Bombyx mori Fibroin Mediates the Assembly of Silk in Response to pH Decrease. J. Mol. Biol.; 2012; 418, pp. 197-207. [DOI: https://dx.doi.org/10.1016/j.jmb.2012.02.040]
11. Hu, W.; Lu, W.; Wei, L.; Zhang, Y.; Xia, Q. Molecular nature of dominant naked pupa mutation reveals novel insights into silk production in Bombyx mori. Insect Biochem. Mol. Biol.; 2019; 109, pp. 52-62. [DOI: https://dx.doi.org/10.1016/j.ibmb.2019.04.006] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30954682]
12. Hao, Z.; Long, D.; Zhang, Y.; Umuhoza, D.; Dai, J.; Xu, Z.; Zhang, G.; Meng, W.; Xiang, Z.; Zhao, A. New insight into the mechanism of in vivo fibroin self-assembly and secretion in the silkworm, Bombyx mori. Int. J. Biol. Macromol.; 2021; 169, pp. 473-479. [DOI: https://dx.doi.org/10.1016/j.ijbiomac.2020.12.132] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33358779]
13. Lu, W.; Ma, S.; Sun, L.; Zhang, T.; Wang, X.; Feng, M.; Wang, A.; Shi, R.; Jia, L.; Xia, Q. Combined CRISPR toolkits reveal the domestication landscape and function of the ultra-long and highly repetitive silk genes. Acta Biomater.; 2023; 158, pp. 190-202. [DOI: https://dx.doi.org/10.1016/j.actbio.2022.12.063] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36603730]
14. Moreno-Tortolero, R.O.; Luo, Y.; Parmeggiani, F.; Skaer, N.; Walker, R.; Serpell, L.; Holland, C.; Davis, S.A. Silk Road Revealed: Mechanism of silk fibre formation in Bombyx mori. bioRxiv; 2023; [DOI: https://dx.doi.org/10.1101/2023.06.02.543394]
15. Zhou, C.Z.; Confalonieri, F.; Medina, N.; Zivanovic, Y.; Esnault, C.; Yang, T.; Jacquet, M.; Janin, J.; Duguet, M.; Perasso, R. et al. Fine organization of Bombyx mori fibroin heavy chain gene. Nucleic Acids Res.; 2000; 28, pp. 2413-2419. [DOI: https://dx.doi.org/10.1093/nar/28.12.2413] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/10871375]
16. Lu, W.; Zhang, T.; Zhang, Q.; Zhang, N.; Jia, L.; Ma, S.; Xia, Q. FibH Gene Complete Sequences (FibHome) Revealed Silkworm Pedigree. Insects; 2023; 14, 244. [DOI: https://dx.doi.org/10.3390/insects14030244] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36975929]
17. Zhou, L.; Chen, X.; Shao, Z.; Huang, Y.; Knight, D.P. Effect of Metallic Ions on Silk Formation in the Mulberry Silkworm, Bombyx mori. J. Phys. Chem. B; 2005; 109, pp. 16937-16945. [DOI: https://dx.doi.org/10.1021/jp050883m] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/16853155]
18. Greving, I.; Cai, M.; Vollrath, F.; Schniepp, H.C. Shear-Induced Self-Assembly of Native Silk Proteins into Fibrils Studied by Atomic Force Microscopy. Biomacromolecules; 2012; 13, pp. 676-682. [DOI: https://dx.doi.org/10.1021/bm201509b] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/22352290]
19. Hu, L.; Han, Y.; Ling, S.; Huang, Y.; Yao, J.; Shao, Z.; Chen, X. Direct Observation of Native Silk Fibroin Conformation in Silk Gland of Bombyx mori Silkworm. ACS Biomater. Sci. Eng.; 2020; 6, pp. 1874-1879. [DOI: https://dx.doi.org/10.1021/acsbiomaterials.9b01586]
20. Suzuki, Y.; Morie, S.; Okamura, H.; Asakura, T.; Naito, A. Real-Time Monitoring of the Structural Transition of Bombyx mori Liquid Silk under Pressure by Solid-State NMR. J. Am. Chem. Soc.; 2023; 145, pp. 22925-22933. [DOI: https://dx.doi.org/10.1021/jacs.3c04361]
21. Ma, S.; Shi, R.; Wang, X.; Liu, Y.; Chang, J.; Gao, J.; Lu, W.; Zhang, J.; Zhao, P.; Xia, Q. Genome editing of BmFib-H gene provides an empty Bombyx mori silk gland for a highly efficient bioreactor. Sci. Rep.; 2014; 4, 6867. [DOI: https://dx.doi.org/10.1038/srep06867]
22. Malay, A.D.; Sato, R.; Yazawa, K.; Watanabe, H.; Ifuku, N.; Masunaga, H.; Hikima, T.; Guan, J.; Mandal, B.B.; Damrongsakkul, S. et al. Relationships between physical properties and sequence in silkworm silks. Sci. Rep.; 2016; 6, 27573. [DOI: https://dx.doi.org/10.1038/srep27573] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/27279149]
23. Frandsen, P.B.; Hotaling, S.; Powell, A.; Heckenhauer, J.; Kawahara, A.Y.; Baker, R.H.; Hayashi, C.Y.; Ríos-Touma, B.; Holzenthal, R.; Pauls, S.U. et al. Allelic resolution of insect and spider silk genes reveals hidden genetic diversity. Proc. Natl. Acad. Sci. USA; 2023; 120, e2221528120. [DOI: https://dx.doi.org/10.1073/pnas.2221528120] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/37094147]
24. Takasu, Y.; Yamada, N.; Kojima, K.; Iga, M.; Yukuhiro, F.; Iizuka, T.; Yoshioka, T. Fibroin heavy chain gene replacement with a highly ordered synthetic repeat sequence in Bombyx mori. Insect Biochem. Mol. Biol.; 2023; 161, 104002. [DOI: https://dx.doi.org/10.1016/j.ibmb.2023.104002] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/37657611]
25. Malay, A.D.; Craig, H.C.; Chen, J.; Oktaviani, N.A.; Numata, K. Complexity of Spider Dragline Silk. Biomacromolecules; 2022; 23, pp. 1827-1840. [DOI: https://dx.doi.org/10.1021/acs.biomac.1c01682] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35378031]
26. Nakamura, H.; Kono, N.; Mori, M.; Masunaga, H.; Numata, K.; Arakawa, K. Composition of Minor Ampullate Silk Makes Its Properties Different from Those of Major Ampullate Silk. Biomacromolecules; 2023; 24, pp. 2042-2051. [DOI: https://dx.doi.org/10.1021/acs.biomac.2c01474] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/37002945]
27. Lu, W.; Shi, R.; Li, X.; Ma, S.; Yang, D.; Shang, D.; Xia, Q. A review on complete silk gene sequencing and de novo assembly of artificial silk. Int. J. Biol. Macromol.; 2024; 264, 130444. [DOI: https://dx.doi.org/10.1016/j.ijbiomac.2024.130444] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/38417762]
28. Kono, N.; Ohtoshi, R.; Malay, A.D.; Mori, M.; Masunaga, H.; Yoshida, Y.; Nakamura, H.; Numata, K.; Arakawa, K. Darwin’s bark spider shares a spidroin repertoire with Caerostris extrusa but achieves extraordinary silk toughness through gene expression. Open Biol.; 2021; 11, 210242. [DOI: https://dx.doi.org/10.1098/rsob.210242] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34932907]
29. Kono, N.; Nakamura, H.; Mori, M.; Yoshida, Y.; Ohtoshi, R.; Malay, A.D.; Moran, D.A.P.; Tomita, M.; Numata, K.; Arakawa, K. Multicomponent nature underlies the extraordinary mechanical properties of spider dragline silk. Proc. Natl. Acad. Sci. USA; 2021; 118, e2107065118. [DOI: https://dx.doi.org/10.1073/pnas.2107065118]
30. Song, K.; Wang, Y.; Dong, W.; Li, Z.; Xia, Q.; Zhu, P.; He, H. Decoding silkworm spinning programmed by pH and metal ions. Sci. Bull.; 2024; 69, pp. 792-802. [DOI: https://dx.doi.org/10.1016/j.scib.2023.12.050]
31. Yoshioka, T.; Tsubota, T.; Tashiro, K.; Jouraku, A.; Kameda, T. A study of the extraordinarily strong and tough silk produced by bagworms. Nat. Commun.; 2019; 10, 1469. [DOI: https://dx.doi.org/10.1038/s41467-019-09350-3]
32. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A. et al. Highly accurate protein structure prediction with AlphaFold. Nature; 2021; 596, pp. 583-589. [DOI: https://dx.doi.org/10.1038/s41586-021-03819-2]
33. Peng, D.; Zheng, L.; Liu, D.; Han, C.; Wang, X.; Yang, Y.; Song, L.; Zhao, M.; Wei, Y.; Li, J. et al. Large-language models facilitate discovery of the molecular signatures regulating sleep and activity. Nat. Commun.; 2024; 15, 3685. [DOI: https://dx.doi.org/10.1038/s41467-024-48005-w]
34. Lin, Z.; Akin, H.; Rao, R.; Hie, B.; Zhu, Z.; Lu, W.; Smetanin, N.; Verkuil, R.; Kabeli, O.; Shmueli, Y. et al. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science; 2023; 379, pp. 1123-1130. [DOI: https://dx.doi.org/10.1126/science.ade2574] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36927031]
35. Ruffolo, J.A.; Nayfach, S.; Gallagher, J.; Bhatnagar, A.; Beazer, J.; Hussain, R.; Russ, J.; Yip, J.; Hill, E.; Pacesa, M. et al. Design of highly functional genome editors by modeling the universe of CRISPR-Cas sequences. bioRxiv; 2024; [DOI: https://dx.doi.org/10.1101/2024.04.22.590591]
36. Jiang, M.; Shu, T.; Ye, C.; Ren, J.; Ling, S. Predicting the conformations of the silk protein through deep learning. Analyst; 2021; 146, pp. 2490-2498. [DOI: https://dx.doi.org/10.1039/D1AN00290B]
37. Kim, Y.; Yoon, T.; Park, W.B.; Na, S. Predicting mechanical properties of silk from its amino acid sequences via machine learning. J. Mech. Behav. Biomed. Mater.; 2023; 140, 105739. [DOI: https://dx.doi.org/10.1016/j.jmbbm.2023.105739]
38. Fazio, V.; Pugno, N.M.; Giustolisi, O.; Puglisi, G. Hierarchical physically based machine learning in material science: The case study of spider silk. arXiv; 2023; [DOI: https://dx.doi.org/10.48550/ARXIV.2307.12945]
39. Almagro Armenteros, J.J.; Tsirigos, K.D.; Sønderby, C.K.; Petersen, T.N.; Winther, O.; Brunak, S.; Von Heijne, G.; Nielsen, H. SignalP 5.0 improves signal peptide predictions using deep neural networks. Nat. Biotechnol.; 2019; 37, pp. 420-423. [DOI: https://dx.doi.org/10.1038/s41587-019-0036-z]
40. Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; Von Haeseler, A.; Lanfear, R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol.; 2020; 37, pp. 1530-1534. [DOI: https://dx.doi.org/10.1093/molbev/msaa015]
41. Mirdita, M.; Schütze, K.; Moriwaki, Y.; Heo, L.; Ovchinnikov, S.; Steinegger, M. ColabFold: Making protein folding accessible to all. Nat. Methods; 2022; 19, pp. 679-682. [DOI: https://dx.doi.org/10.1038/s41592-022-01488-1] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35637307]
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
The FibH gene, crucial for silk spinning in insects, encodes a protein that significantly influences silk fiber mechanics. Due to its large size and repetitive sequences, limited known sequences of insect FibH impede comprehensive understanding. Here, we analyzed 114 complete FibH gene sequences from Lepidoptera (71 moths, 24 butterflies) and 13 Trichoptera, revealing single-copy FibH in most species, with 2–3 copies in Hesperinae and Heteropterinae (subfamily of skippers). All FibH genes are structured with two exons and one intron (39–45 bp), with the second exon being notably longer. Moths exhibit higher GC content in FibH compared to butterflies and Trichoptera. The FibH composition varies among species, with moths and butterflies favoring Ala, Gly, Ser, Pro, Gln, and Asn, while Trichoptera FibH is enriched in Gly, Ser, and Arg, and has less Ala. Unique to Trichoptera FibH are Tyr, Val, Arg, and Trp, whereas Lepidoptera FibH is marked by polyAla (polyalanine), polySer (polyserine), and the hexapeptide GAGSGA. A phylogenetic analysis suggests that Lepidoptera FibH evolved from Trichoptera, with skipper FibH evolving from Papilionoidea. This study substantially expands the FibH repertoire, providing a foundation for the development of artificial silk.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details


1 Integrative Science Center of Germplasm Creation in Western China (CHONGQING) Science City, Biological Science Research Center, Southwest University, Chongqing 400715, China;
2 College of Sericulture, Textile and Biomass Sciences, Southwest University, Chongqing 400715, China;