1. Introduction
The family Parvoviridae encompasses small non-enveloped and negative single-stranded DNA viruses, including many human and animal pathogens, that infect a wide variety of species [1]. Porcine parvoviruses (PPV) are important pathogens that cause reproductive failure in swine, resulting in enormous losses in the pig industry worldwide [2]. The Parvoviridae family consists of three subfamilies: Parvovirinae, Hamaparvovirinae, and Densovirinae. Viruses infecting vertebrates and vertebrate cell cultures are assigned to the subfamily Parvovirinae, and Bocaparvovirus, Copiparvovirus, and Tetraparvovirus contain viruses that infect pigs. Currently, eight species in four genera of parvovirus have been described to infect swine. Over the past few years, several new parvoviruses have been discovered in pigs.
Amino acids are encoded in the form of triplet codons. An amino acid can be encoded by one or more (not more than six) triplet codons. Synonymous codons are not randomly selected within or between genomes, which is called codon bias [3,4], Codon bias is affected by many factors, including natural selection, mutation pressure, genetic drift, protein structure and characteristics, tRNA abundance, GC content, etc. [5,6,7,8,9,10]. The non-random use of synonymous codons is called “codon usage bias” [11]. It can reveal the basic characteristics of molecular evolution [12]. Additionally, the use of viral codons may be affected by the host [13,14]. Similar codon usage patterns between virus and host will severely hinder host translation. In addition, codon bias can also affect protein function and translation efficiency to affect viral protein synthesis efficiency, replication adaptability, virulence, and even virus survival [14,15,16]. Considering the dependence of the virus on host cell mechanisms, codon usage patterns can help us understand molecular evolution and expression levels and help vaccine design [17]. Recombination analysis of other pathogens with other DNA [18] and RNA [19] viruses, including recombination outages and evolutionary rates.
Until now, the influence of codon usage bias on the genetic evolution of PPV has not been elucidated in detail. We conducted a comprehensive analysis of the evolutionary process, genetic evolution, and codon usage patterns based on the updated global-scale PPV sequence data set. Our research not only provides detailed information about the codon usage in PPV (PPV2-NS1, PPV3-VP1, PPV4-CP, PPV5-NS1, PPV6-NS1, and PPV7-NS1) genes, but it also demonstrates PPV evolution, species spread, and host adaptation to provide a novel approach.
2. Materials and Methods
2.1. Genome Alignment and Phylogenetic Analysis
The nucleotide and deduced amino acid (AA) sequences of PPVs were assembled using the SeqMan program of DNAstar software version 7.0 (DNASTAR Inc., Madison, WI, USA) to determine sequence homology and genetic variations. The phylogenetic analysis is based on the nucleotide sequence by the distance-based neighbor-joining method using the iTOL v6 with bootstrap analysis of 1000 replicates (
2.2. Phylogeographic Model
jModelTest software selected the best evolution model, and BEAST v1.10.4 was used under the GTR + I + G model of nucleotide substitution with a relaxed clock. The PPV gene was used for phylogenetic and phylodynamic analysis of PPV, and its high variability compared with other genes and having the most available sequences. Markov chain Monte Carlo (MCMC) (set to perform 80 million computations) methods were used to analyze a maximum clade credibility tree. The BEAUti software (version 1.10.4) (
2.3. Codon Usage Index
2.3.1. Nucleotide Composition
The nucleotide composition of the PPVs (PPV2-NS1, PPV3-VP1, PPV4-CP, PPV5-NS1, PPV6-NS1, and PPV7-NS1), including the content of each nucleotide (A%, T%, C%, and G%), total GC, and the AT content of each sequence, is calculated using Bioedit (version 7.0.9.1). The frequency of the third nucleotide (A3s, C3s, T3s, and G3s) was counted using Codon W (version 1.4.2) (
2.3.2. Effective Number of Codons (ENC) Analysis
The effective number of codons (ENC) describes the degree of deviation of codon usage from random selection. It is not a comparison of the frequency of a particular codon with other codons, but it reflects the unbalanced use of synonymous codons in the codon family degree of preference. Calculated based only on codon usage data, it has nothing to do with gene length and amino acid (aa) composition [22] (detailed introduction in Supplementary Materials).
2.3.3. Relative Synonymous Codon Usage Analysis (RSCU)
Relative synonymous codon usage (RSCU) is used to check the usage of synonymous codons to assess the deviation of synonymous codon usage and to highlight the relative frequency of synonymous codons without being affected by the confounding of the amino acid composition of different gene products [23]. In research, RSCU values of highly expressed genes are usually used to establish a reference table (
2.3.4. Relative Codon Deoptimization Index (RCDI)
The relative codon deoptimization index (RCDI) is a measure of comparison with the general codon distribution. The translation rate of viral genes can be estimated, especially the translation rate of the entire genome; the higher the similarity between the virus and the host gene (close to the RCDI value of 1), the higher the translation rate [24]. The RCDI value is calculated by CAIcal (
2.3.5. Codon Adaptation Index (CAI) Analysis
The CAI value ranges from 0 to 1. The higher the CAI value, the higher the similarity to the host [25] (detailed introduction in Supplementary Materials).
2.3.6. Frequency of Optimal Codons (FOP) Analysis
The optimal codon for an amino acid is defined as the codon with the largest number of tRNA genes with its anticodon [23]. Therefore, the FOP metric is a weighted average of the best codon RSCU, where the weight is the corresponding amino acid frequency (detailed introduction in Supplementary Materials).
2.3.7. Codon Bias Index (CBI) Analysis
Codon bias index (CBI) reflects the composition of high-expression superior codons in a specific gene. For the target host’s own genes, this index has a good correlation with the ENC value, which more clearly reflects the expression of foreign genes in the target host. The value of CBI ranges from 0 to 1. When translating amino acids, the number of synonymous codons is the same, and the CBI is 1. When an amino acid only has one corresponding codon, that is, the codon preference is maximized, then the CBI is 0. It is also possible that the number of occurrences of the optimal codon is less than the average number of usages, the CBI is a negative value [26] (detailed introduction in Supplementary Materials).
2.3.8. ENC–GC3s Drawing Analysis
By exploring the functional relationship between ENC (ordinate) and GC3s (abscissa), it is tested whether there are other factors involved in the formation of codon usage patterns in addition to mutation pressure. If there is no natural selection, codon preference is only subject to mutation pressure. In an ideal state, the point corresponding to the gene will fall on the desired curve or a position closer to the curve. When the formation plays an important role, the ENC value is located or distributed around the expected curve; however, when the use of codons is affected by natural selection, mutation pressure, and other factors, the ENC value is much lower than the expected curve [27], indicating that other factors are also involved in codon use preference.
2.3.9. Neutrality Plot Analysis (G12s/GC3s)
The linear relationship between GC12 and GC3 by neutrality graph analysis, which is to study the relative contribution of mutation and natural selection pressure to codon usage bias. If the slope of the equation is close to 1, it means that sudden change pressure is the dominant factor and has a strong dependence [10]. Conversely, values close to the abscissa or ordinate indicate that natural selection is dominant.
2.4. Statistical Analysis
The scores of CAI, RSCU, RCDI, FOP, and CBI are not strictly normally distributed; the pedigrees have unequal variances, and their scores have statistically significant differences. Significant relationships are shown in box plots, where p ≤ 0.001 is extremely significant (****).
3. Results
3.1. Phylogenetic Analysis of PPVS
To establish the genetic relationships of strains of PPV, a phylogenetic analysis of PPVs was performed based on the nucleotide sequences of the structural protein using a distance-based neighbor-joining method with 1000 bootstrap replicates in iTOL v6 (Figure 1).
3.2. Evolutionary Tree Construction Based on the Bayesian Markov Chain Method
The Bayesian Markov chain method was used to determine the mutation rate of the PPV strain’s structural protein gene codons, and the BEAST results were analyzed by Trace. The results showed that the mutation rates of the amino acid codons encoded by the PPV strain’s structural protein gene were different, and the mutation rates of the three codons were 1.0214, 0.8584, and 1.1203 (Figure 2A). Therefore, the third codon has the highest mutation rate. Because the third codon has the highest mutation rate and the codon has degeneracy, some mutations will not change the amino acid encoding the protein, which makes the homology between PPV virus strains very high. These values indicate that during this period, the structural protein gene has a base mutation, which may be related to the recent high detection rate of the PPV virus. The geographical distribution of the PPV virus is steadily increasing. In the skyline chart, it can be seen that the effective outbreak of PPV virus has declined since 2007, but it also increased in 2015 (Figure 2B,C).
3.3. Nucleotide Bias of PPV Genotypes
Among PPV antigenic variants, Table 1 shows that nucleotide A (0.272 ± 0.005 in all) is the most frequent of all the strains, with PPV2, PPV3, PPV4, PPV5, PPV6, and PPV7 showing percentages of 0.231 ± 0.002, 0.231 ± 0.002, 0.342 ± 0.001, 0.345 ± 0.002, 0.235 ± 0.001, and 0.265 ± 0.005 (mean ± SD), respectively. in terms of synonymous codons, while the mean value of T3s (0.347 ± 0.152 in all) is the highest at the third codon position, followed by A3s (0.329 ± 0.129 in all), C3s (0.299 ± 0.123 in all), and G3s (0.284 ± 0.078 in all), the third position of synonymous codons shows dissimilar composition patterns among these PPV sub-types. Furthermore, the GC content varied among different synonymous codon positions and was also different among PPV sub-types. While the value of GC3S in PPV2 and PPV7 is higher than GC1S and GC2S, the GC3S of other sub-types is lower than GC1S. In particular, regardless of the position of the synonymous codon, PPV5 showed the lowest GC content. PPV7 has the highest GC12s values relative to other sub-types PPV2, PPV3, PPV4, PPV5, and PPV6.
3.4. Codon Bias Measures
The ENC values were 51.607 ± 0.891, 57.433 ± 0.192, 44.843 ± 0.270, 5.469 ± 1.007, 48.045 ± 0.325, and 45.812 ± 0.812 for PPV2, PPV3, PPV4, PPV5, PPV6, and PPV7, respectively. Although all ENC values are higher than 35, indicating that the codon biases of all sub-types are low, the ENC values of PPV2 and PPV3 can reach 51.607 (SD ± 0.891) and 57.433 (SD ± 0.192), the codons of PPV2 and PPV3 are lower and the endogenous genes corresponding to various sub-types of PPV tend to be less expressed (Table 1).
We compared the RSCU values among different PPV sub-types. RSCU analysis shows that some codons may be regarded as overrepresented (RSCU > 1.6) or be said to be underrepresented (RSCU < 0.6) in different PPV sub-types (Supplementary Figure S1). Codons AGA(Arg) (2.279 ± 1.312 in all) is overrepresented in all six sub-types, and GTC(Val) (0.564 ± 0.374 in all), TCG(Ser) (0.395 ± 0.236 in all), GCG(Ala) (0.553 ± 0.211 in all), CGT(Arg) (0.439 ± 0.414 in all), and CGA(Arg) (0.528 ± 0.225 in all) are overrepresented. More specifically, codon AGA(Arg) is most overrepresented in PPV4, PPV5, and PPV7, while it is low in PPV2, PPV3, and PPV6, and codon TCG(Ser) is underrepresented in PPV3, PPV4, PPV5, PPV6, and PPV7, while it is low in PPV2; codon GCG(Ala) is underrepresented in PPV2, PPV4, PPV5, and PPV7, while it is low in PPV3 and PPV6; codon CGT(Arg) is underrepresented in PPV2, PPV3, PPV4, PPV5, and PPV7, while it is low in PPV6; codon CGA(Arg) is underrepresented in PPV2, PPV3, PPV4, PPV5, and PPV6, while it is low in PPV7. Surprisingly, the AGA(Arg) was 4.097 ± 0.129 and 3.812 ± 0.107 in PPV4 and PPV5, respectively, indicating an extremely preferential use of this codon (Supplementary Table S7).
3.5. Codon Usage Comparison between Virus and Host
CAI is a measure of the codon usage adaptability of the most commonly used synonymous codons in the reference genome, and it is usually used to predict gene expression efficiency and the expression level of foreign genes. Regarding the CAI value, PPV3 and PPV7 have the highest CAI values, which proves that their codon bias is very strong relative to other sub-types (Figure 3A). The relative codon deoptimization index (RCDI) is a measure of comparison with the general codon distribution. The translation rate of viral genes can be estimated, especially the translation rate of the entire genome. Among them, PPV4 has the highest RCDI value, indicating that the higher the similarity with the host gene, the higher the translation rate (Figure 3B). The optimal codon refers to the most frequently used codon in the highly expressed genes of a species. The results show that PPV7 has the highest FOP value compared to other sub-types, indicating that it has optimal codon usage frequency (Figure 3C). PPV7 was a new PPV sub-type in recent years. The CAI, RCDI, and FOP values show that it has strong expression of foreign genes and optimal codon usage compared to other PPV sub-types.
3.6. CBI–ENC Correlation Analysis
The CBI value reflects the composition of a gene with high expression of superior codons, but it has a good correlation with the ENC value. The CBI–ENC correlation analysis shows that the CBI value is indeed correlated with the ENC value, but only PPV2 and PPV7 have a positive CBI value, and PPV7 is the highest, indicating that the synonymous codons occur the same number of times when translating amino acids; the CBI value of other PPV3, PPV4, PPV5, and PPV6 sub-types are negative, indicating that the number of occurrences of optimal codons is lower than the average number of usages (Supplementary Figure S2).
3.7. ENC–GC3s Drawing Analysis
Use the ENC–GC3s diagram to explore the influence of GC3s on codon preference. As shown in Figure 4, the distributions of the ENC–GC3s diagrams of the seven PPV sub-types are relatively similar, and all genes fall under the expected curve or just on the expected curve. This means that indicating that mutation pressure is less suitable for codon usage patterns, but other factors, especially natural selection, play an important role in codon usage preference.
3.8. Neutrality Plot Analysis
Neutral analysis based on GC12 and GC3 can quantitatively evaluate the impact of pressure mutations and natural selection. We found that the coefficients between GC12s and GC3s are positively correlated. This phenomenon suggests that the role of natural selection may be important in shaping PPV codon usage, especially for PPV3 (slope of regression line 0.0976, R2 = 0.0634), PPV5 (slope of regression line 0.1518, R2 = 0.178), PPV6 (slope of regression line 0.1365, R2 = 0.167), and PPV7 (slope of regression line 0.0633, R2 = 0.0993) (Figure 5). This indicates that mutational pressure has a certain impact on the codon usage preference of PPV genes, and natural selection plays a very important or even leading role in the codon usage pattern.
4. Discussion
Several studies to determine the effects of PPV2–PPV7 on pig health have focused on concurrent infections between PPV and other pathogens (mainly PRRSV and PCV2). Because PPV2–PPV6 have never been cultured and challenged in vitro, their importance must be inferred or speculated based on testing at the DNA level. Previous phylogenetic studies of strains on a global scale have shown that they can be divided into seven different sub-types. Therefore, studying the transmission mode and gene codon bias of different sub-types of PPV will provide more objective data and help to better understand the ecological and genetic evolutionary dynamics of these viruses. In order to explore the evolutionary characteristics and host adaptability of different PPV sub-types, a codon usage analysis was performed here. Based on the phylogenetic analysis of MCC, we identified six main PPV sub-types (PPV2, PPV3, PPV4, PPV5, PPV6, and PPV7) and determined the evolutionary dynamics of PPV for the first time.
Bayesian system dynamics models are used in research environments and are rarely used for routine monitoring of field data to support disease prevention and control. Due to the complexity and scale of the data analyzed, this implementation is challenging. Bayesian inference (BI) is a statistical method based on using evolutionary models of sequence evolution to reconstruct a phylogenetic tree. The resulting tree not only reflects the best estimate of the phylogenetic relationship but also provides definite support for branching [28]. Many infectious viral diseases use Bayesian system dynamics models. For example, the study simulated the evolutionary dynamics of PRRSV [29,30]. The data we selected include all the published structural protein sequences of various sub-types of PPV by NCBI. The skyline chart shows that the effective outbreak of PPV virus has declined since 2007 and increased in 2015. The chart can also predict the trend of dynamic phylogeny, indicating the trend of PPV outbreak, and provide a theoretical basis for clinical prevention and control.
Regarding the codon usage pattern of the PRRSV ORF5 gene, the changes in different lineages of PRRSV reflect the evolutionary changes made for survival and adaptation to the host, confirming that the attenuation of continuous passage of PRRSV in cells has a certain relationship with codon preference, so it indicates that codon preference has a certain impact on the survival, host adaptation, and virulence of PRRSV virus [31]. Previous studies have confirmed that the codon usage preference of the PCV2 clade is mainly affected by natural selection, and its evolutionary model seems to be spread across hosts [32]. It was then determined that H7N9 has a low codon bias, which is mainly driven by natural selection [33]. It proved that codons are very important for adapting to the host. In this study, based on a comprehensive analysis of CAI, FOP, and RCID, the new PPV7 sub-type has the highest CAI and FOP values compared to other sub-types, and the RCID value of PPV7 is not the highest, but it remains above the average level. Because previous studies have shown that the CAI value is high, indicating that the virus has strong adaptability [34], and inferred that it may be related to the virus’s virulence to the host [33], it has important guiding significance for studying whether the new PPV7 sub-type has higher infectivity and pathogenicity. Using the ENC–GC3s diagram to explore the codon preference of GC3s shows that the distribution of the ENC–GC3s diagrams of the seven PPV sub-types is relatively similar, indicating that mutation pressure is less suitable for codon usage patterns. The CBI-ENC correlation analysis shows that the CBI value is indeed related to the ENC value, but only the CBI values of PPV2 and PPV7 are positive, and PPV7 is the highest, indicating that the number of synonymous codons are the same with translating amino acid groups; the CBI values of other sub-types PPV3, PPV4, PPV5, and PPV6 sub-types are negative, indicating that the number of occurrences of the best codons is lower than the average number of usages. Neutral analysis based on GC12 and GC3 can quantitatively evaluate the impact of stress mutations and natural selection. We have determined that mutation stress has a certain impact on the codon usage preference of PPV genes, and natural selection plays a very important and leading role in codon usage patterns. To sum up, PPV is still widely distributed in China, and the new type of PPV7 is determined by the analysis of the bias in cryptography to be able to better adapt to the host and thus better spread.
5. Conclusions
In conclusion, we have determined that the codon usage preferences of PPV sub-types were mainly affected by natural selection and played an important role in the evolution process. Importantly, the main evolutionary pattern of PPV that appears to be fixed through host transmission. Our research is an effective model for the evolution, origin, and codon bias of various viruses, as well as predicting the trend of dynamic phylogeny, providing a theoretical basis for prevention in clinical practice.
Conceptualization, C.X., H.L., X.X. and N.J.; methodology, C.X.; software, C.X.; validation, H.L. and X.X.; formal analysis, C.X.; investigation, X.Z., Z.H., Y.X. and H.Z.; resources, H.L., C.X. and Y.Z.; data curation, Y.T., Y.Z. and P.Z.; writing—original draft preparation, H.L. and C.X.; writing—review and editing, H.L. and C.X.; visualization, C.X.; supervision, H.L.; project administration, H.L.; funding acquisition, H.L. and N.J. All authors have read and agreed to the published version of the manuscript.
The National Program for Key Basic Research Projects of China (grant number 2018YFD0500104 and 2018YFD0500803) by Chinese Academy of Agricultural Sciences, Changchun Veterinary Research Institute.
In this study, the pig clinical samples used were obtained with the written of the farmer, and experiments were carried out in strict accordance with the Animal Ethics Procedures and Guidelines of the People’s Republic of China.
Not applicable.
The data presented in this study are available in the article and in its online
The authors declare no conflict of interest.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure 1. PPV phylogenetic tree based on the nucleotide sequence of all PPV sub-types.
Figure 2. PPV virus structural protein codon mutation rate and skyline diagram (A,B). The codon mutation rate of PPV virus structural protein gene was estimated by Bayesian Markov chain method. The codon mutation rate is the result of BEAST running using Trace analysis. (C) The dynamic study of the genetic diversity of PPV virus structural protein genes by Bayesian skyline diagram. The thick solid line is the median estimate, and the dashed line represents the 95% confidence interval. The abscissa is time, and the ordinate is the effective population size.
Figure 3. (A): CAI scatter plots of structural protein gene from different sub-types of all PPV strains. The asterisk indicates a significant difference between labeled groups. (B): RCDI scatter plots of structural protein gene from different sub-types of all PPV strains. The asterisk indicates a significant difference between labeled groups. (C): FOP scatter plots of structural protein gene from different sub-types of all PPV strains. The asterisk indicates a significant difference between labeled groups (p < 0.001 ****).
Figure 4. The relationship between ENC and GC3 of each sub-type of PPV. (A): PPV2. (B): PPV3. (C): PPV4. (D): PPV5. (E): PPV6. (F): PPV7.
Properties of structural protein genes from PPV strains analyzed in this study (mean value ± SD).
Categories | PPV2 | PPV3 | PPV4 | PPV5 | PPV6 | PPV7 | ALL |
---|---|---|---|---|---|---|---|
%A | 0.231 ± 0.002 | 0.231 ± 0.002 | 0.342 ± 0.001 | 0.345 ± 0.002 | 0.235 ± 0.001 | 0.265 ± 0.005 | 0.272 ± 0.005 |
%C | 0.282 ± 0.002 | 0.282 ± 0.002 | 0.201 ± 0.001 | 0.170 ± 0.001 | 0.210 ± 0.001 | 0.274 ± 0.002 | 0.239 ± 0.004 |
%T | 0.189 ± 0.002 | 0.189 ± 0.002 | 0.248 ± 0.001 | 0.270 ± 0.001 | 0.260 ± 0.001 | 0.155 ± 0.002 | 0.224 ± 0.005 |
%G | 0.298 ± 0.003 | 0.298 ± 0.002 | 0.202 ± 0.001 | 0.212 ± 0.001 | 0.295 ± 0.004 | 0.305 ± 0.006 | 0.264 ± 0.004 |
A3S | 0.212 ± 0.009 | 0.247 ± 0.004 | 0.506 ± 0.004 | 0.522 ± 0.017 | 0.199 ± 0.003 | 0.333 ± 0.017 | 0.329 ± 0.129 |
C3S | 0.440 ± 0.008 | 0.266 ± 0.003 | 0.179 ± 0.002 | 0.194 ± 0.015 | 0.213 ± 0.004 | 0.473 ± 0.009 | 0.299 ± 0.123 |
T3S | 0.202 ± 0.007 | 0.411 ± 0.003 | 0.396 ± 0.003 | 0.434 ± 0.010 | 0.532 ± 0.004 | 0.128 ± 0.004 | 0.347 ± 0.152 |
G3S | 0.380 ± 0.008 | 0.303 ± 0.003 | 0.183 ± 0.003 | 0.150 ± 0.010 | 0.315 ± 0.004 | 0.332 ± 0.023 | 0.284 ± 0.078 |
%G+C | 0.580 ± 0.003 | 0.507 ± 0.001 | 0.410 ± 0.001 | 0.387 ± 0.003 | 0.469 ± 0.001 | 0.580 ± 0.006 | 0.494 ± 0.074 |
GC1S | 0.587 ± 0.003 | 0.593 ± 0.001 | 0.479 ± 0.001 | 0.483 ± 0.002 | 0.540 ± 0.001 | 0.621 ± 0.006 | 0.552 ± 0.060 |
GC2S | 0.473 ± 0.002 | 0.455 ± 0.002 | 0.440 ± 0.002 | 0.379 ± 0.002 | 0.434 ± 0.001 | 0.470 ± 0.003 | 0.444 ± 0.030 |
GC3S | 0.681 ± 0.010 | 0.472 ± 0.003 | 0.311 ± 0.003 | 0.296 ± 0.003 | 0.433 ± 0.003 | 0.648 ± 0.014 | 0.478 ± 0.150 |
GC12S | 0.530 ± 0.002 | 0.524 ± 0.001 | 0.460 ± 0.001 | 0.431 ± 0.001 | 0.487 ± 0.001 | 0.545 ± 0.003 | 0.500 ± 0.040 |
ENC | 51.607 ± 0.891 | 57.433 ± 0.192 | 44.843 ± 0.270 | 45.469 ± 1.007 | 48.045 ± 0.325 | 45.812 ± 0.812 | 48.319 ± 4.073 |
CBI | 0.036 ± 0.012 | −0.027 ± 0.007 | −0.035 ± 0.008 | −0.024 ± 0.005 | −0.053 ± 0.004 | 0.013 ± 0.173 | −0.015 ± 0.068 |
Supplementary Materials
The following supporting information can be downloaded at:
References
1. He, J.; Shehu, A.; Haspel, N.; Chen, B. The 6th Computational Structural Bioinformatics Workshop. BMC Struct. Biol.; 2013; 13, (Suppl. 1), pp. S1-S11. [DOI: https://dx.doi.org/10.1186/1472-6807-13-S1-I1] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/24564893]
2. Hueffer, K.; Parrish, C.R. Parvovirus Host Range, Cell Tropism and Evolution. Curr. Opin. Microbiol.; 2003; 6, pp. 392-398. [DOI: https://dx.doi.org/10.1016/S1369-5274(03)00083-3]
3. Dittmar, K.A.; Goodenbour, J.M.; Pan, T. Tissue-Specific Differences in Human Transfer RNA Expression. PLoS Genet.; 2006; 2, E221. [DOI: https://dx.doi.org/10.1371/journal.pgen.0020221]
4. Marin, A.; Bertranpetit, J.; Oliver, J.L.; Medina, J.R. Variation in G + C-Content and Codon Choice: Differences among Synonymous Codon Groups in Vertebrate Genes. Nucleic Acids Res.; 1989; 17, pp. 6181-6189. [DOI: https://dx.doi.org/10.1093/nar/17.15.6181]
5. Hershberg, R.; Petrov, D.A. Selection on Codon Bias. Annu. Rev. Genet.; 2008; 42, pp. 287-299. [DOI: https://dx.doi.org/10.1146/annurev.genet.42.110807.091442]
6. Hu, J.; Zhao, X.; Zhang, Z.; Yu, J. Compositional Dynamics of Guanine and Cytosine Content in Prokaryotic Genomes. Res. Microbiol.; 2007; 158, pp. 363-370. [DOI: https://dx.doi.org/10.1016/j.resmic.2007.02.007]
7. Ikemura, T. Correlation between the Abundance of Escherichia Coli Transfer RNAs and the Occurrence of the Respective Codons in Its Protein Genes: A Proposal for A Synonymous Codon Choice That Is Optimal for the E. Coli Translational System. J. Mol. Biol.; 1981; 151, pp. 389-409. [DOI: https://dx.doi.org/10.1016/0022-2836(81)90003-6]
8. Li, G.; Wang, H.; Wang, S.; Xing, G.; Zhang, C.; Zhang, W.; Liu, J.; Zhang, J.; Su, S.; Zhou, J. Insights into the Genetic and Host Adaptability of Emerging Porcine Circovirus 3. Virulence; 2018; 9, pp. 1301-1313. [DOI: https://dx.doi.org/10.1080/21505594.2018.1492863]
9. Plotkin, J.B.; Kudla, G. Synonymous but not the Same: The Causes and Consequences of Codon Bias. Nat. Rev. Genet.; 2011; 12, pp. 32-42. [DOI: https://dx.doi.org/10.1038/nrg2899] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/21102527]
10. Sueoka, N. Directional Mutation Pressure and Neutral Molecular Evolution. Proc. Natl. Acad. Sci. USA; 1988; 85, pp. 2653-2657. [DOI: https://dx.doi.org/10.1073/pnas.85.8.2653]
11. Grantham, R.; Gautier, C.; Gouy, M.; Mercier, R.; Pave, A. Codon Catalog Usage and the Genome Hypothesis. Nucleic Acids Res.; 1980; 8, pp. R49-R62. [DOI: https://dx.doi.org/10.1093/nar/8.1.197-c]
12. Sharp, P.M.; Matassi, G. Codon Usage and Genome Evolution. Curr. Opin. Genet. Dev.; 1994; 4, pp. 851-860. [DOI: https://dx.doi.org/10.1016/0959-437X(94)90070-1]
13. Wu, H.; Bao, Z.; Mou, C.; Chen, Z.; Zhao, J. Comprehensive Analysis of Codon Usage on Porcine Astrovirus. Viruses; 2020; 12, 991. [DOI: https://dx.doi.org/10.3390/v12090991]
14. Wu, X.M.; Wu, S.F.; Ren, D.M.; Zhu, Y.P.; He, F.C. The Analysis Method and Progress in the Study of Codon Bias. Yi Chuan Hered.; 2007; 29, pp. 420-426. [DOI: https://dx.doi.org/10.1360/yc-007-0420]
15. Chaney, J.L.; Clark, P.L. Roles for Synonymous Codon Usage in Protein Biogenesis. Annu. Rev. Biophys.; 2015; 44, pp. 143-166. [DOI: https://dx.doi.org/10.1146/annurev-biophys-060414-034333] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/25747594]
16. Chen, F.; Wu, P.; Deng, S.; Zhang, H.; Hou, Y.; Hu, Z.; Zhang, J.; Chen, X.; Yang, J.R. Dissimilation of Synonymous Codon Usage Bias in Virus-Host Coevolution due to Translational Selection. Nat. Ecol. Evol.; 2020; 4, pp. 589-600. [DOI: https://dx.doi.org/10.1038/s41559-020-1124-7] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32123323]
17. Nasrullah, I.; Butt, A.M.; Tahir, S.; Idrees, M.; Tong, Y. Genomic Analysis of Codon Usage Shows Influence of Mutation Pressure, Natural Selection, and Host Features on Marburg Virus Evolution. BMC Evol. Biol.; 2015; 15, 174. [DOI: https://dx.doi.org/10.1186/s12862-015-0456-4] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/26306510]
18. He, W.T.; Bollen, N.; Xu, Y.; Zhao, J.; Dellicour, S.; Yan, Z.; Gong, W.; Zhang, C.; Zhang, L.; Lu, M. et al. Phylogeography Reveals Association between Swine Trade and the Spread of Porcine Epidemic Diarrhea Virus in China and across the World. Mol. Biol. Evol.; 2021; [DOI: https://dx.doi.org/10.1093/molbev/msab364]
19. He, W.T.; Ji, X.; He, W.; Dellicour, S.; Wang, S.; Li, G.; Zhang, L.; Gilbert, M.; Zhu, H.; Xing, G. et al. Genomic Epidemiology, Evolution, and Transmission Dynamics of Porcine Deltacoronavirus. Mol. Biol. Evol.; 2020; 37, pp. 2641-2654. [DOI: https://dx.doi.org/10.1093/molbev/msaa117]
20. Drummond, A.J.; Rambaut, A. Beast: Bayesian Evolutionary Analysis by Sampling Trees. BMC Evol. Biol.; 2007; 7, 214. [DOI: https://dx.doi.org/10.1186/1471-2148-7-214]
21. Puigbo, P.; Bravo, I.G.; Garcia-Vallve, S. Caical: A Combined Set of Tools to Assess Codon Usage Adaptation. Biol. Direct; 2008; 3, 38. [DOI: https://dx.doi.org/10.1186/1745-6150-3-38]
22. Wright, F. The ‘Effective Number of Codons’ Used in A Gene. Gene; 1990; 87, pp. 23-29. [DOI: https://dx.doi.org/10.1016/0378-1119(90)90491-9]
23. Sharp, P.M.; Tuohy, T.M.; Mosurski, K.R. Codon Usage in Yeast: Cluster Analysis Clearly Differentiates Highly and Lowly Expressed Genes. Nucleic Acids Res.; 1986; 14, pp. 5125-5143. [DOI: https://dx.doi.org/10.1093/nar/14.13.5125] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/3526280]
24. Mueller, S.; Papamichail, D.; Coleman, J.R.; Skiena, S.; Wimmer, E. Reduction of the Rate of Poliovirus Protein Synthesis through Large-Scale Codon Deoptimization Causes Attenuation of Viral Virulence by Lowering Specific Infectivity. J. Virol.; 2006; 80, pp. 9687-9696. [DOI: https://dx.doi.org/10.1128/JVI.00738-06] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/16973573]
25. Sharp, P.M.; Li, W.H. The Codon Adaptation Index—A Measure of Directional Synonymous Codon Usage Bias, and Its Potential Applications. Nucleic Acids Res.; 1987; 15, pp. 1281-1295. [DOI: https://dx.doi.org/10.1093/nar/15.3.1281]
26. Morton, B.R. Chloroplast DNA Codon Use: Evidence for Selection at the PSB a Locus Based on TRNA Availability. J. Mol. Evol.; 1993; 37, pp. 273-280. [DOI: https://dx.doi.org/10.1007/BF00175504]
27. Majeed, A.; Kaur, H.; Bhardwaj, P. Selection Constraints Determine Preference For A/U-Ending Codons in Taxus Contorta. Genome; 2020; 63, pp. 215-224. [DOI: https://dx.doi.org/10.1139/gen-2019-0165]
28. Boon, L.; Ugarte-Berzal, E.; Vandooren, J.; Opdenakker, G. Glycosylation of Matrix Metalloproteases and Tissue Inhibitors: Present State, Challenges and Opportunities. Biochem. J.; 2016; 473, pp. 1471-1482. [DOI: https://dx.doi.org/10.1042/BJ20151154]
29. Alkhamis, M.A.; Perez, A.M.; Murtaugh, M.P.; Wang, X.; Morrison, R.B. Applications of Bayesian Phylodynamic Methods in a Recent U.S. Porcine Reproductive and Respiratory Syndrome Virus Outbreak. Front. Microbiol.; 2016; 7, 67. [DOI: https://dx.doi.org/10.3389/fmicb.2016.00067]
30. Chaikhumwang, P.; Tantituvanont, A.; Tripipat, T.; Tipsombatboon, P.; Piriyapongsa, J.; Nilubol, D. Dynamics and Evolution of Highly Pathogenic Porcine Reproductive and Respiratory Syndrome Virus Following Its Introduction into a Herd Concurrently Infected with Both Types 1 And 2. Infect. Genet. Evol.; 2015; 30, pp. 164-174. [DOI: https://dx.doi.org/10.1016/j.meegid.2014.12.025] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/25557456]
31. Wu, W.; Ge, X.; Zhang, Y.; Han, J.; Guo, X.; Zhou, L.; Yang, H. Evolutionary Patterns of Codon Usage in Major Lineages of Porcine Reproductive and Respiratory Syndrome Virus in China. Viruses; 2021; 13, 1044. [DOI: https://dx.doi.org/10.3390/v13061044]
32. He, W.; Zhao, J.; Xing, G.; Li, G.; Wang, R.; Wang, Z.; Zhang, C.; Franzo, G.; Su, S.; Zhou, J. Genetic Analysis and Evolutionary Changes of Porcine Circovirus 2. Mol. Phylogenet. Evol.; 2019; 139, 106520. [DOI: https://dx.doi.org/10.1016/j.ympev.2019.106520] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31152778]
33. Sun, J.M.; Zhao, W.; Wang, R.Y.; Zhang, W.Y.; Li, G.R.; Lu, M.; Shao, Y.K.; Yang, Y.C.; Wang, N.N.; Gao, Q. et al. Analysis of the Codon Usage Pattern of Ha and Na Genes of H7n9 Influenza a Virus. Int. J. Mol. Sci.; 2020; 21, 7129. [DOI: https://dx.doi.org/10.3390/ijms21197129]
34. Li, G.; Wang, R.; Zhang, C.; Wang, S.; He, W.; Zhang, J.; Liu, J.; Cai, Y.; Zhou, J.; Su, S. Genetic and Evolutionary Analysis of Emerging H3n2 Canine Influenza Virus. Emerg. Microbes Infect.; 2018; 7, 73. [DOI: https://dx.doi.org/10.1038/s41426-018-0079-0] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/29691381]
35. Wong, E.H.; Smith, D.K.; Rabadan, R.; Peiris, M.; Poon, L.L. Codon usage bias and the evolution of influenza A viruses. Codon Usage Biases of Influenza Virus. BMC Evol. Biol.; 2010; 10, 253. [DOI: https://dx.doi.org/10.1186/1471-2148-10-253] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/20723216]
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Porcine parvovirus (PPV) is the main pathogen of reproductive disorders. In recent years, a new type of porcine parvovirus has been discovered and named porcine parvovirus 2 to 7 (PPV2–PPV7), and it is associated with porcine circovirus type 2 in pigs. Codon usage patterns and their effects on the evolution and host adaptation of different PPV sub-types are still largely unknown. Here, we define six main sub-types based on the Bayesian method of structural proteins of each sub-type of PPV, including PPV2, PPV3, PPV4, PPV5, PPV6, and PPV7, which show different degrees of codon usage preferences. The effective number of codons (ENC) indicates that all PPV sub-types have low codon bias. According to the codon adaptation index (CAI), PPV3 and PPV7 have the highest similarity with the host, which is related to the main popular tendency of the host in the field; according to the frequency of optimal codons (FOP), PPV7 has the highest frequency of optimal codons, indicating the most frequently used codons in its genes; and according to the relative codon deoptimization index (RCDI), PPV3 has a higher degree. Therefore, it is determined that mutational stress has a certain impact on the codon usage preference of PPV genes, and natural selection plays a very decisive and dominant role in the codon usage pattern. Our research provides a new perspective on the evolution of porcine parvovirus (PPV) and may help provide a new method for future research on the origin, evolutionary model, and host adaptation of PPV.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details


1 Changchun Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Changchun 130117, China;
2 Changchun Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Changchun 130117, China;
3 College of Veterinary Medicine, Jilin University, Changchun 130062, China;
4 Institute of Specialty, Chinese Academy of Agricultural Sciences, Changchun 130117, China;
5 Changchun Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Changchun 130117, China;