1. Introduction
Horses occupy a distinctive role in modern society, having transitioned significantly from their historical function as traditional working animals to their current roles in sports and recreation. Among contemporary equestrian activities, horse racing stands out as a globally popular sport, attracting widespread attention and participation [1]. Horse racing, typically involving jockeys competing on designated tracks, represents a major industry in at least 71 countries, with approximately 500,000 racehorses actively competing and offering over EUR 3.3 billion in prize money globally [2]. Internationally renowned events such as the Melbourne Cup, Kentucky Derby, and Japan Cup are highly celebrated [3,4,5]. Additionally, global horse racing betting generates annual revenues exceeding 115 billion EUR, making substantial contributions to local economic development, job creation, and the enrichment of recreational activities [5]. To achieve outstanding performance in specific types of competitions, various horse breeds have been gradually developed through selective breeding to meet the evolving demands of racing and athletic performance.
After more than 200 years of rigorous selective breeding, Thoroughbreds have become the most valuable and widely regarded elite racehorse breed globally. They are renowned for their exceptional athletic performance. In their early development, the enthusiasm for horse racing among British aristocrats significantly contributed to the refinement of this breed [6]. Thoroughbreds are celebrated for their outstanding athletic phenotype, excelling particularly in speed-based races due to their finely honed genetic and physical traits [7]. The Mongolian horse is one of the oldest existing horse breeds, renowned for its exceptional endurance, ability to thrive on coarse feed, and resistance to diseases [8]. As an important indigenous horse breed resource in China, Mongolian horses are frequently utilized as foundation stock for horse improvement programs. Among these, the Xilingol horse is a new breed developed through 35 years of targeted breeding, with the Mongolian horse serving as the maternal breed [9]. Since 1995, Thoroughbreds have been incorporated into the Xilingol horse breeding program for hybrid vigor. After nearly 30 years of meticulous breeding, significant progress has been achieved. The resulting offspring have been provisionally named the “Grassland-Thoroughbred” and are colloquially referred to as “Sunflower Horses” due to the distinctive sunflower-shaped markings on their hindquarters. As the genetic improvement of horses continues to advance, research is expanding from traditional traits such as body type and speed to genomic-level exploration. Genomic variations are increasingly recognized as crucial factors determining horses’ environmental adaptability and athletic ability.
In recent years, with the advent of genome resequencing technology, it has become possible to identify selection traits across the entire genome [10]. This innovative sequencing approach enables the detection of complete genomic information in individuals, leading to significant advancements in the study of racehorse performance. Racing ability is a complex trait determined by the interaction of multiple genes. For instance, the MYPN gene, which encodes myopalladin and is mainly expressed in skeletal and cardiac muscle, plays a crucial role in regulating actin organization and has undergone positive selection in horses [11,12]. The athletic potential of racehorses depends not only on favorable environmental conditions but also on optimal DNA variation combinations at loci that significantly influence athletic ability. A genome-wide SNP association study revealed that a sequence variation (g.66493737C>T) in the MSTN (myostatin) gene is the most effective indicator for predicting the optimal race distance for Thoroughbreds [13]. Additionally, Studies have also found that young Spanish trotters carrying the GG genotype of the PDK4 gene and the AA genotype of the DMRT3 gene perform exceptionally well in short-distance races, while horses with the AA genotype of DMRT3 and the CC genotype of CKM excel in long-distance races. In adult horses, the AA genotype of DMRT3 can enhance both speed and endurance [14]. SNPs and haplotypes in the EDN3 gene region are significantly associated with racing performance. The T-allele is more frequent in racing breeds, while the CC genotype predominates in smaller horses and draft horses [15]. These findings indicate that genome resequencing is a comprehensive and reliable method for exploring important regions and functional genes associated with athletic traits. It provides a robust theoretical foundation and technical support for genetic improvement of racehorse performance.
This study systematically analyses the genetic variation between the racehorse and non-racehorse populations of Grassland-Thoroughbreds, with a particular focus on identifying SNP loci significantly associated with athletic performance. The findings offer valuable references for the genetic improvement and scientifically informed breeding of racehorse breeds. By optimizing breeding strategies, it is possible not only to significantly enhance the athletic performance of racehorses but also to promote the high-quality development of China’s horse racing industry. Additionally, this research contributes to strengthening the genetic diversity of Chinese horse breeds, contributing to the protection and sustainable utilization of equine resources.
2. Materials and Methods
2.1. Sample Collection
The Grassland-Thoroughbreds used in this study are offspring derived from multiple generations of crossbreeding between Xilingol horses and Thoroughbreds. These samples were obtained from Inner Mongolia Grassland-Thoroughbred Breeding Company Limited, which maintains complete pedigree records and breeding lineage information for the hybrid horses (Supplementary Figure S1). Based on the pedigree data, individuals with close genetic relationships were excluded to reduce genetic relatedness among samples. The experimental subjects consisted of two groups of Grassland-Thoroughbreds: a non-racing group (NR group, n = 30), whose members had no professional training or competition experience; and a racing group (RH group, n = 30), composed of individuals with consistent competitive performance. The performance of the RH group horses was evaluated through multiple regional competitions, and high-level racehorses achieving excellent rankings in multiple races were ultimately selected (Supplementary Table S1). Blood samples were collected from the jugular vein of each horse by professional technicians and placed into anticoagulant tubes containing EDTA. The samples were then aliquoted, rapidly frozen in liquid nitrogen, and stored at −80 °C to ensure sample integrity and stability for subsequent experiments.
2.2. Library Construction and Sequencing
Genomic DNA was first extracted from all blood samples using the CTAB method and subsequently used for DNA library preparation [16]. Sequencing libraries were then prepared following the manufacturer’s instructions using the NEBNext® Ultra™ DNA Library Prep Kit for Illumina (NEB, San Diego, CA, USA), with unique index adapters ligated to each sample [17]. The purified libraries were assessed for quality and fragment size distribution using the Agilent 5400 system (Agilent, Santa Clara, CA, USA) to ensure they met the sequencing criteria. Libraries that passed quality control were subjected to paired-end sequencing (PE150) on the Illumina HiSeq 4000 platform (Novogene Co., Ltd., Beijing, China), generating high-quality sequencing data.
2.3. Data Quality Control
Firstly, the raw sequencing data were subjected to quality control using fastp (version 0.20.0) to ensure reliable reads for subsequent analyses. The quality control criteria included (1) removing reads with ≥5% unidentified nucleotides (N); (2) removing reads with >20% bases having a Phred quality score less than 15; (3) removing reads with adapter sequences, allowing ≤3% mismatches; and (4) removing reads shorter than 15 bases [18]. After cleaning, the processed data were evaluated using the FastQC (version 0.11.5) tool to verify their quality and suitability for subsequent analyses [19].
2.4. Variant Detection
The clean data were initially aligned to the horse reference genome (
To compare Indels between racing and non-racing horses and identify unique mutations, the raw Indels were obtained using the SelectVariants module. Quality control conditions for Indels were defined as follows: QD < 2.0, QUAL < 30.0, FS > 200.0, ReadPosRankSum < −20.0.
2.5. Principal Component Analysis
Based on the identified SNP data, PCA was conducted using PLINK (version 1.9) software to investigate the genetic structure of the racing and non-racing horse populations. The distribution of the first two principal components (PCs) was visualized using the ggplot2 package in R (version 4.4.1). The PIHAT (Proportion of Identical by Descent) values were estimated using the --genome command in PLINK software to calculate the homozygosity proportion between horses. Closely related individuals were removed (PIHAT value > 0.3).
2.6. Whole-Genome Selection Signal Analysis
Combining multiple methods of analysis can enhance the accuracy and sensitivity of genetic selection signals, partially addressing the limitations of small sample sizes. When multiple analysis methods indicate the same genetic variation signals, the authenticity of these signals can be confirmed with greater confidence [22,23]. Based on different principles, this study employed four selection signal detection methods to screen and analyze target regions and candidate genes.
Due to the susceptibility of single-point SNP scanning methods to interference from factors such as genetic drift, we employed a sliding window calculation strategy to enhance sensitivity to target signals and reduce the probability of false positives [24]. To detect genomic regions under positive selection between racing and non-racing horses, four test methods were used to analyze selection features. First, Fst (Fixation index) is an indicator of genetic differentiation between two populations and is widely applied to identify selection signals across the genome [25,26]. Second, the π ratio assesses selective pressure in genomic regions by comparing the nucleotide diversity (π) ratio between racing and non-racing populations to scan the genome for positively selected regions. Fst and π ratio values were computed using VCFtools (version 0.1.15) [27], with parameters set as follows: -fst-window-size (--window-pi) 100,000 and -fst-window-step (--window-pi-step) 10,000, followed by the calculation of the π ratio [28]. In both analyses, the top 5% of sites were extracted as candidate SNPs. Combining Fst and π ratio screenings allows for the identification of stronger selection signals and the further targeting of candidate genes.
Additionally, the Integrated Haplotype Score (iHS) and Cross-population Extended Haplotype Homozygosity (XP-EHH) were employed to identify positive selection signals within populations through haplotype structure analysis and are widely applied in livestock selection studies [29,30]. Under selection, the frequency of preferred alleles increases rapidly, forming long haplotype regions with low polymorphism, which are significantly distinct from regions with high polymorphism and low linkage disequilibrium in non-selected areas [31]. In this study, BEAGLE (version 5.1) was used with default settings to impute missing alleles for all individuals of Grassland-Thoroughbred and to infer haplotype phases [32]. The selscan (version 1.2.0) tool was employed to analyze selection signals by calculating the iHS (selscan --ihs) and XP-EHH (selscan --xpehh) values between two populations. Additionally, the norm module of selscan was further employed to standardize iHS (norm --ihs --bp-win --winsize 100000) and XP-EHH (norm --xpehh --bp-win --winsize 100000) scores. This study further calculated the average |iHS| and XP-EHH scores for non-overlapping windows of 100 kb across the entire genome of autosomal chromosomes [33,34]. We adopted the top 5% regions as the screening criterion, following a common empirical approach in selection signal studies.
2.7. Detection and Annotation of Candidate Genes
In this study, genome data were analyzed using the Fst, π ratio, iHS, and XP-EHH methods; SNPs falling within the top 5% of each selection metric were designated as “outlier loci”. Candidate regions associated with racehorse selection were defined as those identified by at least three of the four methods. Subsequently, genes containing these outlier loci were identified based on the annotation information of the EquCab 3.0 reference genome. Finally, a Venn diagram was constructed to visualize the overlap among the candidate gene sets identified by the Fst, π ratio, iHS, and XP-EHH methods.
2.8. Functional Enrichment Analysis
Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses provide critical insights into the regulatory mechanisms of genes associated with traits like equine athletic performance, offering valuable directions for future research. To elucidate gene functions and their regulatory mechanisms, we performed functional enrichment analyses. Using the online tool DAVID (
3. Results
3.1. Sequencing and Detection of SNPs and Indels
This study conducted high-depth genome re-sequencing on 60 horses, with an average individual genome sequencing depth of 25.63X. Alignment to the reference genome achieved 99.19% coverage (Supplementary Table S2). A total of 23,929,853 SNPs were identified (Figure 1A), the majority of which were located in intergenic regions and intronic regions, with relatively few in exonic regions. Analysis revealed 161,824 missense mutations and 116,897 synonymous mutations in protein-coding genes. The quality of the SNPs was assessed by calculating the transition-to-transversion ratio (Ti/Tv), which yielded a Ti/Tv ratio of 2.06 (Supplementary Table S3). Following stringent filtering, a total of 6,361,002 high-quality SNPs were retained. Genome-wide distribution analysis showed that SNP density was highest on Chr 1 and 3, and lowest on Chr 31 (Figure 1B). Additionally, this study identified 768,590 Indels, with the number of deletions (n = 434,710) slightly higher than insertions (n = 333,880) (Figure 1C). These Indels were distributed across the entire genome but exhibited uneven distribution, with the highest number of Indels located on Chr 1 (n = 55,984). Functional annotation using the ANNOVAR (
3.2. Population Structure Analysis
PCA revealed that the first two principal components partially explained the genetic variation observed between the groups. Despite some overlap in the PCA distributions of racehorses and non-racehorses, a general trend of separation was observed, suggesting differences in genetic structure between the two groups (Figure 1D).
3.3. Genome-Wide Selective Signature Detection
Based on the SNP dataset, selection signals associated with traits influencing racing performance were analyzed using four approaches: Fst, π ratio, iHS, and XP-EHH. After filtering for the top 5% overlapping windows, a set of candidate genes were identified (Figure 2B). Specifically, Fst and π ratio analyses identified 11,294 regions (Fst = 0.037, π ratio = 1.24), involving 2352 and 2498 genes, respectively. The iHS and XP-EHH methods, after normalization, identified 1139 regions (|iHS| = 2.86, XP-EHH = 0.99), involving 1164 and 1035 genes, respectively. Using Venn diagram analysis, genes identified shared by at least three of the four methods as candidate genes, resulting in a final set of 215 candidate genes (Figure 2A) (Supplementary Table S4).
For the analysis of the Indels dataset, the Fst method was applied to identify the top 1% of Indels loci, resulting in 2256 candidate regions (Fst = 0.06) encompassing 661 genes (Figure 3A). The 215 candidate genes derived from the SNP analysis were further integrated with the 661 candidate genes from the Indels analysis (Supplementary Table S5), yielding 84 overlapping genes. These findings suggest that certain genes may play pivotal roles in the selection process underlying racing performance traits, as identified across two independent datasets and analytical methods.
3.4. Candidate Gene Enrichment Analysis
Functional enrichment analysis of candidate genes identified from the SNP and Indels datasets was performed using GO and KEGG to elucidate their potential biological functions. Multiple categories were statistically significant (p < 0.05). GO enrichment analysis of SNP-derived candidate genes revealed significant enrichment in four biological processes, four cellular components, and six molecular functions. KEGG enrichment identified four critical pathways (Figure 2C). The GO terms were predominantly associated with ion transport, synaptic function, signal transduction, and regulation (Supplementary Table S6), including copper ion transmembrane transport (GO:0035434), dendrite membrane (GO:0032590), protein binding (GO:0005515), and acetylcholine receptor binding (GO:0033130). KEGG pathways were primarily linked to Neuroactive ligand-receptor interaction (ecb04080), Motor proteins (ecb04814), Insulin signaling pathway (ecb04910), and Chronic myeloid leukemia (ecb05220). These pathways influence athletic performance by affecting neural signaling, muscle movement mechanisms, metabolic regulation, and cellular adaptation (Supplementary Table S7).
GO and KEGG enrichment analyses of Indels-derived candidate genes were conducted to investigate the functional impact of mutations in racing and non-racing populations. Approximately 60% of the candidate genes were successfully mapped to GO terms, with the majority enriched in biological processes (Figure 3B). Specifically, 26 biological processes, 13 cellular components, and 14 molecular functions were identified (p < 0.05). GO enrichment highlighted functions related to immunity, energy metabolism, and neural regulation (Supplementary Table S8), including ATP binding (GO:0005524), Actin binding (GO:0003779), and Lipid catabolic process (GO:0016042). These functions collectively provide the energy required for physical activity, enhancing endurance and exercise efficiency. KEGG enrichment analysis (Supplementary Table S9) identified 40 key pathways, predominantly related to disease and immune regulation. Pathways closely associated with athletic performance were primarily focused on Motor proteins (ecb04814), the Cytoskeleton in muscle cells (ecb04820), the FoxO signaling pathway (ecb04068), and Fat digestion and absorption (ecb04975). These pathways play crucial roles in energy production, vascular function, and tissue maintenance, all of which are essential for sustaining physical activity and enhancing athletic performance.
The PPI network suggests that there may be cooperative regulatory functions among these genes (Figure 4). Notably, genes such as FOXO1, PPARGC1A, and PRKG1 are located in the network, and these genes play potential key roles in biological processes related to energy metabolism, oxidative stress response, and muscle function. This network provides a preliminary interaction-based perspective on the molecular mechanisms underlying athletic performance in Grassland-Thoroughbreds.
4. Discussion
Recent advancements in high-throughput sequencing technologies have enabled the identification of numerous functional genes associated with economically important traits in livestock through selection signal analysis, offering valuable genetic resources for molecular breeding. This study represents the first genomic analysis of locally bred racing breeds, achieving a sequencing coverage of 99.19%, which ensures the accuracy of data analysis. The average sequencing depth reached 25.63, further enhancing the reliability of variant annotation, mutation site identification, and genome structure research. The calculated TS/TV ratio was 2.06, comparable to the ratios observed in human and cattle re-sequencing studies, which were 2.1 and 2.2, respectively. This underscores the high data quality and conserved genome structure, providing a reliable foundation for detecting selection signals [36,37]. The findings of this study not only provide insights for predicting traits related to athletic performance but also establish a scientific basis for breed improvement. These results support the selection and breeding of individuals with superior athletic performance and provide robust data support for the genomic analysis of Grassland-Thoroughbred.
Artificial selection induces significant genetic differentiation in specific genomic regions, influencing racehorse performance. However, racing performance is influenced not only by genetic factors but also by environmental conditions and complex gene interactions [38]. Despite the complexity of this process, the detection of selection signals effectively identifies genes or regulatory genomic elements associated with evolutionary adaptation and essential physiological traits [39,40,41]. Skeletal muscle, a highly plastic tissue, responds to exercise and training stimuli by enhancing athletic performance through adjustments in muscle mass, changes in muscle fiber type composition, and optimization of mitochondrial function. This study identified several key muscle function-related genes through selection analysis. KEGG enrichment highlighted two athletic-performance-associated genes within the insulin signaling pathway: PPARGC1A and FOXO1. Among these, PPARGC1A is a widely recognized gene pivotal to equine athletic performance [42]. PPARGC1A encodes PGC-1α in skeletal muscle, which regulates glucose transport during exercise and plays an essential role in promoting mitochondrial biogenesis and adaptation to aerobic training [43,44]. When intense exercise increases energy demands, PGC-1α is activated and collaborates with other transcription factors to regulate the expression of genes involved in insulin signaling, angiogenesis, lipid metabolism, and carbohydrate metabolism [45,46]. Studies indicate that during the recovery phase following exercise, PPARGC1A expression and PGC-1α protein levels increase significantly, potentially promoting mitochondrial biogenesis [47,48,49]. Additionally, experiments have demonstrated that in untrained muscle, PPARGC1A expression significantly increases following single-leg knee-extension exercises, likely regulated by systemic factors via its canonical promoter. In contrast, trained muscle exhibits a more sustained expression of PPARGC1A [49,50,51]. The Gly482Ser polymorphism is regarded as a functional variant of PPARGC1A, potentially influencing the differentiation between power- and endurance-oriented athletes. Compared with carriers of the Gly482 allele, individuals with the Ser482 allele generally exhibit lower low-density lipoprotein levels and higher cholesterol levels [52]. The Ser allele frequency is lower among elite endurance and power athletes [53,54]. However, other studies suggest that the Ser allele might benefit power-based sports performance [55]. Regardless of the type of exercise, the Gly allele is generally considered advantageous for endurance sports [56,57]. These findings provide valuable insights into equine sports genetics, suggesting that PPARGC1A polymorphisms may serve as targets for optimizing equine athletic performance through gene editing or selective breeding. PPARGC1A may enhance the aerobic metabolic capacity of muscle cells in Grassland-Thoroughbreds by promoting mitochondrial biogenesis, enabling them to produce and utilize energy more efficiently during prolonged high-intensity exercise. This knowledge lays a scientific foundation for improving racehorse breeding and enhancing athletic outcomes. FOXO1 (Forkhead box O1) is a conserved transcription factor across species. Research has demonstrated that FOXO1 synergizes with PGC-1α, playing a critical role in insulin-mediated glucose and lipid metabolism, while also regulating mitochondrial oxidative stress, autophagy, and overall metabolism [58,59,60,61,62]. Exercise induces an increase in FOXO1 phosphorylation, enhancing its activity and potentially triggering cardiac hypertrophy, a process demonstrated to exert protective effects on the heart [63,64]. Moreover, FOXO1 acts as a metabolic switch, promoting a shift towards lipid metabolism [65]. Studies indicate that the mRNA levels of FOXO1 in muscle are significantly higher in athletes compared with sedentary individuals, highlighting its importance in energy adaptation during physical activity [66]. FOXO1 is therefore a critical component of the adaptive metabolic response to exercise [63]. Consistently, this study revealed that PPARGC1A and FOXO1 exhibited similar expression patterns in the skeletal muscle of Thoroughbreds, further supporting their roles in energy metabolism and exercise adaptation [67]. FOXO1 may help Grassland-Thoroughbreds achieve dynamic energy metabolic balance during exercise by regulating glucose and lipid metabolism throughout racing activities. Additionally, the SGCD gene encodes δ-sarcoglycan, a protein widely expressed in cardiac and skeletal muscles. It is a vital component of the dystrophin-associated glycoprotein complex [68]. Loss of SGCD function leads to muscle dysfunction and may progress to cardiomyopathy characterized by localized fibrosis [69]. δ-sarcoglycan deficiency, caused by mutations in the SGCD gene, results in muscle cell damage and degeneration, impairing regenerative capacity and compromising the structural integrity of muscle fiber types [70]. Studies in Drosophila models have demonstrated that SGCD deficiency causes muscle tearing and reduced climbing ability [71]. In mice, the absence of δ-sarcoglycan leads to mitochondrial swelling, disrupted cristae structure, loss of membrane integrity, and diminished Ca2+ retention capacity. These alterations are accompanied by reduced oxidative capacity and significantly decreased exercise endurance [72]. SGCD may play an important role in maintaining the structural integrity of skeletal and cardiac muscle cells in Grassland-Thoroughbreds and function to prevent muscle damage and degeneration during exercise, thereby ensuring efficient muscle contraction.
The athletic performance of horses is not only constrained by physiological factors but also by psychological and motivational aspects. Genes involved in neurobiological functions frequently appear in transcriptomic and genomic studies investigating equine performance [39]. Among these, the FOXP2 (Forkhead Box P Subfamily 2) is a highly conserved transcription factor in vertebrates that regulates the development of certain brain functions [73]. Mouse studies have demonstrated that reduced levels of FOXP2 lead to impaired synaptic plasticity and diminished learning ability, accompanied by abnormalities in motor coordination and sensory functions. Conversely, restoring FOXP2 expression improves motor coordination and sensory skills [74,75]. FOXP2 may enhance the neural circuits in the brain related to motor control and sensory processing, thereby improving the coordination and agility of Grassland-Thoroughbreds, which is crucial for precise movements and responses during high-speed racing. Similarly, PRKG1 (cGMP-dependent protein kinase 1), which is widely expressed in the nervous system, is considered fundamental to neuroplasticity and learning ability [76]. Under high-intensity interval exercise, PRKG1 expression significantly increases, while the expression of PDE5 (phosphodiesterase 5) decreases. PDE5 phosphorylation activates its enzymatic activity, enhancing its affinity for cGMP within the regulatory domain and reducing intracellular cGMP levels [77]. These findings suggest that neurobiology-related genes, such as FOXP2 and PRKG1, are essential for neural function and behavioral regulation, with potential applications in optimizing equine athletic performance and training adaptability.
Physical exercise significantly elevates the energy demand of skeletal muscle, inducing mitochondrial stress and triggering adaptive responses—hallmarks of exercise training [78]. SLC25A15 (solute carrier family 25 member 15), an essential member of the mitochondrial carrier family, functions as a mitochondrial ornithine carrier and is integral to arginine synthesis. Studies have demonstrated that during training in male mice, the expression level of SLC25A15 consistently increases in subcutaneous fat [79]. Arginine can activate AMPK to promote lipolysis, thereby enhancing metabolic efficiency [80]. Furthermore, SLC25A15 and ornithine contribute to brain energy metabolism homeostasis, cellular ATP transport, and inflammation regulation [81,82]. CKMT2 (mitochondrial creatine kinase), located in the mitochondrial intermembrane space, catalyzes the rapid hydrolysis of ATP to ADP, facilitating ATP synthesis and supporting mitochondrial respiratory chain activity [83]. Exercise training significantly increases CKMT2 expression in mice, improving maximum respiration rate and ADP utilization efficiency. This indicates that training-induced mitochondrial functional adaptation in skeletal muscle depends on the intensity of the metabolic load [84]. TRAP1 (TNF receptor-associated protein 1), a mitochondrial molecular chaperone, regulates the conformation, activity, and stability, thereby modulating cellular metabolism and signaling pathways [85]. TRAP1 directly interacts with components of the mitochondrial respiratory chain, and its expression level is closely linked to cellular respiratory capacity and glycolysis [86]. In studies related to soccer, TRAP1 was found to participate in autophagy–lysosome systems and proteasome-mediated protein degradation mechanisms, correlating closely with improved cardiorespiratory function [87,88]. These genes may play important roles in energy supply, endurance, and recovery capacity during competitions in Grassland-Thoroughbreds.
Indels, a type of biallelic genomic variation, are small-scale and widely distributed across the genome, making them a common form of natural genetic variation. As the second most prevalent genomic variation following SNPs [89], Indels are widely employed in human identity verification due to their relatively low and stable mutation rate, which provides an advantage over SNPs and short tandem repeats [90,91]. In this study, both SNPs and Indels were utilized for Fst analysis, revealing largely consistent selective feature regions identified by the two markers. This consistency aligns with findings in other species; for instance, overlapping selective signals between SNPs and Indels have been reported in humans and chickens [92,93]. In equine studies, previous research has linked Indels to various traits, including optimal racing distances, muscle mass [94,95], and coat color [96]. In this research, Indels were also employed to identify selective features, demonstrating that regions surrounding Indels were more frequently subject to recent selective sweeps. Additionally, key genes including FOXO1, SLC25A15, CKMT2, FOXP2, and TRAP1 were found to be influenced by both Indels and SNPs. These genes are enriched in functional pathways such as the FoxO signaling pathway, nuclear function-related pathways, and ATP-binding pathways, which collectively contribute to metabolic regulation, antioxidation, autophagy, muscle adaptation, and energy supply mechanisms. These pathways play critical roles in optimizing athletic performance, particularly in endurance, explosiveness, and recovery [97,98]. The findings underscore the utility of Indel variants as effective tools for analyzing selective traits, while highlighting the significant biological functions of genes influenced by both Indels and SNPs in athletic performance. Nevertheless, further research is required to elucidate the precise mechanisms of Indels and their potential value in selection analysis. Such research is particularly important for validating their consistency across diverse species and environmental contexts. Expanding knowledge in this area could broaden the applications of Indels in genetic breeding and genomic studies.
This study systematically screened candidate genes related to athletic performance in the Grassland-Thoroughbred population, such as PPARGC1A, FOXO1, and SGCD, through whole-genome resequencing, and revealed their potential roles in muscle function. Nevertheless, due to the lack of measurements of key performance indicators and direct validation of gene functions, the genotype–phenotype association analysis in this study still has certain limitations. The relatively small sample size may affect the statistical power of the results and the generalizability of the conclusions. Future studies should increase the sample size to enhance the representativeness and reliability of the research. Secondly, although efforts were made to exclude close relatives through population structure analysis, potential population stratification may still exist, which could introduce bias into the genetic association analysis. Future work needs to collect more athletic performance phenotype data and further explore the genetic basis underlying phenotypic differences in horses. These factors provide opportunities for further exploration in future research. This study provides preliminary genomic resources and candidate gene targets for breed improvement, laying the foundation for future gene function validation, phenotype association studies, and the development of breeding strategies.
5. Conclusions
This study conducted a comprehensive analysis of genomic variation in the Grassland-Thoroughbred using whole-genome sequencing (WGS) data and provides preliminary evidence for how these variations may impact racing performance. By employing four different selection signal detection methods, the study identified genetic variations associated with athletic performance, which may contribute to the evolution of adaptive traits crucial for success in horse racing. This study highlighted key candidate genes such as PPARGC1A, FOXO1, SGCD, FOXP2, PRKG1, SLC25A15, CKMT2, and TRAP1, which play critical roles in muscle function, metabolism, sensory regulation, and neurobiology. While these genes show potential in shaping the racing phenotype, further functional validation experiments are needed to confirm their specific roles in athletic performance. The results provide valuable initial insights for genomic improvement of racing horse populations and lay the foundation for future exploration of the molecular mechanisms underlying equine athletic performance.
Conceptualization, D.B. and W.D.; formal analysis, W.D., L.S. and X.S.; software, W.G. and T.B.; investigation, Z.L., H.W. and Y.L.; visualization, W.D. and W.G.; writing—original draft preparation, W.D. and W.G.; writing—review and editing, W.D.; resources, D.B. and M.D.; project administration, D.B. and M.D.; funding acquisition, D.B. and M.D. All authors have read and agreed to the published version of the manuscript.
All sampling procedures were approved by the Laboratory Animal Welfare and Ethics Committee of the Inner Mongolia Agricultural University and were conducted in accordance with the prescribed standards. The animal experiments were approved by the Animal Management and Ethics Committee of the College of Animal Science at Inner Mongolia Agricultural University (Permit No: NND2023085), and all experimental procedures were conducted in strict accordance with the relevant laboratory regulations and guidelines. This study adhered to the ARRIVE 2.0 guidelines for reporting animal research.
Not applicable.
Sequence data that support the findings of this study have been deposited in the National Center for Biotechnology Information with the primary accession code PRJNA11630453 and PRJNA1063377.
The authors declare no conflicts of interest.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Figure 1 The quantity and distribution of SNPs, Indels, and population structure: (A) Annotation of genome-wide SNPs according to ANNOVAR. (B) The SNP density across the whole genome was estimated in each 1 Mb genome block. (C) Statistical analysis of insertion and deletion types of Indels. (D) Principal component analysis between racing and non-racing horses.
Figure 2 Multiple selection sweep analysis: (A) Venn diagram showing the SNPs overlap among Fst, π ratio, his, and XP-EHH. (B) Calculation of Fst, π-Ratio, iHS, and XP-EHH values for all autosomes of racing horses using a 100 Kb sliding window with a 10 Kb step size. The green horizontal line indicates the threshold for extracting the top 5% of values for racing horses. (C) GO and KEGG pathway enrichment analysis based on SNP candidate genes. The top 5 enriched terms, sorted by corrected p < 0.05, are shown below.
Figure 3 Candidate gene function enrichment analyses: (A) Manhattan plot of genome-wide putative selection regions between racing and non-racing horses (Indels dataset). The green horizontal line represents the top 1% of Fst values. (B) GO and KEGG pathway enrichment analysis based on Indels candidate genes.
Figure 4 Functional interaction network of candidate genes.
Supplementary Materials
The following supporting information can be downloaded at
1. DuBois, C.; Nakonechny, L.; Derisoud, E.; Merkies, K. Examining Canadian Equine Industry Participants’ Perceptions of Horses and Their Welfare. Animals; 2018; 8, 201. [DOI: https://dx.doi.org/10.3390/ani8110201]
2. Authorities IFoHR. IFHRA Annual Report 2019; International Federation of Horse Racing Authorities: Boulogne, France, 2019.
3. Chien, P.M. Council for Australasian Tourism and Hospitality Education. CAUTHE 2014: Tourism and Hospitality in the Contemporary World: Trends, Changes and Complexity; The University of Queensland: Brisbane, Australia, 2014.
4. Worthington, A.C. National exuberance: A note on the Melbourne Cup effect in Australian stock returns. Econ. Pap. A J. Appl. Econ. Policy; 2007; 26, pp. 170-179. [DOI: https://dx.doi.org/10.1111/j.1759-3441.2007.tb01014.x]
5. Narayan, P.K.; Smyth, R. The race that stops a nation: The demand for the Melbourne Cup. Econ. Rec.; 2004; 80, pp. 193-207. [DOI: https://dx.doi.org/10.1111/j.1475-4932.2004.00172.x]
6. Cassidy, R. The Sport of Kings: Kinship, Class and Thoroughbred Breeding in Newmarket; Cambridge University Press: Cambridge, UK, 2002.
7. Kay, J.; Vamplew, W. Encyclopedia of British horse Racing; Routledge: London, UK, 2012.
8. Davis, M. When Things Get Dark: A Mongolian Winter’s Tale; Macmillan: London, UK, 2010.
9. Qi, B. Xilin Gol League Animal Husbandry Chronicle; Inner Mongolia People’s Publishing House: Hohhot, China, 2002.
10. Sackton, T.B. Studying Natural Selection in the Era of Ubiquitous Genomes. Trends Genet.; 2020; 36, pp. 792-803. [DOI: https://dx.doi.org/10.1016/j.tig.2020.07.008] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32800625]
11. Orlando, L.; Ginolhac, A.; Zhang, G.; Froese, D.; Albrechtsen, A.; Stiller, M.; Schubert, M.; Cappellini, E.; Petersen, B.; Moltke, I. Recalibrating Equus evolution using the genome sequence of an early Middle Pleistocene horse. Nature; 2013; 499, pp. 74-78. [DOI: https://dx.doi.org/10.1038/nature12323]
12. Otey, C.A.; Rachlin, A.; Moza, M.; Arneman, D.; Carpen, O. The palladin/myotilin/myopalladin family of actin-associated scaffolds. Int. Rev. Cytol.; 2005; 246, pp. 31-58.
13. Hill, E.W.; McGivney, B.A.; Gu, J.; Whiston, R.; Machugh, D.E. A genome-wide SNP-association study confirms a sequence variant (g.66493737C>T) in the equine myostatin (MSTN) gene as the most powerful predictor of optimum racing distance for Thoroughbred racehorses. BMC Genom.; 2010; 11, 552. [DOI: https://dx.doi.org/10.1186/1471-2164-11-552]
14. Negro Rama, S.; Valera, M.; Membrillo, A.; Gómez, M.D.; Solé, M.; Menendez-Buxadera, A.; Anaya, G.; Molina, A. Quantitative analysis of short- and long-distance racing performance in young and adult horses and association analysis with functional candidate genes in Spanish Trotter horses. J. Anim. Breed. Genet.; 2016; 133, pp. 347-356. [DOI: https://dx.doi.org/10.1111/jbg.12208]
15. Jäderkvist Fegraeus, K.; Velie, B.D.; Axelsson, J.; Ang, R.; Hamilton, N.A.; Andersson, L.; Meadows, J.R.S.; Lindgren, G. A potential regulatory region near the EDN3 gene may control both harness racing performance and coat color variation in horses. Physiol. Rep.; 2018; 6, e13700. [DOI: https://dx.doi.org/10.14814/phy2.13700]
16. Thomas, J.C.; Khoury, R.; Neeley, C.K.; Akroush, A.M.; Davies, E.C. A fast CTAB method of human DNA isolation for polymerase chain reaction applications. Biochem. Educ.; 1997; 25, pp. 233-235. [DOI: https://dx.doi.org/10.1016/S0307-4412(97)00122-2]
17. Mazuet, C.; Legeay, C.; Sautereau, J.; Bouchier, C.; Criscuolo, A.; Bouvet, P.; Trehard, H.; Jourdan Da Silva, N.; Popoff, M. Characterization of Clostridium Baratii Type F Strains Responsible for an Outbreak of Botulism Linked to Beef Meat Consumption in France. PLoS Curr.; 2017; 9.
18. Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. Fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics; 2018; 34, pp. i884-i890. [DOI: https://dx.doi.org/10.1093/bioinformatics/bty560]
19. Wingett, S.W.; Andrews, S. FastQ Screen: A tool for multi-genome mapping and quality control. F1000Res; 2018; 7, 1338. [DOI: https://dx.doi.org/10.12688/f1000research.15931.1]
20. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics; 2009; 25, pp. 1754-1760. [DOI: https://dx.doi.org/10.1093/bioinformatics/btp324] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/19451168]
21. McKenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M.
22. Foll, M.; Gaggiotti, O. A genome-scan method to identify selected loci appropriate for both dominant and codominant markers: A Bayesian perspective. Genetics; 2008; 180, pp. 977-993. [DOI: https://dx.doi.org/10.1534/genetics.108.092221] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/18780740]
23. Wright, S. The genetical structure of populations. Ann. Eugen.; 1950; 15, pp. 323-354. [DOI: https://dx.doi.org/10.1111/j.1469-1809.1949.tb02451.x]
24. Ma, Y.; Ding, X.; Qanbari, S.; Weigend, S.; Zhang, Q.; Simianer, H. Properties of different selection signature statistics and a new strategy for combining them. Heredity; 2015; 115, pp. 426-436. [DOI: https://dx.doi.org/10.1038/hdy.2015.42]
25. Weir, B.S.; Cockerham, C.C. Estimating F-statistics for the analysis of population structure. Evolution; 1984; 38, pp. 1358-1370. [DOI: https://dx.doi.org/10.2307/2408641]
26. Sabeti, P.C.; Schaffner, S.F.; Fry, B.; Lohmueller, J.; Varilly, P.; Shamovsky, O.; Palma, A.; Mikkelsen, T.S.; Altshuler, D.; Lander, E.S. Positive natural selection in the human lineage. Science; 2006; 312, pp. 1614-1620. [DOI: https://dx.doi.org/10.1126/science.1124309]
27. Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T.
28. Yang, J.; Li, W.R.; Lv, F.H.; He, S.G.; Tian, S.L.; Peng, W.F.; Sun, Y.W.; Zhao, Y.X.; Tu, X.L.; Zhang, M.
29. Qanbari, S.; Pausch, H.; Jansen, S.; Somel, M.; Strom, T.M.; Fries, R.; Nielsen, R.; Simianer, H. Classic selective sweeps revealed by massive sequencing in cattle. PLoS Genet.; 2014; 10, e1004148. [DOI: https://dx.doi.org/10.1371/journal.pgen.1004148]
30. Randhawa, I.A.; Khatkar, M.S.; Thomson, P.C.; Raadsma, H.W. A meta-assembly of selection signatures in cattle. PLoS ONE; 2016; 11, e0153013. [DOI: https://dx.doi.org/10.1371/journal.pone.0153013]
31. Voight, B.F.; Kudaravalli, S.; Wen, X.; Pritchard, J.K. A map of recent positive selection in the human genome. PLoS Biol.; 2006; 4, e72.
32. Browning, S.R.; Browning, B.L. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am. J. Hum. Genet.; 2007; 81, pp. 1084-1097. [DOI: https://dx.doi.org/10.1086/521987] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/17924348]
33. Liu, Y.; Xu, L.; Yang, L.; Zhao, G.; Li, J.; Liu, D.; Li, Y. Discovery of genomic characteristics and selection signatures in southern Chinese local cattle. Front. Genet.; 2020; 11, 533052. [DOI: https://dx.doi.org/10.3389/fgene.2020.533052] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33391332]
34. Szpiech, Z.A.; Hernandez, R.D. Selscan: An efficient multithreaded program to perform EHH-based scans for positive selection. Mol. Biol. Evol.; 2014; 31, pp. 2824-2827. [DOI: https://dx.doi.org/10.1093/molbev/msu211]
35. Dennis, G.; Sherman, B.T., Jr.; Hosack, D.A.; Yang, J.; Gao, W.; Lane, H.C.; Lempicki, R.A. DAVID: Database for Annotation, Visualization, and Integrated Discovery. Genome Biol.; 2003; 4, P3. [DOI: https://dx.doi.org/10.1186/gb-2003-4-9-r60]
36. Choi, J.W.; Liao, X.; Stothard, P.; Chung, W.H.; Jeon, H.J.; Miller, S.P.; Choi, S.Y.; Lee, J.K.; Yang, B.; Lee, K.T.
37. Choi, J.W.; Liao, X.; Park, S.; Jeon, H.J.; Chung, W.H.; Stothard, P.; Park, Y.S.; Lee, J.K.; Lee, K.T.; Kim, S.H.
38. Hill, E.W.; Gu, J.; McGivney, B.A.; MacHugh, D.E. Targets of selection in the Thoroughbred genome contain exercise-relevant gene SNPs associated with elite racecourse performance. Anim. Genet.; 2010; 41, (Suppl. S2), pp. 56-63. [DOI: https://dx.doi.org/10.1111/j.1365-2052.2010.02104.x]
39. Han, H.; McGivney, B.A.; Allen, L.; Bai, D.; Corduff, L.R.; Davaakhuu, G.; Davaasambuu, J.; Dorjgotov, D.; Hall, T.J.; Hemmings, A.J.
40. Santos, W.B.; Pereira, C.B.; Maiorano, A.M.; Arce, C.D.S.; Baldassini, W.A.; Pereira, G.L.; Chardulo, L.A.L.; Neto, O.R.M.; Oliveira, H.N.; Curi, R.A. Genomic inbreeding estimation, runs of homozygosity, and heterozygosity-enriched regions uncover signals of selection in the Quarter Horse racing line. J. Anim. Breed. Genet.; 2023; 140, pp. 583-595. [DOI: https://dx.doi.org/10.1111/jbg.12812] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/37282810]
41. Han, H.; McGivney, B.A.; Farries, G.; Katz, L.M.; MacHugh, D.E.; Randhawa, I.A.S.; Hill, E.W. Selection in Australian Thoroughbred horses acts on a locus associated with early two-year old speed. PLoS ONE; 2020; 15, e0227212. [DOI: https://dx.doi.org/10.1371/journal.pone.0227212] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32049967]
42. Eivers, S.S.; McGivney, B.A.; Fonseca, R.G.; MacHugh, D.E.; Menson, K.; Park, S.D.; Rivero, J.L.; Taylor, C.T.; Katz, L.M.; Hill, E.W. Alterations in oxidative gene expression in equine skeletal muscle following exercise and training. Physiol. Genom.; 2010; 40, pp. 83-93. [DOI: https://dx.doi.org/10.1152/physiolgenomics.00041.2009] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/19861432]
43. Scarpulla, R.C. Transcriptional paradigms in mammalian mitochondrial biogenesis and function. Physiol. Rev.; 2008; 88, pp. 611-638. [DOI: https://dx.doi.org/10.1152/physrev.00025.2007]
44. Lira, V.A.; Benton, C.R.; Yan, Z.; Bonen, A. PGC-1alpha regulation by exercise training and its influences on muscle function and insulin sensitivity. Am. J. Physiol. Endocrinol. Metab.; 2010; 299, pp. E145-E161. [DOI: https://dx.doi.org/10.1152/ajpendo.00755.2009]
45. Lin, J.; Handschin, C.; Spiegelman, B.M. Metabolic control through the PGC-1 family of transcription coactivators. Cell Metab.; 2005; 1, pp. 361-370. [DOI: https://dx.doi.org/10.1016/j.cmet.2005.05.004]
46. Kupr, B.; Handschin, C. Complex Coordination of Cell Plasticity by a PGC-1α-controlled Transcriptional Network in Skeletal Muscle. Front. Physiol.; 2015; 6, 325. [DOI: https://dx.doi.org/10.3389/fphys.2015.00325]
47. Wright, D.C.; Han, D.H.; Garcia-Roves, P.M.; Geiger, P.C.; Jones, T.E.; Holloszy, J.O. Exercise-induced mitochondrial biogenesis begins before the increase in muscle PGC-1alpha expression. J. Biol. Chem.; 2007; 282, pp. 194-199. [DOI: https://dx.doi.org/10.1074/jbc.M606116200]
48. Ikeda, S.; Kizaki, T.; Haga, S.; Ohno, H.; Takemasa, T. Acute exercise induces biphasic increase in respiratory mRNA in skeletal muscle. Biochem. Biophys. Res. Commun.; 2008; 368, pp. 323-328. [DOI: https://dx.doi.org/10.1016/j.bbrc.2008.01.095] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/18249186]
49. Popov, D.V.; Lysenko, E.A.; Makhnovskii, P.A.; Kurochkina, N.S.; Vinogradova, O.L. Regulation of PPARGC1A gene expression in trained and untrained human skeletal muscle. Physiol. Rep.; 2017; 5, e13339. [DOI: https://dx.doi.org/10.14814/phy2.13543] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/29233908]
50. Hashimoto, T.; Hussien, R.; Oommen, S.; Gohil, K.; Brooks, G.A. Lactate sensitive transcription factor network in L6 cells: Activation of MCT1 and mitochondrial biogenesis. Faseb. J.; 2007; 21, pp. 2602-2612. [DOI: https://dx.doi.org/10.1096/fj.07-8174com] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/17395833]
51. Kitaoka, Y.; Takeda, K.; Tamura, Y.; Hatta, H. Lactate administration increases mRNA expression of PGC-1α and UCP3 in mouse skeletal muscle. Appl. Physiol. Nutr. Metab.; 2016; 41, pp. 695-698. [DOI: https://dx.doi.org/10.1139/apnm-2016-0016]
52. Zhang, S.L.; Lu, W.S.; Yan, L.; Wu, M.C.; Xu, M.T.; Chen, L.H.; Cheng, H. Association between peroxisome proliferator-activated receptor-gamma coactivator-1alpha gene polymorphisms type 2 diabetes in southern Chinese population: Role of altered interaction with myocyte enhancer factor, 2.C. Chin. Med. J.; 2007; 120, pp. 1878-1885. [DOI: https://dx.doi.org/10.1097/00029330-200711010-00005]
53. Lucia, A.; Gómez-Gallego, F.; Barroso, I.; Rabadán, M.; Bandrés, F.; San Juan, A.F.; Chicharro, J.L.; Ekelund, U.; Brage, S.; Earnest, C.P.
54. Eynon, N.; Meckel, Y.; Sagiv, M.; Yamin, C.; Amir, R.; Sagiv, M.; Goldhammer, E.; Duarte, J.A.; Oliveira, J. Do PPARGC1A and PPARalpha polymorphisms influence sprint or endurance phenotypes?. Scand. J. Med. Sci. Sports; 2010; 20, pp. e145-e150. [DOI: https://dx.doi.org/10.1111/j.1600-0838.2009.00930.x]
55. Gineviciene, V.; Jakaitiene, A.; Aksenov, M.O.; Aksenova, A.V.; Druzhevskaya, A.M.; Astratenkova, I.V.; Egorova, E.S.; Gabdrakhmanova, L.J.; Tubelis, L.; Kucinskas, V.
56. Jin, H.-J.; Hwang, I.-W.; Kim, K.-C.; Cho, H.-I.; Park, T.-H.; Shin, Y.-A.; Lee, H.-S.; Hwang, J.-H.; Kim, A.-R.; Lee, K.-H. Is there a relationship between PPARD T294C/PPARGC1A Gly482Ser variations and physical endurance performance in the Korean population?. Genes. Genom.; 2016; 38, pp. 389-395. [DOI: https://dx.doi.org/10.1007/s13258-015-0380-4]
57. Maciejewska, A.; Sawczuk, M.; Cieszczyk, P.; Mozhayskaya, I.A.; Ahmetov, I.I. The PPARGC1Agene Gly482Ser in Polish Russian athletes. J. Sports Sci.; 2012; 30, pp. 101-113. [DOI: https://dx.doi.org/10.1080/02640414.2011.623709]
58. Cheng, Z. FoxOtranscription factors in mitochondrial homeostasis. Biochem. J.; 2022; 479, pp. 525-536. [DOI: https://dx.doi.org/10.1042/BCJ20210777] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35195252]
59. Cheng, Z. The FoxO–autophagy axis in health and disease. Trends Endocrinol. Metab.; 2019; 30, pp. 658-671. [DOI: https://dx.doi.org/10.1016/j.tem.2019.07.009] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31443842]
60. Cheng, Z. FOXO1: Mute for a tuned metabolism?. Trends Endocrinol. Metab.; 2015; 26, pp. 402-403. [DOI: https://dx.doi.org/10.1016/j.tem.2015.06.006] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/26170190]
61. Sedding, D.G. FoxO transcription factors in oxidative stress response and ageing--a new fork on the way to longevity?. Biol. Chem.; 2008; 389, pp. 279-283. [DOI: https://dx.doi.org/10.1515/BC.2008.033]
62. Essers, M.A.; de Vries-Smits, L.M.; Barker, N.; Polderman, P.E.; Burgering, B.M.; Korswagen, H.C. Functional interaction between beta-catenin and FOXO in oxidative stress signaling. Science; 2005; 308, pp. 1181-1184. [DOI: https://dx.doi.org/10.1126/science.1109083]
63. Sanchez, A.M.; Candau, R.B.; Bernardi, H. FoxO transcription factors: Their roles in the maintenance of skeletal muscle homeostasis. Cell. Mol. Life Sci.; 2014; 71, pp. 1657-1671. [DOI: https://dx.doi.org/10.1007/s00018-013-1513-z]
64. Weeks, K.L.; Tham, Y.K.; Yildiz, S.G.; Alexander, Y.; Donner, D.G.; Kiriazis, H.; Harmawan, C.A.; Hsu, A.; Bernardo, B.C.; Matsumoto, A.
65. Sanchez, A.M.; Bernardi, H.; Py, G.; Candau, R.B. Autophagy is essential to support skeletal muscle plasticity in response to endurance exercise. Am. J. Physiol.-Regul. Integr. Comp. Physiol.; 2014; 307, pp. R956-R969. [DOI: https://dx.doi.org/10.1152/ajpregu.00187.2014]
66. Koltai, E.; Bori, Z.; Osvath, P.; Ihasz, F.; Peter, S.; Toth, G.; Degens, H.; Rittweger, J.; Boldogh, I.; Radak, Z. Master athletes have higher miR-7, SIRT3 and SOD2 expression in skeletal muscle than age-matched sedentary controls. Redox Biol; 2018; 19, pp. 46-51. [DOI: https://dx.doi.org/10.1016/j.redox.2018.07.022]
67. Eivers, S.S.; McGivney, B.A.; Gu, J.; MacHugh, D.E.; Katz, L.M.; Hill, E.W. PGC-1α encoded by the PPARGC1A gene regulates oxidative energy metabolism in equine skeletal muscle during exercise. Anim. Genet.; 2012; 43, pp. 153-162. [DOI: https://dx.doi.org/10.1111/j.1365-2052.2011.02238.x] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/22404351]
68. Marín-García, J. Cardiomyopathies: A Comparative Analysis of Phenotypes and Genotypes; Elsevier: Amsterdam, The Netherlands, 2014; pp. 363-426.
69. Sabharwal, R.; Chapleau, M.W. Autonomic, locomotor and cardiac abnormalities in a mouse model of muscular dystrophy: Targeting the renin–angiotensin system. Exp. Physiol.; 2014; 99, pp. 627-631. [DOI: https://dx.doi.org/10.1113/expphysiol.2013.074336] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/24334334]
70. Palma-Flores, C.; Cano-Martínez, L.J.; Fernández-Valverde, F.; Torres-Pérez, I.; de Los Santos, S.; Hernández-Hernández, J.M.; Hernández-Herrera, A.F.; García, S.; Canto, P.; Zentella-Dehesa, A. Differential histological features myogenic protein levels in distinct muscles of d-sarcoglycan null muscular dystrophy mouse model. J. Mol. Histol.; 2023; 54, pp. 405-413. [DOI: https://dx.doi.org/10.1007/s10735-023-10136-7] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/37358754]
71. Goldstein, J.A.; Kelly, S.M.; LoPresti, P.P.; Heydemann, A.; Earley, J.U.; Ferguson, E.L.; Wolf, M.J.; McNally, E.M. SMAD signaling drives heart and muscle dysfunction in a Drosophila model of muscular dystrophy. Hum. Mol. Genet.; 2011; 20, pp. 894-904. [DOI: https://dx.doi.org/10.1093/hmg/ddq528]
72. Bround, M.J.; Havens, J.R.; York, A.J.; Sargent, M.A.; Karch, J.; Molkentin, J.D. ANT-dependent MPTP underlies necrotic myofiber death in muscular dystrophy. Sci. Adv.; 2023; 9, eadi2767. [DOI: https://dx.doi.org/10.1126/sciadv.adi2767]
73. Bessho, C.; Yamada, S.; Tanida, T.; Tanaka, M. FoxP2 protein decreases at a specific region in the chick midbrain after hatching. Neurosci. Lett.; 2023; 800, 137119. [DOI: https://dx.doi.org/10.1016/j.neulet.2023.137119]
74. Fisher, S.E.; Scharff, C. FOXP2 as a molecular window into speech and language. Trends Genet.; 2009; 25, pp. 166-177. [DOI: https://dx.doi.org/10.1016/j.tig.2009.03.002]
75. Rodríguez-Urgellés, E.; Casas-Torremocha, D.; Sancho-Balsells, A.; Ballasch, I.; García-García, E.; Miquel-Rio, L.; Manasanch, A.; Del Castillo, I.; Chen, W.; Pupak, A.
76. Feil, R.; Hofmann, F.; Kleppisch, T. Function of cGMP-dependent protein kinases in the nervous system. Rev. Neurosci.; 2005; 16, pp. 23-41. [DOI: https://dx.doi.org/10.1515/REVNEURO.2005.16.1.23]
77. Gao, S.; Yao, W.; Zhou, R.; Pei, Z. Exercise training affects calcium ion transport by downregulating the CACNA2D1 protein to reduce hypertension-induced myocardial injury in mice. iScience; 2024; 27, 109351. [DOI: https://dx.doi.org/10.1016/j.isci.2024.109351]
78. Woodhead, J.S.T.; Merry, T.L. Mitochondrial-derived peptides and exercise. Biochim. Biophys. Acta Gen. Subj.; 2021; 1865, 130011. [DOI: https://dx.doi.org/10.1016/j.bbagen.2021.130011]
79. Many, G.M.; Sanford, J.A.; Sagendorf, T.J.; Hou, Z.; Nigro, P.; Whytock, K.L.; Amar, D.; Caputo, T.; Gay, N.R.; Gaul, D.A.
80. Tan, B.; Li, X.; Yin, Y.; Wu, Z.; Liu, C.; Tekwe, C.D.; Wu, G. Regulatory roles for L-arginine in reducing white adipose tissue. Front. Biosci.; 2012; 17, pp. 2237-2246. [DOI: https://dx.doi.org/10.2741/4047] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/22652774]
81. Viegas, C.M.; Zanatta, A.; Knebel, L.A.; Schuck, P.F.; Tonin, A.M.; Ferreira Gda, C.; Amaral, A.U.; Dutra Filho, C.S.; Wannmacher, C.M.; Wajner, M. Experimental evidence that ornithine and homocitrulline disrupt energy metabolism in brain of young rats. Brain Res.; 2009; 1291, pp. 102-112. [DOI: https://dx.doi.org/10.1016/j.brainres.2009.07.021] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/19616520]
82. Lee, Y.M.; Choi, D.H.; Cheon, M.W.; Kim, J.G.; Kim, J.S.; Shin, M.G.; Kim, H.R.; Youn, D. Changes in Mitochondria-Related Gene Expression upon Acupuncture at LR3 in the D-Galactosamine-Induced Liver Damage Rat Model. Evid. Based Complement. Alternat. Med.; 2022; 2022, 3294273. [DOI: https://dx.doi.org/10.1155/2022/3294273]
83. Kay, L.; Nicolay, K.; Wieringa, B.; Saks, V.; Wallimann, T. Direct evidence for the control of mitochondrial respiration by mitochondrial creatine kinase in oxidative muscle cells in situ. J. Biol. Chem.; 2000; 275, pp. 6937-6944. [DOI: https://dx.doi.org/10.1074/jbc.275.10.6937]
84. Touron, J.; Perrault, H.; Maisonnave, L.; Patrac, V.; Walrand, S.; Malpuech-Brugere, C.; Pereira, B.; Burelle, Y.; Costes, F.; Richard, R. Effects of exercise-induced metabolic and mechanical loading on skeletal muscle mitochondrial function in male rats. J. Appl. Physiol.; 2022; 133, pp. 611-621. [DOI: https://dx.doi.org/10.1152/japplphysiol.00719.2021]
85. Kang, S.; Kang, B.H. Structure Function Inhibitors of the Mitochondrial Chaperone TRAP1. J. Med. Chem.; 2022; 65, pp. 16155-16172. [DOI: https://dx.doi.org/10.1021/acs.jmedchem.2c01633]
86. Lisanti, S.; Tavecchio, M.; Chae, Y.C.; Liu, Q.; Brice, A.K.; Thakur, M.L.; Languino, L.R.; Altieri, D.C. Deletion of the mitochondrial chaperone TRAP-1 uncovers global reprogramming of metabolic networks. Cell Rep.; 2014; 8, pp. 671-677. [DOI: https://dx.doi.org/10.1016/j.celrep.2014.06.061]
87. Schmidt, J.F.; Andersen, T.R.; Andersen, L.J.; Randers, M.B.; Hornstrup, T.; Hansen, P.R.; Bangsbo, J.; Krustrup, P. Cardiovascular function is better in veteran football players than age-matched untrained elderly healthy men Scand. J. Med. Sci. Sports; 2015; 25, pp. 61-69.
88. Mancini, A.; Vitucci, D.; Randers, M.B.; Schmidt, J.F.; Hagman, M.; Andersen, T.R.; Imperlini, E.; Mandola, A.; Orrù, S.; Krustrup, P.
89. Mills, R.E.; Pittard, W.S.; Mullaney, J.M.; Farooq, U.; Creasy, T.H.; Mahurkar, A.A.; Kemeza, D.M.; Strassler, D.S.; Ponting, C.P.; Webber, C. Natural genetic variation caused by small insertions and deletions in the human genome. Genome Res.; 2011; 21, pp. 830-839. [DOI: https://dx.doi.org/10.1101/gr.115907.110]
90. Chen, X.; Nie, S.; Hu, L.; Fang, Y.; Cui, W.; Xu, H.; Zhao, C.; Zhu, B.F. Forensic efficacy evaluation and genetic structure exploration of the Yunnan Miao group by a multiplex InDel panel. Electrophoresis; 2022; 43, pp. 1765-1773. [DOI: https://dx.doi.org/10.1002/elps.202100387] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35707881]
91. Huang, Y.; Liu, C.; Xiao, C.; Chen, X.; Han, X.; Yi, S.; Huang, D. Mutation analysis of 28 autosomal short tandem repeats in the Chinese Han population. Mol. Biol. Rep.; 2021; 48, pp. 5363-5369. [DOI: https://dx.doi.org/10.1007/s11033-021-06522-7]
92. Chen, C.-H.; Chuang, T.-J.; Liao, B.-Y.; Chen, F.-C. Scanning for the signatures of positive selection for human-specific insertions and deletions. Genome Biol. Evol.; 2009; 1, pp. 415-419. [DOI: https://dx.doi.org/10.1093/gbe/evp041]
93. Boschiero, C.; Moreira, G.C.M.; Gheyas, A.A.; Godoy, T.F.; Gasparin, G.; Mariani, P.; Paduan, M.; Cesar, A.S.M.; Ledur, M.C.; Coutinho, L.L. Genome-wide characterization of genetic variants and putative regions under selection in meat and egg-type chicken lines. BMC Genom.; 2018; 19, 83. [DOI: https://dx.doi.org/10.1186/s12864-018-4444-0] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/29370772]
94. Rooney, M.F.; Hill, E.W.; Kelly, V.P.; Porter, R.K. The “speed gene” effect of myostatin arises in Thoroughbred horses due to a promoter proximal SINE insertion. PLoS ONE; 2018; 13, e0205664. [DOI: https://dx.doi.org/10.1371/journal.pone.0205664] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30379863]
95. Petersen, J.L.; Valberg, S.J.; Mickelson, J.R.; McCue, M.E. Haplotype diversity in the equine myostatin gene with focus on variants associated with race distance propensity and muscle fiber type proportions. Anim. Genet.; 2014; 45, pp. 827-835. [DOI: https://dx.doi.org/10.1111/age.12205]
96. Rieder, S.; Taourit, S.; Mariat, D.; Langlois, B.; Guérin, G. Mutations in the agouti (ASIP), the extension (MC1R), and the brown (TYRP1) loci and their association to coat color phenotypes in horses (Equus caballus). Mamm. Genome; 2001; 12, pp. 450-455. [DOI: https://dx.doi.org/10.1007/s003350020017]
97. Biglari, S.; Afousi, A.G.; Mafi, F.; Shabkhiz, F. High-intensity interval training-induced hypertrophy in gastrocnemius muscle via improved IGF-I/Akt/FoxO and myostatin/Smad signaling pathways in rats. Physiol. Int.; 2020; 107, pp. 220-230. [DOI: https://dx.doi.org/10.1556/2060.2020.00020]
98. Korge, P. Factors limiting adenosine triphosphatase function during high intensity exercise. Thermodynamic and regulatory considerations. Sports Med.; 1995; 20, pp. 215-225. [DOI: https://dx.doi.org/10.2165/00007256-199520040-00002] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/8584847]
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Speed is not only the primary objective of racehorse breeding but also a crucial indicator for evaluating racehorse performance. This study investigates a newly developed racehorse breed in China. Through whole-genome resequencing, we selected 60 offspring obtained from the crossbreeding of Thoroughbred horses and Xilingol horses for this study. This breed is tentatively named “Grassland-Thoroughbred”, and the samples were divided into two groups based on racing ability: 30 racehorses and 30 non-racehorses. Based on whole-genome sequencing data, the study achieved an average sequencing depth of 25.63×. The analysis revealed strong selection pressure on chromosomes (Chr) 1 and 3. Selection signals were detected using methods such as the nucleotide diversity ratio (π ratio), integrated haplotype score (iHS), fixation index (Fst), and cross-population extended haplotype homozygosity (XP-EHH). Regions ranked in the top 5% by at least three methods were designated as candidate regions. This approach detected 215 candidate genes. Additionally, the Fst method was employed to detect Indels, and the top 1% regions detected were considered candidate regions, covering 661 candidate genes. Functional enrichment analysis of the candidate genes suggests that pathways related to immune regulation, neural signal transmission, muscle contraction, and energy metabolism may significantly influence differences in performance. Among these identified genes, PPARGC1A, FOXO1, SGCD, FOXP2, PRKG1, SLC25A15, CKMT2, and TRAP1 play crucial roles in muscle function, metabolism, sensory perception, and neurobiology, indicating their key significance in shaping racehorse phenotypes. This study not only enhances understanding of the molecular mechanisms underlying racehorse speed but also provides essential theoretical and practical references for the molecular breeding of Grassland-Thoroughbreds.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details

1 Key Laboratory of Equus Germplasm Innovation (Co-Construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Hohhot 010018, China; [email protected] (W.D.); [email protected] (W.G.); [email protected] (T.B.); [email protected] (L.S.); [email protected] (Y.L.); [email protected] (X.S.); [email protected] (Z.L.); [email protected] (H.W.); [email protected] (M.D.), Inner Mongolia Key Laboratory of Equine Science Research and Technology Innovation, Inner Mongolia Agricultural University, Hohhot 010018, China, Equus Research Center, College of Animal Science, Inner Mongolia Agricultural University, Hohhot 010018, China