Introduction
Human ear morphology is a complex trait that is influenced by a multidimensional set of correlated and heritable phenotypes. Studies have estimated the heritability of ear morphology phenotypes to be between 29% and 61% [1], indicating that genetic variants play a significant role in shaping the ear’s anatomy. Understanding the genetic basis of human ear morphology has important implications for multiple scientific disciplines such as human genetics, developmental biology, medicine, evolutionary biology, anthropology, and forensics. This study aims to generate knowledge that can aid in understanding genetic disorders related to ear morphology, improve diagnosis and treatment of ear-related diseases and also help in forensics and anthropology to identify individuals based on their ear morphology.
Previous genome-wide association studies (GWASs) on ear morphology have focused on a limited number of qualitative ear features and have not utilized quantitative ear traits. A GWAS in 5,062 Latin Americans identified significant associations at seven genetic loci for ear lobe size and attachment, folding of antihelix, helix rolling, ear protrusion and antitragus size, suggesting the involvement of the genes EDAR, CART1, SP5, MPRS22, LRBA, LOC153910, and LOC100287225 [1]. A multi-ethnic GWAS meta-analysis on earlobe attachment in 74,660 samples revealed a total of 49 genetic loci [2], further emphasizing the complexity of the genetic architecture of ear morphology.
Due to the complex and multidimensional nature of ear morphology, focusing on a limited number of qualitative ear features, as done in previous GWASs, can result in missing important phenotypic information and genetic associations. Additionally, traditional methods such as human perception [1] and questionnaire-based [2] phenotyping are prone to bias and instability. Therefore, in this study, we developed a fully automated computer pipeline that utilizes a deep learning convolutional neural network (CNN) [3] to quantify a large number of ear phenotypes from high-resolution digital side-face photos. This approach allows for a more comprehensive and accurate assessment of ear morphology and can provide a more complete understanding of the genetic factors underlying this trait.
High correlations between various ear traits may be influenced by genetic variants with multi-trait effects. However, traditional single-trait GWAS, as used in previous ear GWASs [1,2], has limitations in identifying these genetic loci. To overcome these limitations, here we use C-GWAS [4], a recently developed method for combining GWAS summary statistics of multiple potentially related traits, to integrate our GWAS results. C-GWAS has been shown to have increased statistical power compared to traditional methods such as minimal p-values of multiple single-trait GWASs (MinGWAS) and MTAG [5]. Furthermore, it has been successfully applied to the analysis of a large number of facial traits, resulting in the identification of novel genetic loci associated with facial variation that were not identified via traditional methods [4].
In this study, we used a sample of 14,921 multi-ethnic individuals from Europe (N = 4,740), Asia (N = 4,835), and Latin America (N = 5,346) to investigate the genetic factors that contribute to ear morphology. We quantified 136 quantitative ear phenotypes using deep learning analyses of digital face images, and conducted GWASs and GWAS meta-analyses. These results were then combined using a recently developed C-GWAS method [4]. We identified novel genetic loci and confirmed previously established loci that are involved in normal-range quantitative variation of ear morphology in humans. We performed functional follow-up studies in gene-edited mice to further understand the impact of these genetic loci on ear morphology.
Results
Performance of CNN in quantitative ear phenotyping
In this study, we used a deep learning convolution neural network (CNN) approach [3] for comprehensive ear landmarking on high-resolution 2D digital side face photos. We focused on the 17 most anatomically meaningful landmarks (S1A Fig) out of the 55 that could be located by the CNN, and derived 136 pairwise inter-landmark distances as input phenotypes for subsequent genetic analyses. We found that the inter-rater correlations on the left ear were reasonably high (mean r = 0.47, sd = 0.24, S2A Fig) and the correlations between manual- and CNN-derived phenotypes on the same ear were reasonably high too (mean r = 0.34, sd = 0.18, S2B Fig). Utilizing the symmetric nature of the human face, it was obvious that the left and right ear correlations for CNN-derived phenotypes (r = 0.52, sd = 0.10, S2D Fig) were significantly higher, and with a smaller standard deviation, than those from the same human rater (r = 0.44, sd = 0.20, S2C Fig). This demonstrated that the deep learning approach on ear phenotyping that we used in this study on the full set of facial images had at least an equal performance in the accuracy of ear landmarking compared to human perception, while also being more efficient.
Sample characteristics and phenotype results
This study included 14,921 individuals from five different population cohorts from three continents: Europe, Asia, and Latin America (S1B Fig and S1 Table). These cohorts were: two European cohorts (the Rotterdam Study with 3,675 participants from the Netherlands and the TwinsUK study with 1,065 participants from the UK), two East Asian cohorts (the Taizhou Longitudinal Study, TZL, with 2,348 Han Chinese participants from China and the National Survey of Physical Traits study, NSPT, with 2,487 Han Chinese participants from China), and one cohort of mixed ancestry (the CANDELA study with 5,346 Latin Americans, who have an estimated genomic admixture of 48% European, 46% Native American, and 6% African). It is worth noting that, while CANDELA and TZL have been used in previous GWAS studies on qualitative ear features [1,2], quantitative ear phenotypes have not been studied in these cohorts before.
The majority (82–100%) of the 136 ear phenotypes studied were normally distributed, as determined by the Kolmogorov-Smirnov normality test (P > 0.05 after multiple testing correction, as shown in S2 and S3 Tables). The lower proportion of normally distributed ear phenotypes in the Rotterdam Study (RS) can be attributed to a higher proportion of older individuals (with an average age more than 10 years older than in the other cohorts, as seen in S1 Table). The effects of sex (min P = 3.5e-145) and age (min P = 5.9e-99) on ear phenotypes were investigated in detail using individual-level data from RS. These factors together explained up to 16.9% of the variance for certain ear phenotypes (e.g. phenotype L7-L14) (S4 Table). Women had longer distances involving otobasion inferius (L7) compared to men, while men had longer distances involving otobasion superius (L1) compared to women (S3A Fig). Aging was associated with a significant increase in ear length and thinness (S3B Fig). Unsupervised hierarchical clustering of the 136 ear phenotypes resulted in two main clusters, mainly representing vertical and horizontal ear features (S4A Fig). The genetic correlation estimates from genome-wide SNPs were high and similar to phenotypic correlations (S4B Fig), indicating shared genetic contributors across multiple ear phenotypes. The twin-based heritability of the 136 ear phenotypes was estimated in TwinsUK to have a mean of 0.52 (sd = 0.03, 0.49–0.56, S5A Fig). These estimates were generally higher than those obtained from genome-wide SNPs in the same cohort (mean = 0.38, sd = 0.09, 0.20–0.60, S5B Fig). The genetic heritability estimates were largely consistent with those for qualitative ear features reported previously in the CANDELA study [1].
GWASs, Meta-analysis, and C-GWAS
Single-trait GWASs of 136 ear phenotypes were conducted in each of the five cohorts separately (as shown in S1B Fig) and the results were then meta-analyzed, resulting in 136 sets of single-trait summary statistics. The inflation factors in these single-trait GWASs were all within an acceptable range (average λ = 1.04, with 1.02 < λ < 1.06). The minimal p-values of the 136 single-trait GWASs were adjusted empirically using simulations as implemented in C-GWAS, so that the adjusted minimal p-values follow a uniform distribution under the null. These adjusted minimal p-values are abbreviated as "MinGWAS" below. Similarly, the C-GWAS results were also adjusted using the embedded simulations, so that the adjusted p-values also closely follow a uniform distribution under the null (as shown in Fig 1A–1C). This means that MinGWAS and C-GWAS results can be directly compared with each other as well as with those from any standard single-trait GWAS. Therefore, the traditional genome-wide significance threshold (P ≤ 5e-8) can be considered as the study-wide significance threshold in this study. Additional information on the C-GWAS method can be found in other publications [4].
[Figure omitted. See PDF.]
Fig 1. Outcomes from GWAS meta-analysis (MinGWAS) and combined GWAS (C-GWAS) for 136 quantitative ear morphology traits in 5 multi-ethnic cohorts (N = 14,921).
A simulation analysis derived the null distributions of the crude p-values (gray). The p-values corrected using the getCoef function (orange, details of function in Materials and methods) are compared with the uniform distribution (black) for both the C-GWAS results (A) and the MinGWAS result (B). After multiple testing correction, the C-GWAS results (upper part of D) and the minimal p-values from 136 single ear trait GWASs (lower part of D) are plotted using a combined Manhattan plot (D) and a Q-Q plot (C). In the Manhattan plot (D), the study-wide significance threshold (P = 5e-8, all p-values of MinGWAS and C-GWAS have been corrected) is indicated using dashed lines. A total of 16 study-wide significant loci were identified, among which 9 were significant in both C-GWAS and MinGWAS (orange dots and circles with cross), 1 solely was significant in MinGWAS (purple circle with cross), and the remaining 6 were significant only in C-GWAS (red dots and circles with cross). (E) The closest genes to regional lead SNPs are listed (E) (orange for novel). (F) Venn diagram of 15 candidate genes in the 15 loci highlighted by C-GWAS, 10 candidate genes in the 10 loci highlighted by MinGWAS, 50 candidate genes in 50 loci from previous GWASs of categorical ear features, and 243 candidate genes in 243 loci from previous GWASs of facial shape variation. (G) A list of 59 previously established ear-associated SNPs was looked up in the C-GWAS and MinGWAS results. In RS, the ear variance explained by 15 lead SNPs from loci identified by C-GWAS (H) is compared with that explained by 10 lead SNPs from loci identified by MinGWAS (I).
https://doi.org/10.1371/journal.pgen.1010786.g001
MinGWAS and C-GWAS together identified a total of 1,205 ear-associated SNPs at 16 distinct genetic loci that exceeded the study-wide significance (Fig 1D–1E and Table 1). Eight of these loci had not been previously associated with ear features in previous GWASs and were identified for the first time in this study (Fig 1F); four of them were significant in both MinGWAS and C-GWAS, and four were significant solely in C-GWAS (Table 1 and Fig 1F). Notably, all eight novel loci had highly consistent allele effects across all five cohorts, despite their different continental ancestries (S6A–S6H Fig). The most significant novel finding was rs7812632 at 8q24.13 (PC-GWAS = 1.2e-19), where the G-allele mainly led to an increased vertical length of the ear. This SNP was top-associated with L6-L15 (PMinGWAS = 4.7e-10) and showed nominally significant association with L6-L15 in all cohorts, with similar effect sizes (except for TwinsUK due to QC failure). This SNP is located approximately 200kbp upstream of the HAS2 gene, which encodes hyaluronan synthase 2, a protein that plays an important role in embryonic development of branchial arches [6] and cranial neural crest cells (CNCCs) [7]. Variants in or near HAS2 have been previously associated with face morphology [8], body height [9], and male pattern baldness [10]. Additionally, this locus is the only one among all the significant ones we found that shows borderline genome-wide significant association with ear landmark L3 (PL3-L15 = 2.7e-7, S6D Fig), which anatomically approximates Darwin’s Tubercle (OMIM:124300, S7 Fig) [11]. We also tested the association between rs7812632 and Darwin’s Tubercle in CANDELA, as it is the only cohort studied in which this phenotype has been manually obtained. This SNP was nominally associated with Darwin’s Tubercle (P = 0.03) and the A-allele was associated with an increased prevalence.
[Figure omitted. See PDF.]
Table 1. SNPs associated with quantitative ear morphology traits identified by MinGWAS and C-GWAS.
https://doi.org/10.1371/journal.pgen.1010786.t001
The second most significant novel finding was rs57788627 at 4q28.1 (PC-GWAS = 2.5e-15), where the G-allele mainly led to an increased ear width. This SNP was top-associated with L15-L16 (PMinGWAS = 2.3e-9) and showed nominally significant association with L15-L16 in almost all cohorts, with similar effect sizes (4/5, except NSPT). This locus is remarkable as it represents a gene desert and the nearest gene, INTU, is 408kb away from the region’s lead SNP (S6B Fig). A recent face GWAS by Xiong et al. found an association between INTU and human facial variation and also confirmed the enhancer activity of this region through luciferase assay experiments [8]. Among all genes in this region, INTU showed the most preferential expression in CNCCs. Additionally, INTU is involved in both ciliogenesis and convergent extension and plays a role in embryonic development [12]. Recent studies have suggested an important role of Intu in mouse skeletal development [13]. Furthermore, exome sequencing of orofaciodigital syndrome patients with facial, oral, and ear abnormalities revealed disease-associated mutations in INTU [14]. We attained mouse mutants using one-step CRISPR/cas9 mediated gene editing experiments to assess the functional significance of Intu gene expression during ear mouse development (for details, see below).
Among the eight loci that were already reported in the two previous single-trait GWASs of qualitative ear traits, five were significant in our quantitative study in both MinGWAS and C-GWAS, specifically 1p12 TBX15, 2q12.3 EDAR, 2q31.1 SP5, 3q23 MRPS22, and 6q24.2 HIVEP2. One locus, 6q21 PRDM1/ATG5, reached study-wide significance only in MinGWAS, and two, 4q31.3 LRBA and 7q21.3 DLX6, reached study-wide significance only in C-GWAS. The allele effects of all these eight loci were in the same direction in all five cohorts (S6L–S6S Fig). Notably, all loci except 6q21 showed orders of magnitude higher significance in C-GWAS than in MinGWAS (Fig 1B), with rs17034666 at 2q12.3 EDAR being the most extreme example (from PMinGWAS = 1.09e-11 to PC-GWAS = 1.38e-24). The East Asian-specific missense variant of EDAR (EDARV370A, rs3827760) had been previously associated with a wide range of endoderm-derived phenotypes, such as chin protrusion [15], hair shape [16], hair thickness [17], sweat glands [18], size of feminine breasts [18], and shovel incisors characteristic [19,20]. However, this SNP was removed from our C-GWAS due to its very low MAF in European samples. The boost of significance at 2q12.3 EDAR is an example of the increased power of C-GWAS in detecting multi-trait effects. A multivariable fitting analysis in the RS cohort based on individual-level data showed that the lead SNPs from C-GWAS-significant loci explained on average 1.5 times and up to 3.1 times more phenotypic ear variance than the SNPs from MinGWAS-significant loci did (Fig 1H–1I).
Integration with previous literature knowledge
The two previous single-trait GWASs on qualitative ear morphology [1,2] identified 58 autosomal SNPs at 50 loci mainly associated with earlobe features. We looked up these 58 SNPs in three of our cohorts (RS, TwinsUK, and NSPT) not considering CANDELA and TZL because these two cohorts were used in the previous GWASs. For 49 SNPs (nine SNPs were non-polymorphic in at least one dataset and thus got excluded from this analysis), both C-GWAS (P = 1.1e-8) and MinGWAS (P = 2e-5) p-values highly significantly deviated from the null, while C-GWAS p-values obviously deviated further from the null than MinGWAS p-values did (Fig 1G). Under the nominal significance level, C-GWAS re-identified a larger number of the previously established loci than MinGWAS did (16 vs. 13). In addition, all our nominally significant associations involving earlobe landmarks (S5 Table) were consistent with the findings from previous studies of earlobe features, i.e., they were previously associated with earlobe features and the most significant ear phenotype in our study involved earlobe landmarks too. These results further confirm the increased statistical power of C-GWAS compared to the traditionally used MinGWAS.
Both the ear and face belong to craniofacial phenotypes for which shared genetic effects maybe expected. Therefore, we examined the 238 distinct genetic loci previously associated with human facial shape variation [8,15,21–30] in our C-GWAS results for quantitative ear morphology. Two of these face loci, which harbor TBX15 and INTU, respectively, (S6 Table) showed study-wide significant association in our ear C-GWAS. On the other hand, we saw that a substantial proportion of the ear-associated loci identified in the present ear study (6/16) showed genome-wide significant association with facial shape variation in previous face GWASs (Fig 1F). The finding of a larger proportion of ear-associated loci implicated in facial shape than face-associated loci implicated in ear morphology is consistent with the longer span of early development of the face (4th to 9th gestation week) compared to the outer ear (6th and 7th gestation week) [31,32]. Furthermore, more than half (9/16) of our C-GWAS-identified ear loci (1p12 TBX15, 2q12.3 EDAR, 2q31.1 SP5/MYO3B, 3q23 MRPS22, 6q21 ATG5, 8q24.13 HAS2, 9q33.1 ASTN2, 10q22.2 C10orf11, 20q11.22 UQCC1), which were particularly associated with earlobe phenotypes in our study, were previously reported to have association with male pattern baldness in the GWAS catalog [10], and two of our ear loci, including 2q12.3 EDAR and 6q21 ATG5, were previously associated with mono eyebrow [21] (S7 Table). Male pattern baldness and mono eyebrow also reflect surface ectoderm-derived phenotypes. These results suggest that surface ectoderm-derived phenotypes such as facial shape, ear morphology, male pattern baldness, and mono eyebrow share genetic factors, which may be expected, but has not been empirically demonstrated before. These findings provide further evidence of the genetic overlap between different craniofacial phenotypes and highlights the importance of studying multiple traits together to gain a more comprehensive understanding of the genetic basis of craniofacial variation.
Functional annotations
Several lines of evidence support the functional implications of the 16 discovered ear-associated loci, including 8 novel loci. A gene ontology enrichment analysis highlighted 6 ear development-related biological process terms (FDR < 0.01, Tables 1 and S8) including ear development, inner ear development, development and morphogenesis of the skeletal system, and embryonic organ development and morphogenesis. The majority of the lead SNPs (10/16) showed evidence of regulatory activity in the 3DSNP database [33] (S9 Table). Four out of the 16 loci showed positive enhancer activity in transgenic mice supported by the spatial pattern of expression located in ears/branchial arch/craniofacial [34]. The majority (14/16) of nearby genes or 3D interaction genes has been associated with abnormal ear/craniofacial phenotypes in various databases [35]. Most (15/16) of nearby genes or 3D interaction genes were expressed in branchial arch and embryo ectoderm in mice [36,37]. These lines of evidence strengthen the reliability of our novel loci and support that ear and cranial morphogenesis share a substantial proportion of genetic factors. These findings provide further support for the functional relevance of the identified loci and highlight the importance of studying multiple traits together to gain a more comprehensive understanding of the genetic basis of craniofacial development.
Embryonic cranial neural crest cells (CNCCs) are temporary, migratory stem cells that play a crucial role in the formation of the ear during embryonic development. We compared the expression of 16 genes located near the 16 ear-associated SNPs with a random set of 16 genes selected near randomly chosen and frequency-matched 16 SNPs in CNCCs and 49 other cell types for 10000 replicates. Compared with the randomly selected gene sets, this set of 16 targeted genes near the ear-associated SNPs had significantly higher expression in 23 types of cells (P < 0.05, Fig 2A), including those involved in the formation of constituent of ear, such as CNCCs, articular chondrocyte, osteoblast, skin fibroblast, and adipose mesenchymal stem cell. In addition, we performed a heritability enrichment analysis using the S-LDSC method [38] with respect to active regulatory regions based on our C-GWAS results of Europeans (RS and TwinsUK). This analysis showed that the S-LDSC coefficient Z-scores in mesenchymal stem cells, osteoblast primary cells, adult dermal fibroblast primary cells, adipose derived mesenchymal stem cell cultured cells, mesenchymal stem cell derived chondrocyte cultured cells, all ranked within the top 5% in a total of 102 types of cells and CNCCs slightly behind them (ranked 9th) (Fig 2C). These cell types from the heritability enrichment analysis are consistent with those where the 16 genes showed preferential expression. These cell types play important roles in ear morphogenesis, supporting the reliability of our findings. In addition, 14 out of these 16 genes showed preferential expression in CNCCs compared to 49 other cell types (Fig 2B). The genes near the eight newly discovered loci showed preferential expression in CNCC, highly consistent with the preferential expression pattern of the genes near the 8 previously established ear-associated loci. Interestingly, several of these genes (EDAR, INTU, and HAS2) have previously been linked to facial morphology [8], supporting the idea that genetic effects shaping the ear and face originate during early embryogenesis. These findings provide a priority list for future in-vivo studies on genes involved in ear variation and embryo surface ectoderm-derived phenotypes.
[Figure omitted. See PDF.]
Fig 2. Differential expressions of 16 genes near 16 ear-associated loci in 50 cell types.
(A) Boxplots of normalized RNA-seq VST values for the 16 genes (in orange, details of genes shown in lower B plot) and 16 median gene expression value as control (randomly matched 1e-4 times using SNPsnap, in blue). Expression differences between the control genes and the 16 genes were tested in each cell type using the unpaired Wilcoxon rank-sum test. The expression of the 16 genes in CNCCs was also iteratively compared with that in all other cell types using paired Wilcoxon rank-sum test. Statistical significance was indicated: *P < 0.05. (B) Normalized RNA-seq VST values in CNCCs were compared with those in other 49 types of cells (purple), in 20 tissue cell types (blue), in 20 primary cell types (green), and in nine embryonic stem cell types (orange), using one-sample Student’s t-test. Dotted line represents Bonferroni corrected significant threshold (P < 2.27e-3). Significant gene labels are depicted in color, non-significant gene labels in black. (C) Partitioned heritability enrichments based on cell-type-specific regulatory annotations (More details see in Methods). Heritability enrichment Z-scores, as estimated by stratified linkage disequilibrium score regression (S-LDSC) of the C-GWAS summary data for GWASs of 136 ear traits. Trait abbreviations as in S10 Table.
https://doi.org/10.1371/journal.pgen.1010786.g002
Ear effects in Intu and Tbx15 mutant mice
Here, we examined adult ear morphology in Intu and Tbx15 mouse mutants using one-step CRISPR/cas9 mediated gene editing experiments to assess the functional significance of Intu and Tbx15 expression during ear development. Our breeding experiments generated 18 F2 9-weeks sexually mature Intu mice i.e., 10 heterozygous Intu+/-, 8 wild-type WT (Intu+/+), while homozygous loss-of-function of Intu was lethal. Quantitative assessment of mouse ear shape (assessed by PC analysis of 21 three-dimensional landmark coordinates) revealed significant differences (linear regression PPC3 = 1.5e-3) (Fig 3C) between the heterozygous Intu littermates and WT mice. The Intu genotype significantly associated with D6_14 (P = 2.1e-3, Beta = 1.78) and D3_6 (P = 3.3e-3, Beta = 1.74) (Fig 3E and S11 Table) after FDR correction. Notably, in humans, the top INTU variant rs57788627 was nominally significantly associated with ear phenotypes L3-L6 and L3-L7, which respectively corresponds to D6_14 and D3_6 in mice (P = 1.9e-5, Beta = 0.07; P = 8.7e-5, Beta = 0.06) (Fig 3E and S11 Table). Overall, the ears of the heterozygous Intu mutant mice were shorter than those of the WT mice consistently show in the result of PC3 and distance phenotypes (Fig 3D–3F). Furthermore, the heterozygous Intu mutant mice showed a significant trend of reduction in body length and fore and hind limb length (S8 Fig), which was consistent with previous findings [13].
[Figure omitted. See PDF.]
Fig 3. In-vivo mouse models of INTU deficiency.
Intu mutant mice (Intu-+/-, n = 10, 9 weeks) vs. C57BL/6 wild type control mice (WT+/+, n = 8, 9 weeks) were compared for ear and body morphological differences (for details of Tbx15 mice see S9 Fig). (A) Regional association plot for the associated locus nearby INTU, and the other two SNPs at this locus, which have been recently reported the association with face morphology, were also marked. (B) The schematic diagram of the one-step CRISPR/Cas9 technology used in Intu knockout mice (see S10 Fig for details). (C) Example of left profile craniofacial photo of Intu+/- mutant mice with removed hair (left); and 21 craniofacial landmarks pattern in 3D mice image (right). (D) The principal component analysis for 21 landmarks of Intu+/- mutant mice and WT+/+ mice. The left layer shown the detailed contributed proportion of 21 landmarks to the first 5 principal components. The upper left shown the cumulative variance is explained by the first 10 PCs. The significant association between the genotype and PC3 shown in upper right (* P < 0.05, ** P < 0.01, *** P < 0.001). The bottom shown the maximum PC3-, minimum PC3- and mean ear shapes (the labels on the mouse ear were consistent with the FigC). (E) The pattern of genetic association in humans (left) and in mice (right). (F) Effect of Intu knock-out on ear phenotypes in mice (blue for effect of heterozygote mutant and red for wild type). The labels on three ears were consistent with Fig S1 (more details see in Methods).
https://doi.org/10.1371/journal.pgen.1010786.g003
Our breeding experiment generated 37 F2 9-weeks adult Tbx15 mice i.e., 9 homozygous Tbx15-/-, 18 heterozygous Tbx15+/-, 10 wild-type WT (Tbx15+/+). Previous gene editing experiments have already shown that Tbx15-/- mutant mice demonstrate facial variation, loss of weight, shorter limbs and a distinct “droopy ear” feature [39]. In our study, besides the facial differences reported previously, ear differences were obvious. In addition to the distinct “droopy ear” feature, the Tbx15 genotype was significantly associated with the ear landmarks PC1 (P = 3.2e-3) and PC4 (P = 1.9e-3) (S9C Fig). Overall, the ears of the Tbx15 mutant mice were obviously longer and wider than those of the WT mice (S9E and S9F Fig).
Discussion
In this study of the genetic architecture of ear morphology, we developed a CNN-based deep learning model to automatically locate 17 anatomical ear landmarks, allowing us to quantify 136 ear traits in 14,921 individuals from three continents. We then used a recently developed C-GWAS method [4] to combine GWASs of ear traits. These efforts allowed us to identify 16 genetic loci associated with multiple ear traits, including 8 novel and 8 previously reported loci [1,2]. Bioinformatic analysis supports the functional role of the newly identified genes in ear morphogenesis, and gene editing experiments in mice showed that a gene desert near INTU has a functional impact on ear morphology. Our findings also highlight shared genetic contributors between some surface ectoderm-derived appearance phenotypes, with an emphasis on ear and facial morphology, male pattern baldness, and mono eyebrow.
The novel ear-associations at 5 of the 8 loci we identified here i.e., 4q28.1 INTU, 5p15.1 ANKH, 8q24.13 HAS2/HAS2-AS1, 10q22.2 C10orf11, and 20q11.22 UQCC1/GDF5 are supported by other evidence, whereas for 3 novel ear genes (2q13 ACOXL, 9q33.1 ASTN2, and 21q21.3 N6AMT1) no such evidence is currently available. The strong evidence for INTU and HAS2 has been detailed above. The gene ANKH, which regulates inorganic pyrophosphate transport, is located near the lead SNP rs10062331 at 5p15.1. ANKH has been linked to diseases such as Craniometaphyseal Dysplasia (OMIM: 123000) and Chondrocalcinosis 2 (OMIM: 118600), both of which are characterized by abnormal development of bones and connective tissue. ANKH’s association with skeletal disorders suggests it may play a role in ear cartilage development. The lead SNP rs10824309 at 10q22.2 is an intronic variant of C10orf11, also known as LRMDA. This gene has been associated with craniofacial traits such as adolescent idiopathic scoliosis [40], heel bone mineral density [41], and chin shape [21]. This provides additional support for the novel association between 10q22.2 and ear development as craniofacial and ear development both stem from a common origin during early development. The lead SNP rs2378353 located at 20q11.22 is within the intron of UQCC1, which encodes a transmembrane protein involved in the assembly of the ubiquinol-cytochrome c reductase complex. Polymorphisms in this gene have been linked to human height [42] and osteoarthritis [43], and rs2378353 has been found to have a significant impact on the splicing of UQCC1 in muscle-skeletal tissue, as well as its expression in multiple tissues including fibroblasts, muscle-skeletal tissue, and skin [44]. UQCC1 is expressed in the branchial arches and head surface ectoderm during early embryonic development in mice [45]. These findings, combined with the differential expression of UQCC1 in cranial neural crest cells, suggest that rs2378353 may play a role in ear morphogenesis by regulating the expression of UQCC1 during early development. The functional information for the other 3 novel ear-associated loci identified in this study is limited.
Eight of the 16 identified loci we identified here with association with quantitative ear traits have been previously reported to be associated with qualitative ear features (mainly earlobe features) in single-trait GWASs, including 1p12 TBX15 [2], 2q12.3 EDAR [1,2], 2q31.1 SP5/MYO3B [1,2], 3q23 PISRT1/MRPS22 [1,2], 4q31.3 LRBA [1,2], 6q21 PRDM1/ATG5 [2], 6q24.2 LOC153910/HIVEP2 [2], and 7q21.3 DLX6/SEM1 [2]. Two of these (4q31.3 LRBA and 7q21.3 DLX6/SEM1), which were solely identified by C-GWAS in our study, were previously identified in a single GWAS of earlobe attachment involving ~70,000 samples; hence, in a study that was five times larger than our current study. That besides the 5x smaller sample size, our study identified 8 novel loci, half of which were identified only by C-GWAS, confirms the power gain of C-GWAS in detecting small but multi-trait effects, as previously highlighted also for human face shape traits [4].
A significant portion of the 16 ear-related loci discussed here have been shown to have strong ties to human facial structure [8,15,21,22,30]. This indicates that both ear and face morphologies are influenced by common genetic factors, likely stemming from shared developmental processes in the craniofacial area. Additionally, a considerable number of the 16 ear-related loci were also previously linked to male pattern baldness, and two loci were linked to mono-eyebrow [21]. These findings suggest that not only do ears and face share genetic factors, but ears, baldness, and mono-eyebrow also share genetic factors, and it is possible that other craniofacial traits may also be influenced by similar genetic factors. Further investigation is needed to fully understand this.
Our study sheds light on the genetic basis of the Darwin tubercle, a special ear phenotype that is equivalent to the ear tip of other mammals and represents a trace of human evolution [11,46]. Previous genetic studies on this trait have been limited and have not produced significant results. Our findings indicate that the HAS2 gene influences the L3-L15 phenotype, which roughly represents the Darwin tubercle, marking the beginning of understanding this intriguing aspect of human evolution.
This study involved cohorts belonging to three continental populations from Europe, Asia, and America. For detecting cross-population allelic effects, C-GWAS demonstrated noticeably improved power than MinGWAS as evident by boosting the significance of known ear-associated loci while keeping the same study-wide type-I error rate as MinGWAS. However, because our approach required the a priori exclusion of SNPs not polymorphic in one of the studied cohorts, population-specific association signals, could not be detected. Another flip-side caveat of our study, in which we maximized the sample size used for discovery, is that no additional population samples were available for direct replication of our novel findings. However, that half of the loci we identified represent previously known ear loci that we re-discovered with our approach, puts confidence in our approach and thus also in the true-positive status of the newly identified loci. Additional confidence is provided by other evidence we accumulated from the literature for most of the novel loci as well as by direct functional evidence in mice for one novel gene (INTU) and one previously reported gene (TBX15). Nevertheless, future studies in independent population samples from different continental populations are warranted to further verify our novel findings.
Our study represents a landmark in the field of genetics as it is the first to examine the genetic basis of quantitative ear morphology using a deep-learning-based CNN phenotyping approach. By combining the results of 136 single ear trait GWASs conducted in well-sized population samples of different ancestral origins, our C-GWAS method enabled us to identify eight new loci and confirm eight previously reported loci that play a role in ear morphology. These loci impact a wide range of ear phenotypes, demonstrating the increased power of C-GWAS in detecting multi-trait effects. Furthermore, our findings suggest that many facial and ear traits share a substantial proportion of genetic determinants, derived from the surface ectoderm. Our study significantly advances the understanding of the genetics of human ear morphology and provides a list of promising candidate genes for further functional studies.
Materials and methods
Ethics statement
This study includes five cohorts, all cohorts were approved by the Ethics Committee and all participants provided written informed consent to participate in the study. Details in following: The Rotterdam Study (RS) has been approved by the Medical Ethics Committee of the Erasmus MC (registration number MEC 02.1015) and by the Dutch Ministry of Health, Welfare and Sport (Population Screening Act WBO, license number 1071272-159521-PG). The TwinsUK Study (TwinsUK) has been approved by the St. Thomas’ Hospital Local Research Ethics Committee. The Taizhou Longitudinal Study (TZL) has been approved by the Ethics Committee of Human Genetic Resources at the Shanghai institute of life Sciences, Chinese Academy of Sciences (ER-SIBS-261410). The National Survey of Physical Traits Study (NSPT) has been approved by the Ethics Committee of Human Genetic Resources of School of Life Sciences, Fudan University, Shanghai (14117). The Consortium for the Analysis of the Diversity and Evolution of Latin America (CANDELA) Study has been approved by ethical committee at universities in all samples countries: the Universidad Nacional Autonoma de Mexico (Mexico), the Universidad de Antioquia (Colombia), the Universidad Peruana Cayetano Heredia (Peru), the Universidad de Tarapaca (Chile), the Universidade Federal do Rio Grande do Sul (Brazil) as well as at the University College London (UK).
Study cohorts
This study used a total of 14,921 samples that passed quality control from five cohorts of multi-ethnic ancestries from Europe (the Rotterdam Study, RS, N = 3,675; the TwinsUK study, N = 1,065), Asia (the Taizhou Longitudinal Study, TZL, N = 2,348; the National Survey of Physical Traits study, NSPT, N = 2,487), and Latin America (the CANDELA study, N = 5,346). Details about these five cohorts are included below.
The Rotterdam Study
The Rotterdam Study (RS) is a population-based prospective study of individuals aged ≥ 45 years living in a suburb of Rotterdam, the Netherlands. Details regarding the cohort profile have been described elsewhere [47]. The Rotterdam Study has been approved by the Medical Ethics Committee of the Erasmus MC (registration number MEC 02.1015) and by the Dutch Ministry of Health, Welfare and Sport (Population Screening Act WBO, license number 1071272-159521-PG). The Rotterdam Study has been entered into the Netherlands National Trial Register (NTR; www.trialregister.nl) and into the WHO International Clinical Trials Registry Platform (ICTRP; www.who.int/ictrp/network/primary/en/) under shared catalogue number NTR6831. All participants provided written informed consent to participate in the study. A total of 5,604 participants not wearing make-up, cream nor jewelry, were photographed using a Premier 3dMD face3-plus UHD camera (3dMD, Atlanta, Georgia, USA). Each person’s 3D avatar is rendered from three 2D high resolution photos taken from predefined angles, i.e., from front-top, left-down, and right-down directions. Ears in the left-down and right-down photos were manually segmented and chopped for subsequent phenotyping. Genotyping was carried out using the Infinium II HumanHap 550K Genotyping BeadChip version 3 (Illumina, San Diego, California USA). All SNPs were imputed using MACH software (www.sph.umich.edu/csg/abecasis/MaCH/) based on the 1000-Genomes Project reference population information [48]. Genotype and individual quality controls have been described in detail previously [49]. In current study, after all phenotype and genotype quality controls, this study included 3,675 individuals of RS.
TwinsUK Study
The TwinsUK Study (TwinsUK) included 1,153 phenotyped participants (all female and all of European ancestry) within the TwinsUK adult twin registry based at St. Thomas’ Hospital in London. Volunteers gave written informed consent under a protocol reviewed by the St. Thomas’ Hospital Local Research Ethics Committee. Participants were photographed using a Premier 3dMD face3-plus UHD camera (3dMD, Atlanta, Georgia, USA), and 3D avatars were rendered from two 2D photos taken from the left and right directions. Ears in the 2D photos were manually segmented and chopped for subsequent phenotyping. Genotyping of the TwinsUK cohort was done with a combination of Illumina HumanHap300 and HumanHap610Q chips. Intensity data for each of the arrays were pooled separately and genotypes were called with the Illuminus32 calling algorithm, thresholding on a maximum posterior probability of 0.95 as previously described [50]. Imputation was performed using the IMPUTE 2.0 software package using haplotype information from the 1000-Genomes Project (Phase 1, integrated variant set across 1,092 individuals, v2, March 2012). After all phenotype and genotype quality controls, the current study included a total of 9.35 million autosomal SNPs (MAF > 0.01, imputation R2 > 0.3, SNP call rate > 0.97, HWE > 1e-6) and 1,065 individuals of TwinsUK.
Taizhou Longitudinal Study
The Taizhou Longitudinal Study (TZL) includes Han Chinese sampled in Taizhou, Jiangsu province in 2014 [51]. In total, 2,600 individuals were enrolled. The TZL was approved by the Ethics Committee of Human Genetic Resources at the Shanghai institute of life Sciences, Chinese Academy of Sciences (ER-SIBS-261410). All participants had provided written consent. Participants were photographed using a Canon EOS700D camera. Four digital photographs of the face: left side (−90°), frontal (0°)*2, right side (90°) were taken from ∼1.5 meters. Ears were cropped from the left side facial photographs. All samples were genotyped using the Illumina HumanOmniZhongHua-8 chips, which interrogates 894,517 SNPs. Individuals with more than 5% missing data, related individuals, and the ones that failed the X-chromosome sex concordance check or had ethnic information incompatible with their genetic information were excluded. SNPs with more than 2% missing data, with a minor allele frequency smaller than 1%, and the ones that failed the Hardy–Weinberg deviation test (P < 1e-5) were also excluded. After applying these filters, we obtained a dataset of 2,600 samples with 776,213 SNPs. The chip genotype data were firstly phased using SHAPEIT [52]. IMPUTE2 [53] was then used to impute genotypes at non-genotyped SNPs using the 1000 Genomes Phase 3 data as the reference panel. After all phenotype and genotype quality controls, the current study included a total of 7,057,720 imputed and genotyped SNPs and 2,348 individuals of TZL.
The National Survey of Physical Traits Study
The National Survey of Physical Traits Study (NSPT) is part of the National Science & Technology Basic Research Project, which contained four sub-cohorts collected from three different regions of China in different years: Taizhou, Jiangsu province in 2015 and 2019, Zhengzhou, Henan province in 2017, Nanning, Guangxi province in 2018. The NSPT project was approved by the Ethics Committee of Human Genetic Resources of School of Life Sciences, Fudan University, Shanghai (14117). All participants provided written informed consent. Participants were photographed using a Canon EOS700D camera. Ten digital photographs of the face: left side (−90°)*2, left angle (−45°)*2, frontal (0°)*2, right angle (45°)*2, right side (90°)*2 were taken from ∼1.5 meters. Photos in each direction include one with eyes open and one with eyes closed. Ears were cropped from the left side facial photographs with open eyes. DNA was extracted from blood samples using the MagPure Blood DNA KF Kit. The DNA samples were genotyped on the Illumina Infinium Global Screening Array that investigates 707,180 variants which is a fully custom array designed by WeGene (https://www.wegene.com/). Individuals with more than 5% missing data, related individuals, and the ones that failed the X-chromosome sex concordance check or had ethnic information incompatible with their genetic information were excluded. In total, 2,487 individuals were enrolled (15HanTZ: N = 404; 17HanZZ: N = 644; 18HanNN: N = 1,119; 19HanTZ: N = 320). The genotype data were phased using SHAPEIT [52] and imputed to the 1000 Genomes Project Phase 3 reference panel using IMPUTE2 [53]. Variants exclusion criteria included INFO < 0.8, certainty score < 0.9, MAF < 0.02, SNP-wise call rate > 5%, and deviation from Hardy-Weinberg equilibrium (P < 1×10−6). After all phenotype and genotype quality controls, the current study included a total of 8,018,212 imputed and genotyped SNPs and 2,487 individuals of NSPT.
CANDELA Study
The Consortium for the Analysis of the Diversity and Evolution of Latin America (CANDELA) Study [54] consists of 6,630 volunteers recruited in five Latin American countries (Brazil, Colombia, Chile, Mexico and Peru). Ethics approvals were obtained from ethical committee at universities in all samples countries: the Universidad Nacional Autonoma de Mexico (Mexico), the Universidad de Antioquia (Colombia), the Universidad Peruana Cayetano Heredia (Peru), the Universidad de Tarapaca (Chile), the Universidade Federal do Rio Grande do Sul (Brazil) as well as at the University College London (UK). All participants provided written informed consent. Five digital photographs of the face: left side (−90°), left angle (−45°), frontal (0°), right angle (45°), right side (90°) were taken from ~1.5 meters at eye level using a Nikon D90 camera fitted with a Nikkor 50 mm fixed focal length lens. Ears were cropped from the left side facial photographs. DNA samples were genotyped on Illumina’s Omni Express BeadChip. After applying quality control filters 669, 462 SNPs and 6,357 individuals were retained for further analyses (2,922 males, 3,435 females). Average admixture proportions for this sample were estimated as: 48% European, 46% Native American and 6% African, but with substantial inter-individual variation. After all genomic and phenotypic quality controls this study included 6,238 individuals. The genetic PCs were obtained from the LD-pruned dataset of 93,328 SNPs. These PCs were selected by inspecting the proportion of variance explained and checking scatter and scree plots. The final imputed dataset used in the GWAS analyses included genotypes for 9,143,600 SNPs using the 1000-Genomes Phase I reference panel. After all phenotype and genotype quality controls, the current study included a total of 9,143,600 imputed and genotyped SNPs and 5,346 individuals of CANDELA.
Deep learning based ear phenotyping
Our computer pipeline for quantitative ear phenotyping, which includes automated ear detection, segmentation, landmarking, and phenotype acquisition is available at https://github.com/Fun-Gene/EarPhenotyping. In all cohorts, we used the same pipeline to minimize potential variation caused by different methodology. Specifically, we used the previously established two-stage ear landmark detector consisting of two CNNs deep-learnt from ~15,000 labeled ear images [3]. Both CNNs had the same architecture consisting of alternating 3 convolutional layers and 3 max-pooling layers followed by 3 fully connected layers. The 1st CNN detects various orientations and sizes and rectifies all coordinates at a coarse-grained level. The 2nd CNN was trained in a more controlled scenario for accurate landmarking of 55 ear landmarks. The performance of this two-stage landmark detector has been verified in various scenarios of difficulty that represents the state-of-the-art in ear landmark detection. In our application, we double checked all resultant landmarks from all images by eye-balling and removed those low-quality images that failed in CNN-based landmarking. After obtaining the coordinates of 55 landmarks, Generalized Procrustes Analysis (GPA) was used to remove affine variations due to shifting, rotation, and scaling. We then focused on the 17 most anatomical meaningful landmarks covering the entire ear (S1A Fig and S12 Table) and from those, calculated 136 inter-landmark distances as ear phenotypes in the genetic studies. Outliers greater than 3 standard deviations were removed and Z-transformed values were used in all subsequent analyses.
For quality control purposes, a trained rater carefully labeled the 17 landmarks on both the left and the right ear in 50 randomly selected and shuffled images. Pearson’s correlations were calculated between the left and the right ear phenotypes from this rater, which were compared with the left-right correlations from the CNN approach. Within the same ear (the left side only), Pearson’s correlations between the rater and the CNN approach were calculated.
Genetic correlation, heritability, GWASs, meta-analyses
Genetic correlations were estimated from all SNPs using GEMMA [55] based on a multivariate linear mixed model. Unsupervised hierarchical clustering analysis is conducted using 1-abs (correlation) as a dissimilarity matrix, and each iteration is updated using the Lance-Williams formula [56]. Twin heritability was estimated using phenotype correlations in monozygotic (MZ) and dizygotic twins (DZ), h2 = 2(r(MZ)-r(DZ)). SNP-based heritability was estimated from all GWAS SNPs using the restricted maximum likelihood estimation in GCTA [57].
GWASs were independently carried out in each cohort. GWASs in unrelated individuals (RS, TZL, NSPT, and CANDELA) were performed using linear models assuming an additive allele effect adjusted for covariates sex, age, BMI, and top 5–10 genomic PCs using PLINK 1.9 [58]. GWASs in TwinsUK (females only) were performed using GEMMA [55], which implements an LME model with an empirical genetic relatedness matrix to account for cryptic pedigree and population structure. All cohorts were aligned according to the genome-build GRCh37.p13. Meta-analyses of all cohorts were conducted using the inverse variance fixed-effect model in PLINK 1.9. All statistical analyses were conducted using the R Environment for Statistical Computing (version 3.5.2) unless otherwise specified.
C-GWAS
In the application of C-GWAS analysis on 136 ear meta-analyses, we focused on SNPs with an observable frequency (MAF > 0.01) in three different continental groups (European, East Asian, and Latin American). The C-GWAS is an R library that is freely available at https://github.com/Fun-Gene/CGWAS. In the current application, default parameters were used. The summary statistics of the 136 meta-analyses were used as the input of C-GWAS. Details of C-GWAS method has been described previously [4]. In brief, the null hypothesis (H0) for a SNP under testing is the absence of any allelic effect on all traits, and the alternative hypothesis (H1) is that its allelic effects deviate from 0 for at least one of the multiple traits. C-GWAS incorporate two different tests originated from either the effect based inversed covariance weighting or the truncated Wald test to maximize statistical power. All resultant P-values from C-GWAS are adjusted using the getCoef function implemented in C-GWAS, which performs simulations (n simulations = 1e8) to guarantee that the null distribution of C-GWAS follows the uniform distribution in all quantiles. This simulation analysis was also applied for adjusting for minimal p-values of meta-analysis of 136 traits, and the adjusted minimal P-values are abbreviated as MinGWAS. Therefore, C-GWAS and from MinGWAS are directly comparable with each other and with any standard single-trait GWAS, so that the traditional genome-wide significance threshold of 5e-8 corresponds to our study-wide significance threshold. C-GWAS completed the analysis of 136 traits and 4,803,785 SNPs within 2 hours with 16 threads in parallel and peak memory usage of 32 GB.
Post-GWAS analyses
SNPs and nearby genes were annotated using ANNOVAR [59]. Enrichment analysis of biological processes, molecular functions, and cellular components were conducted using Metascape [60] based on the Gene Ontology (GO) database. Regulatory activities of associated SNPs and 3D interacting genes were explored using the 3DSNP database [33]. Enhancer activities and embryonic expression patterns at the associated loci were examined in transgenic mice in the Vista Enhancer Browser database [34]. Potential functional links between nearby genes, 3D interacting genes and ear/craniofacial features were examined in the Harmonizome database [35] and gene expression patterns in the branchial arch and embryonic ectoderm were examined in the MGI database [36,37]. Gene expressions in cranial neural crest cells (CNCCs) were compared between genes of interest and background genes over the genome using RNA-seq data from Prescott et al. [61], GTEx [44], and ENCODE [62]. We attained all annotated nearest genes of the 1e-4 sets of matched SNPs for 16 lead SNPs using SNPsnap [63], the 16 genes with the median expression value separately in 50 cell types were as the correspondent control genes. Partitioned heritability enrichments based on CNCCs [61], 4-stages of embryonic craniofacial tissues [64] and 97 cell types form Roadmap Epigenomics resource [65,66]. The annotations for CNCCs, embryonic craniofacial tissues and other cell types refer the previous study [67] using S-LDSC [38]. For CNCCs, we downloaded and processed H3K27ac and ATAC data, and all data from different replicates. Peaks were called using MACS2 [68]. We iteratively obtained and combined the most reliable peaks based on a 50% peak overlap rate for all replicates from of H3K27ac and ATAC data using BEDtools. [69]. Based on the combined peaks, we used ROSE [70] to infer enhancers including super-enhancers and annotated all enhancer regions (S14 Table). For embryonic craniofacial tissues, we combined all regions with the following annotations from the 25-state chromHMM model: ‘Enh’, ‘TxReg’, ‘PromD1’, ‘PromD2’, ‘PromU’ and ‘TssA’. For other cell types, we combined all regions with the following annotations from the 15-state chromHMM model: ‘1_TssA’, ‘2_TssAFlnk’, ‘7_Enh’ and ‘6_EnhG’. Each annotation was individually added to the baseline LD model [38]. Also, we use the C-GWAS summary data for 136 meta-analysis in European populations including RS and TwinsUK. Pleiotropic associations were looked up in GWAScatalog based on the region (basepair position of lead SNP +- 500kb) and annotated gene [71].
CRISPR-Cas9-mediated gene editing in mouse
We generated C57BL/6J Intu knockout mice using the one-step CRISPR/Cas9 method [72] and tightly followed the steps described in a previous study of Tbx15 and Pax1 genes [39]. In brief, fertilized eggs obtained from super-ovulated females (four weeks) mated with males (seven-eight weeks) were microinjected with mixtures of Cas9 mRNA and sgRNA (S10 Fig). The injected eggs were cultured to day two and transferred to female mice. We obtained only positive heterozygote mutants because homozygote mutants of Intu were fatal. Compared with the intact allele (402kb), the mutant allele had a reduced sequence (251kb) with a removal of exons from 2 to 15 (S10 Fig). Mice were raised in a pathogen-free environment and bred according to SPF animal breeding standards. After eight months breeding experiments, we obtained a total of 19 F2 sexually mature mice (9-weeks), including 8 wild-type and 11 heterozygote mutant mice. For Tbx15, we used previously generated mice [39], including 10 wild-type, 18 heterozygote, and 10 homozygote mutant mice. The use of laboratory animals (SYXK 2019–0022) was licensed by the Beijing Municipal Science and Technology Commission.
Mouse pinna phenotyping
F2 sexually mature mice were sacrificed by cervical dislocation. Hair was removed with a razor and Weiting depilatory cream. A HandySCAN BLACK scanner was used to obtain 3D models (resolution 0.3 mm, accuracy 0.03 mm). The coordinates of 9 anatomical landmarks (3 facial and 6 ear landmarks) and 12 pseudo-landmarks were obtained using Geomagic Wrap (Fig 2C and S13 Table). The 12 pseudo-landmarks were equally spaced from the four partial curves of the mouse auricle contour (including L1-L4, L4-L6, L6-L5, and L5-L3) for better covering the contour of the auricle. The R package ‘geomorph’ were used for series of analysis (https://github.com/geomorphR/geomorph). GPA was used to remove the effects of translation, rotation and scaling. After superimposition of the GPA-adjusted coordinates, only the shape component remained in the aligned specimens. Mice with the outlier in PC1 were excluded then the GPA and PCA were redone. The first 10 PCs were used to represent dominant but different dimensional variations of all landmarks. The ear shape visualization of shape variation based on the maximum and minimum PC compare to mean ear shape. The inter-landmark also distances also were visualized (S13 Table). Z-transformed variables were used for association analysis. The effect of per mutant allele on ear shape was tested using linear regression with covariate sex, weight and body length. Multiple testing was corrected using FDR method.
Supporting information
S1 Fig. Study design.
(A) The location of selected 17 ear landmarks. (B) Design of the current study.
https://doi.org/10.1371/journal.pgen.1010786.s001
(TIF)
S2 Fig. Pearson’s correlation coefficients for 136 ear phenotypes derived from different methods (auto pipeline and manual-landmarking) and different ears (left and right ears).
(A) correlation between left ear phenotypes by manual-landmarking (expert 1 vs. expert 2). (B) correlation between right ear phenotypes by manual-landmarking and auto-landmarking (expert 1 vs. auto). (C) correlation between phenotypes from left and right ears by manual-landmarking. (D) correlation between phenotypes from left and right ears by auto-landmarking.
https://doi.org/10.1371/journal.pgen.1010786.s002
(TIF)
S3 Fig. Effects of sex (left) and age (right) on 136 ear phenotypes in RS.
Please note the different figure legends in these two figures.
https://doi.org/10.1371/journal.pgen.1010786.s003
(TIF)
S4 Fig. Two distinct clusters of 136 ear phenotypes derived from unsupervised hierarchical clustering.
(A) Two clusters for 136 phenotypes. (B) Phenotypic (right up) and genetic correlation matrix (left down) within and between the cluster.
https://doi.org/10.1371/journal.pgen.1010786.s004
(TIF)
S5 Fig. Heritability of 136 ear phenotypes estimated in TwinsUK.
(A) Twin heritability. (B) SNP-based heritability.
https://doi.org/10.1371/journal.pgen.1010786.s005
(TIF)
S6 Fig. 16 ear-associated loci we identified in our current MinGWAS and C-GWAS.
The first 8 (A-H) were novel loci, others 8 were previously reported loci (I-P). Each figure includes three figures, LocusZoom (up) shows regional association plots for the top-associated ear phenotype (p values in CGWAS, except for the 6q21 PRDM1/ATG5, which solely identified by meta-analysis) with candidate genes aligned below according to the chromosomal positions (GRCh37.p13) followed by the linkage disequilibrium (LD) patterns (r2) of European. Ear map (left lower) shows the association (p values in Meta-analysis) between all ear phenotypes (P < 1e-3) and top-SNP identified in our analysis. Effect plot (right lower) shows effect sizes for the effect allele of top-SNPs from the association with top-associated ear phenotype in all 5 GWASs and meta-analysis.
https://doi.org/10.1371/journal.pgen.1010786.s006
(DOCX)
S7 Fig. Various degrees of Darwin’s tubercle.
https://doi.org/10.1371/journal.pgen.1010786.s007
(TIF)
S8 Fig. Comparison of 4 phenotypes of Intu+/- mutant mice and WT+/+ mice including weight, body length, forelimb length and posterior limb length (* P < 0.05, ** P < 0.01, *** P < 0.001).
https://doi.org/10.1371/journal.pgen.1010786.s008
(TIF)
S9 Fig. In-vivo mouse models of TBX15 deficiency.
Homozygous Tbx15-/- mutant mice (N = 9, 9 weeks), heterozygous Tbx15+/- (N = 18, 9 weeks) and C57BL/6 WT+/+ control mice (N = 10, 9 weeks) were compared for ear and body morphological differences. (A) The schematic diagram of the one-step CRISPR/Cas9 technology used in Tbx15 knockout mice. (B) Example of left profile craniofacial photo of Tbx15-/- mutant mice with removal hair. (C) The principal component analysis for 21 landmarks of Tbx15-/- mutant mice, heterozygous Tbx15+/- and WT+/+ mice. The upper layer shown the detailed contribution proportion of 21 landmarks to the first 10 principal components. The middle layer shown the screenplot of first 10 PCs, the significant association between the genotype and PCs which including PC1 and PC4 (* P < 0.05, ** P < 0.01, *** P < 0.001). The bottom shown the maximum PC1-, minimum PC1-,maximum PC4-, minimum PC4-, and mean ear shapes. (D) The pattern of genetic association in humans (left) and in mice (right). (E) Effect of Tbx15 knock-out on ear phenotypes in mice (blue for effect of heterozygote mutant and red for wildtype).
https://doi.org/10.1371/journal.pgen.1010786.s009
(TIF)
S10 Fig. The INTU knockout mice using one-step CRISPR/Cas9 technology.
(A) The schematic diagram. (B) the sequence of gRNA. (C) Genotype was identified by PCR (Positive 251kb (loss-function), negative 402kb (wild type), heterozygous including 251kb and 402kb).
https://doi.org/10.1371/journal.pgen.1010786.s010
(TIF)
S1 Table. Characteristics of 5 cohorts.
https://doi.org/10.1371/journal.pgen.1010786.s011
(XLSX)
S2 Table. Characteristics of 136 ear phenotypes in 5 cohorts.
https://doi.org/10.1371/journal.pgen.1010786.s012
(XLSX)
S3 Table. Kolmogorov-Smirnov normality tests of 136 ear phenotypes in 5 cohorts.
https://doi.org/10.1371/journal.pgen.1010786.s013
(XLSX)
S4 Table. The effects of age and sex on 136 ear phenotypes in the RS cohort.
https://doi.org/10.1371/journal.pgen.1010786.s014
(XLSX)
S5 Table. Replication of previously ear-associated SNPs in MinGWAS and C-GWAS.
https://doi.org/10.1371/journal.pgen.1010786.s015
(XLSX)
S6 Table. Replication of previously face-associated SNPs in MinGWAS and C-GWAS.
https://doi.org/10.1371/journal.pgen.1010786.s016
(XLSX)
S7 Table. GWAS Catalog entries for significant SNPs in MinGWAS and C-GWAS.
https://doi.org/10.1371/journal.pgen.1010786.s017
(XLSX)
S8 Table. Enrichment of 31 genes in the 16 ear-associated loci.
https://doi.org/10.1371/journal.pgen.1010786.s018
(XLSX)
S9 Table. The evidence of the 16 ear-associated lead SNPs or nearby genes in four databases including 3DSNP, VISTA, Harmonizome, and MGI.
https://doi.org/10.1371/journal.pgen.1010786.s019
(XLSX)
S10 Table. Partitioned heritability enrichments based on cell-type-specific regulatory annotations.
https://doi.org/10.1371/journal.pgen.1010786.s020
(XLSX)
S11 Table. Effects of sex and genotypes on 28 ear phenotypes in mice.
https://doi.org/10.1371/journal.pgen.1010786.s021
(XLSX)
S12 Table. Definition of 17 human ear landmarks.
https://doi.org/10.1371/journal.pgen.1010786.s022
(XLSX)
S13 Table. Definition of 21 ear landmarks in mice.
https://doi.org/10.1371/journal.pgen.1010786.s023
(XLSX)
S14 Table. Enhancer regions estimated based on the H3K27ac and ATAC-seq using ROSE.
https://doi.org/10.1371/journal.pgen.1010786.s024
(XLSX)
Acknowledgments
The authors thank all sample donors for their contribution to this project. We would like to express our gratitude to Winston Rojas-Montoya for his invaluable support at the Universidad de Antioquia during the difficult period following the unfortunate passing of Gabriel Bedoya.
Citation: Li Y, Xiong Z, Zhang M, Hysi PG, Qian Y, Adhikari K, et al. (2023) Combined genome-wide association study of 136 quantitative ear morphology traits in multiple populations reveal 8 novel loci. PLoS Genet 19(7): e1010786. https://doi.org/10.1371/journal.pgen.1010786
1. Adhikari K, Reales G, Smith AJ, Konka E, Palmen J, Quinto-Sanchez M, et al. A genome-wide association study identifies multiple loci for variation in human ear morphology. Nat Commun. 2015;6:7500. Epub 2015/06/25. pmid:26105758.
2. Shaffer JR, Li J, Lee MK, Roosenboom J, Orlova E, Adhikari K, et al. Multiethnic GWAS Reveals Polygenic Architecture of Earlobe Attachment. Am J Hum Genet. 2017;101(6):913–24. Epub 2017/12/05. pmid:29198719.
3. Hansley EE, Segundo MP, Sarkar S. Employing fusion of learned and handcrafted features for unconstrained ear recognition. Iet Biometrics. 2018;7(3):215–23.
4. Xiong Z, Gao X, Chen Y, Feng Z, Pan S, Lu H, et al. Combining genome-wide association studies highlight novel loci involved in human facial variation. Nat Commun. 2022;13(1):7832. Epub 2022/12/21. pmid:36539420.
5. Turley P, Walters RK, Maghzian O, Okbay A, Lee JJ, Fontana MA, et al. Multi-trait analysis of genome-wide association summary statistics using MTAG. Nat Genet. 2018;50(2):229–37. Epub 2018/01/03. pmid:29292387.
6. Klewer SE, Yatskievych T, Pogreba K, Stevens MV, Antin PB, Camenisch TD. Has2 expression in heart forming regions is independent of BMP signaling. Gene Expr Patterns. 2006;6(5):462–70. Epub 2006/02/07. pmid:16458617.
7. Casini P, Nardi I, Ori M. Hyaluronan is required for cranial neural crest cells migration and craniofacial development. Dev Dyn. 2012;241(2):294–302. Epub 2011/12/21. pmid:22184056.
8. Xiong Z, Dankova G, Howe LJ, Lee MK, Hysi PG, de Jong MA, et al. Novel genetic loci affecting facial shape variation in humans. Elife. 2019;8. Epub 2019/11/26. pmid:31763980.
9. Kichaev G, Bhatia G, Loh PR, Gazal S, Burch K, Freund MK, et al. Leveraging Polygenic Functional Enrichment to Improve GWAS Power. Am J Hum Genet. 2019;104(1):65–75. Epub 2019/01/01. pmid:30595370.
10. Yap CX, Sidorenko J, Wu Y, Kemper KE, Yang J, Wray NR, et al. Dissection of genetic variation and evidence for pleiotropy in male pattern baldness. Nat Commun. 2018;9(1):5407. Epub 2018/12/24. pmid:30573740.
11. X W, H X, Z C. Anthropometric Methods (Chinese Version)2010.
12. Park TJ, Haigo SL, Wallingford JB. Ciliogenesis defects in embryos lacking inturned or fuzzy function are associated with failure of planar cell polarity and Hedgehog signaling. Nat Genet. 2006;38(3):303–11. Epub 2006/02/24. pmid:16493421.
13. Chang R, Petersen JR, Niswander LA, Liu A. A hypomorphic allele reveals an important role of inturned in mouse skeletal development. Dev Dyn. 2015;244(6):736–47. Epub 2015/03/17. pmid:25774014.
14. Bruel AL, Franco B, Duffourd Y, Thevenon J, Jego L, Lopez E, et al. Fifteen years of research on oral-facial-digital syndromes: from 1 to 16 causal genes. J Med Genet. 2017;54(6):371–80. Epub 2017/03/16. pmid:28289185.
15. Adhikari K, Fuentes-Guajardo M, Quinto-Sanchez M, Mendoza-Revilla J, Camilo Chacon-Duque J, Acuna-Alonzo V, et al. A genome-wide association scan implicates DCHS2, RUNX2, GLI3, PAX1 and EDAR in human facial variation. Nat Commun. 2016;7:11616. Epub 2016/05/20. pmid:27193062.
16. Tan J, Yang Y, Tang K, Sabeti PC, Jin L, Wang S. The adaptive variant EDARV370A is associated with straight hair in East Asians. Hum Genet. 2013;132(10):1187–91. Epub 2013/06/25. pmid:23793515.
17. Fujimoto A, Kimura R, Ohashi J, Omi K, Yuliwulandari R, Batubara L, et al. A scan for genetic determinants of human hair morphology: EDAR is associated with Asian hair thickness. Hum Mol Genet. 2008;17(6):835–43. Epub 2007/12/11. pmid:18065779.
18. Kamberov YG, Wang S, Tan J, Gerbault P, Wark A, Tan L, et al. Modeling recent human evolution in mice by expression of a selected EDAR variant. Cell. 2013;152(4):691–702. Epub 2013/02/19. pmid:23415220.
19. Kimura R, Yamaguchi T, Takeda M, Kondo O, Toma T, Haneji K, et al. A common variation in EDAR is a genetic determinant of shovel-shaped incisors. Am J Hum Genet. 2009;85(4):528–35. Epub 2009/10/07. pmid:19804850.
20. Tan J, Peng Q, Li J, Guan Y, Zhang L, Jiao Y, et al. Characteristics of dental morphology in the Xinjiang Uyghurs and correlation with the EDARV370A variant. Sci China Life Sci. 2014;57(5):510–8. Epub 2014/04/23. pmid:24752358.
21. Pickrell JK, Berisa T, Liu JZ, Segurel L, Tung JY, Hinds DA. Detection and interpretation of shared genetic influences on 42 human traits. Nat Genet. 2016;48(7):709–17. Epub 2016/05/18. pmid:27182965.
22. Claes P, Roosenboom J, White JD, Swigut T, Sero D, Li J, et al. Genome-wide mapping of global-to-local genetic effects on human facial shape. Nat Genet. 2018;50(3):414–23. Epub 2018/02/21. pmid:29459680.
23. Liu F, van der Lijn F, Schurmann C, Zhu G, Chakravarty MM, Hysi PG, et al. A genome-wide association study identifies five loci influencing facial morphology in Europeans. PLoS Genet. 2012;8(9):e1002932. Epub 2012/10/03. pmid:23028347 financial or other direct benefits. All other authors have declared that no competing interests exist.
24. Paternoster L, Zhurov AI, Toma AM, Kemp JP, St Pourcain B, Timpson NJ, et al. Genome-wide association study of three-dimensional facial morphology identifies a variant in PAX3 associated with nasion position. Am J Hum Genet. 2012;90(3):478–85. Epub 2012/02/22. pmid:22341974.
25. Cha S, Lim JE, Park AY, Do JH, Lee SW, Shin C, et al. Identification of five novel genetic loci related to facial morphology by genome-wide association studies. BMC Genomics. 2018;19(1):481. Epub 2018/06/21. pmid:29921221.
26. Shaffer JR, Orlova E, Lee MK, Leslie EJ, Raffensperger ZD, Heike CL, et al. Genome-Wide Association Study Reveals Multiple Loci Influencing Normal Human Facial Morphology. PLoS Genet. 2016;12(8):e1006149. Epub 2016/08/26. pmid:27560520.
27. Lee MK, Shaffer JR, Leslie EJ, Orlova E, Carlson JC, Feingold E, et al. Genome-wide association study of facial morphology reveals novel associations with FREM1 and PARK2. PLoS One. 2017;12(4):e0176566. Epub 2017/04/26. pmid:28441456.
28. Cole JB, Manyama M, Kimwaga E, Mathayo J, Larson JR, Liberton DK, et al. Genomewide Association Study of African Children Identifies Association of SCHIP1 and PDE8A with Facial Size and Shape. PLoS Genet. 2016;12(8):e1006174. Epub 2016/08/26. pmid:27560698.
29. Crouch DJM, Winney B, Koppen WP, Christmas WJ, Hutnik K, Day T, et al. Genetics of the human face: Identification of large-effect single gene variants. Proc Natl Acad Sci U S A. 2018;115(4):E676–E85. Epub 2018/01/06. pmid:29301965.
30. White JD, Indencleef K, Naqvi S, Eller RJ, Hoskens H, Roosenboom J, et al. Insights into the genetic architecture of the human face. Nat Genet. 2021;53(1):45–53. Epub 2020/12/09. pmid:33288918.
31. Anthwal N, Thompson H. The development of the mammalian outer and middle ear. J Anat. 2016;228(2):217–32. Epub 2015/08/01. pmid:26227955.
32. Som PM, Naidich TP. Illustrated review of the embryology and development of the facial region, part 1: Early face and lateral nasal cavities. AJNR Am J Neuroradiol. 2013;34(12):2233–40. Epub 2013/03/16. pmid:23493891.
33. Lu Y, Quan C, Chen H, Bo X, Zhang C. 3DSNP: a database for linking human noncoding SNPs to their three-dimensional interacting genes. Nucleic Acids Res. 2017;45(D1):D643–D9. Epub 2016/10/30. pmid:27789693.
34. Visel A, Minovitsky S, Dubchak I, Pennacchio LA. VISTA Enhancer Browser—a database of tissue-specific human enhancers. Nucleic Acids Res. 2007;35(Database issue):D88–92. Epub 2006/11/30. pmid:17130149.
35. Rouillard AD, Gundersen GW, Fernandez NF, Wang Z, Monteiro CD, McDermott MG, et al. The harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins. Database (Oxford). 2016;2016. Epub 2016/07/05. pmid:27374120.
36. Smith CM, Hayamizu TF, Finger JH, Bello SM, McCright IJ, Xu J, et al. The mouse Gene Expression Database (GXD): 2019 update. Nucleic Acids Res. 2019;47(D1):D774–D9. Epub 2018/10/20. pmid:30335138.
37. Bult CJ, Blake JA, Smith CL, Kadin JA, Richardson JE, Mouse Genome Database G. Mouse Genome Database (MGD) 2019. Nucleic Acids Res. 2019;47(D1):D801–D6. Epub 2018/11/09. pmid:30407599.
38. Finucane HK, Bulik-Sullivan B, Gusev A, Trynka G, Reshef Y, Loh PR, et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat Genet. 2015;47(11):1228–35. Epub 2015/09/29. pmid:26414678.
39. Qian Y, Xiong Z, Li Y, Liu F. The effects of Tbx15 and Pax1 on facial and other physical morphology in mice. 2021.
40. Liu J, Zhou Y, Liu S, Song X, Yang XZ, Fan Y, et al. The coexistence of copy number variations (CNVs) and single nucleotide polymorphisms (SNPs) at a locus can result in distorted calculations of the significance in associating SNPs to disease. Hum Genet. 2018;137(6–7):553–67. Epub 2018/07/19. pmid:30019117.
41. Kim SK. Identification of 613 new loci associated with heel bone mineral density and a polygenic risk score for bone mineral density, osteoporosis and fracture. PLoS One. 2018;13(7):e0200785. Epub 2018/07/27. pmid:30048462.
42. He M, Xu M, Zhang B, Liang J, Chen P, Lee JY, et al. Meta-analysis of genome-wide association studies of adult height in East Asians identifies 17 novel loci. Hum Mol Genet. 2015;24(6):1791–800. Epub 2014/11/28. pmid:25429064.
43. Tachmazidou I, Hatzikotoulas K, Southam L, Esparza-Gordillo J, Haberland V, Zheng J, et al. Identification of new therapeutic targets for osteoarthritis through genome-wide analyses of UK Biobank data. Nat Genet. 2019;51(2):230–6. Epub 2019/01/22. pmid:30664745.
44. Consortium GT. The Genotype-Tissue Expression (GTEx) project. Nat Genet. 2013;45(6):580–5. Epub 2013/05/30. pmid:23715323.
45. Brunskill EW, Potter AS, Distasio A, Dexheimer P, Plassard A, Aronow BJ, et al. A gene expression atlas of early craniofacial development. Dev Biol. 2014;391(2):133–46. Epub 2014/05/02. pmid:24780627.
46. Millard DR, Pickard RE. Darwin’s tubercle belongs to Woolner. Arch Otolaryngol. 1970;91(4):334–5. Epub 1970/04/01. pmid:4909009.
47. Ikram MA, Brusselle GGO, Murad SD, van Duijn CM, Franco OH, Goedegebure A, et al. The Rotterdam Study: 2018 update on objectives, design and main results. Eur J Epidemiol. 2017;32(9):807–50. Epub 2017/10/25. pmid:29064009.
48. Genomes Project C, Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, et al. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491(7422):56–65. Epub 2012/11/07. pmid:23128226.
49. Lango Allen H, Estrada K, Lettre G, Berndt SI, Weedon MN, Rivadeneira F, et al. Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature. 2010;467(7317):832–8. Epub 2010/10/01. pmid:20881960.
50. Small KS, Hedman AK, Grundberg E, Nica AC, Thorleifsson G, Kong A, et al. Identification of an imprinted master trans regulator at the KLF14 locus related to multiple metabolic phenotypes. Nat Genet. 2011;43(6):561–4. Epub 2011/05/17. pmid:21572415.
51. Wang X, Lu M, Qian J, Yang Y, Li S, Lu D, et al. Rationales, design and recruitment of the Taizhou Longitudinal Study. BMC Public Health. 2009;9:223. Epub 2009/07/11. pmid:19589173.
52. O’Connell J, Gurdasani D, Delaneau O, Pirastu N, Ulivi S, Cocca M, et al. A general approach for haplotype phasing across the full spectrum of relatedness. PLoS Genet. 2014;10(4):e1004234. Epub 2014/04/20. pmid:24743097.
53. Howie BN, Donnelly P, Marchini J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 2009;5(6):e1000529. Epub 2009/06/23. pmid:19543373.
54. Ruiz-Linares A, Adhikari K, Acuna-Alonzo V, Quinto-Sanchez M, Jaramillo C, Arias W, et al. Admixture in Latin America: geographic structure, phenotypic diversity and self-perception of ancestry based on 7,342 individuals. PLoS Genet. 2014;10(9):e1004572. Epub 2014/09/26. pmid:25254375.
55. Zhou X, Stephens M. Efficient multivariate linear mixed model algorithms for genome-wide association studies. Nat Methods. 2014;11(4):407–9. Epub 2014/02/18. pmid:24531419.
56. Jr J. Taylor & Francis Online:: Hierarchical Grouping to Optimize an Objective Function—Journal of the American Statistical Association—Volume 58, Issue 301. Journal of the American Statistical Association.
57. Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet. 2011;88(1):76–82. Epub 2010/12/21. pmid:21167468.
58. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. Epub 2007/08/19. pmid:17701901.
59. Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38(16):e164. Epub 2010/07/06. pmid:20601685.
60. Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. 2019;10(1):1523. Epub 2019/04/05. pmid:30944313.
61. Prescott SL, Srinivasan R, Marchetto MC, Grishina I, Narvaiza I, Selleri L, et al. Enhancer divergence and cis-regulatory evolution in the human and chimp neural crest. Cell. 2015;163(1):68–83. Epub 2015/09/15. pmid:26365491.
62. Consortium EP. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489(7414):57–74. Epub 2012/09/08. pmid:22955616.
63. Pers TH, Timshel P, Hirschhorn JN. SNPsnap: a Web-based tool for identification and annotation of matched SNPs. Bioinformatics. 2015;31(3):418–20. Epub 2014/10/16. pmid:25316677.
64. Wilderman A, VanOudenhove J, Kron J, Noonan JP, Cotney J. High-Resolution Epigenomic Atlas of Human Embryonic Craniofacial Development. Cell Rep. 2018;23(5):1581–97. Epub 2018/05/03. pmid:29719267.
65. Roadmap Epigenomics C, Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, et al. Integrative analysis of 111 reference human epigenomes. Nature. 2015;518(7539):317–30. Epub 2015/02/20. pmid:25693563.
66. Luo Y, Hitz BC, Gabdank I, Hilton JA, Kagda MS, Lam B, et al. New developments on the Encyclopedia of DNA Elements (ENCODE) data portal. Nucleic Acids Res. 2020;48(D1):D882–D9. Epub 2019/11/13. pmid:31713622.
67. Naqvi S, Sleyp Y, Hoskens H, Indencleef K, Spence JP, Bruffaerts R, et al. Shared heritability of human face and brain shape. Nat Genet. 2021;53(6):830–9. Epub 2021/04/07. pmid:33821002.
68. Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 2008;9(9):R137. Epub 2008/09/19. pmid:18798982.
69. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26(6):841–2. Epub 2010/01/30. pmid:20110278.
70. Whyte WA, Orlando DA, Hnisz D, Abraham BJ, Lin CY, Kagey MH, et al. Master transcription factors and mediator establish super-enhancers at key cell identity genes. Cell. 2013;153(2):307–19. Epub 2013/04/16. pmid:23582322.
71. Welter D, MacArthur J, Morales J, Burdett T, Hall P, Junkins H, et al. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res. 2014;42(Database issue):D1001–6. Epub 2013/12/10. pmid:24316577.
72. Zuo E, Cai YJ, Li K, Wei Y, Wang BA, Sun Y, et al. One-step generation of complete gene knockout mice and monkeys by CRISPR/Cas9-mediated gene editing with multiple sgRNAs. Cell Res. 2017;27(7):933–45. Epub 2017/06/07. pmid:28585534.
About the Authors:
Yi Li
Contributed equally to this work with: Yi Li, Ziyi Xiong, Manfei Zhang, Pirro G. Hysi, Yu Qian, Kaustubh Adhikari
Roles Data curation, Formal analysis, Investigation, Visualization, Writing – original draft, Writing – review & editing
Affiliations CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, China, CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, China
Ziyi Xiong
Contributed equally to this work with: Yi Li, Ziyi Xiong, Manfei Zhang, Pirro G. Hysi, Yu Qian, Kaustubh Adhikari
Roles Data curation, Formal analysis, Visualization, Writing – original draft, Writing – review & editing
Affiliations Department of Genetic Identification, Erasmus MC, University Medical Center, the Netherlands, Department of Epidemiology, Erasmus MC, University Medical Center, the Netherlands
Manfei Zhang
Contributed equally to this work with: Yi Li, Ziyi Xiong, Manfei Zhang, Pirro G. Hysi, Yu Qian, Kaustubh Adhikari
Roles Data curation, Formal analysis, Funding acquisition, Writing – original draft, Writing – review & editing
Affiliations CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, China, Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China, State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, China
Pirro G. Hysi
Contributed equally to this work with: Yi Li, Ziyi Xiong, Manfei Zhang, Pirro G. Hysi, Yu Qian, Kaustubh Adhikari
Roles Data curation, Formal analysis, Writing – review & editing
Affiliation: Department of Twin Research and Genetic Epidemiology, King’s College London, United Kingdom
Yu Qian
Contributed equally to this work with: Yi Li, Ziyi Xiong, Manfei Zhang, Pirro G. Hysi, Yu Qian, Kaustubh Adhikari
Roles Data curation, Formal analysis, Investigation, Writing – review & editing
Affiliations CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, China, Beijing No.8 High School, Beijing, China
Kaustubh Adhikari
Contributed equally to this work with: Yi Li, Ziyi Xiong, Manfei Zhang, Pirro G. Hysi, Yu Qian, Kaustubh Adhikari
Roles Data curation, Formal analysis, Writing – review & editing
Affiliations Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, United Kingdom, School of Mathematics and Statistics, Faculty of Science, Technology, Engineering and Mathematics, The Open University, United Kingdom
Jun Weng
Roles Data curation, Writing – review & editing
Affiliation: Center for Biometrics and Security Research & National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, China
Sijie Wu
Roles Data curation, Writing – review & editing
Affiliations CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, China, State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, China, Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, China
Siyuan Du
Roles Data curation, Writing – review & editing
Affiliations CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, China, Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China, University of Chinese Academy of Sciences, China
Rolando Gonzalez-Jose
Roles Data curation, Writing – review & editing
Affiliation: Instituto Patagonico de Ciencias Sociales y Humanas, Centro Nacional Patagonico, CONICET, Argentina
Lavinia Schuler-Faccini
Roles Data curation, Writing – review & editing
Affiliation: Departamento de Genetica, Universidade Federal do Rio Grande do Sul, Brasil
Maria-Catira Bortolini
Roles Data curation, Writing – review & editing
Affiliation: Departamento de Genetica, Universidade Federal do Rio Grande do Sul, Brasil
Victor Acuna-Alonzo
Roles Data curation, Writing – review & editing
Affiliation: Molecular Genetics Laboratory, National School of Anthropology and History, Mexico
Samuel Canizales-Quinteros
Roles Data curation, Writing – review & editing
Affiliation: Unidad de Genomica de Poblaciones Aplicada a la Salud, Facultad de Quimica, UNAM-Instituto Nacional de Medicina Genomica, Mexico
Carla Gallo
Roles Data curation, Writing – review & editing
Affiliation: Laboratorios de Investigacion y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Peru
Giovanni Poletti
Roles Data curation, Writing – review & editing
Affiliation: Laboratorios de Investigacion y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Peru
Gabriel Bedoya
† Deceased.
Roles Data curation, Writing – review & editing
Affiliation: GENMOL (Genetica Molecular), Universidad de Antioquia, Medellin, Colombia
Francisco Rothhammer
Roles Data curation, Writing – review & editing
Affiliation: Instituto de Alta Investigacion, Universidad de Tarapaca, Chile
Jiucun Wang
Roles Data curation, Writing – review & editing
Affiliations State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, China, Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, China
Jingze Tan
Roles Data curation, Writing – review & editing
Affiliation: Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, China
Ziyu Yuan
Roles Data curation, Writing – review & editing
Affiliation: Fudan-Taizhou Institute of Health Sciences, China
Li Jin
Roles Data curation, Funding acquisition, Writing – review & editing
Affiliations CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, China, State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, China, Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, China, Fudan-Taizhou Institute of Health Sciences, China
André G. Uitterlinden
Roles Data curation, Writing – review & editing
Affiliations Department of Epidemiology, Erasmus MC, University Medical Center, the Netherlands, Department of Internal Medicine, Erasmus MC, University Medical Center, the Netherlands
Mohsen Ghanbari
Roles Data curation, Writing – review & editing
Affiliation: Department of Epidemiology, Erasmus MC, University Medical Center, the Netherlands
M. Arfan Ikram
Roles Data curation, Writing – review & editing
Affiliation: Department of Epidemiology, Erasmus MC, University Medical Center, the Netherlands
Tamar Nijsten
Roles Data curation, Writing – review & editing
Affiliation: Department of Dermatology, Erasmus MC, University Medical Center, the Netherlands
Xiangyu Zhu
Roles Data curation, Writing – review & editing
Affiliations Center for Biometrics and Security Research & National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, China, School of Artificial Intelligence, University of Chinese Academy of Sciences, China
Zhen Lei
Roles Data curation, Writing – review & editing
Affiliations Center for Biometrics and Security Research & National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, China, School of Artificial Intelligence, University of Chinese Academy of Sciences, China
Peilin Jia
Roles Funding acquisition, Writing – review & editing
Affiliation: CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, China
Andres Ruiz-Linares
Roles Funding acquisition, Supervision, Writing – review & editing
¶‡ AR-L, TDS, SW, MK and FL also contributed equally to this work.
Affiliations Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, United Kingdom, Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, China, Aix-Marseille Universite, CNRS, EFS, ADES, France
Timothy D. Spector
Roles Supervision, Writing – review & editing
¶‡ AR-L, TDS, SW, MK and FL also contributed equally to this work.
Affiliation: Department of Twin Research and Genetic Epidemiology, King’s College London, United Kingdom
Sijia Wang
Roles Conceptualization, Funding acquisition, Supervision, Writing – review & editing
* E-mail: [email protected] (SW); [email protected] (FL)
¶‡ AR-L, TDS, SW, MK and FL also contributed equally to this work.
Affiliations CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, China, Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, China
Manfred Kayser
Roles Conceptualization, Supervision, Writing – original draft, Writing – review & editing
¶‡ AR-L, TDS, SW, MK and FL also contributed equally to this work.
Affiliation: Department of Genetic Identification, Erasmus MC, University Medical Center, the Netherlands
Fan Liu
Roles Conceptualization, Funding acquisition, Supervision, Writing – original draft, Writing – review & editing
* E-mail: [email protected] (SW); [email protected] (FL)
¶‡ AR-L, TDS, SW, MK and FL also contributed equally to this work.
Affiliations CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, China, Department of Genetic Identification, Erasmus MC, University Medical Center, the Netherlands
ORCID logo https://orcid.org/0000-0001-9241-8161
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2023 Li et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Human ear morphology, a complex anatomical structure represented by a multidimensional set of correlated and heritable phenotypes, has a poorly understood genetic architecture. In this study, we quantitatively assessed 136 ear morphology traits using deep learning analysis of digital face images in 14,921 individuals from five different cohorts in Europe, Asia, and Latin America. Through GWAS meta-analysis and C-GWASs, a recently introduced method to effectively combine GWASs of many traits, we identified 16 genetic loci involved in various ear phenotypes, eight of which have not been previously associated with human ear features. Our findings suggest that ear morphology shares genetic determinants with other surface ectoderm-derived traits such as facial variation, mono eyebrow, and male pattern baldness. Our results enhance the genetic understanding of human ear morphology and shed light on the shared genetic contributors of different surface ectoderm-derived phenotypes. Additionally, gene editing experiments in mice have demonstrated that knocking out the newly ear-associated gene (Intu) and a previously ear-associated gene (Tbx15) causes deviating mouse ear morphology.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer