1. Introduction
LIM domain only protein 7 (LMO7) is a large protein (1683 residues) that orchestrates many protein–protein interactions. It contains a LIM domain (a unique cysteine-rich zinc-binding domain), a calponin homology (CH) domain, a PDZ domain and an F-box (FBX) domain [1]. Combining distinct functional domains in one protein, the PDZ–LIM proteins have been related to wide-ranging and multicompartmental cell functions during development and homeostasis. For instance, these proteins are known to mediate signaling between the nucleus and the cytoplasm, sequester nuclear factors to the cytoplasm and interact with actin microfilaments [2]. In addition, the PDZ–LIM proteins facilitate the assembly of protein complexes, regulate gene transcription, control mitosis progression, influence spindle assembly checkpoint besides being involved in the ciliary function [3,4,5,6,7]. Regarding its intracellular distribution, the LMO7 protein can be found in the nucleus, perinuclear region, cytoplasm and/or on cell surface, particularly in cell–cell adhesions (where it can interact with nectin and E-cadherin through afadin and alpha-actinin) and focal adhesions [2,8,9,10]. Importantly, misregulation of the PDZ–LIM proteins is involved in cancer [11,12].
LMO7 has been described as an important regulator of skeletal muscle- and heart-relevant genes [8]. Accordingly, LMO7 is expressed in somites and in important structures involved in heart development in different vertebrates, including zebrafish, chicken and mouse embryos [13]. In addition, deletion of Lmo7 in mice causes a muscular dystrophy phenotype similar to Emery–Dreifuss muscular dystrophy (EDMD) [8]. LMO7 binds the emerin protein and is involved in the regulation of the emerin gene [8,14].
The LMO7 role in skeletal muscle structure and function is still controversial. While Lao et al. [15] demonstrated that Lmo7-null mice have normal skeletal muscles regarding morphology, physiological function and regeneration capabilities, Mull et al. [16] reported impaired skeletal muscle function in Lmo7-mutant mice involving growth retardation and decreased fiber size. Assuming that these differences cannot be attributed to the methodology used in these studies, the role of LMO7 in skeletal muscle cells remains to be further elucidated.
Using C2C12 mouse myoblast cell cultures, Dedeic et al. [17] reported that LMO7 is required for myoblast differentiation. Lmo7-downregulated myoblasts exhibited reduced expression levels of Pax3, Pax7, Myf5 and MyoD, whereas Lmo7 overexpression increased MyoD and Myf5 expression. LMO7 was found in the cytoplasm of myotubes and within the nuclei of myoblasts [17]. Importantly, LMO7 was shown to bind to the Pax3, MyoD and Myf5 promoters during myogenesis [17].
The role of LMO7 in skeletal muscles was further analyzed in chicken muscle cell cultures [10]. Knockdown of LMO7 using siRNA induced a decrease in the number of myoblasts and in the myotube size. Activators of the Wnt/beta-catenin pathway (Wnt3a and BIO) were able to revert these effects, suggesting an interaction between the Wnt/beta-catenin and LMO7-mediated signaling pathways.
This work aimed to shed light on LMO7 evolution, alternative transcripts, protein structure and gene regulation through multiple in silico analyses to provide support for future studies in different biological contexts, including skeletal muscles. In addition, we analyzed the spatial distribution of the LMO7 protein in myogenic cells of chicken and zebrafish, two important model organisms used to investigate skeletal muscle development, growth, and repair.
2. Results and Discussion
2.1. Identification of LMO7 Orthologous Genes in Different Vertebrates
To identify the orthologs of the human LMO7 gene in species representative of different vertebrate groups, we started by performing a systematic search in the Ensembl and Gene NCBI databases. A single LMO7 gene was identified in the genome of placental and marsupial mammals (mouse and opossum), sauropsids (chicken, zebra finch, Chinese softshell turtle and anole lizards), amphibian (Xenopus tropicalis) and in the spotted gar, a holostean fish. In contrast, in teleost fishes (zebrafish and catfish), two lmo7 genes (lmo7a and lmo7b) were identified (Table S1).
To understand the evolutionary relationship between the LMO7 sequences identified, we performed a phylogenetic analysis based on the predicted amino acid sequences. The Branchiostoma floridae (cephalochordate) Lmo7 protein was used as an out-group. The UniProt or NCBI IDs of the proteins used to build the LMO7 phylogenetic tree are presented in Table S1. Our results revealed that the LMO7 sequences form two subgroups. One encompasses the tetrapod LMO7 and Lmo7a of holostean and teleost fishes while the other includes the lmo7b teleost protein (Figure 1). The common ancestry of the different LMO7 orthologs is further evidenced by the presence of syntenic gene sets around the LMO7 genes, as displayed in Table S2.
Our data on placental mammals, chicken and Xenopus are in accordance with a previous study but differ regarding zebrafish, given that only a single lmo7 gene was described previously in this species [13]. The presence of two lmo7 genes in the genome of teleost fishes is expected given that an additional round of whole genome duplication occurred at the base of the teleost fish lineage [18,19]. Although additional analyses are required to determine which teleost species bear both lmo7 genes or whether there are further unidentified paralogs in specific teleost fishes, our results demonstrate for the first time that lmo7 underwent duplication in teleosts, generating lmo7a and lmo7b.
2.2. Characterization of the LMO7 Transcripts in Human and Vertebrate Model Organisms
Alternative splicing-derived transcripts of LMO7 have been described for human [20] and other placental mammals [2]. However, data on LMO7 variants generated either by alternative splicing or distinct starting sites for transcription or termination are still scarce. Therefore, we analyzed the information about the LMO7 transcripts available in the Ensembl genome browser for humans and the major vertebrate model organisms (mouse, chicken, Xenopus and zebrafish).
Our analysis revealed that, like other genes encoding the PDZ–LIM proteins [21], the LMO7 ortholog genes are targets of alternative splicing given that several different transcripts were identified in all the species evaluated (Tables S3–S7). In human, there are 27 different predicted LMO7 transcripts, nine of which are non-coding. Among them, only three feature the highest Ensembl Transcript Support Level (TSL:1, Table S3). In mice, 16 transcripts were predicted for the Lmo7 gene (eight with TSL:1, Table S4), nine of which are non-coding. In chicken and Xenopus, eight and six protein-coding LMO7/lmo7 transcripts were predicted, respectively, although no Ensembl tags were available to assess the prediction reliability of these transcripts (Tables S5 and S6). In zebrafish, 13 transcripts are predicted for the lmo7a gene (eight are non-coding), while for lmo7b, five transcripts are predicted (Table S7). The exon–intron structure predicted for a well-documented LMO7 transcript of human and mice as well as the longest transcripts found in the other species evaluated are presented in Figure 2.
Taken together, our findings indicate that studies of expression at the mRNA and protein levels are required to validate the bioinformatic predictions about the LMO7 variant transcripts in human and in the main vertebrate model organisms. In addition, as multiple non-coding transcripts were predicted to be generated from the vertebrate LMO7 genes, functional studies to elucidate the role of such transcripts are required. Given that non-coding transcripts can regulate gene expression in different ways [22] and some can generate active micropeptides [23], non-coding LMO7 transcripts can potentially play important roles that have yet to be described.
2.3. Dissecting the Structural Features of the LMO7 Protein
To deepen our comparative analysis, we evaluated the structural features of the LMO7 proteins encoded by the transcripts shown in Figure 2. The human LMO7 (hLMO7; UniProt ID Q8WWI1) contains three well-folded domains, as predicted from the primary structure (The UniProt Consortium, 2021 [24]). Moreover, there is a domain of unknown function (DUF4757) found in two regions: residues 294–382 and residues 650–787, as reported by the PFAM database [25]. There is one three-dimensional structural model for the PDZ domain of LMO7 (PDB 2eaq, residues 1037–1126) (Figure 3A). This X-ray crystallographic structure shows five antiparallel β-strands resembling a β-barrel (also known as the β-finger) next to an α-helical segment. The folding shared by different types of the PDZ domains accounts for their binding to redundant targets. The PDZ domains are widely present in life, from bacteria to mammals, and recognize short linear motifs (SLiMs) in various proteins, contributing to assembly of multicomponent complexes (reviewed in [26]). Close to the N-terminus, there is a calponin homology (CH) domain involved in the interaction with cytoskeletal proteins and signaling. The C-terminal LIM domain is composed of two zinc fingers that function in cytoskeleton reorganization. Collectively, all three domains of LMO7 are involved in protein–protein interactions. However, the predicted domains only cover 16% of the hLMO7 primary structure. Moreover, LMO7 shows nuclear localization, acting on transcriptional activation and differentiation, processes related to proteins containing long regions of intrinsic disorder [27], and none of its globular domains is related to direct nucleic acid binding. Then, we analyzed whether hLMO7 contains intrinsically disordered regions (IDRs), which are highly flexible regions that lack a fixed secondary and/or tertiary structure [28]. Instead, IDRs are best described as an ensemble of conformations, challenging the structure–function relationship since the plasticity of unordered regions enables control of several regulatory pathways [29]. Analysis by means of the charge–hydropathy (CH) plot [30] shows that the chemical composition of the LMO primary structure is similar to that of proteins containing long IDRs (Figure 3C). Prediction of disordered regions by means of a set of seven different algorithms revealed two long C-terminal disordered regions interspaced with the PDZ and LIM domains, respectively, and a short N-terminal disordered region (Figure 4). In addition to PONDR, IUPred, PrDOs and their derivatives, the D2P2 database [31] verifies IDRs by means of the combination with the Spritz algorithms [32] that have been trained on NMR and X-ray crystallography data as well as biophysical characterizations gathered in the Disprot database (available at disprot.org). The results from D2P2 show consensual regions of disorder (75% accordance among the predictors) as follows: residues 321–344 (where the nuclear localization signal is found), 753–929, 938–1042 and 1236–1603. Interestingly, the enrichment in intrinsic disorder is conserved along evolution (Figure 3C and Figure 4). The LMO7 sequences from mouse (Mus musculus), chicken (Gallus gallus) and zebrafish (Danio rerio) shows an enriched disorder content, with the longest disordered segment appearing in the C-terminus (comprising around 400 residues: 1200–1600 for human and mouse, 1100–1500 for chicken and 900–1300 for zebrafish Lmo7b) (Figure 4). To investigate whether the hLMO7 segments could fold into coiled coils, we used ncoils [33] and observed three segments with high propensity: residues 721–776, 1227–1277 and 1347–1384. Coiled coil structures become folded into α-helical segments upon intermolecular interactions and can form supercoils containing up to seven helices wrapped around each other. Since hLMO7 is involved in many protein interactions, coiled coils could orchestrate the assemblage of large protein clusters. The high content of intrinsic disorder in hLMO7 (PONDR-VLXT total score: 52.5%) together with the prediction of coiled coils, which also function as oligomerization domains, prompted us to verify whether hLMO7 presents potential regions that could drive liquid–liquid phase separation (LLPS). Two bioinformatic tools, catGRANULE [34] and PScore [35], predicted a short C-terminal region comprising about 50 residues, with a potential to drive homotypic LLPS (Figure 5). The catGRANULE algorithm predicts condensation based on the following characteristics: primary sequence composition, intrinsic disorder and nucleic acid-binding propensities [34]. According to catGRANULE, hLMO7 showed a strong total score for LLPS (score: 1.024; a score above zero indicates LLPS) (Figure 5). In contrast, the overall PScore, which was developed on the high frequency of planar sp2 pi–pi interactions from LLPS “driver” proteins, was only 3.66, below the minimum threshold of 4, which indicates LLPS [35] (Figure 5). Nonetheless, multiple heterotypic interactions with its molecular partners promoted by its three domains and the IDRs could drive the assemblage of condensates. Consequently, it is essential to assess whether hLMO7 can form biomolecular condensates in vivo.
Regarding other motifs encoded by the hLMO7 sequence, analysis by Phobius [36] predicts a transmembrane motif (TM; residues 32–50) at the extreme N-terminus (Figure 3A). This may contribute to the insertion in the plasma membrane, and perhaps also to insertion in the nuclear membrane, as LMO7 is found in the nuclear membrane during the differentiation of mouse myoblasts [17]. Interestingly, several LIM domain-containing proteins undergo nucleocytoplasmic shuttling under specific conditions. Indeed, LMO7 is a DNA-binding protein that enhances transcription of key genes related to myogenic differentiation [17]. Accordingly, Holaska et al. [8] suggest two nuclear export signals (NES; residues 118–127 and 650–659) in the N-terminus and a C-terminal nuclear localization signal (NLS; residues 1189–11,996) based on bioinformatic prediction tools. However, to the best of our knowledge, the evaluation tools used in their work were not stated. Hence, we also searched for NLS/NES annotated in the database of nuclear transport sequences based on the experimentally proven sequences [37] and found only one motif that may function as a nuclear localization sequence (NLS) in the N-terminus (328-LRKKP-333) for the human protein (Figure 3B). It was reported that LMO7 N-terminus contains a predicted transactivation domain (TAD). Additionally, residues 888–1320 of recombinant hLMO7 seem to be involved in DNA binding, as evidenced by the electrophoresis mobility shift assay (EMSA) [17]. Using the 9aaTAD prediction tool [38], we found that the N-terminus contains three motifs and the C-terminus contains one motif (residues 931–939) with 92% identity to nine-amino-acid transactivation domains (9aaTAD). Interestingly, this propensity for a C-terminal TAD could explain the requirement of residues 888–1320 from hLMO7 to bind myogenic promoters in vitro. In addition, NLSs are found in LMO7 from all the species studied, apart from the b isoform from zebrafish. A perfect match for 9aaTADs was observed for LMO7 from zebrafish (isoform a), chicken and mouse (Figure 3B). Based on a search through the PhosphoSitePlus database [39], we report the post-translational modifications (PTMs) that occur in hLMO7 (Figure 3C). Most phosphorylated residues lie within the IDRs of hLMO7. We speculate that phosphorylation/dephosphorylation of serine 318, serine 322 and tyrosine 323 controls nuclear shuttling.
To identify differences in domain organization between the five protein-coding splicing variants of hLMO7 reported in UniProt (ID Q8WWI1), we performed primary structure alignment by Clustal Omega (Figure 6 and Figure 7A). The alignment showed that isoform 5 does not possess the predicted TM motif, the CH domain or a short-disordered region predicted by seven disorder algorithms, from residues 182 to 285 that precede the NLS (Figure 6 and Figure 7A). Proteins containing a CH domain have been implicated in actin and tubulin binding and are believed to connect cytoskeleton to signaling pathways [40]. Thus, isoform 5 probably lacks the ability to interact with several proteins to transduce signaling via the CH domain.
Isoform 3 lacks residues 359 to 690 predicted to contain two main antiparallel α-helices and four short α-helices together with long disordered regions (Figure 6 and Figure 7B, highlighted in green). Because the function of the region that is missing in isoform 3 is unknown, it is not possible to predict the functional outcome. Interestingly, all five hLMO7 isoforms share the main long disordered regions, the PDZ domain and the predicted NLS (Figure 6 and Figure 7A). Analysis using the I-TASSER structural alignment program (TM-align) [42] showed that the closest structural analog to the hLMO7′s LIM domain is the LIM domain from thyroid receptor interacting protein 6 (TRIP6; PDB 1X61). Superimposition of the 3D structural models from the LIM domain of hLMO7 (obtained by AlphaFold) and TRIP6 (solved by solution nuclear magnetic resonance spectroscopy) showed a root-mean-square deviation (RMSD) of 2.255 Å, indicating a similar fold despite a low level (27%) of sequence identity, as assessed by Clustal Omega. The LIM domain is absent in isoform 4 and is truncated in isoforms 2 and 5 (Figure 6), presumably impacting the protein–protein interactions which the LIM domain orchestrates. Specifically, isoforms 2 and 5 do not contain two key Zn(II) binding sites (Figure 7C, highlighted in blue). Since proper folding requires zinc ion coordination, the absence of binding residues (Figure 6, dark blue rectangles) would abolish LIM-related functions such as recruitment of diverse proteins, including DNA-binding proteins [43].
Additionally, we used AlphaFold and analysis of coevolutionary couplings to obtain structural insights on hLMO7. AlphaFold is a cutting-edge machine-learning approach that predicts protein three-dimensional structural models for the proteome of human and twenty other organisms. Despite this deep-learning method reporting a very low confidence level for the structural definition of IDRs, a significant correlation to disorder predictors has been demonstrated [44]. The predicted Local Distance Difference Test (pLDDT, range from 0 to 100) measures the accuracy of the AlphaFold model to a corresponding structure resolved by structural techniques. Consistent with the analysis by the seven disorder algorithms, hLMO7 is also predicted by AlphaFold to contain multiple domains linked by long disordered regions (Figure 8).
The analysis of evolutionary couplings among the amino acid residues of hLMO7 by the EVcoupling server indicated several significant couplings consistent with globular domains (Figure 8A). Indeed, these match well with the fold predictions by the AlphaFold server and identify the calponin homology, PDZ and LIM domains (Figure 8B). However, it is worth noting that there are many significant evolutionary couplings within the sequence of LMO7 that are in regions with a low score for AlphaFold prediction confidence (less than 70). These regions are predicted to be mostly intrinsically disordered (Figure 4), and we speculate that these intrinsically disordered regions with significant long-range evolutionary couplings might undergo disorder-to-order transitions, depending on the solution composition of specific cellular contexts. It is also interesting that an unknown domain reported by PFAM (DUF4757) exhibits an abundance of evolutionary couplings among residues 634-701 (Figure 8). This domain is also not well-defined in the 3D model of hLMO7 and has intermediate values of the disorder score (mean disorder around 0.5).
2.4. Identification of a Putative Regulator of LMO7 Transcription in the Myogenic Context
Given that LMO7 has been described as having a role during skeletal muscle differentiation of different vertebrate species [8,10], we wondered whether the LMO7 ortholog genes would be regulated by an evolutionarily conserved element (e.g., a promoter or an enhancer). Therefore, we searched for evolutionarily conserved regions (ECRs) in the LMO7 loci of human, mouse, chicken, Xenopus tropicalis and zebrafish (lmo7a loci) using the ECR genome browser.
Our comparative analysis revealed an ECR present in the LMO7 loci of all species analyzed except for zebrafish. In humans, this conserved element (hLMO7 ECR) is found upstream of the first exon of the Lmo7-207 transcript (ENST00000465261.6; see Table S1 for details). Several potential transcription factor binding sites (TFBS) for proteins involved in skeletal myogenesis, such as PAX3, MYOD, MYOGENIN and MEF2, were identified in the human ECR (Figure 9). In addition, this ECR contains LEF1/TCF4 potential binding sites, indicating that this element may be responsive to WNT ligands, which are known to play multiple and pivotal roles in skeletal muscle development [45]. The MultiTF search for conserved TFBSs among human and other species showed that MEF2 and LEF1/TCF4 sites are evolutionarily conserved.
To further evaluate the regulatory potential of the human hLMO7 ECR, its sequence was used to perform a BLAT search against the human genome (assembly GRCh38/hg38) in the UCSC genome browser. Our analysis revealed that the hLMO7 ECR partially overlaps with a predicted LMO7 regulatory element of the GeneHancer promoter/enhancer catalog (ID GH13J075758), a database of human regulatory elements and their inferred target genes [46]. Of interest, the GH13J075758 element was also categorized as a super enhancer related to myogenic differentiation in dbSUPER (ID SE_37705), a database of super enhancers in the mouse and human genomes [47].
Overall, our findings indicated that a conserved regulatory element is found in the LMO7 loci of humans and other vertebrates, that may be involved in regulating gene transcription in the context of skeletal myogenesis. Functional studies are required to confirm the role of the ECR identified here as an LMO7 regulatory element.
2.5. Intracellular Distribution of the LMO7 Protein in Chicken and Zebrafish Embryonic Muscle Cells
Since the PDZ–LIM proteins have been associated with multicompartmental cell functions during development, we decided to explore the cellular distribution of the LMO7 protein during vertebrate skeletal muscle development. To achieve this objective, we analyzed the intracellular localization of LMO7 in two widely used vertebrate animal models for the study of myogenesis. Zebrafish embryos and primary cultures of chicken embryonic muscle cells were labeled for LMO7 and analyzed with a confocal laser microscope. First, we analyzed the localization of Lmo7 in zebrafish somites. Zebrafish embryos are particularly appropriate for studies on skeletal muscle cell development because of their transparency and external development, which allow access to embryos for easy and detailed visualization. Furthermore, in a single zebrafish embryo, it is possible to analyze different differentiation stages of somite progenitor muscle cells. Zebrafish somites form one after another from tissue at the tail end of the embryo so that somites near the tail of the fish are younger and somites near the head are older. Our results showed Lmo7 near the septa and the notochord, and in the cytoplasm of myogenic cells in the somites from the trunk region of prim 25 zebrafish embryos [48] (Figure 10A–C). In somites from the caudal region of prim 25 zebrafish embryos, Lmo7 was found in the cytoplasm and was particularly concentrated in the perinuclear region of myogenic cells (Figure 10D–F). No labeling of Lmo7 was detected within the nuclei of progenitor skeletal muscle cells in zebrafish embryos (Figure 10). The presence of Lmo7 near the septa of zebrafish somites might be related to its role in skeletal muscle cell adhesion to the extracellular matrix (ECM), whereas Lmo7 perinuclear localization might be related to its role in intracellular signaling. Recently, the perinuclear region of eukaryotic cells has been described as a space that concentrates signaling proteins distributed in a 3D network of cytoskeletal filaments and organelles [49].
Next, we analyzed the distribution of LMO7 in chicken myogenic cell cultures under a fluorescence confocal microscope and found LMO7 within the nuclei of mononucleated myoblasts and in the cytoplasm (perinuclear region) of multinucleated myotubes (Figure 11A–P). These results agree with a previous report showing that upon myotube formation in the mouse C2C12 cell line, LMO7 shuttled from the nucleus to the cytoplasm [17]. Furthermore, our data are in accordance with the predicted function of the PDZ–LIM proteins regarding their ability to mediate signals between the nucleus and the cytoplasm [5] since LMO7 was found in the nucleus and in the perinuclear cloud of chicken muscle cells. The presence of LMO7 within the nuclei of chicken myoblasts reinforces the idea that LMO7 has a role in the regulation of gene expression in skeletal muscle cells [8,17]. Since beta-catenin is a major transcriptional regulator in muscle cells [50], we decided to investigate a possible crosstalk between the Wnt/beta-catenin pathway and the LMO7 signaling pathways during chicken myogenesis. To test that, we treated chicken myogenic cells with two activators of the Wnt/beta-catenin pathway, BIO and Wnt3a, and analyzed possible alterations in the intracellular distribution of LMO7. We selected BIO and Wnt3a for these experiments since both molecules have been shown to be robust activators of the canonical Wnt/beta-catenin signaling pathway. BIO is a potent and selective pharmacological inhibitor of glycogen synthase kinase-3β (GSK3-β), and it is well-established that inhibition of GSK3-β allows the nuclear translocation of beta-catenin and the subsequent beta-catenin-dependent regulation of its target genes [51]. Different from BIO, Wnt3a is a member of the canonical Wnt glycoproteins that can bind to its receptor Frizzled (Fz) and coreceptor lipoprotein receptor-related protein (LRP5/6) at the plasma membrane of the target cells and activate the canonical Wnt/beta-catenin signaling pathway. Since these two activators of the Wnt/beta-catenin pathway (BIO and Wnt3a) differ in their mechanisms of action, testing their effects in chicken muscle cells could provide more robust information on the possible interplay between LMO7 and the Wnt/beta-catenin signaling pathways. Interestingly, both BIO and Wnt3a induced an increase in the nuclear labeling of LMO7, which was concentrated in specific compartments within the nuclei of muscle cells (Figure 11, insets M–P). Curiously, LMO7 labeling in fibroblasts was lower than in muscle cells in all the experimental conditions (Figure 11A–P), pointing to a muscle-specific role of LMO7. Quantification of the LMO7 labeling showed an increase in the presence of LMO7 within the nuclei of all the cell phenotypes (myoblasts, myotubes and fibroblasts) after treatment with BIO and Wnt3a (Figure 11Q). We also quantified the amount of LMO7-positive nuclear aggregates in all the experimental conditions and found a significant increase in these aggregates in myoblasts, myotubes and fibroblasts after the activation of the Wnt/beta-catenin pathway (Figure 11R). Since chromatin is not randomly distributed within the interphase nuclei of eukaryotic cells [52], our data suggest that LMO7 could be concentrated at specific nuclear territories with active transcription. Further studies are necessary to identify the specific nuclear domain where LMO7 localizes after the activation of the Wnt signaling pathway in skeletal muscle cells.
Importantly, the LMO7 protein was detected within the nuclei of chicken myoblasts and in the cytoplasm of zebrafish somites (Figure 10 and Figure 11). No labeling of LMO7 was detected within the nuclei of muscle cells in zebrafish embryos (Figure 10). The differences in the intracellular distribution of the LMO7 protein between zebrafish and chicken muscle cells may have different explanations: (i) we analyzed chicken muscle cells grown in vitro as compared with zebrafish embryos grown in vivo. It is possible that the mechanical stress caused by in vitro conditions where the chicken muscle cells were cultivated induced the nuclear translocation of LMO7 and the subsequent activation of target genes. LMO7 has been reported to be associated with focal adhesions (cell–ECM adhesions) in cells by the interaction with p130Cas, a key signaling component of focal adhesions, and that this association allows muscle cells to withstand mechanical stress [9]. Importantly, focal adhesions are rarely seen in vivo, such as in zebrafish embryos, and therefore LMO7 may have a specific role in stress-related responses of muscle cells grown in vitro; and/or (ii) as described above, our results revealed that the Lmo7 teleost genes/proteins are separated into lmo7a/Lmo7a and lmo7b/Lmo7b. We cannot exclude the possibility that the antibody against Lmo7 that we used in the immunofluorescence experiments with zebrafish embryos was able to detect only one zebrafish Lmo7 protein isoform and that the other undetected isoform could have a different intracellular localization (including muscle cell nuclei in zebrafish somites). More experiments are needed to explore this hypothesis.
2.6. A bibliometric Glimpse of All the Published Data on LMO7
Finally, we performed an exploratory analysis of the data retrieved from the PubMed (
3. Materials and Methods
3.1. Bioinformatic Analysis
3.1.1. Identification of the LMO7 Orthologs Genes
Orthologs of the human LMO7 gene were identified by textual searches in the Gene NCBI database (
3.1.2. Phylogenetic Relationships among the LMO7 Proteins
Multiple alignments of the predicted LMO7 protein amino acid sequences were conducted using ClustalW of Molecular Evolutionary Genetics Analysis (MEGA) software version 11 [55]. Phylogenetic trees were built using the neighbor-joining method of the same software. The robustness of the groupings was assessed using 1000 bootstrap resampling.
3.1.3. LMO7 Transcripts in Human and Vertebrate Model Organisms
Information about LMO7 transcripts annotated in human and major vertebrate model organisms (mouse, chicken, Xenopus tropicalis and zebrafish) were obtained in the Ensembl genome browser (
3.1.4. Structural Features of LMO7 by Bioinformatics Tools
The intrinsically disordered regions of LMO7 were analyzed by means of the D2P2 database [31] and seven disorder predictors, PONDR-FIT [56], PONDR-VLXT [57], IUPred-long (long regions of intrinsic disorder) and IUPred-short (short regions of intrinsic disorder) [58], PONDR-VSL2 and PONDR-VL3 [59] and PrDOS [60]. The average disorder profile was obtained by calculating the mean of disorder reports from the seven computational tools. A score > 0.5 refers to amino acid residues in disordered regions whereas scores from 0.2 to 0.5 indicate residues in flexible segments. CatGRANULE [34] and PSCore algorithms predicted liquid–liquid phase separation. Coiled coil regions were predicted by ncoils [33]. The Phobius server was used to identify a transmembrane α-helical motif [36]. Nuclear export sequences (NES) and nuclear localization signals (NLS) were analyzed in [37]. Nine-amino-acid transactivation domains (TADs) were analyzed by means of the 9aaTAD prediction tool (available at
3.1.5. Identification of the LMO7 Putative Regulatory Elements
Comparative genomics is a useful tool to identify gene regulatory elements (e.g., promoters or enhancers) [62]. Therefore, we searched for evolutionarily conserved regions (ECRs) among the LMO7 loci of human, mouse, opossum, chicken, Xenopus and zebrafish using the ECR browser (
3.1.6. Bibliometric Analysis
For the bibliometric evaluation, we performed exploratory analyses of the data retrieved from the articles present in the PubMed database (
3.2. Analysis of Localization of LMO7 in Zebrafish and Chicken Muscles
3.2.1. Antibodies and Probes
Rabbit polyclonal antibody against LMO7 (code # HPA020923) was purchased from Sigma-Aldrich (St. Louis, MO, USA). DNA-binding probe 4,6-diamino-2-phenylindole dihydrochloride (DAPI) and Alexa Fluor 488-goat anti-rabbit IgG antibody were from Molecular Probes (Eugene, OR, USA).
3.2.2. Chicken Embryonic Skeletal Muscle Cell Cultures
The primary cultures of myogenic cells were prepared from breast muscles of 11-day-old chicken embryos as previously described [64]. Chicken embryos were obtained from Granja Tolomei (Rio de Janeiro, Brazil) and handled according to the Institutional Animal Care and Use Committee protocols under number 069/19. Chicken muscle cells were grown in the minimum essential medium (MEM) with 10% horse serum, 0.5% chicken embryo extract, 1% L-glutamine and 1% penicillin/streptomycin (Invitrogen, São Paulo, Brazil) at an initial density of 7.5 × 105 cells/dish at 37 °C in an atmosphere of 5% CO2. For the activation of the Wnt/beta-catenin pathway, 24 h cells were treated for 24 h with 5 µM 6-bromoindirubin-30-oxime (BIO, Sigma-Aldrich) or conditioned media (50% v/v) enriched in Wnt3a (obtained from L-Wnt3a cells, ATCC, Manassas, VA, USA).
3.2.3. Zebrafish Husbandry
Zebrafish (Danio rerio) were maintained in aquaria with a recirculating water system at 28 ± 1 °C on a 14:10 light/dark cycle in a vivarium localized at the Institute of Biomedical Sciences, Federal University of Rio de Janeiro (Rio de Janeiro, Brazil). The animals were handled and the experiments were carried out according to the Institutional Animal Care and Use Committee protocols under number 039/20. Embryos were collected and rinsed with water three times before chorion removal.
3.2.4. Immunofluorescence Microscopy and Digital Image Processing
Dechorionated zebrafish embryos at 24 hpf were fixed in 4% paraformaldehyde in PBS for 1 h at room temperature (RT). The embryos were then permeabilized with 0.5% Triton-X100 in PBS (PBS/T) for 30 min, blocked in 3% BSA-PBS/T for 1 h and incubated overnight at 4 °C with the primary antibody against Lmo7 (diluted 1:100 in a blocking solution). Chicken myogenic cells were fixed in methanol at −20 °C for 5 min followed by permeabilization in PBS/T for 15 min. The cells were blocked in 50 mM ammonium chloride for 15 min and 3% BSA-PBS/T for 30 min and incubated with the primary antibody against Lmo7 (diluted 1:100 in a blocking solution) for 1 h at 37 °C. Then, the embryos and the cells were washed for 30 min with PBS/T and incubated for 2 h at 37 °C with Alexa Fluor-conjugated secondary antibody (diluted 1:200 in a blocking solution). The nuclei were labeled with 0.1 μg/mL of DAPI in 0.9% NaCl. The embryos and the cells were mounted on #1.5 24 × 60 mm glass coverslips (with spacers in the case of embryos) using Prolong Gold (Molecular Probes). The cells and the embryos were examined under a Leica TCS SPE fluorescence confocal laser scanning microscope (Leica, Wetzlar, Germany). The control experiments with only the secondary antibodies showed only faint background staining (data not shown). Quantification of the LMO7 fluorescence labeling were performed using the Fiji software [65] and figure panels were produced with the Adobe Photoshop software (Adobe Systems Inc., San José, CA, USA), where some of the original fluorescence grayscale images were pseudo-colored and superimposed.
3.2.5. Statistical Analysis
Statistical analysis was carried out using the GraphPad Prism software version 8. The results of three independent experiments are expressed as the means ± standard deviation. Statistical analysis of the data related to the quantification of nucleus versus cytoplasm localization of LMO7 and LMO7-positive nuclear aggregates was performed with two-way ANOVA followed by Tukey’s post hoc test. p < 0.05 was considered statistically significant.
G.G. carried out the experiments with zebrafish embryos. K.M.B. carried out the experiments with chicken myogenic cell cultures. M.J.d.A. and M.d.S.A. carried out the protein bioinformatics analysis. L.M.V. carried out the bibliometric analysis. L.E.A. carried out LMO7 orthologs identification, LMO7 transcripts analysis and ECR identification/characterization. C.M. conceived the study, participated in its design, in the analysis of data and coordination of the study. All authors have read and agreed to the published version of the manuscript.
This work was supported by Brazilian grants from Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) and Fundação Carlos Chagas Filho de Apoio à Pesquisa do Estado do Rio de Janeiro (FAPERJ).
The use of chicken embryos and zebrafish embryos was approved by the Ethics Committee for Animal Care and Use in Scientific Research of the Federal University of Rio de Janeiro and received the following approval numbers: 069/19 for chicken embryos and 039/20 for zebrafish embryos.
Not applicable.
The authors confirm that the data supporting the findings of this study are available within the article.
The authors declare that they have no competing interests.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure 1. Molecular phylogenetic tree of the LMO7 proteins from representative species of vertebrates. The phylogenetic tree was inferred from amino acid sequences using the Branchiostoma floridae Lmo7 protein as an out-group. Bootstrap values were obtained by the neighbor-joining method (1000 replications) using the MEGA 11 software.
Figure 2. Comparison of the exon–intron organization of the LMO7 transcripts predicted for human and vertebrate model organisms (mouse, chicken, Xenopus and zebrafish). Location of the calponin (CH), PDZ and LIM zinc-binding (LIM) domains is shown for all the species. The CH domain is missing in the chicken LMO7 transcript. The longest transcripts predicted for the two zebrafish lmo7 ortholog genes (lmo7a and lmo7b) are depicted.
Figure 3. LMO7 is a multidomain protein that has a charge–hydropathy characteristic of proteins enriched in intrinsic disorder and contains nuclear localization sequence(s) and putative nine-amino-acid transactivation motifs. (A) Schematic organization of human LMO7 with the domains, motifs and post-translational modifications (PTMs) indicated along sequence. The C-terminal domain mediates interaction with DNA promoters and protein partners. TM, transmembrane α-helix. NLS, nuclear localization sequence. PO4, phosphorylation. Ub, ubiquitination. CH3, methylation. The inset shows the X-ray crystallographic structure of the PDZ domain (PDB 2eaq). (B) LMO7 from different species were analyzed by NLSdb (third column, https://rostlab.org/services/nlsdb/, 20 November 2021) and 9aaTAD (fourth column, https://www.med.muni.cz/9aaTAD/). Perfect matches of 9aaTAD are highlighted in bold whereas other annotated motifs correspond to 92% matches. Corresponding UniProt identifiers (UP ID) are shown in the second column. (C) Predictor of Natural Disordered Regions (PONDR; http://www.pondr.com/) analysis of charge–hydropathy (CH plot). Well-folded proteins indicated as blue circles and disordered proteins as red squares. LMO7 from the different species is located in the plot region of disordered proteins apart from isoform b of zebrafish.
Figure 4. LMO7 is enriched in intrinsic disorder and this feature is conserved across evolution. Analysis of intrinsic disorder by seven disorder predictors along the LMO7 primary structure. Note that the C-terminal protein/DNA-binding region is the longest disordered segment. Data from PONDR-VLXT (light green curve), PONDR-VL3 (green curve) and PONDR-VSL2 (pale orange curve) were provided by http://www.pondr.com/; from PONDR-FIT (blue curve)—by http://original.disprot.org/pondr-fit.php; from IUPred2-long disorder (dark purple curve) and IUPred2-short disorder (purple curve)—by https://iupred2a.elte.hu/plotand; from PrDOS (pink curve)—by http://prdos.hgc.jp/cgi-bin/top.cgi. The average of disorder across all the algorithms was calculated (black curve). Scores higher than 0.5 represent disordered regions, and values between 0.2 and 0.5 indicate flexible segments.
Figure 5. A C-terminal short segment of hLMO7 is predicted to undergo liquid–liquid phase separation. Propensity for hLMO7 LLPS ability along the primary structure calculated by catGRANULE (red curve, http://service.tartaglialab.com/new_submission/catGRANULE) and PScore (blue curve, http://abragam.med.utoronto.ca/~JFKlab/Software/psp.htm).
Figure 6. hLMO7 transcript variants show conserved intrinsic disorder but differ in the LIM domain. Alignment of hLMO7 isoforms 1 to 5 (UniProt IDs Q8WWI1-1; Q8WWI1-2; Q8WWI1-3; Q8WWI1-4; Q8WWI1-5, respectively) obtained by Clustal Omega [41]. The predicted domains, motifs and intrinsically disordered regions (IDR) reported in the Database of Disordered Protein Predictions (D2P2) [31] are marked by rectangles and the sequences indicated above. Isoform 5 lacks the putative transmembrane α-helix (TM, predicted by Phobius at https://phobius.sbc.su.se/) and the calponin homology domain (CH). Isoform 3 lacks residues 356–690. The LIM domain is missing in isoform 4. Isoforms 2 and 5 do not have the key zinc ion coordination motifs (dark blue rectangles). Cysteine and histidine residues that might also be involved in Zn(II) binding are marked in purple and blue, respectively. NLS, nuclear localization signal predicted by NLSdb (https://rostlab.org/services/nlsdb/).
Figure 7. Comparison of domains organization of the five isoforms from human LMO7. (A) Summary of results from the primary structure alignment among the hLMO7 isoforms. √, presence. ×, absence. * lacks Zn(II) binding motifs. TM, transmembrane α-helix. NLS, nuclear localization sequence. CH, calponin homology domain. UD, unknown domain DUF4757. (B) hLMO7 Alpha Fold-predicted structural model showing the region missing in isoform 3 (green) and the LIM domain (white) which is only conserved in isoforms 1 and 3. (C) Superimposition of the LIM domain of hLMO7 (obtained by AlphaFold; white ribbon representation) and the LIM domain of TRIP6 (PDB 1X61; pale green ribbon representation), which is the closest structural analog reported by I-TASSER [42]. Two zinc ions (black spheres) are coordinated in the fingers. Regions highlighted in dark blue are missing in isoforms 2 and 5. Furthermore, isoform 4 does not have the full LIM domain.
Figure 8. Three-dimensional structure model prediction of hLMO7 indicates the presence of well-folded domains and many intrinsic disorder regions with long-range evolutionary couplings. The evolutionary couplings of overlapping regions of LMO7 were calculated by the EVcoupling server (A). The quality scores (Q) for the identification of evolutionary couplings are indicated for each segment and range from 0 (worst) to 10 (best). The pink-shaded squares highlight well-folded domains previously described (residues 61–215, 1037–1125 and 1606–1675) as well as the one identified here (residues 634–701). The 3D structure calculated by the AlphaFold server is indicated in (B). Each globular domain highlighted by EVcoupling is indicated in the 3D model. The full-length protein 3D model is colored by the fold prediction confidence score for each amino acid residue (bottom right legend).
Figure 9. Evolutionarily conserved region (ECR) in the LMO7 loci of human and other vertebrate species with features of a skeletal muscle-specific regulatory element. (A) Conservation profiles of the human LMO7 genomic region (base sequence, on top) in comparison with the mouse (mm10), opossum (momDom5), chicken (Galgal3) and Xenopus (XenTro3) ortholog regions. The 5′–3′ orientation of the human transcripts is denoted by arrow lines. The color codes used to indicate different gene regions are: exons (blue), 5′ UTRs (yellow), introns (pink), repetitive elements (green). Each line represents an alignment, and the vertical height indicates the sequence identity underlying the alignment. (B) The human ECR displays several potential binding sites for key regulators of skeletal myogenesis, such as PAX3, MYOD, MYOGENIN and MEF2. Two sets of the LEF1/TCF4 binding sites are predicted in the human ECR. The asterisk indicates the MEF2 and LEF1/TCF4 binding sites that are conserved in mouse, opossum, chicken and Xenopus. (C) UCSC genome browser display of the BLAT alignment of the human ECR to the human GRCh38/hg38 assembly. The human ECR sequence overlaps with the GH13J075758 promoter/enhancer of the GeneHancer catalog.
Figure 10. LMO7 distribution in zebrafish embryos. Prim 25 zebrafish embryos were double-labeled with an antibody against LMO7 (green) and the nuclear dye DAPI (red) and analyzed under a confocal laser microscope. The images shown in (A’–F’) are higher magnifications of the dashed areas marked in the corresponding panels (A–F). In somites of the trunk region of prim 25 zebrafish embryos, Lmo7 was found near the septa, the notochord and in the cytoplasm of skeletal muscle progenitor cells (A–C and A’–C’), while in the caudal region, it was found in the cytoplasm and the perinuclear region of muscle cells (D–F and D’–F’). Scale bar = 50 µm.
Figure 11. LMO7 intracellular distribution in chicken muscle cells. Chicken myogenic cells were grown for 24 h and treated with DMSO, BIO or Wnt3a for the next 24 h. The cells were double-labeled with an antibody against LMO7 (green) and the nuclear dye DAPI (blue) and analyzed under a confocal laser microscope. Insets (M–P) are higher magnifications of the areas marked in the images C,F,I and L. In the control and DMSO-treated cells, LMO7 is found in the perinuclear region of myotubes and within the nuclei of myoblasts (A–F and M,N). Both BIO and Wnt3a induced an increase in the nuclear labeling of LMO7 (G–L and O,P). Arrows in the insets point to LMO7 localization in myoblasts (Myb), myotubes (Myt) and fibroblasts (Fib). Scale bar for A–L = 20 µm. Quantification of the intracellular localization of LMO7 in the cytoplasm versus the nuclei of cells (Q) and of the fluorescence intensity of LMO7-positive nuclear aggregates (R) is shown. ns = not significant, * = 0.01, ** = 0.07, *** = 0.004, **** = 0.0001.
Figure 12. Frequency of LMO7-containing articles published per year. Data retrieved from the PubMed (https://pubmed.ncbi.nlm.nih.gov/) database using descriptors “LMO7” OR “Lmo7” OR “LMO-7” OR “Lmo-7” OR “Lim domain only protein 7”. The search returned 76 articles as of 10 October 2021 for a period that spanned the years 1998 to 2021. From the total of 76 articles, we analyzed the number of LMO7 articles published per year.
Figure 13. Bibliometric analysis of the co-occurrence relations between scientific terms found in the LMO7 articles. A term map of co-occurrence relations between the scientific terms was created from the bibliometric data retrieved from the titles and abstracts of the published LMO7 articles. The query was performed on October 10, 2021, by using descriptors [lmo7] OR [LMO7] OR [LMO-7] OR [lmo-7] OR [lim domain only protein 7] OR [Lim Domain Only Protein 7]. Five different colors indicate clusters of related terms.
Supplementary Materials
The following are available online at
References
1. Te Velthuis, A.J.; Isogai, T.; Gerrits, L.; Bagowski, C.P. Insights into the molecular evolution of the PDZ/LIM family and identification of a novel conserved protein motif. PLoS ONE; 2007; 2, e189. [DOI: https://dx.doi.org/10.1371/journal.pone.0000189]
2. Ooshio, T.; Irie, K.; Morimoto, K.; Fukuhara, A.; Imai, T.; Takai, Y. Involvement of LMO7 in the association of two cell-cell adhesion molecules, nectin and E-cadherin, through afadin and alpha-actinin in epithelial cells. J. Biol. Chem.; 2004; 279, pp. 31365-31373. [DOI: https://dx.doi.org/10.1074/jbc.M401957200]
3. Velthuis, A.J.; Bagowski, C.P. PDZ and LIM domain-encoding genes: Molecular interactions and their role in development. Sci. World J.; 2007; 7, pp. 1470-1492. [DOI: https://dx.doi.org/10.1100/tsw.2007.232]
4. Friedberg, F. Alternative splicing for members of human mosaic domain superfamilies. I. The CH and LIM domains containing group of proteins. Mol. Biol. Rep.; 2009; 36, pp. 1059-1081. [DOI: https://dx.doi.org/10.1007/s11033-008-9281-9]
5. Krcmery, J.; Camarata, T.; Kulisz, A.; Simon, H.G. Nucleocytoplasmic functions of the PDZ-LIM protein family: New insights into organ development. Bioessays; 2010; 32, pp. 100-108. [DOI: https://dx.doi.org/10.1002/bies.200900148]
6. Castro-Sánchez, S.; Álvarez-Satta, M.; Tohamy, M.A.; Beltran, S.; Derdak, S.; Valverde, D. Whole exome sequencing as a diagnostic tool for patients with ciliopathy-like phenotypes. PLoS ONE; 2017; 12, e0183081. [DOI: https://dx.doi.org/10.1371/journal.pone.0183081] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/28800606]
7. Tzeng, Y.W.; Li, D.Y.; Chen, Y.; Yang, C.H.; Chang, C.Y.; Juang, Y.L. LMO7 exerts an effect on mitosis progression and the spindle assembly checkpoint. Int. J. Biochem. Cell Biol.; 2018; 94, pp. 22-30. [DOI: https://dx.doi.org/10.1016/j.biocel.2017.11.006]
8. Holaska, J.M.; Rais-Bahrami, S.; Wilson, K.L. Lmo7 is an emerin-binding protein that regulates the transcription of emerin and many other muscle-relevant genes. Hum. Mol. Genet.; 2006; 15, pp. 3459-3472. [DOI: https://dx.doi.org/10.1093/hmg/ddl423] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/17067998]
9. Wozniak, M.A.; Baker, B.M.; Chen, C.S.; Wilson, K.L. The emerin-binding transcription factor Lmo7 is regulated by association with p130Cas at focal adhesions. Peer J.; 2013; 1, e134. [DOI: https://dx.doi.org/10.7717/peerj.134] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/24010014]
10. Possidonio, A.C.; Soares, C.P.; Fontenele, M.; Morris, E.R.; Mouly, V.; Costa, M.L.; Mermelstein, C. Knockdown of Lmo7 inhibits chick myogenesis. FEBS Lett.; 2016; 590, pp. 317-329. [DOI: https://dx.doi.org/10.1002/1873-3468.12067]
11. Hu, Q.; Guo, C.; Li, Y.; Aronow, B.J.; Zhang, J. LMO7 mediates cell-specific activation of the Rho-myocardin-related transcription factor-serum response factor pathway and plays an important role in breast cancer cell migration. Mol. Cell Biol.; 2011; 31, pp. 3223-3240. [DOI: https://dx.doi.org/10.1128/MCB.01365-10]
12. Liu, X.; Yuan, H.; Zhou, J.; Wang, Q.; Qi, X.; Bernal, C.; Avella, D.; Kaifi, J.T.; Kimchi, E.T.; Timothy, P. et al. LMO7 as an unrecognized factor promoting pancreatic cancer progression and metastasis. Front. Cell Dev. Biol.; 2021; 9, 647387. [DOI: https://dx.doi.org/10.3389/fcell.2021.647387] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33763427]
13. Ott, E.B.; van den Akker, N.M.; Sakalis, P.A.; Gittenberger-de Groot, A.C.; Te Velthuis, A.J.; Bagowski, C.P. The lim domain only protein 7 is important in zebrafish heart development. Dev. Dyn.; 2008; 237, pp. 3940-3952. [DOI: https://dx.doi.org/10.1002/dvdy.21807]
14. Holaska, J.M.; Wilson, K.L. An emerin “proteome”: Purification of distinct emerin-containing complexes from HeLa cells suggests molecular basis for diverse roles including gene regulation, mRNA splicing, signaling, mechanosensing, and nuclear architecture. Biochemistry; 2007; 46, pp. 8897-8908. [DOI: https://dx.doi.org/10.1021/bi602636m]
15. Lao, D.H.; Esparza, M.C.; Bremner, S.N.; Banerjee, I.; Zhang, J.; Veevers, J.; Bradford, W.H.; Gu, Y.; Dalton, N.D.; Knowlton, K.U. et al. Lmo7 is dispensable for skeletal muscle and cardiac function. Am. J. Physiol. Cell Physiol.; 2015; 309, pp. C470-C479. [DOI: https://dx.doi.org/10.1152/ajpcell.00177.2015]
16. Mull, A.; Kim, G.; Holaska, J.M. LMO7-null mice exhibit phenotypes consistent with emery-dreifuss muscular dystrophy. Muscle Nerve; 2015; 51, pp. 222-228. [DOI: https://dx.doi.org/10.1002/mus.24286]
17. Dedeic, Z.; Cetera, M.; Cohen, T.V.; Holaska, J.M. Emerin inhibits Lmo7 binding to the Pax3 and MyoD promoters and expression of myoblast proliferation genes. J. Cell Sci.; 2011; 124, pp. 1691-1702. [DOI: https://dx.doi.org/10.1242/jcs.080259]
18. Braasch, I.; Gehrke, A.R.; Smith, J.J.; Kawasaki, K.; Manousaki, T.; Pasquier, J.; Amores, A.; Desvignes, T.; Batzel, P.; Catchen, J. et al. The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons. Nat. genet.; 2016; 48, pp. 427-437.
19. Glasauer, S.M.; Neuhauss, S.C. Whole-genome duplication in teleost fishes and its evolutionary consequences. Mol. Genet. Genomics.; 2014; 289, pp. 1045-1060. [DOI: https://dx.doi.org/10.1007/s00438-014-0889-2] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/25092473]
20. Putilina, T.; Jaworski, C.; Gentleman, S.; McDonald, B.; Kadiri, M.; Wong, P. Analysis of a human cDNA containing a tissue-specific alternatively spliced LIM domain. Biochem. Biophys. Res. Commun.; 1998; 252, pp. 433-439. [DOI: https://dx.doi.org/10.1006/bbrc.1998.9656] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/9826547]
21. Baralle, F.E.; Giudice, J. Alternative splicing as a regulator of development and tissue identity. Nat. Rev. Mol. Cell Biol.; 2017; 18, pp. 437-451. [DOI: https://dx.doi.org/10.1038/nrm.2017.27]
22. Li, J.; Liu, C. Coding or noncoding, the converging concepts of RNAs. Front. Genet.; 2019; 10, 496. [DOI: https://dx.doi.org/10.3389/fgene.2019.00496]
23. Dhamija, S.; Menon, M.B. Non-coding transcript variants of protein-coding genes—What are they good for?. RNA Biol.; 2018; 15, pp. 1025-1031. [DOI: https://dx.doi.org/10.1080/15476286.2018.1511675] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30146915]
24. The UniProt Consortium. UniProt: The universal protein knowledgebase in 2021. Nucleic Acids Res.; 2021; 49, D1.
25. Mistry, J.; Chuguransky, S.; Williams, L.; Qureshi, M.; Salazar, G.A.; Sonnhammer, E.L.L.; Tosatto, S.C.E.; Paladin, L.; Raj, S.; Richardson, L.J. et al. Pfam: The protein families database in 2021. Nucleic Acids Res.; 2021; 49, pp. D412-D419. [DOI: https://dx.doi.org/10.1093/nar/gkaa913]
26. Amacher, J.F.; Brooks, L.; Hampton, T.H.; Madden, D.R. Specificity in PDZ-peptide interaction networks: Computational analysis and review. J. Struct. Biol.; 2020; 4, 100022. [DOI: https://dx.doi.org/10.1016/j.yjsbx.2020.100022]
27. Tompa, P. Intrinsically disordered proteins: A 10-year recap. Trends Biochem. Sci.; 2012; 37, pp. 509-516. [DOI: https://dx.doi.org/10.1016/j.tibs.2012.08.004] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/22989858]
28. Uversky, V.N. Intrinsically disordered proteins from A to Z. Int. J. Biochem. Cell Biol.; 2011; 43, pp. 1090-1103. [DOI: https://dx.doi.org/10.1016/j.biocel.2011.04.001]
29. Wright, P.; Dyson, H. Intrinsically disordered proteins in cellular signalling and regulation. Nat. Rev. Mol. Cell Biol.; 2015; 16, pp. 18-29. [DOI: https://dx.doi.org/10.1038/nrm3920]
30. Uversky, V.N.; Gillespie, J.R.; Fink, A.L. Why are “natively unfolded” proteins unstructured under physiologic conditions?. Proteins; 2000; 41, pp. 415-427. [DOI: https://dx.doi.org/10.1002/1097-0134(20001115)41:3<415::AID-PROT130>3.0.CO;2-7]
31. Oates, M.E.; Romero, P.; Ishida, T.; Ghalwash, M.; Mizianty, M.J.; Xue, B.; Dosztányi, Z.; Uversky, V.N.; Obradovic, Z.; Kurgan, L. et al. D²P²: Database of disordered protein predictions. Nucleic Acids Res.; 2013; 41, pp. D508-D516. [DOI: https://dx.doi.org/10.1093/nar/gks1226]
32. Walsh, I.; Martin, A.J.; Di Domenico, T.; Tosatto, S.C. ESpritz: Accurate and fast prediction of protein disorder. Bioinformatics; 2012; 28, pp. 503-509. [DOI: https://dx.doi.org/10.1093/bioinformatics/btr682]
33. Lupas, A.; Van Dyke, M.; Stock, J. Predicting coiled coils from protein sequences. Science; 1991; 252, pp. 1162-1164. [DOI: https://dx.doi.org/10.1126/science.252.5009.1162] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/2031185]
34. Bolognesi, B.; Gotor, N.L.; Dhar, R.; Cirillo, D.; Baldrighi, M.; Tartaglia, G.G.; Lehner, B. A concentration-dependent liquid phase separation can cause toxicity upon increased protein expression. Cell Rep.; 2016; 16, pp. 222-231. [DOI: https://dx.doi.org/10.1016/j.celrep.2016.05.076]
35. Vernon, R.M.; Chong, P.A.; Tsang, B.; Kim, T.H.; Bah, A.; Farber, P.; Lin, H. Forman-Kay JD. Pi-Pi contacts are an overlooked protein feature relevant to phase separation. Elife; 2018; 7, e31486. [DOI: https://dx.doi.org/10.7554/eLife.31486] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/29424691]
36. Käll, L.; Krogh, A.; Sonnhammer, E.L. Advantages of combined transmembrane topology and signal peptide prediction—The Phobius web server. Nucleic Acids Res.; 2007; 35, pp. W429-W432. [DOI: https://dx.doi.org/10.1093/nar/gkm256]
37. Bernhofer, M.; Goldberg, T.; Wolf, S.; Ahmed, M.; Zaugg, J.; Boden, M.; Rost, B. NLSdb-major update for database of nuclear localization signals and nuclear export signals. Nucleic Acids Res.; 2018; 46, pp. D503-D508. [DOI: https://dx.doi.org/10.1093/nar/gkx1021]
38. Piskacek, M.; Havelka, M.; Rezacova, M.; Knight, A. The 9aaTAD transactivation domains: From Gal4 to p53. PLoS ONE; 2016; 11, e0162842.
39. Hornbeck, P.V.; Zhang, B.; Murray, B.; Kornhauser, J.M.; Latham, V.; Skrzypek, E. PhosphoSitePlus, 2014: Mutations, PTMs and recalibrations. Nucleic Acids Res.; 2015; 43, pp. D512-D520. [DOI: https://dx.doi.org/10.1093/nar/gku1267]
40. Bañuelos, S.; Saraste, M.; Djinović Carugo, K. Structural comparisons of calponin homology domains: Implications for actin binding. Structure; 1998; 6, pp. 1419-1431. [DOI: https://dx.doi.org/10.1016/S0969-2126(98)00141-5]
41. Sievers, F.; Wilm, A.; Dineen, D.; Gibson, T.J.; Karplus, K.; Li, W.; Lopez, R.; McWilliam, H.; Remmert, M.; Söding, J. et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol.; 2011; 7, 539. [DOI: https://dx.doi.org/10.1038/msb.2011.75]
42. Yang, J.; Zhang, Y. I-TASSER server: New development for protein structure and function predictions. Nucleic Acids Res.; 2015; 43, pp. W174-W181. [DOI: https://dx.doi.org/10.1093/nar/gkv342]
43. Matthews, J.M. LIM Domain Proteins. Reference Module in Life Sciences; Elsevier: Amsterdam, The Netherlands, 2017.
44. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A. et al. Highly accurate protein structure prediction with AlphaFold. Nature; 2021; 596, pp. 583-589. [DOI: https://dx.doi.org/10.1038/s41586-021-03819-2] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34265844]
45. Girardi, F.; Le Grand, F. Wnt signaling in skeletal muscle development and regeneration. Prog. Mol. Biol. Transl. Sci.; 2018; 153, pp. 157-179.
46. Fishilevich, S.; Nudel, R.; Rappaport, N.; Hadar, R.; Plaschkes, I.; Iny, S.; Rosen, T.N.; Kohn, A.; Twik, M.; Safran, M. et al. GeneHancer: Genome-wide integration of enhancers and target genes in GeneCards. Database; 2017; 2017, bax028. [DOI: https://dx.doi.org/10.1093/database/bax028] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/28605766]
47. Khan, A.; Zhang, X. dbSUPER: A database of super-enhancers in mouse and human genome. Nucleic Acids Res.; 2016; 44, pp. D164-D171. [DOI: https://dx.doi.org/10.1093/nar/gkv1002]
48. Kimmel, C.B.; Ballard, W.W.; Kimmel, S.R.; Ullmann, B.; Schilling, T.F. Stages of embryonic development of the zebrafish. Dev. Dyn.; 1995; 203, pp. 253-310. [DOI: https://dx.doi.org/10.1002/aja.1002030302] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/8589427]
49. Do Amaral, M.J.; de Andrade Rosa, I.; Andrade, S.A.; Fang, X.; Andrade, L.R.; Costa, M.L.; Mermelstein, C. The perinuclear region concentrates disordered proteins with predicted phase separation distributed in a 3D network of cytoskeletal filaments and organelles. Biochim. Biophys. Acta Mol. Cell Res.; 2021; 1869, 119161. [DOI: https://dx.doi.org/10.1016/j.bbamcr.2021.119161] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34655689]
50. Suzuki, A.; Scruggs, A.; Iwata, J. The temporal specific role of WNT/β-catenin signaling during myogenesis. J. Nat. Sci.; 2015; 1, e143. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/26176019]
51. Meijer, L.; Skaltsounis, A.L.; Magiatis, P.; Polychronopoulos, P.; Knockaert, M.; Leost, M.; Ryan, X.P.; Vonica, C.A.; Brivanlou, A.; Dajani, R. et al. GSK-3-selective inhibitors derived from Tyrian purple indirubins. Chem. Biol.; 2003; 10, pp. 1255-1266. [DOI: https://dx.doi.org/10.1016/j.chembiol.2003.11.010]
52. Pombo, A.; Dillon, N. Three-dimensional genome architecture: Players and mechanisms. Nat. Rev. Mol. Cell Biol.; 2015; 16, pp. 245-257. [DOI: https://dx.doi.org/10.1038/nrm3965]
53. Van Eck, N.J.; Waltman, L. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics; 2010; 84, pp. 523-538. [DOI: https://dx.doi.org/10.1007/s11192-009-0146-3]
54. Waltman, L.; Van Eck, N.J.; Noyons, E.C.M. A unified approach to mapping and clustering of bibliometric networks. J. Informetr.; 2010; 4, pp. 629-635. [DOI: https://dx.doi.org/10.1016/j.joi.2010.07.002]
55. Tamura, K.; Stecher, G.; Kumar, S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol. Biol. Evol.; 2021; 38, pp. 3022-3027. [DOI: https://dx.doi.org/10.1093/molbev/msab120] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33892491]
56. Xue, B.; Dunbrack, R.L.; Williams, R.W.; Dunker, A.K.; Uversky, V.N. PONDR-FIT: A meta-predictor of intrinsically disordered amino acids. Biochim. Biophys. Acta.; 2010; 1804, pp. 996-1010. [DOI: https://dx.doi.org/10.1016/j.bbapap.2010.01.011] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/20100603]
57. Romero, P.; Obradovic, Z.; Li, X.; Garner, E.C.; Brown, C.J.; Dunker, A.K. Sequence complexity of disordered protein. Proteins; 2001; 42, pp. 38-48. [DOI: https://dx.doi.org/10.1002/1097-0134(20010101)42:1<38::AID-PROT50>3.0.CO;2-3]
58. Dosztányi, Z. Prediction of protein disorder based on IUPred. Protein Sci.; 2018; 27, pp. 331-340. [DOI: https://dx.doi.org/10.1002/pro.3334]
59. Peng, K.; Vucetic, S.; Radivojac, P.; Brown, C.J.; Dunker, A.K.; Obradovic, Z. Optimizing long intrinsic disorder predictors with protein evolutionary information. J. Bioinform. Comput. Biol.; 2005; 3, pp. 35-60. [DOI: https://dx.doi.org/10.1142/S0219720005000886]
60. Ishida, T.; Kinoshita, K. PrDOS: Prediction of disordered protein regions from amino acid sequence. Nucleic Acids Res.; 2007; 35, pp. W460-W464. [DOI: https://dx.doi.org/10.1093/nar/gkm363] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/17567614]
61. Marks, D.S.; Colwell, L.J.; Sheridan, R.; Hopf, T.A.; Pagnani, A.; Zecchina, R.; Sander, C. Protein 3D structure computed from evolutionary sequence variation. PLoS ONE; 2011; 6, e28766. [DOI: https://dx.doi.org/10.1371/journal.pone.0028766]
62. Loots, G.G.; Ovcharenko, I. Dcode.org anthology of comparative genomic tools. Nucleic Acids Res.; 2005; 33, pp. W56-W64. [DOI: https://dx.doi.org/10.1093/nar/gki355] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/15980535]
63. Ovcharenko, I.; Loots, G.G.; Giardine, B.M.; Hou, M.; Ma, J.; Hardison, R.C.; Stubbs, L.; Miller, W. Mulan: Multiple-sequence local alignment and visualization for studying function and evolution. Genome Res.; 2005; 15, pp. 184-194. [DOI: https://dx.doi.org/10.1101/gr.3007205] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/15590941]
64. Mermelstein, C.S.; Costa, M.L.; Chagas, F.C.; Moura Neto, V. Intermediate filament proteins in TPA-treated skeletal muscle cells in culture. J. Muscle Res. Cell Motil.; 1996; 17, pp. 199-206. [DOI: https://dx.doi.org/10.1007/BF00124242] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/8793722]
65. Schindelin, J.; Arganda-Carreras, I.; Frise, E.; Kaynig, V.; Longair, M.; Pietzsch, T.; Preibisch, S.; Rueden, C.; Saalfeld, S.; Schmid, B. et al. Fiji: An open-source platform for biological-image analysis. Nat. Methods; 2012; 9, pp. 676-682. [DOI: https://dx.doi.org/10.1038/nmeth.2019]
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
LMO7 is a multifunctional PDZ–LIM protein that can interact with different molecular partners and is found in several intracellular locations. The aim of this work was to shed light on LMO7 evolution, alternative transcripts, protein structure and gene regulation through multiple in silico analyses. We also explored the intracellular distribution of the LMO7 protein in chicken and zebrafish embryonic skeletal muscle cells by means of confocal fluorescence microscopy. Our results revealed a single LMO7 gene in mammals, sauropsids, Xenopus and in the holostean fish spotted gar while two lmo7 genes (lmo7a and lmo7b) were identified in teleost fishes. In addition, several different transcripts were predicted for LMO7 in human and in major vertebrate model organisms (mouse, chicken, Xenopus and zebrafish). Bioinformatics tools revealed several structural features of the LMO7 protein including intrinsically disordered regions. We found the LMO7 protein in multiple intracellular compartments in chicken and zebrafish skeletal muscle cells, such as membrane adhesion sites and the perinuclear region. Curiously, the LMO7 protein was detected within the nuclei of muscle cells in chicken but not in zebrafish. Our data showed that a conserved regulatory element may be related to muscle-specific LMO7 expression. Our findings uncover new and important information about LMO7 and open new challenges to understanding how the diverse regulation, structure and distribution of this protein are integrated into highly complex vertebrate cellular milieux, such as skeletal muscle cells.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details




1 Instituto de Ciências Biomédicas, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-901, Brazil;
2 Faculdade de Farmácia, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-901, Brazil;
3 Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-901, Brazil;
4 Departamento de Bioquímica e Biologia Tecidual, Universidade de Campinas (UNICAMP), Campinas, São Paulo 13083-872, Brazil;