ARTICLE
Received 24 Mar 2013 | Accepted 19 Aug 2013 | Published 12 Sep 2013
Mutually exclusive splicing is an important mechanism in a wide range of eukaryotic branches to expand proteome diversity, but the extent of its distribution within a single species and its evolutionary conservation is unknown. Here we present a genome-wide analysis of mutually exclusive spliced exons (MXEs) in Drosophila melanogaster at unprecedented depth. Most of the new MXE candidates are supported by evolutionary conservation, transcriptome data analysis and identication of competing RNA secondary structural elements. The enrichment of the genes with MXEs in transmembrane transporters and ion channel activity is consistent with ndings in humans, although the MXEs appeared independently and in non-homologous genes, supporting the idea of a universal benet of adapting ion channel and receptor properties by tandem exon duplications. The comparison of the mutually exclusive spliced exomes within the Drosophila clade shows high numbers of MXE gain and loss events, suggesting a role of these processes in speciation.
DOI: 10.1038/ncomms3460
Expansion of the mutually exclusive spliced exome in Drosophila
Klas Hatje 1 & Martin Kollmar1
1 Department of NMR-based Structural Biology, Group Systems Biology of Motor Proteins, Max-Planck-Institute for Biophysical Chemistry, Am Fassberg 11, 37077 Gttingen, Germany. Correspondence and requests for materials should be addressed to M.K. (email: mailto:[email protected]
Web End [email protected] ).
NATURE COMMUNICATIONS | 4:2460 | DOI: 10.1038/ncomms3460 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 1
& 2013 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms3460
Alternative processing of primary RNA transcripts is an important driver of increased proteome diversity and regulated gene expression in eukaryotes. Alternative
splicing has been reported for alveolates1,2 and stramenopiles3, green algae4 and plants5, the cryptophyte Guillardia theta and the chlorarachniophyte Bigelowiella natans6, fungi7 and metazoa810, and has therefore been an essential characteristic of the last common ancestor of the eukaryotes. The prevalence of the splice types and the overall number of events strongly differ between branches and species. For instance, intron retention is the preferred type in fungi and plants, whereas most mammalian isoforms are produced by exon skipping. A particularly interesting type of generating alternative transcripts is mutually exclusive splicing, which means that exons of clusters of the internal exons are spliced in a mutually exclusive manner. For example, the Drosophila Down Syndrome Cell Adhesion Molecule (Dscam) gene contains 95 mutually exclusive spliced exons (MXEs) representing the most extensively alternatively spliced gene known11,12. Mutations in MXEs and regions regulating their splicing cause human diseases like the Timothy syndrome13, cardiomyopathy14 or cancer15,16. Mutually exclusive splicing has been shown to be regulated by competing RNA secondary structures11,17,18.
Evidence for alternative splicing has mainly been derived by complementary DNA and transcriptome sequencing. However, isoforms might be expressed infrequently in very few tissues or might have very short half-lives hindering their identication with experimental methods, although huge efforts were undertaken to determine the complete transcriptomes of human and model organisms810,19,20. Computational approaches integrating biological knowledge could ll this gap. The annotation of theD. melanogaster genome is in a particularly advanced state, owing to protein purication21, expressed sequence tag (EST) sequencing22,23, transcriptome sequencing10, DNA microarray studies2426, proteomics studies19 and whole-genome sequencing of closely related Drosophila species2729. Therefore, it provides a good platform for assessing new methods and validating predictions thereof.
Despite the vast amount of these high-throughput data, our understanding of MXE splicing in a genome-wide context is
limited. Little is known about the evolution of the mutually exclusive spliced exome. Here we determine the mutually exclusive spliced exome of D. melanogaster with the help of an in silico prediction pipeline30 and provide evidence for newly predicted MXE candidates by experimental data and evolutionary conservation. We report on the continuous rapid gain and loss of MXEs across 12 Drosophila species.
ResultsCharacteristics of MXEs. MXEs have to fulll the following essential preconditions: They need to be arranged next to each other in clusters, the reading frames must be preserved and the splice site patterns, such as GTAG, GCAG or ATAC, must be compatible for anking constitutive exons and the MXEs. In addition, we expect these exons to have a similar length, if they code for the same region in the tertiary structure of the encoded protein. Thus, length differences are only possible in some loop regions to not disturb the overall protein structure. For the same reason and as MXEs likely evolved from exon duplication events, we expect high sequence similarity between those exons, especially in the slower evolving protein sequences. Higher sequence conservation between MXEs compared with skipped exons has already been observed for human, indicating their unusual biological importance8.
Discovery of MXEs. To assess the predictive power of these criteria, we analysed all annotated internal MXEs of D. melanogaster (Flybase r5.36; Fig. 1). The number of MXEs was evaluated as a function of sequence similarity and maximal length difference, whereas the minimal length of the exons was set to 15 aa (Supplementary Fig. S1). The Drosophila genome contains 60 genes, with 261 annotated internal MXEs of which 251 exons(96.2%) in 55 genes (92%) have length differences of o25 aa (239 have length differences of o10 aa; Supplementary Fig. S2), and 234 exons (89.7%) have similarity scores of 41% within the respective clusters (Supplementary Fig. S3). Using these parameters, we would predict 744 genes to encode 3,583 internal MXEs. However, already at more stringent values false-positive
a b
50,000
45,000
147
40,000
150
35,000
30,000
# Genes
100
50
60
Intron length (nt)
25,000
50
40
20,000
30
15,000
0
20
25
20
10
10,000
15
0
Maximal length difference (aa)
10
15
10
20
5,000
30
5
50
40
70
60
80 Similarity score (%)
0
0 50 100 150 200 450Exon length (aa)
0 100
90
Figure 1 | Assessing annotated and predicted MXEs. (a) Dependence of the number of genes containing internal MXEs on the maximal length difference and similarity between search exon and MXE candidate. The coloured grid denotes the number of genes with MXEs as annotated in FlyBase r5.36that were also predicted by WebScipio. The red and blue lines mark the number of genes containing predicted MXE candidates at the maximal length difference of 20 amino acids and at the minimal similarity score of 15%, respectively. (b) Scatter plot of the internal MXE candidates. Green, annotated in r5.36; red, predicted MXEs.
2 NATURE COMMUNICATIONS | 4:2460 | DOI: 10.1038/ncomms3460 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2013 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms3460 ARTICLE
candidates are predicted, such as an additional exon candidate for the rst cluster of MXEs in the well-studied muscle myosin heavy chain gene (Mhc; length difference of 1 and score of 10.4%). Therefore, we decided to use relatively stringent parameters for the analysis, a maximum length difference of 20 aa and a similarity score of 15%, to avoid the incorporation of many false positives, while being aware that we will miss some of the most divergent cases (Supplementary Fig. S4). Under these criteria, 43 genes (71.7%) encode 218 annotated internal MXEs (83.5% sensitivity) and additional 201 high-condence MXE candidates were predicted, of which 44 are completely new exons in 40 genes (Fig. 1a and Supplementary Figs S5S7). To exclude that the determined characteristics are Drosophila-specic, we also analysed the annotated mutually exclusive exomes of Homo sapiens (NCBI release 37.3), Caenorhabditis elegans (WormBase release WS230) and Arabidopsis thaliana (TAIR release 167; Supplementary Figs S1S3). At a length difference of 20 aa and a similarity score of 15%, 58% (84 of 144) of human MXEs and 54% (19 of 35) of worm MXEs could be reconstructed, while the At annotation (14 MXEs) does not contain MXEs matching our criteria. This analysis indicates that we have determined species-independent parameters to predict MXE candidates. The high sensitivity of the method implies that most of the new MXE candidates are real exons, which have escaped experimental detection so far.
Very short exons and very long introns increase the chances of predicting false candidates. To exclude potential mispredictions, we analysed the exon lengths of annotated MXEs and the lengths of introns surrounding them (Fig. 1b). Exon lengths of MXEs are at least 15 residues (also found for human and Caenorhabditis MXEs, Supplementary Fig. S8). The introns surrounding annotated MXEs vary from 50 to 50,000 bp (Fig. 1b and Supplementary Fig. S9). Although most introns range up to 5,000 bp, we therefore cannot assume that potential MXE candidates in longer introns are false predictions (Supplementary Fig. S10). MXE candidates, which are also conserved in other arthropods, were found, for example, in very long introns of the nAcRalpha-80B and bruno-3 genes (Supplementary Fig. S9).
Characteristics of MXEs. To identify further parameters characterizing MXEs and to ensure that the predicted MXEs have the same features as the already annotated MXEs, we analysed these exons in comparison with all exons in the genome and the subset of constitutive exons matching the criteria of MXEs. The comparison of the exon and intron lengths did not reveal any distinctive features (Supplementary Figs S11S14). In terms of GC content, the annotated MXEs, which we could not reconstruct, and the constitutive exons, which match the criteria for MXEs, have higher GC contents than the MXEs, which we could reconstruct and which we predict (Supplementary Fig. S15). However, the distribution of the GC content is very broad for all types of exons ranging from 30 to 65%, so that this cannot be taken as a criterion for exclusion. On the basis of the FlyBase annotation, MXEs are found in longer genes, and this is also true for the predicted MXE candidates (Supplementary Fig. S16). The codon usage is almost identical in all types of exons, except for a considerably higher content of alanines (GCC codon) and glutamines (CAA and CAG) in the MXEs, which were annotated in FlyBase but which we could not reconstruct (Supplementary Fig. S17). The 50-splice junctions of constitutive and MXEs are also slightly different, the latter having a higher priority for G in the 1 and a lower priority for GT in the 5 and 6 positions
(Supplementary Figs S18 and S19). Analysis of the start and end phases of the exons showed that the percentage of symmetric exons is a bit higher for the predicted and not annotated MXEs
(51%) compared with that of the already annotated but not predicted MXEs (26%, Supplementary Fig. S20). This might indicate that some of the predicted MXEs might rather be spliced as constitutive or differentially included exons. Altogether, further discriminative features between MXEs and constitutive exons to be included in the search parameters could not be determined.
MXEs versus constitutive and differentially included exons. The number of false positives and true negatives could only be determined if an absolutely correct annotation of all genes were available. Although such a data set is missing, we tried to estimate the number of false-positive predictions by searching for constitutive and differentially included exons that match the criteria of MXEs. Of the 60,401 exons annotated as constitutive or differentially included exons in the D. melanogaster genome, only 169 exons (0.28%) in 46 genes matched these criteria. Several of these exons are even annotated as MXEs in the latest FlyBase release based on RNA-Seq evidence, including a cluster of MXEs in the bTub97EF gene, the Lipophorin receptor 1 gene and the nicotinic Acetylcholine Receptor a 30D gene (Supplementary Fig. S21). This demonstrates that only a minor part of all internal exons matching the characteristics of MXEs is spliced constitutively, and we conclude that most of the new MXE candidates will be spliced in a mutually exclusive way.
The mutually exclusive spliced exome of D. melanogaster. To characterize the mutually exclusive spliced exome, we identied 1,297 exons that are mutually exclusive in annotated isoforms of the same gene (Supplementary Tables S1 and S2). Of these, 291 had similar length and sequence, including 218 internal MXEs. We predicted 539 exons of similar length and sequence that could be spliced in a mutually exclusive way (two times the annotated exons; Fig. 2, Supplementary Figs S22 and S23, and Supplementary Data 1 and 2). Four hundred and nineteen of the MXE candidates were internal, including 218 of the already annotated MXEs. Evidence for the predicted MXE candidates was obtained through additional data (Fig. 2 and Supplementary Data3): (A) Mapping of EST and RNA-Seq data. (B) Conservation of the MXE candidates in other arthropods. For this purpose, we identied the homologues to the D. melanogaster genes in 11 sequenced Drosophila species, as well as in Anopheles gambiae, Aedes aegypti, Atta cephalotes, Apis mellifera, Tribolium castaneum, Pediculus humanus corporis and Daphnia pulex, and predicted MXE candidates in the homologues using the same pipeline as for D. melanogaster. (C) Ab initio prediction of exonic regions in the respective introns using AUGUSTUS31.(D) Identication of competing RNA secondary structures. Of the internal MXEs, 57% were supported by multiple data types, 21% were supported by EST data. Of the 44 newly predicted internal MXEs, 8 were supported by EST and/or RNA-Seq data. Of the annotated and reconstructed internal MXEs, and of the total predicted internal MXEs, 94.5% and 76.6%, respectively, are evolutionarily conserved in at least 1 of the 18 further analysed species. In total, only 120 cases of terminal MXEs have been identied with similar length and sequence. These exons are, however, spliced by alternative cleavage and polyadenylation and/ or alternative promotor usage, and represent only 73 (7.0%) of the annotated 1,036 terminal MXEs. As many of these terminal MXE candidates belong to predicted genes, which are not yet supported by full-length cDNA or RNA-Seq data, some might turn to internal exons if further 50 and 30 exons are identied.
Examples of supported new MXE candidates. The genes containing annotated or predicted MXEs are almost evenly spread on all chromosomes (Fig. 2 and Supplementary Data 3). Seventy-ve
NATURE COMMUNICATIONS | 4:2460 | DOI: 10.1038/ncomms3460 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 3
& 2013 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms3460
drpr
NrxIV
ssp2
Pka R2
Spn4
CG42345
Galpha49B
Nop60B
itp
CG9850
CG17574
CG30493
G oalpha47A
scb
CG9380
CG30427
Ate1
CG30463
sli
aPKC
bbc
RpS11
whd
Aldh III
Dscam
l 2 01289
Gprk1
CG30438
Eb1
Rya r44F
brp
CG30002
14 3 3zeta
Pfk
Or46a
shn
spin
Pkc53E
mbl
CG3209
3L
CG6673
mtacp1
Eip63E
Lmpt
ect
Rdl
Dscam2
Dhc64C
PGRPLC
nAcRalpha 80B
CG8834
CG10638
CG4300
CG6084
2R
verm
Lasp
pHCl
bru 3
dpr6
ome
ens
Or69a
CG16758
CG7991
eRF1
Snap25
mex1
Mbs
l 3 mbn
pip
CG5946
bt
CG11155
pan
4 CG1674
unc 13
Rala
br
CG1835
Ntf 2
Fas2
msta
Cyp4d1
svr
dnc
CG4004
Flo 2
Mgstl
wupA
para
mmd
Ste CG33237
Ste CG33238
g
up
cac
CG1637
CG7766
CG7267
fs 1 h
sws
CG12541
Ca alpha1T
trol
Actn
slo
CG42235
Esyt2
Atpalpha
CG17838
GluClalpha
gish
Sap47
CG31406
CG14608
CRMP
CG31522
kkv
Sh
Unc 115b
Unc 115a
CG6241
CG14741
Octbeta3R
s
qd
abd
A
vib
X
twin
CG17780
CG5768
ymp
LpR2
LpR1
CG31064
betaTub97EF
3R
Hrb98DE
Ppn
CG42248
CG3600
norpA
SK
CG2082
Jupiter
CG31191
CG34417
mys
tau
heph
ETHR
CG42249
Muc11A
Top1
slgA
AnnIX
Tm2
Tm1
Hrb87F
SelR
mod mdg4
Oamb
fru
CG17472
CadN2
beat IIIc
elB
obst E
jet
25 MXEs5 MXEs
Tsp97E
Ald
CG8671
Ca alpha1D
Mhc
nimC1
Gr28b
nrv2
tkv
CG32982
CG6206
MRP
CG34126
CG43129
Gr39a
CadN
mol
1 MXE
CG4495
nompC
nAcRalpha 30D
CG8086
TepII
Lar
CG3625
l 2 gl
Tsp26A
stai
Sam S
ush
Cda5
CG14010
pgant5
Cf2
CG15279
crol
IA2
Gs1l
tim
2L
dp
Annotated MXEs (%)
MXEs with EST evidence (%)
MXEs with cross-species evidence (%) MXEs overlapping AUGUSTUS prediction (%)
Initial MXEs (%) Internal MXEs (%) Terminal MXEs (%)
Figure 2 | The mutually exclusive exome of Drosophila melanogaster. All genes containing predicted MXEs are listed. The grey bars denote the number of MXEs predicted for each gene and the coloured bars display the evidence by various criteria, in per cent.
per cent of them are named and have at least one functional study linked in FlyBase. As an example, the new gene model of the vibrator (vib) gene coding for a phosphatidylinositol transfer protein contains a cluster of MXEs, which was not known in r5.36 but is supported by EST and RNA-Seq data, and is included in the latest release r5.48 (Fig. 3a; for the complete list of new MXE candidates not included in r5.36 but in r5.48, see Supplementary Fig. S22). Examples of new clusters of MXEs in well-known genes that are not included in r5.48 include the Shaker (Sh) gene, in which the cluster is conserved in all arthropod species analysed and of which the 30-end of the new MXE candidate is supported by RNA-Seq data, and the nicotinic Acetylcholine Receptor a 80B (nAcRalpha-80B) gene, in which the cluster is conserved from Daphnia to mosquitoes and Drosophila but not yet supported by
experimental data (Fig. 3a; for the complete list of new MXE candidates not included in r5.48, see Supplementary Fig. S23). The evolutionary conservation provides high condence to the new MXE candidates. The missing experimental support indicates that the new MXE candidates either represent low-abundance isoforms or variants restricted to very specic cell types and developmental stages, which were not yet covered by the RNASeq read depth and tissue selection8,10. For instance, the expression of an MXE isoform of the human calcium channel CaV2.2 is restricted exclusively to nociceptive neurons of dorsal root ganglia32. However, the MXE expression could also be regulated differently in other individuals than in the sequenced y and in other species. Interindividual variation has been found to be very common in humans, although still less frequent than
4 NATURE COMMUNICATIONS | 4:2460 | DOI: 10.1038/ncomms3460 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2013 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms3460 ARTICLE
a
b
Vibrator
200 bp (ex.) 1,800 bp (in.)
Cross-species search in Pediculus humanus corporis 200 bp (ex.) 1,100 bp (in.)
10 20
....|....|....|....|....|..exonA NPKFMKDAFKIIIDTLHV-GDAGDSENexonB NPGYMDKNFKIDIYSQHIENDLGTVDNPdcExonA NPLYMKEKFHLTIESHHL-IDDGQNENPdcExonB NPGYMKENFLIMIESFHI-NDSGYQEN
Shaker
500 bp (ex.) 11,100 bp (in.)
CG14608 800 bp (ex.) 3100 bp (in.)
Conserved in dper, dpse, dsec, dsim and dvir.
5
AU
U A
C
CU CU UG C U
A A
2a
1 GCCUU UCA AGGCAUCAUCUUA UAGAUUUA GCAUG UUCCACAUAAUUUUAUU CUUUUA GUC
UAG
CGGAACAG U
CUGUGG AGAACUUUUUAGA GUAU CAGAAGUAU AAAAUAU CAAAAUACU
||||| || |||||| |||| |||||| ||| || |||| |||||| ||||| |||
CC
A A
CG
2b
3
3
2c
UUAUUAC
U C
5
G
U
C U U
U C
U
U A
U A
A U
U U
A C
A U
U
G U G C
AUCUUA A
1 UCU GCCUUUCACUA AUC GAUUUAUGGCAUGCAUU CAC U
AGAACCGGCGGUUA AGGAGAGC CCUAAGUAACCUACUAUAA
||| ||| ||||| || |||| ||||||||| ||| ||| || ||||
AGUC
2a
GU UCAG
2c
Cross-species search in Daphnia pulex
400 bp (ex.) 2,800 bp (in.)
3
UG
3
2b
10 20 30 40
....|....|....|....|....|....|....|....|....exonA GVVLFSSAVYFAEAGSDSSFFKSIPDGFWWAVVTMTTVGYGDMRexonB GVVLFSSAVYFAEAGSENSFFKSIPDAFWWAVVTMTTVGYGDMTDapExonA GVILFSSAVYFAEAGSEYSYFKSIPDAFWWAVVTMTTVGYGDMRDapExonB GVILFSSAVYFAEAGSEVSHFKSIPDAFWWAVVTMTTVGYGDMT
Nicotinic Acetylcholine receptor [afii9825] 80B
400 bp (ex.) 17,800 bp (in.)
Legend
Exon Intron Pos. with Ns in target
Intron?
Sequence shift
Mismatch
Additional codon in target or unmatched query
Cross-species search in Anopheles gambiae 400 bp (ex.) 4,700 bp (in.)
10 20 30 40 50 60
....|....|....|....|....|....|....|....|....|....|....|....|exonA ADGNFEVTLATKATIYSEGLVEWKPPAIYKSSCEIDVEYFPFDEQTCVLKFGSWTYDGFKexonB ADGHYEVTLMTKAIVYNNGLVIWQPPAVYKSSCSIDVEYFPYDVQTCILKLGSWTYDGFKAngExonA ADGNFEVTLATKATIYSEGLVEWKPPAIYKSSCEIDVEYFPFDEQTCVLKFGSWTYDGFKAngExonB ADGHYEVTLMTKATVYNNGMVIWQPPAVYKSSCSIDVEYFPYDVQTCVLKLGSWTYDGFK
Gap Gap
Figure 3 | Examples of new MXE candidates. (a) Exonintron gene structures of example genes containing newly predicted internal MXEs. The isoforms of the vibrator gene are annotated in r5.48 and are supported by cDNA and RNA-Seq, the shaker gene splice variants are not annotated but supported by RNA-Seq, and the MXE candidate of the nicotinic Acetylcholine Receptor a 80B is not experimentally supported yet. (b) Exonintron gene structure of the
CG14608 gene containing two newly predicted internal MXEs that are supported by evidence through competing RNA secondary structures. All transcripts are represented 50 to 30. The colour coding is explained in the legend and applies to all gene structure gures. Coloured big bars represent mutually exclusive exons. The darkest coloured bar is the exon that was included in the query sequence, whereas the lighter coloured bars represent identied
MXEs. The higher the similarity between the candidate and the query exon, the darker the colour of the candidate (100% identity would result in the same colour). The opacity of the colours of each alternative exon corresponds to the alignment score of the alternative exon to the original one.
variation between tissues8, but has not yet been shown for Drosophila.
Alternative splicing of the Mhc and the Dscam genes in arthropods has been shown to be regulated by RNA secondary structures18,33. Docking sites (acceptor sequences) have been identied in the introns in front or behind the cluster of MXEs to which only one of the selector sequences downstream or upstream of each MXE, respectively, can bind at a time, forming conserved base-pairing interactions. Although such sites have only been found for some of the MXE clusters in the 14-3-3z, the Mhc and the Dscam genes, this mechanism might also regulate the splicing of other MXE clusters. We searched for complementing sequences in all predicted clusters of MXEs and found favourable sites in many of the annotated clusters. The CG14608 gene exemplies a predicted cluster of MXEs, for which RNA-Seq evidence is not available, but which is supported by cross-species evidence and by competing RNA secondary structure prediction (Fig. 3b and Supplementary Fig. S24).
Functional analysis of the genes containing MXEs. To analyse the conservation pattern of the genes containing MXEs with respect to their involvement in biological processes, we performed a Gene Ontology (GO) analysis34. Surprisingly, the
genes with annotated and reconstructed MXEs, as well as the genes with predicted but not annotated MXEs, both display strong enrichment in transmembrane transporter and ion channel activity, and plasma membrane localization (Supplementary Figs S25 and S26). It has been shown for human that genes with MXEs are more often involved in regulating highly tissue-specic functions than genes with spliced exons, and that these genes are enriched in cell communication and signal trunsduction8. Tandem exon duplications have occurred, for example, in many of the human, C. elegans and Drosophila ion channels, in both voltage- and calcium-gated types35, and in glycine, glutamate and nicotinic acetylcholine receptors35,36. However, the exons duplicated independently, and not only in homologous but also non-homologous families. It has therefore been suggested that exon duplications provide a general benet of adapting ion channel and receptor functions compared with gene duplications35. Although not shown yet, MXEs in these genes are probably a common property of all metazoans. The GO analysis of the new Drosophila MXE candidates showing a similar enrichment in biological processes as in annotated genes indicates that the recent experimental approaches were not yet exhaustive enough and that our approach is a valuable complement to unveil the mutually exclusive exome of a species.
NATURE COMMUNICATIONS | 4:2460 | DOI: 10.1038/ncomms3460 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 5
& 2013 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms3460
a
dyak, dere
dmel
67
15
2
5
5
11
9
1
1
1
1
1
1
4
1
1
3
dana
1
2
1
1
3
4
2
2
dmoj
dgri
7
57
3
7
Sophophora
43
5
dsim, dsec
1
9 4
13
56
1
2
9 11
9
119
115
1
1
420
12
13
1
1
2
53
3
4
1
46
1
1
4
dvir
1
1
2
1 2
2
95
1
1
3
164
4
3
dpse, dper
4
1
dvir, dmoj,
6
1
2
dgri
66
dwil
b c
D. willistoniGene: GK24986, FBgn0226945 Polypeptide: GK24986-PA, FBpp0254129
For clarity introns have been scaled down by a factor of 9.15
D. willistoniGene: GK25120, FBgn0227079 Polypeptide: GK25120-PA, FBpp0254263
Cluster conserved in Anopheles gambiae, Apis mellifera, Atta cephalotes, Pediculus humanus corporis, Tribolium castaneum, dgri, dper and dvir.
1,400 bp (ex.) 12,900 bp (in.)
Cluster conserved in Aedes aegypti, Anopheles gambiae, Apis mellifera, Atta cephalotes,
Daphnia pulex, Pediculus humanus corporis, Tribolium castaneum, dgri and dvir.
400 bp (ex.) 1,200 bp (in.)
1 scf2_1100000004909 (58,856 bp)
1 scf2_1100000004909 (6,666 bp)
For clarity introns have been scaled down by a factor of 2.81
D. melanogasterGene: cac, cacophony, FBgn0263111 Polypeptide: cac-PA, FBpp0298319
For clarity introns have been scaled down by a factor of 6.73
Cross-species search in D. melanogaster Gene: Rop, Ras opposite, FBgn0004574 Polypeptide: Rop-PA, FBpp0073119
Also no introns in the Ras opposite genes of dere, dsec and dyak.
1,400 bp (ex.) 9,300 bp (in.)
1,200 bp (ex.) 5,300 bp (in.)
1 (44,010 bp)
200 bp
Cross-species search in Daphnia pulex
Daphnia pulex
1 3L (1,791 bp)
Cross-species search in Pediculus humanus corporis
400 bp (ex.) 1,400 bp (in.)
1,400 bp (ex.) 4,900 bp (in.)
1 gi|145650040|gb|DS235824.1| (6,565 bp)
For clarity introns have been scaled down by a factor of 3.41
1 gi|321454411|gb|GL732739.1| (26,094 bp)
For clarity introns have been scaled down by a factor of 3.56
10 20 30
dwilExonA YHNQSEMYEKSLKYINMGFTGMFSVETVLKIIGFGVKdwilExonB FHDAPPTHSDILTYMNLVFTLFFLMETILKLIAFGCTdmelExon YHNQGDMYEKSLKYINMGFTGMFSVETVLKIIGFGVKDapExonA YYKQSVLYKETLHYMNTAFTALFSIECMLKIISFGVRDapExonB YYEAPPALVDILAYMNLIFTMLFSLECILKLAAFGIK
10 20 30 40 50
dwilExonA --------AIPPRIFEMLQSHKDICRRYVRTCKEINISFLAYEAQ-----------dwilExonB --------VCPEELFNDL--CKSCAARKIKTLKEINIAFLPYECQ-----------dmelExon YAHVFFTEVCPEELFNDL--CKSCAAGKIKTLKEINIAFLPYECQVFSLDSPDTFQPdcExonA --------ACNDELFKEI--SHARVAKFIKTLKEINIAFIPFEEQ-----------PdcExonB --------VCPEELFNEL--CKSCAAKKIKTLKEINIAFLPYESQ-----------
Figure 4 | Evolution of mutually exclusive splicing clusters. (a) The Venn diagrams48 show the number of clusters of MXEs shared between species and subsets of species groups. Species abbreviations are D. simulans (dsim), D. sechellia (dsec), D. melanogaster (dmel), D. yakuba (dyak), D. erecta (dere), D. ananassae (dana), D. pseudoobscura (dpse), D. persimilis (dper), D. willistoni (dwil), which are all part of the Sophophora branch, and D. virilis (dvir),D. mojavensis (dmoj) and D. grimshawi (dgri). (b) The gene structure of the D. melanogaster Cacophony gene is shown in comparison with its homologues in D. willistoni and Daphnia pulex. The D. melanogaster gene contains three clusters of MXEs. A forth cluster is present in D. willistoni, Daphnia and in other insects that had been lost in D. melanogaster. The respective corresponding exons are marked by dotted lines, and their sequences are shown in the alignment. (c) The gene structure of the D. melanogaster Ras opposite gene is shown in comparison with its homologues in D. willistoni and Pediculus humans corporis. The D. melanogaster gene consists of a single exon as do the homologues in dere, dsec and dyak. The introns of this gene must have been lost in the ancestor of the melanogaster subgroup branch. In contrast, the Ras opposite gene in D. willistoni is a multi-exon gene and contains a cluster of MXEs. This cluster of MXEs is also found in Pediculus and most of the other analysed insect species. The sequences of the MXEs as well as the corresponding region in the D. melanogaster protein are shown in the alignment. All transcripts in b and c are represented 50 to 30. Coloured big bars represent mutually exclusive exons. The darkest coloured bar is the exon that was included in the query sequence, whereas the lighter coloured bars represent identied MXEs.
The higher the similarity between the candidate and the query exon, the darker the colour of the candidate (100% identity would result in the same color). The opacity of the colours of each alternative exon corresponds to the alignment score of the alternative exon to the original one.
6 NATURE COMMUNICATIONS | 4:2460 | DOI: 10.1038/ncomms3460 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2013 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms3460 ARTICLE
56 dsim, dsec
57 dana
95 dpse, dper
66 dwil
7
27
Melanogaster subgroup
12
25
21
11
7
67 dyak, dere
9
20
11
6
47
15 dmel
Obscura group
Sophophora
37
33
186
14
Drosophila
46 dvir
43 dmoj
34
28
13
13
25
53 dgri
9
50 Myr
Figure 5 | The gain and loss of clusters of MXEs. The gain and loss of clusters of MXEs plotted onto the evolutionary tree of the Drosophila species (black and red numbers, respectively). Species abbreviations areD. simulans (dsim), D. sechellia (dsec), D. melanogaster (dmel), D. yakuba (dyak), D. erecta (dere), D. ananassae (dana), D. pseudoobscura (dpse),D. persimilis (dper) and D. willistoni (dwil), which are all part of the Sophophora branch, and D. virilis (dvir), D. mojavensis (dmoj) andD. grimshawi (dgri).
Evolution of the MX spliced exome in 12 Drosophila. It is well known that the clusters of MXEs are highly conserved, for example, in the Drosophila Mhc genes37, whereas some variability has been observed for the Dscam genes11,30. To determine the extent of conservation within the Drosophila MXEs, we compared the data from D. melanogaster (dmel) with the reconstructed corresponding exomes of 11 further Drosophila species (Fig. 4a). In total, 2,640 clusters were identied, most of which are shared among several species, resulting in 770 unique clusters. The genomes of dsim, dsec and dper are less complete than the other assemblies and were, therefore, analysed in groups, resulting in seven Drosophila species or species groups (Fig. 4a). Surprisingly, many of the clusters are unique to one of these groups, such as 164 clusters within the Drosophila subgenus group (dvir, dmoj and dgri) or 95 clusters within the obscura group (dpse and dper). Only 68 clusters are conserved in all 12 species (115 in the seven groups). Thirty-six clusters are missing in only 1 of the species and 16 clusters are absent in any 2 species. The alternative exons of these clusters could have been lost in these species because of a single, independent exon-loss event, or have not been detected. Potential reasons for the latter can be gaps in the assemblies, leading to the absence of entire and partial genes or single exons, and exon sequence divergence, leading to their exclusion under the given cutoff values. However, most clusters are shared by at least two species or species groups, and it is very unlikely that assembly gaps are present in independent genomes at exactly the same region in all the other species. Examples are the Cacophony gene, for which an additional conserved cluster of MXEs was identied in dwil, dgri, dvir and all other arthropods analysed that has, however, been lost in dmel and the other Drosophila species (Fig. 4b), and the Ras opposite (Rop) gene, which is a single-exon gene in dmel, dere, dsec and dyak, but a multi-exon
gene containing a conserved cluster of MXEs in dwil, dgri, dper, dvir and the other arthropods (Fig. 4c). The predicted clusters of MXEs, therefore, represent MXEs of which the alternative exons have been lost in certain species, or exons that have been gained at a certain step in Drosophila evolution. To determine exon gain and loss during the evolution of the Drosophila species, we counted these events based on maximum parsimony requiring the least exon-loss events (Fig. 5). The last common ancestor of the Drosophila species contained at least 186 clusters of MXEs(24.2% of all unique clusters). Four hundred and fty-six clusters(59.2%) are unique to any of the Drosophila species and 111 clusters (14.4%) have been gained in certain branches.
The presence of a large subset of MXEs in at least two Drosophila species, which are not closest relatives, implies frequent MXE gain and loss events during insect evolution. Compared with the conserved set of MXEs, the number of species-specic gains and losses is striking. This pattern suggests that exon duplication leading to MXEs is a very active process that might contribute to speciation. Similarly, changes in the alternative splicing patterns in vertebrates have been found to contribute more to vertebrate speciation and tissue specication than gene expression programmes do38,39. Exon duplication is a very convenient way to increase protein diversity by only modulating a domain or subdomain function. In contrast to gene duplications, the gene dosage is most likely not altered in genes after exon duplication eliminating the need for a relaxed selection against degenerative mutations. Thus, exon duplicates are immediately subject to stabilizing selection and could improve different functions of the original exons.
DiscussionOur analysis of the mutually exclusive exome of D. melanogaster considerably increased the number of mutually exclusive splicing events. Specically, we have identied two times more internal MXE candidates than that already annotated, of which almost 80% are supported by evolutionary conservation or experimental transcript data. This number is surprising given the enormous and long-standing efforts in annotating the D. melanogaster genome. However, annotation is a continuous process and even a recent exhaustive exploration of the developmental transcriptome ofD. melanogaster using RNA-Seq, tiling microarrays and cDNA sequencing failed to detect expression of 12% of the known genes, although the coverage of the genome and transcriptome were 1,200-and 5,900-fold, respectively10. This is consistent with a recent proteomics study showing that MXEs are highly underrepresented in RNA-Seq data19. Because of the tight cut-offs of our analysis, we are sure that many more MXEs can be identied through manual investigation of the unexplored data. Here we provide an important step in completing the D. melanogaster genome annotation and a valuable resource for further studies.
The method has also been applied to the human, C. elegans andA. thaliana genomes for comparison. In all genomes, many more MXE candidates could be identied than the ones annotated, although MXE splicing is not as prevalent as in Drosophila. However, the water ea Daphnia pulex has two to three times more genes with clusters of MXEs than that in D. melanogaster, and we are sure that this particular alternative splice type is even more widespread in other species. Our method provides a straightforward way to analyse other genomes in the future, including resolving articial fusions of tandemly arrayed gene duplicates and candidates for trans-splicing. Given the suggested unusual biological importance of the MXEs in human8 compared with skipped exons, the rapid evolution of the mutually exclusive exomes of the 12 Drosophila species is all the more surprising. It indicates an important role of the corresponding genes and the
NATURE COMMUNICATIONS | 4:2460 | DOI: 10.1038/ncomms3460 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 7
& 2013 Macmillan Publishers Limited. All rights reserved.
ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/ncomms3460
necessity to conserve the overall shape of the proteins while creating diversity in restricted regions.
Methods
Reconstruction of gene structures. Genome assemblies and annotated proteins for the Drosophila species were obtained from FlyBase40 (r5.36 for D. melanogaster, r 1.2 for D. virilis, r 2.25 for D. pseudoobscura and r 1.3 for all other Drosophila species), for Caenorhabditis elegans from WormBase41 (WS 230), for Arabidopsis thaliana from TAIR42 (v. 10) and for human from GenBank (v. 37.3). EST data were downloaded from GenBank. More details about the data sets are given in Supplementary Table S3. The gene structures for the annotated proteins were reconstructed with Scipio43 using standard parameters, except max_mismatch 7, region_size 20,000 (50,000 for
D. melanogaster), single_target_hits, max_move_exon 10, gap_to_close 0,
blat_oneoff false, blat_score 15, blat_identity 54, exhaust_align_size 20,000
and exhaust_gap_size 50. Scipio starts with a blat_tilesize of 7 and reduces this step
by step to 4, if parts of the protein sequence could not be found. All parameters are less stringent than default parameters to increase the chance to reconstruct all genes automatically. The region_size, which determines the number of up- and downstream nucleotides added to the gene sequence, has been increased to allow searching for 50 and 30 candidates of MXEs.
Predicting MXEs. MXEs were predicted in the reconstructed genes using the algorithm implemented in WebScipio30 with a minor modication favouring GT AG splice junctions over the other possible splice sites (GCAG and GGAG) if several overlapping candidates existed. As initial parameters for MXE candidate predictions in D. melanogaster, we set the length difference to 25 aa, the minimum score to 1 and the minimum exon length to 1. For all other species, the parameters were length difference 20, minimum score 10 and minimum exon length 10.
MXE candidates were searched for all exons in all introns, and up- and downstream regions. Candidates for 50-exons of genes were expected to start with a methionine, and candidates for 30-exons of genes were expected to end with a stop codon.
Obtaining evidence for MXE candidates. Ab initio exon prediction was done with AUGUSTUS using default parameters to nd alternative splice forms and the feature set for D. melanogaster. Cross-species searches and mapping of EST data were done with WebScipio with same parameters as for gene reconstructions, except min_identity 60, max_mismatch 0 (allow any number of mismatches),
gap_to_close 10, min_intron_length 35, blat_tilesize 6 and blat_oneoff
true. MXE candidates in cross-species gene reconstructions were searched with length difference 20, minimum score 15 and minimum exon length 15, for
all exons in all introns but not in up- and downstream regions. Binding windows for competing intron RNA secondary structures were predicted for all candidate clusters of MXEs using the SeqAn44 package. The identied binding windows of all homologous genes were aligned using MUSCLE45 and the RNA secondary structures predicted by RNAalifold (ViennaRNA package)46. The GO enrichment analysis was done with AmiGO47. All data can be searched, ltered and browsed at Kassiopeia (http://www.motorprotein.de/kassiopeia
Web End =www.motorprotein.de/kassiopeia). For upload into the FlyBase genome browser, a GFF le containing the complete gene structures of the genes that include new MXEs, and a GFF le containing only the clusters of MXE candidates are available as Supplementary Data 1 and 2.
References
1. Sorber, K., Dimon, M. T. & DeRisi, J. L. RNA-Seq analysis of splicing in Plasmodium falciparum uncovers new splice junctions, alternative splicing and splicing of antisense transcripts. Nucleic Acids Res. 39, 38203835 (2011).
2. Pszenny, V. et al. Targeted disruption of Toxoplasma gondii serine protease inhibitor 1 increases bradyzoite cyst formation in vitro and parasite tissue burden in mice. Infect. Immun. 80, 11561165 (2012).
3. Shen, D., Ye, W., Dong, S., Wang, Y. & Dou, D. Characterization of intronic structures and alternative splicing in Phytophthora sojae by comparative analysis of expressed sequence tags and genomic sequences. Can. J. Microbiol. 57, 8490 (2011).
4. Labadorf, A. et al. Genome-wide analysis of alternative splicing in Chlamydomonas reinhardtii. BMC Genomics 11, 114 (2010).
5. Marquez, Y., Brown, J. W. S., Simpson, C., Barta, A. & Kalyna, M. Transcriptome survey reveals increased complexity of the alternative splicing landscape in Arabidopsis. Genome Res. 22, 11841195 (2012).
6. Curtis, B. A. et al. Algal genomes reveal evolutionary mosaicism and the fate of nucleomorphs. Nature 492, 5965 (2012).
7. Wang, B. et al. Survey of the transcriptome of Aspergillus oryzae via massively parallel mRNA sequencing. Nucleic Acids Res. 38, 50755087 (2010).
8. Wang, E. T. et al. Alternative isoform regulation in human tissue transcriptomes. Nature 456, 470476 (2008).
9. Pan, Q., Shai, O., Lee, L. J., Frey, B. J. & Blencowe, B. J. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat. Genet. 40, 14131415 (2008).
10. Graveley, B. R. et al. The developmental transcriptome of Drosophila melanogaster. Nature 471, 473479 (2011).
11. Graveley, B. R. Mutually exclusive splicing of the insect Dscam pre-mRNA directed by competing intronic RNA secondary structures. Cell 123, 6573 (2005).
12. Nilsen, T. W. & Graveley, B. R. Expansion of the eukaryotic proteome by alternative splicing. Nature 463, 457463 (2010).
13. Splawski, I. et al. Ca(V)1.2 calcium channel dysfunction causes a multisystem disorder including arrhythmia and autism. Cell 119, 1931 (2004).
14. Mayr, J. A. et al. Deciency of the mitochondrial phosphate carrier presenting as myopathy and cardiomyopathy in a family with three affected children. Neuromusc. Disord. 21, 803808 (2011).
15. Fraser, S. P. et al. Voltage-gated sodium channel expression and potentiation of human breast cancer metastasis. Clin. Cancer Res. 11, 53815389 (2005).
16. David, C. J., Chen, M., Assanah, M., Canoll, P. & Manley, J. L. HnRNP proteins controlled by c-Myc deregulate pyruvate kinase mRNA splicing in cancer. Nature 463, 364368 (2010).
17. Xing, Y. & Lee, C. Alternative splicing and RNA selection pressure--evolutionary consequences for eukaryotic genomes. Nat. Rev. Genet. 7, 499509 (2006).
18. Yang, Y. et al. RNA secondary structure in mutually exclusive splicing. Nat. Struct. Mol. Biol. 18, 159168 (2011).
19. Ezkurdia, I. et al. Comparative proteomics reveals a signicant bias toward alternative protein isoforms with conserved structure and function. Mol. Biol. Evol. 29, 22652283 (2012).
20. Brunner, E. et al. A high-quality catalog of the Drosophila melanogaster proteome. Nat. Biotechnol. 25, 576583 (2007).
21. Guruharsha, K. G. et al. A protein complex network of Drosophila melanogaster. Cell 147, 690703 (2011).
22. Stapleton, M. et al. The Drosophila Gene Collection: Identication of putative full-length cDNAs for 70% of D. melanogaster genes. Genome Res. 12, 12941300 (2002).
23. Stapleton, M. et al. A Drosophila full-length cDNA resource. Genome. Biol. 3 research0080-0080.8 (2002).
24. Arbeitman, M. N. et al. Gene expression during the life cycle of Drosophila melanogaster. Science 297, 22702275 (2002).
25. Stolc, V. et al. A gene expression map for the euchromatic genome of Drosophila melanogaster. Science 306, 655660 (2004).
26. Manak, J. R. et al. Biological function of unannotated transcription during the early development of Drosophila melanogaster. Nat. Genet. 38, 11511158 (2006).
27. Richards, S. et al. Comparative genome sequencing of Drosophila pseudoobscura: chromosomal, gene, and cis-element evolution. Genome Res. 15, 118 (2005).
28. Clark, A. G. et al. Evolution of genes and genomes on the Drosophila phylogeny. Nature 450, 203218 (2007).
29. Zhou, Q. & Bachtrog, D. Sex-specic adaptation drives early sex chromosome evolution in Drosophila. Science 337, 341345 (2012).
30. Pillmann, H., Hatje, K., Odronitz, F., Hammesfahr, B. & Kollmar, M. Predicting mutually exclusive spliced exons based on exon length, splice site and reading frame conservation, and exon sequence homology. BMC Bioinformatics 12, 270 (2011).
31. Stanke, M. & Waack, S. Gene prediction with a hidden Markov modeland a new intron submodel. Bioinformatics 19(Suppl 2): ii215ii225 (2003).
32. Bell, T. J., Thaler, C., Castiglioni, A. J., Helton, T. D. & Lipscombe, D. Cell-specic alternative splicing increases calcium channel current density in the pain pathway. Neuron 41, 127138 (2004).
33. May, G. E., Olson, S., McManus, C. J. & Graveley, B. R. Competing RNA secondary structures are required for mutually exclusive splicing of the Dscam exon 6 cluster. RNA. 17, 222229 (2011).
34. Rhee, S. Y., Wood, V., Dolinski, K. & Draghici, S. Use and misuse of the gene ontology annotations. Nat. Rev. Genet. 9, 509515 (2008).
35. Copley, R. R. Evolutionary convergence of alternative splicing in ion channels. Trends Genet. 20, 171176 (2004).
36. Lipscombe, D. Neuronal proteins custom designed by alternative splicing. Curr. Opin. Neurobiol. 15, 358363 (2005).
37. Odronitz, F. & Kollmar, M. Comparative genomic analysis of the arthropod muscle myosin heavy chain genes allows ancestral gene reconstruction and reveals a new type of partially processed pseudogene. BMC Mol. Biol. 9, 21 (2008).
38. Merkin, J., Russell, C., Chen, P. & Burge, C. B. Evolutionary dynamics of gene and isoform regulation in mammalian tissues. Science 338, 15931599 (2012).
39. Barbosa-Morais, N. L. et al. The evolutionary landscape of alternative splicing in vertebrate species. Science 338, 15871593 (2012).
40. Tweedie, S. et al. FlyBase: enhancing Drosophila Gene Ontology annotations. Nucleic Acids Res. 37, D555D559 (2009).
41. Yook, K. et al. WormBase 2012: more genomes, more data, new website. Nucleic Acids Res. 40, D735D741 (2012).
8 NATURE COMMUNICATIONS | 4:2460 | DOI: 10.1038/ncomms3460 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications
& 2013 Macmillan Publishers Limited. All rights reserved.
NATURE COMMUNICATIONS | DOI: 10.1038/ncomms3460 ARTICLE
42. Lamesch, P. et al. The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools. Nucleic Acids Res. 40, D1202D1210 (2012).
43. Keller, O., Odronitz, F., Stanke, M., Kollmar, M. & Waack, S. Scipio: using protein sequences to determine the precise exon/intron structures of genes and their orthologs in closely related species. BMC Bioinformatics 9, 278 (2008).
44. Dring, A., Weese, D., Rausch, T. & Reinert, K. SeqAn An efcient, generic C library for sequence analysis. BMC Bioinformatics 9, 11 (2008).
45. Edgar, R. C. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5, 113 (2004).
46. Lorenz, R. et al. ViennaRNA Package 2.0. Algorithms Mol. Biol. 6, 26 (2011).47. Carbon, S. et al. AmiGO: online access to ontology and annotation data. Bioinformatics 25, 288289 (2009).
48. McCandless, D. Information is Beautiful (Collins, 2010).
Acknowledgements
We thank Dr. Bjrn Hammesfahr and Professor Stephan Waack for suggestions and discussions, and Professor Christian Griesinger for continuous support. This work was supported by the German Research Foundation (KO 2251/6-1 to M.K.).
Author contributions
K.H. wrote software and scripts. K.H. and M.K. performed all data analyses and wrote the manuscript.
Additional information
Accession code: Sequences for spliced exons have been deposited in the Kassiopeia database under accession numbers dmel00001 to dmel00539.
Supplementary Information accompanies this paper at http://www.nature.com/naturecommunications
Web End =http://www.nature.com/ http://www.nature.com/naturecommunications
Web End =naturecommunications
Competing nancial interests: The authors declare no competing nancial interests.
Reprints and permission information is available online at http://npg.nature.com/reprintsandpermissions/
Web End =http://npg.nature.com/ http://npg.nature.com/reprintsandpermissions/
Web End =reprintsandpermissions/
How to cite this article: Hatje, K. et al. Expansion of the mutually exclusive spliced exome in Drosophila. Nat. Commun. 4:2460 doi: 10.1038/ncomms3460 (2013).
NATURE COMMUNICATIONS | 4:2460 | DOI: 10.1038/ncomms3460 | http://www.nature.com/naturecommunications
Web End =www.nature.com/naturecommunications 9
& 2013 Macmillan Publishers Limited. All rights reserved.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright Nature Publishing Group Sep 2013
Abstract
Mutually exclusive splicing is an important mechanism in a wide range of eukaryotic branches to expand proteome diversity, but the extent of its distribution within a single species and its evolutionary conservation is unknown. Here we present a genome-wide analysis of mutually exclusive spliced exons (MXEs) in Drosophila melanogaster at unprecedented depth. Most of the new MXE candidates are supported by evolutionary conservation, transcriptome data analysis and identification of competing RNA secondary structural elements. The enrichment of the genes with MXEs in transmembrane transporters and ion channel activity is consistent with findings in humans, although the MXEs appeared independently and in non-homologous genes, supporting the idea of a universal benefit of adapting ion channel and receptor properties by tandem exon duplications. The comparison of the mutually exclusive spliced exomes within the Drosophila clade shows high numbers of MXE gain and loss events, suggesting a role of these processes in speciation.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer