that, in the majority of cases, do not share sequence homology with each other. These longer transcripts are spliced, capped and polyadenylated, suggesting that they are expressed and potentially functional within cells. Indeed, some lncRNAs do now have proven functions, and where these are known they have been named accordingly; for instance, 'XIST' 'X (inactive)-specific transcript (non-protein coding)' is involved in transcriptionally silencing one of the pair of X chromosomes [33]. There are, however, potentially thousands of lncRNAs, and for the vast majority their function remains unresolved. Where lncRNAs reside on the opposite strand to a protein-coding gene, it is thought that they could potentially regulate the expression of the coding gene [34]. These antisense transcripts are named using the approved HGNC symbol for the protein-coding gene with the suffix '-AS' for 'antisense'; the lncRNA gene on the opposite strand to the BOK gene is 'BOK-AS1' for 'BOK antisense RNA 1 (non-protein coding)'. Likewise, those lncRNA genes that reside entirely within an intron of a protein-coding gene are symbolised by the suffix '-IT' for 'intronic transcript' (eg 'MAGI2-IT1' for 'MAGI2 intronic transcript 1 (non-protein coding)'). There are also lncRNAs that are postulated to function only as transcriptional apparatus for the expression of small ncRNA genes nested within their introns. These 'host genes' are named with the suffix 'HG' (eg 'SNHG1' for 'small nucleolar RNA host gene 1 (non-protein coding)'). A small number of lncRNAs share homology with each other and are named as paralogues (eg TTTY1A and TTTY1B). Many transcripts do not fit any of these scenarios, that is: the function of the mature transcript is unknown; they are not proximal to a protein-coding gene; and they are not a member of a homologous family. Such 'orphan' ncRNA genes were previously all named with the anonymous stem symbol NCRNA# (eg 'NCRNA00029') but, in collaboration with the lncRNA database lncRNAdb (http://www.lncrnadb.org)[35] and the Vertebrate Genome Annotation (VEGA; http://vega.sanger.ac.uk/) team,[36] HGNC has recently decided to name these intergenic lncRNA genes with the symbol 'LINC#' for 'long intergenic non-protein coding RNA #'.
RNA nomenclature across species
Where an equivalent orthologous ncRNA gene can be shown to exist in another species, the human RNA gene nomenclature could be transferred directly to the other species and, indeed, this is already happening for highly conserved classes of small RNAs, such as the microRNAs. For example, mouse Mir100 is orthologous to human MIR100. The nomenclature for other ncRNA classes that have greatly diverged across genomes will need careful annotation and may require species-specific nomenclature.
Conclusion
Recent years have shown us that the human genome does not comprise transcriptional deserts of 'junk' DNA lying between protein-coding genes, but rather that these regions encode thousands of transcribed ncRNAs that may play crucial roles in vital biological processes. As a result, interest in these RNAs is growing quickly, and HGNC aims to keep apace with the discovery of new ncRNA classes to ensure it can provide a robust and systematic nomenclature for these intriguing genes. All human RNA genes named to date can be found at the HGNC RNA webpage (http://www.genenames.org/rna).
Declarations
Acknowledgements
A number of researchers have provided invaluable support in naming various classes of RNAs: Sam Griffiths-Jones with miRNAs, Michel Weber with snoRNAs, Todd Lowe with tRNAs, Peter Stadler with U7 and vault RNAs, Shipra Agrawal with piRNAs and John Mattick with lncRNAs. Particular thanks go to former HGNC team member Kate Sneddon, and also to current colleagues Ruth Seal, Susan Gordon, Michael Lush and Louise Daugherty.
The work of HGNC is supported by National Human Genome Research Institute (NHGRI) grant P41 HG03345 and Wellcome Trust grant 081979/Z/07/Z.
Watanabe H, Fujiyama A, Hattori M, Taylor TD, et al: DNA sequence and comparative analysis of chimpanzee chromosome 22. Nature. 2004, 429: 382-388. 10.1038/nature02564.Google Scholar
Birney E, Stamatoyannopoulos JA, Dutta A, Guigo R, et al: Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 2007, 447: 799-816. 10.1038/nature05874.Google Scholar
Taft RJ, Pang KC, Mercer TR, Dinger M, et al: Non-coding RNAs: Regulators of disease. J Pathol. 2010, 220: 126-139. 10.1002/path.2638.Google Scholar
Seal RL, Gordon SM, Lush MJ, Wright MW, Bruford EA: genenames.org: The HGNC resources in 2011. Nucleic Acids Res. 2011, D514-D519. 39 Database
Lee RC, Feinbaum RL, Ambros V: The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14. Cell. 1993, 75: 843-854. 10.1016/0092-8674(93)90529-Y.Google Scholar
Lagos-Quintana M, Rauhut R, Lendeckel W, Tuschl T: Identification of novel genes coding for small expressed RNAs. Science. 2001, 294: 853-858. 10.1126/science.1064921.Google Scholar
Bartel DP: MicroRNAs: Target recognition and regulatory functions. Cell. 2009, 136: 215-233. 10.1016/j.cell.2009.01.002.Google Scholar
Mencia A, Modamio-Hoybjor S, Redshaw N, Morin M, et al: Mutations in the seed region of human miR-96 are responsible for non-syndromic progressive hearing loss. Nat Genet. 2009, 41: 609-613. 10.1038/ng.355.Google Scholar
Kozomara A, Griffiths-Jones S: miRBase: integrating microRNA annotation and deep-sequencing date. Nucleic Acids Res. 2011, D152-D157. 39 Database
Chan PP, Lowe TM: GtRNAdb: A database of transfer RNA genes detected in genomic sequence. Nucleic Acids Res. 2009, D93-D97, PMID: 18984615. 37 Database
Lowe TM, Eddy SR: tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997, 25: 955-964.Google Scholar
Helm M, Brule H, Friede D, Giege R: Search for characteristic structural features of mammalian mitochondrial tRNAs. RNA. 2000, 6: 1356-1379. 10.1017/S1355838200001047.Google Scholar
Putz J, Dupuis B, Sissler M, Florentz C: Mamit-tRNA, a database of mammalian mitochondrial tRNA primary and secondary structures. RNA. 2007, 13: 1184-1190. 10.1261/rna.588407.Google Scholar
Korostelev A, Ermolenko DN, Noller HF: Structural dynamics of the ribosome. Curr Opin Chem Biol. 2008, 12: 674-683. 10.1016/j.cbpa.2008.08.037.Google Scholar
Hillis DM, Dixon MT: Ribosomal DNA: Molecular evolution and phylogenetic inference. Q Rev Biol. 1991, 66: 411-453. 10.1086/417338.Google Scholar
Pruesse E, Quast C, Knittel K, Fuchs BM, et al: SILVA: A comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Res. 2007, 35: 7188-7196. 10.1093/nar/gkm864.Google Scholar
Ritchie DB, Schellenberg MJ, MacMillan AM: Spliceosome structure: Piece by piece. Biochim Biophys Acta. 2009, 1789: 624-633. 10.1016/j.bbagrm.2009.08.010.Google Scholar
Will CL, Luhrmann R: Splicing of a rare class of introns by the U12-dependent spliceosome. Biol Chem. 2005, 386: 713-724.Google Scholar
Decatur WA, Fournier MJ: RNA-guided nucleotide modification of ribosomal and other RNAs. J Biol Chem. 2003, 278: 695-698. 10.1074/jbc.R200023200.Google Scholar
Lestrade L, Weber MJ: snoRNA-LBME-db, a comprehensive database of human H/ACA and C/D box snoRNAs. Nucleic Acids Res. 2006, D158-D162. 34 Database
Darzacq X, Jady BE, Verheggen C, Kiss AM, et al: Cajal body-specific small nuclear RNAs: A novel class of 2'-O-methylation and pseudouridylation guide RNAs. EMBO J. 2002, 21: 2746-2756. 10.1093/emboj/21.11.2746.Google Scholar
Lin H: piRNAs in the germ line. Science. 2007, 316: 397-10.1126/science.1137543.Google Scholar
Sai Lakshmi S, Agrawal S: piRNABank: A web resource on classified and clustered Piwi-interacting RNAs. Nucleic Acids Res. 2008, D173-D177. 36 Database
Reiner R, Ben-Asouli Y, Krilovetzky I, Jarrous N: A role for the catalytic ribonucleoprotein RNase P in RNA polymerase III transcription. Genes Dev. 2006, 20: 1621-1635. 10.1101/gad.386706.Google Scholar
Esakova O, Krasilnikov AS: Of proteins and RNA: The RNase P/MRP family. RNA. 2010, 16: 1725-1747. 10.1261/rna.2214510.Google Scholar
Mowry KL, Steitz JA: Identification of the human U7 snRNP as one of several factors involved in the 3' end maturation of histone premessenger RNA's. Science. 1987, 238: 1682-1687. 10.1126/science.2825355.Google Scholar
Marz M, Mosig A, Stadler BM, Stadler PF: U7 snRNAs: A computational survey. Genomics Proteomics Bioinformatics. 2007, 5: 187-195. 10.1016/S1672-0229(08)60006-6.Google Scholar
Stadler PF, Chen JJ, Hackermuller J, Hoffmann S, et al: Evolution of vault RNAs. Mol Biol Evol. 2009, 26: 1975-1991. 10.1093/molbev/msp112.Google Scholar
Steiner E, Holzmann K, Elbling L, Micksche M, et al: Cellular functions of vaults and their involvement in multidrug resistance. Curr Drug Targets. 2006, 7: 923-934. 10.2174/138945006778019345.Google Scholar
Peterlin BM, Price DH: Controlling the elongation phase of transcription with P-TEFb. Mol Cell. 2006, 23: 297-305. 10.1016/j.molcel.2006.06.014.Google Scholar
Feng J, Funk WD, Wang SS, Weinrich SL, et al: The RNA component of human telomerase. Science. 1995, 269: 1236-1241. 10.1126/science.7544491.Google Scholar
Khachane AN, Harrison PM: Mining mammalian transcript data for functional long non-coding RNAs. PLoS One. 2010, 5: e10316-10.1371/journal.pone.0010316.Google Scholar
Brown CJ, Ballabio A, Rupert JL, Lafreniere RG, et al: A gene from the region of the human X inactivation centre is expressed exclusively from the inactive X chromosome. Nature. 1991, 349: 38-44. 10.1038/349038a0.Google Scholar
Faghihi MA, Wahlestedt C: Regulatory roles of natural antisense transcripts. Nat Rev Mol Cell Biol. 2009, 10: 637-643. 10.1038/nrm2738.Google Scholar
Amaral PP, Clark MB, Gascoigne DK, Dinger ME, Mattick JS: lncRNAdb: A reference database for long noncoding RNAs. Nucleic Acids Res. 2011, D146-D151. 39 Database
Wilming LG, Gilbert JG, Howe K, Trevanion S, et al: The vertebrate genome annotation (Vega) database. Nucleic Acids Res. 2008, D753-D760. 36 Database
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright BioMed Central 2011
Abstract
Previously, the majority of the human genome was thought to be 'junk' DNA with no functional purpose. Over the past decade, the field of RNA research has rapidly expanded, with a concomitant increase in the number of non-protein coding RNA (ncRNA) genes identified in this 'junk'. Many of the encoded ncRNAs have already been shown to be essential for a variety of vital functions, and this wealth of annotated human ncRNAs requires standardised naming in order to aid effective communication. The HUGO Gene Nomenclature Committee (HGNC) is the only organisation authorised to assign standardised nomenclature to human genes. Of the 30,000 approved gene symbols currently listed in the HGNC database (http://www.genenames.org/search), the majority represent protein-coding genes; however, they also include pseudogenes, phenotypic loci and some genomic features. In recent years the list has also increased to include almost 3,000 named human ncRNA genes. HGNC is actively engaging with the RNA research community in order to provide unique symbols and names for each sequence that encodes an ncRNA. Most of the classical small ncRNA genes have now been provided with a unique nomenclature, and work on naming the long (> 200 nucleotides) non-coding RNAs (lncRNAs) is ongoing.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer