Back to Search Start Over

From identification to validation to gene count

Authors :
Amid, Clara
Frankish, Adam
HAVANA
Aken, Bronwen
Ezkurdia, Iakes
Kokocinski, Felix
Gilbert, James
White, Simon
Carninci, Piero
Gingeras, Thomas R.
Guigó Serra, Roderic
Searle, Stephen
Tress, Michael
Harrow, Jennifer
Hubbard, Tim J.
Source :
Genome Biology
Publication Year :
2010
Publisher :
BioMed Central, 2010.

Abstract

The current GENCODE gene count of ~ 30,000, including 21,727 protein-coding and 8,483 RNA genes, is significantly lower than the 100,000 genes anticipated by early estimates. Accurate annotation of protein-coding and non-coding genes and pseudogenes is essential in calculating the true gene count and gaining insight into human evolution. As part of the GENCODE Consortium, the HAVANA team produces high quality manual gene annotation, which forms the basis for the reference gene set being used by the ENCODE project and provides a rich annotation of alternative splice variants and assignment of functional potential. However, the protein-coding potential of some splice variants is uncertain and valid splice variants can remain unannotated if they are absent from current cDNA libraries. Recent technological developments in sequencing and mass spectrometry have created a vast amount of new transcript and protein data that facilitate the identification and validation of new and existing transcripts, while harboring their own limitations and problems.

Details

Language :
English
ISSN :
14656914 and 14656906
Volume :
11
Issue :
Suppl 1
Database :
OpenAIRE
Journal :
Genome Biology
Accession number :
edsair.doi.dedup.....47fdd787c7351cce9b98c1dcc78d01b1