Back to Search
Start Over
From identification to validation to gene count
- Source :
- Genome Biology
- Publication Year :
- 2010
- Publisher :
- BioMed Central, 2010.
-
Abstract
- The current GENCODE gene count of ~ 30,000, including 21,727 protein-coding and 8,483 RNA genes, is significantly lower than the 100,000 genes anticipated by early estimates. Accurate annotation of protein-coding and non-coding genes and pseudogenes is essential in calculating the true gene count and gaining insight into human evolution. As part of the GENCODE Consortium, the HAVANA team produces high quality manual gene annotation, which forms the basis for the reference gene set being used by the ENCODE project and provides a rich annotation of alternative splice variants and assignment of functional potential. However, the protein-coding potential of some splice variants is uncertain and valid splice variants can remain unannotated if they are absent from current cDNA libraries. Recent technological developments in sequencing and mass spectrometry have created a vast amount of new transcript and protein data that facilitate the identification and validation of new and existing transcripts, while harboring their own limitations and problems.
- Subjects :
- Genetics
0303 health sciences
GENCODE
Pseudogene
Alternative splicing
Selected Oral Presentation
Gene Annotation
Biology
ENCODE
RNA missatger
03 medical and health sciences
Annotation
0302 clinical medicine
030220 oncology & carcinogenesis
splice
Gene
Proteïnes
Genètica
030304 developmental biology
Subjects
Details
- Language :
- English
- ISSN :
- 14656914 and 14656906
- Volume :
- 11
- Issue :
- Suppl 1
- Database :
- OpenAIRE
- Journal :
- Genome Biology
- Accession number :
- edsair.doi.dedup.....47fdd787c7351cce9b98c1dcc78d01b1