Back to Search Start Over

Current status and new features of the Consensus Coding Sequence database

Authors :
Catherine Snow
Robert Baertsch
Marie-Marthe Suner
Shashikant Pujar
Susan M. Hiatt
Toby Hunt
Tim Hubbard
José M. González
Wendy Wu
Lillian D. Riddick
Kim D. Pruitt
M. Kay
Janet Weber
David Haussler
Garth Brown
Nuala A. O'Leary
Adam Frankish
Jennifer Harrow
James G. R. Gilbert
Bronwen Aken
Ruth Bennett
Jeena Rajan
Andrei Shkeda
Jonathan M. Mudge
Laurens G. Wilming
Stephen J. Trevanion
Kelly M. McGarvey
Pamela Tamez
Jennifer Hart
Mark Diekhans
Stephen M. J. Searle
James Ostell
Bhanu Rajput
Rachel A. Harte
Craig Wallin
Michael R. Murphy
Mark G. Thomas
Charles A. Steward
Jane E. Loveland
Catherine M. Farrell
Sanjida H. Rangwala
Daniel Barrell
David Webb
Source :
Nucleic Acids Research, Nucleic acids research, vol 42, iss Database issue, Europe PubMed Central
Publication Year :
2013

Abstract

The Consensus Coding Sequence (CCDS) project (http://www.ncbi.nlm.nih.gov/CCDS/) is a collaborative effort to maintain a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assemblies by the National Center for Biotechnology Information (NCBI) and Ensembl genome annotation pipelines. Identical annotations that pass quality assurance tests are tracked with a stable identifier (CCDS ID). Members of the collaboration, who are from NCBI, the Wellcome Trust Sanger Institute and the University of California Santa Cruz, provide coordinated and continuous review of the dataset to ensure high-quality CCDS representations. We describe here the current status and recent growth in the CCDS dataset, as well as recent changes to the CCDS web and FTP sites. These changes include more explicit reporting about the NCBI and Ensembl annotation releases being compared, new search and display options, the addition of biologically descriptive information and our approach to representing genes for which support evidence is incomplete. We also present a summary of recent and future curation targets.

Details

ISSN :
13624962
Volume :
42
Database :
OpenAIRE
Journal :
Nucleic acids research
Accession number :
edsair.doi.dedup.....4da6021ec661ab2f93e222547170d758