Gerwald A. Köhler, Ursula Oberholzer, Carol A. Munro, Alexander D. Johnson, Jan Ihmels, Martine Raymond, Dominique Sanglard, Anastasia Levitin, George Newport, Jan Dungan, M. Andrew Uhl, Catherine Bachewich, Edwin Wang, Sadri Znaidi, Michael C. Lorenz, Anne Marcil, Steve Bates, Hervé Hogues, Gavin Sherlock, Bernard Turcotte, Aaron P. Mitchell, Marco van het Hoog, Burkhard R. Braun, Tatiana Iouk, Kim Rutherford, Lionel Frangeul, Lois L. Hoyer, Judith Berman, Fredj Tekaia, Daniel Dignard, Mikhail Martchenko, Malcolm Whiteway, Christophe d'Enfert, Neil A. R. Gow, Nina Agabian, Rosa Zito, Joachim Morschhäuser, Diane O. Inglis, Alan Kuo, Matthew Berriman, André Nantel, Maria C. Costanzo, Doreen Harcus, Department of Microbiology and Immunology, University of California, Biotechnology Research Institute (Environmental Biotechnology Sector), National Research Council of Canada (NRC), Biologie et Pathogénicité fongiques, Institut Pasteur [Paris]-Institut National de la Recherche Agronomique (INRA), Department of stomatology, University of São Paulo (USP), The Wellcome Trust Sanger Institute [Cambridge], University of Texas, Intégration et Analyse Génomique (Plate-Forme 4) (PF4), Institut Pasteur [Paris], Génétique Moléculaire des Levures, Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut Pasteur [Paris]-Centre National de la Recherche Scientifique (CNRS), School of Medical Sciences, University of Aberdeen, University of Illinois System, University of Würzburg, Université de Montréal (UdeM), McGill University = Université McGill [Montréal, Canada], Sanford University School of Medecine, Partenaires INRAE, Department of Molecular Genetics, Weizmann Institute of Science [Rehovot, Israël], Department of Genetics, Cell Biology and Development, University of Minnesota [Twin Cities] (UMN), University of Minnesota System-University of Minnesota System, Institut de Microbiologie, Centre Hospitalier Universitaire Vaudois [Lausanne] (CHUV), Columbia University [New York], University of California [San Francisco] (UCSF), University of California (UC), Biologie et Pathogénicité fongiques (BPF), Institut National de la Recherche Agronomique (INRA)-Institut Pasteur [Paris] (IP), Universidade de São Paulo = University of São Paulo (USP), Institut Pasteur [Paris] (IP), Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut Pasteur [Paris] (IP)-Centre National de la Recherche Scientifique (CNRS), University of California [San Francisco] (UC San Francisco), and Snyder, Michael
Recent sequencing and assembly of the genome for the fungal pathogen Candida albicans used simple automated procedures for the identification of putative genes. We have reviewed the entire assembly, both by hand and with additional bioinformatic resources, to accurately map and describe 6,354 genes and to identify 246 genes whose original database entries contained sequencing errors (or possibly mutations) that affect their reading frame. Comparison with other fungal genomes permitted the identification of numerous fungus-specific genes that might be targeted for antifungal therapy. We also observed that, compared to other fungi, the protein-coding sequences in the C. albicans genome are especially rich in short sequence repeats. Finally, our improved annotation permitted a detailed analysis of several multigene families, and comparative genomic studies showed that C. albicans has a far greater catabolic range, encoding respiratory Complex 1, several novel oxidoreductases and ketone body degrading enzymes, malonyl-CoA and enoyl-CoA carriers, several novel amino acid degrading enzymes, a variety of secreted catabolic lipases and proteases, and numerous transporters to assimilate the resulting nutrients. The results of these efforts will ensure that the Candida research community has uniform and comprehensive genomic information for medical research as well as for future diagnostic and therapeutic applications., Synopsis Candida albicans is a commonly encountered fungal pathogen usually responsible for superficial infections (thrush and vaginitis). However, an estimated 30% of severe fungal infections, most due to Candida, result in death. Those who are most at risk include individuals taking immune-suppressive drugs following organ transplantation, people with HIV infection, premature infants, and cancer patients undergoing chemotherapy. Current therapies for this pathogen are made more difficult by the significant secondary effects of anti-fungal drugs that target proteins that are also found in the human host. Recent sequencing and assembly of the genome for the fungal pathogen C. albicans used simple automated procedures for the identification of putative genes. Here, we report a detailed annotation of the 6,354 genes that are present in the genome sequence of this organism, essentially writing the dictionary of the C. albicans genome. Comparison with other fungal genomes permitted the identification of numerous fungus-specific genes that are absent from the human genome and whose products might be targeted for antifungal therapy. The results of these efforts will thus ensure that the Candida research community has uniform and comprehensive genomic information for medical research, for the development of functional genomic tools as well as for future diagnostic and therapeutic applications.