Back to Search Start Over

Automatic Generation of Functional Annotation Rules Using Inferred GO-Domain Associations

Authors :
Alborzi, Seyed Ziaeddin
Devignes, Marie-Dominique
Aridhi, Sabeur
Saidi, Rabie
Renaux, Alexandre
Martin, Maria J.
Ritchie, David
Computational Algorithms for Protein Structures and Interactions (CAPSID)
Inria Nancy - Grand Est
Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Department of Complex Systems, Artificial Intelligence & Robotics (LORIA - AIS)
Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA)
Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA)
Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Centre National de la Recherche Scientifique (CNRS)
European Bioinformatics Institute [Hinxton] (EMBL-EBI)
EMBL Heidelberg
Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA)
Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université de Lorraine (UL)
biofunctionprediction.org
Source :
Function-SIG ISMB/ECCB 2017, Function-SIG ISMB/ECCB 2017, Jul 2017, Prague, Czech Republic. 2017, Function-SIG ISMB/ECCB 2017, biofunctionprediction.org, Jul 2017, Prague, Czech Republic
Publication Year :
2017
Publisher :
HAL CCSD, 2017.

Abstract

International audience; The GO ontology is widely used for functional annotation of genes and proteins. It describes biological processes (BP), molecular function (MF), and cellular components (CC) in three distinct hierarchical controlled vocabularies. At the molecular level, functions are often performed by highly conserved parts of proteins, identified by sequence or structure alignments and classified into domains or families (SCOP, CATH, PFAM, TIGRFAMs, etc.). The InterPro database provides a valuable integrated classification of protein sequences and domains which is linked to nearly all existing other classifications. Interestingly, several InterPro families have been manually annotated with GO terms using expert knowledge and the literature. However, the list of such annotations is incomplete (only 20% of Pfam domains and families possess MF GO functional annotation). We therefore developed the GODomainMiner approach to expand the available functional annotations of protein domains and families. Based on our ECDomainMiner approach, we use the respective associations of protein sequences with GO terms and protein domains to infer direct associations between GO terms and protein domains. Finally, we used our calculated GO-Domain associations to devise a systematic way, called AutoProf-Annotator, to generate high confidence rules for protein sequence (or structure) annotation.

Details

Language :
English
Database :
OpenAIRE
Journal :
Function-SIG ISMB/ECCB 2017, Function-SIG ISMB/ECCB 2017, Jul 2017, Prague, Czech Republic. 2017, Function-SIG ISMB/ECCB 2017, biofunctionprediction.org, Jul 2017, Prague, Czech Republic
Accession number :
edsair.dedup.wf.001..e6c1ab4040379cd019d95daf9a560abf