19 results on '"Arighi A."'
Search Results
2. Utility of visual rating scales in primary progressive aphasia
- Author
-
Falgàs, Neus, Sacchi, Luca, Carandini, Tiziana, Montagut, Nuria, Conte, Giorgio, Triulzi, Fabio, Galimberti, Daniela, Arighi, Andrea, Sanchez-Valle, Raquel, and Fumagalli, Giorgio Giulio
- Published
- 2024
- Full Text
- View/download PDF
3. A systematic review of progranulin concentrations in biofluids in over 7,000 people—assessing the pathogenicity of GRN mutations and other influencing factors
- Author
-
Swift, Imogen J., Rademakers, Rosa, Finch, NiCole, Baker, Matt, Ghidoni, Roberta, Benussi, Luisa, Binetti, Giuliano, Rossi, Giacomina, Synofzik, Matthis, Wilke, Carlo, Mengel, David, Graff, Caroline, Takada, Leonel T., Sánchez-Valle, Raquel, Antonell, Anna, Galimberti, Daniela, Fenoglio, Chiara, Serpente, Maria, Arcaro, Marina, Schreiber, Stefanie, Vielhaber, Stefan, Arndt, Philipp, Santana, Isabel, Almeida, Maria Rosario, Moreno, Fermín, Barandiaran, Myriam, Gabilondo, Alazne, Stubert, Johannes, Gómez-Tortosa, Estrella, Agüero, Pablo, Sainz, M. José, Gohda, Tomohito, Murakoshi, Maki, Kamei, Nozomu, Kittel-Schneider, Sarah, Reif, Andreas, Weigl, Johannes, Jian, Jinlong, Liu, Chuanju, Serrero, Ginette, Greither, Thomas, Theil, Gerit, Lohmann, Ebba, Gazzina, Stefano, Bagnoli, Silvia, Coppola, Giovanni, Bruni, Amalia, Quante, Mirja, Kiess, Wieland, Hiemisch, Andreas, Jurkutat, Anne, Block, Matthew S., Carlson, Aaron M., Bråthen, Geir, Sando, Sigrid Botne, Grøntvedt, Gøril Rolfseng, Lauridsen, Camilla, Heslegrave, Amanda, Heller, Carolin, Abel, Emily, Gómez-Núñez, Alba, Puey, Roger, Arighi, Andrea, Rotondo, Enmanuela, Jiskoot, Lize C., Meeter, Lieke H. H., Durães, João, Lima, Marisa, Tábuas-Pereira, Miguel, Lemos, João, Boeve, Bradley, Petersen, Ronald C., Dickson, Dennis W., Graff-Radford, Neill R., LeBer, Isabelle, Sellami, Leila, Lamari, Foudil, Clot, Fabienne, Borroni, Barbara, Cantoni, Valentina, Rivolta, Jasmine, Lleó, Alberto, Fortea, Juan, Alcolea, Daniel, Illán-Gala, Ignacio, Andres-Cerezo, Lucie, Van Damme, Philip, Clarimon, Jordi, Steinacker, Petra, Feneberg, Emily, Otto, Markus, van der Ende, Emma L., van Swieten, John C., Seelaar, Harro, Zetterberg, Henrik, Sogorb-Esteve, Aitana, and Rohrer, Jonathan D.
- Published
- 2024
- Full Text
- View/download PDF
4. Correction: Italian adaptation of the Uniform Data Set Neuropsychological Test Battery (I‑UDSNB 1.0): development and normative data
- Author
-
Conca, Francesca, Esposito, Valentina, Rundo, Francesco, Quaranta, Davide, Muscio, Cristina, Manenti, Rosa, Caruso, Giulia, Lucca, Ugo, Galbussera, Alessia Antonella, Di Tella, Sonia, Baglio, Francesca, L’Abbate, Federica, Canu, Elisa, Catania, Valentina, Filippi, Massimo, Mattavelli, Giulia, Poletti, Barbara, Silani, Vincenzo, Lodi, Raffaele, De Matteis, Maddalena, Maserati, Michelangelo Stanzani, Arighi, Andrea, Rotondo, Emanuela, Tanzilli, Antonio, Pace, Andrea, Garramone, Federica, Cavaliere, Carlo, Pardini, Matteo, Rizzetto, Cristiano, Sorbi, Sandro, Perri, Roberta, Tiraboschi, Pietro, Canessa, Nicola, Cotelli, Maria, Ferri, Raffaele, Weintraub, Sandra, Marra, Camillo, Tagliavini, Fabrizio, Catricalà, Eleonora, and Cappa, Stefano Francesco
- Published
- 2023
- Full Text
- View/download PDF
5. Aquaporin-4 cerebrospinal fluid levels are higher in neurodegenerative dementia: looking at glymphatic system dysregulation
- Author
-
Arighi, Andrea, Arcaro, Marina, Fumagalli, Giorgio Giulio, Carandini, Tiziana, Pietroboni, Anna Margherita, Sacchi, Luca, Fenoglio, Chiara, Serpente, Maria, Sorrentino, Federica, Isgrò, Giovanni, Turkheimer, Federico, Scarpini, Elio, and Galimberti, Daniela
- Published
- 2022
- Full Text
- View/download PDF
6. Amyloid PET imaging and dementias: potential applications in detecting and quantifying early white matter damage
- Author
-
Pietroboni, Anna M., Colombi, Annalisa, Carandini, Tiziana, Sacchi, Luca, Fenoglio, Chiara, Marotta, Giorgio, Arighi, Andrea, De Riz, Milena A., Fumagalli, Giorgio G., Castellani, Massimo, Bozzali, Marco, Scarpini, Elio, and Galimberti, Daniela
- Published
- 2022
- Full Text
- View/download PDF
7. Italian adaptation of the Uniform Data Set Neuropsychological Test Battery (I-UDSNB 1.0): development and normative data
- Author
-
Conca, Francesca, Esposito, Valentina, Rundo, Francesco, Quaranta, Davide, Muscio, Cristina, Manenti, Rosa, Caruso, Giulia, Lucca, Ugo, Galbussera, Alessia Antonella, Di Tella, Sonia, Baglio, Francesca, L’Abbate, Federica, Canu, Elisa, Catania, Valentina, Filippi, Massimo, Mattavelli, Giulia, Poletti, Barbara, Silani, Vincenzo, Lodi, Raffaele, De Matteis, Maddalena, Stanzani Maserati, Michelangelo, Arighi, Andrea, Rotondo, Emanuela, Tanzilli, Antonio, Pace, Andrea, Garramone, Federica, Cavaliere, Carlo, Pardini, Matteo, Rizzetto, Cristiano, Sorbi, Sandro, Perri, Roberta, Tiraboschi, Pietro, Canessa, Nicola, Cotelli, Maria, Ferri, Raffaele, Weintraub, Sandra, Marra, Camillo, Tagliavini, Fabrizio, Catricalà, Eleonora, and Cappa, Stefano Francesco
- Published
- 2022
- Full Text
- View/download PDF
8. Testing the 2018 NIA-AA research framework in a retrospective large cohort of patients with cognitive impairment: from biological biomarkers to clinical syndromes
- Author
-
Carandini, Tiziana, Arighi, Andrea, Sacchi, Luca, Fumagalli, Giorgio G., Pietroboni, Anna M., Ghezzi, Laura, Colombi, Annalisa, Scarioni, Marta, Fenoglio, Chiara, De Riz, Milena A., Marotta, Giorgio, Scarpini, Elio, and Galimberti, Daniela
- Published
- 2019
- Full Text
- View/download PDF
9. Distinct patterns of brain atrophy in Genetic Frontotemporal Dementia Initiative (GENFI) cohort revealed by visual rating scales
- Author
-
Fumagalli, Giorgio G., Basilico, Paola, Arighi, Andrea, Bocchetta, Martina, Dick, Katrina M., Cash, David M., Harding, Sophie, Mercurio, Matteo, Fenoglio, Chiara, Pietroboni, Anna M., Ghezzi, Laura, van Swieten, John, Borroni, Barbara, de Mendonça, Alexandre, Masellis, Mario, Tartaglia, Maria C., Rowe, James B., Graff, Caroline, Tagliavini, Fabrizio, Frisoni, Giovanni B., Laforce, Jr, Robert, Finger, Elizabeth, Sorbi, Sandro, Scarpini, Elio, Rohrer, Jonathan D., Galimberti, Daniela, and on behalf of the Genetic FTD Initiative (GENFI)
- Published
- 2018
- Full Text
- View/download PDF
10. BioCreative III interactive task: an overview
- Author
-
Arighi, C, Roberts, P, Agarwal, S, Bhattacharya, S, Cesareni, G, Chatr-aryamontri, A, Clematide, S; https://orcid.org/0000-0003-1365-0662, Gaudet, P, Giglio, M, Harrow, I, Huala, E, Krallinger, M, Leser, U, Li, D, Liu, F, Lu, Z, Maltais, L, Okazaki, N, Perfetto, L, Rinaldi, Fabio; https://orcid.org/0000-0001-5718-5462, Saetre, R, Salgado, D, Srinivasan, P, Thomas, P, Toldo, L, Hirschman, L, Wu, C, Arighi, C, Roberts, P, Agarwal, S, Bhattacharya, S, Cesareni, G, Chatr-aryamontri, A, Clematide, S; https://orcid.org/0000-0003-1365-0662, Gaudet, P, Giglio, M, Harrow, I, Huala, E, Krallinger, M, Leser, U, Li, D, Liu, F, Lu, Z, Maltais, L, Okazaki, N, Perfetto, L, Rinaldi, Fabio; https://orcid.org/0000-0001-5718-5462, Saetre, R, Salgado, D, Srinivasan, P, Thomas, P, Toldo, L, Hirschman, L, and Wu, C
- Abstract
BACKGROUND: The BioCreative challenge evaluation is a community-wide effort for evaluating text mining and information extraction systems applied to the biological domain. The biocurator community, as an active user of biomedical literature, provides a diverse and engaged end user group for text mining tools. Earlier BioCreative challenges involved many text mining teams in developing basic capabilities relevant to biological curation, but they did not address the issues of system usage, insertion into the workflow and adoption by curators. Thus in BioCreative III (BC-III), the InterActive Task (IAT) was introduced to address the utility and usability of text mining tools for real-life biocuration tasks. To support the aims of the IAT in BC-III, involvement of both developers and end users was solicited, and the development of a user interface to address the tasks interactively was requested. RESULTS: A User Advisory Group (UAG) actively participated in the IAT design and assessment. The task focused on gene normalization (identifying gene mentions in the article and linking these genes to standard database identifiers), gene ranking based on the overall importance of each gene mentioned in the article, and gene-oriented document retrieval (identifying full text papers relevant to a selected gene). Six systems participated and all processed and displayed the same set of articles. The articles were selected based on content known to be problematic for curation, such as ambiguity of gene names, coverage of multiple genes and species, or introduction of a new gene name. Members of the UAG curated three articles for training and assessment purposes, and each member was assigned a system to review. A questionnaire related to the interface usability and task performance (as measured by precision and recall) was answered after systems were used to curate articles. Although the limited number of articles analyzed and users involved in the IAT experiment precluded rigorous q
- Published
- 2011
11. A framework for biomedical figure segmentation towards image-based document retrieval
- Author
-
Jingyi Yu, Cecilia N. Arighi, Manabu Torii, Cathy H. Wu, Luis D. Lopez, Catalina O. Tudor, Hongzhan Huang, and K. Vijay-Shanker
- Subjects
Information retrieval ,Biomedical Research ,Computer science ,Interface (Java) ,Document classification ,Applied Mathematics ,Research ,Computational Biology ,Information Storage and Retrieval ,Image processing ,Image segmentation ,computer.software_genre ,Computer Science Applications ,Computer graphics ,Modal ,Structural Biology ,Modeling and Simulation ,Modelling and Simulation ,Computer Graphics ,Image Processing, Computer-Assisted ,Segmentation ,Document retrieval ,computer ,Molecular Biology - Abstract
The figures included in many of the biomedical publications play an important role in understanding the biological experiments and facts described within. Recent studies have shown that it is possible to integrate the information that is extracted from figures in classical document classification and retrieval tasks in order to improve their accuracy. One important observation about the figures included in biomedical publications is that they are often composed of multiple subfigures or panels, each describing different methodologies or results. The use of these multimodal figures is a common practice in bioscience, as experimental results are graphically validated via multiple methodologies or procedures. Thus, for a better use of multimodal figures in document classification or retrieval tasks, as well as for providing the evidence source for derived assertions, it is important to automatically segment multimodal figures into subfigures and panels. This is a challenging task, however, as different panels can contain similar objects (i.e., barcharts and linecharts) with multiple layouts. Also, certain types of biomedical figures are text-heavy (e.g., DNA sequences and protein sequences images) and they differ from traditional images. As a result, classical image segmentation techniques based on low-level image features, such as edges or color, are not directly applicable to robustly partition multimodal figures into single modal panels. In this paper, we describe a robust solution for automatically identifying and segmenting unimodal panels from a multimodal figure. Our framework starts by robustly harvesting figure-caption pairs from biomedical articles. We base our approach on the observation that the document layout can be used to identify encoded figures and figure boundaries within PDF files. Taking into consideration the document layout allows us to correctly extract figures from the PDF document and associate their corresponding caption. We combine pixel-level representations of the extracted images with information gathered from their corresponding captions to estimate the number of panels in the figure. Thus, our approach simultaneously identifies the number of panels and the layout of figures. In order to evaluate the approach described here, we applied our system on documents containing protein-protein interactions (PPIs) and compared the results against a gold standard that was annotated by biologists. Experimental results showed that our automatic figure segmentation approach surpasses pure caption-based and image-based approaches, achieving a 96.64% accuracy. To allow for efficient retrieval of information, as well as to provide the basis for integration into document classification and retrieval systems among other, we further developed a web-based interface that lets users easily retrieve panels containing the terms specified in the user queries.
- Published
- 2013
12. TGF-beta signaling proteins and the Protein Ontology
- Author
-
Barry Smith, Cathy H. Wu, Darren A. Natale, Hongfang Liu, Judith A. Blake, Harold J. Drabkin, Cecilia N. Arighi, and Winona C. Barker
- Subjects
Information Storage and Retrieval ,Computational biology ,Ontology (information science) ,Biology ,Proteomics ,computer.software_genre ,Biochemistry ,Open Biomedical Ontologies ,03 medical and health sciences ,Annotation ,User-Computer Interface ,0302 clinical medicine ,Structural Biology ,Transforming Growth Factor beta ,OBO Foundry ,Databases, Genetic ,Humans ,Databases, Protein ,Molecular Biology ,030304 developmental biology ,0303 health sciences ,Applied Mathematics ,Alternative splicing ,Intracellular Signaling Peptides and Proteins ,Computational Biology ,Structural Classification of Proteins database ,Computer Science Applications ,Proceedings ,Data mining ,DNA microarray ,computer ,030217 neurology & neurosurgery - Abstract
Background The Protein Ontology (PRO) is designed as a formal and principled Open Biomedical Ontologies (OBO) Foundry ontology for proteins. The components of PRO extend from a classification of proteins on the basis of evolutionary relationships at the homeomorphic level to the representation of the multiple protein forms of a gene, including those resulting from alternative splicing, cleavage and/or post-translational modifications. Focusing specifically on the TGF-beta signaling proteins, we describe the building, curation, usage and dissemination of PRO. Results PRO is manually curated on the basis of PrePRO, an automatically generated file with content derived from standard protein data sources. Manual curation ensures that the treatment of the protein classes and the internal and external relationships conform to the PRO framework. The current release of PRO is based upon experimental data from mouse and human proteins wherein equivalent protein forms are represented by single terms. In addition to the PRO ontology, the annotation of PRO terms is released as a separate PRO association file, which contains, for each given PRO term, an annotation from the experimentally characterized sub-types as well as the corresponding database identifiers and sequence coordinates. The annotations are added in the form of relationship to other ontologies. Whenever possible, equivalent forms in other species are listed to facilitate cross-species comparison. Splice and allelic variants, gene fusion products and modified protein forms are all represented as entities in the ontology. Therefore, PRO provides for the representation of protein entities and a resource for describing the associated data. This makes PRO useful both for proteomics studies where isoforms and modified forms must be differentiated, and for studies of biological pathways, where representations need to take account of the different ways in which the cascade of events may depend on specific protein modifications. Conclusion PRO provides a framework for the formal representation of protein classes and protein forms in the OBO Foundry. It is designed to enable data retrieval and integration and machine reasoning at the molecular level of proteins, thereby facilitating cross-species comparisons, pathway analysis, disease modeling and the generation of new hypotheses.
- Published
- 2009
13. Framework for a Protein Ontology
- Author
-
Cathy H. Wu, Zhang-Zhi Hu, Darren A. Natale, Barry Smith, Winona C. Barker, Judith A. Blake, Hongfang Liu, Cecilia N. Arighi, and Ti-Cheng Chang
- Subjects
Information Storage and Retrieval ,Sequence alignment ,Computational biology ,Biology ,Ontology (information science) ,Biochemistry ,Open Biomedical Ontologies ,Evolution, Molecular ,03 medical and health sciences ,User-Computer Interface ,0302 clinical medicine ,Disease Ontology ,Protein Annotation ,Structural Biology ,OBO Foundry ,Databases, Protein ,Gene ,Molecular Biology ,030304 developmental biology ,0303 health sciences ,SNOMED CT ,Applied Mathematics ,Alternative splicing ,Proteins ,Structural Classification of Proteins database ,Data science ,Computer Science Applications ,ComputingMethodologies_PATTERNRECOGNITION ,Proceedings ,030220 oncology & carcinogenesis ,Database Management Systems ,Sequence Alignment ,Sequence Analysis - Abstract
Biomedical ontologies are emerging as critical tools in genomic and proteomic research, where complex data in disparate resources need to be integrated. A number of ontologies describe properties that can be attributed to proteins. For example, protein functions are described by the Gene Ontology (GO) and human diseases by SNOMED CT or ICD10. There is, however, a gap in the current set of ontologies – one that describes the protein entities themselves and their relationships. We have designed the PRotein Ontology (PRO) to facilitate protein annotation and to guide new experiments. The components of PRO extend from the classification of proteins on the basis of evolutionary relationships to the representation of the multiple protein forms of a gene (products generated by genetic variation, alternative splicing, proteolytic cleavage, and other post-translational modifications). PRO will allow the specification of relationships between PRO, GO and other ontologies in the OBO Foundry. Here we describe the initial development of PRO, illustrated using human and mouse proteins involved in the transforming growth factor-beta and bone morphogenetic protein signaling pathways.
- Published
- 2007
14. A framework for biomedical figure segmentation towards image-based document retrieval.
- Author
-
Lopez, Luis D., Jingyi Yu, Arighi, Cecilia, Tudor, Catalina O., Manabu Torii, Hongzhan Huang, Vijay-Shanker, K., and Cathy Wu
- Abstract
The figures included in many of the biomedical publications play an important role in understanding the biological experiments and facts described within. Recent studies have shown that it is possible to integrate the information that is extracted from figures in classical document classification and retrieval tasks in order to improve their accuracy. One important observation about the figures included in biomedical publications is that they are often composed of multiple subfigures or panels, each describing different methodologies or results. The use of these multimodal figures is a common practice in bioscience, as experimental results are graphically validated via multiple methodologies or procedures. Thus, for a better use of multimodal figures in document classification or retrieval tasks, as well as for providing the evidence source for derived assertions, it is important to automatically segment multimodal figures into subfigures and panels. This is a challenging task, however, as different panels can contain similar objects (i.e., barcharts and linecharts) with multiple layouts. Also, certain types of biomedical figures are text-heavy (e.g., DNA sequences and protein sequences images) and they differ from traditional images. As a result, classical image segmentation techniques based on low-level image features, such as edges or color, are not directly applicable to robustly partition multimodal figures into single modal panels. In this paper, we describe a robust solution for automatically identifying and segmenting unimodal panels from a multimodal figure. Our framework starts by robustly harvesting figure-caption pairs from biomedical articles. We base our approach on the observation that the document layout can be used to identify encoded figures and figure boundaries within PDF files. Taking into consideration the document layout allows us to correctly extract figures from the PDF document and associate their corresponding caption. We combine pixel-level representations of the extracted images with information gathered from their corresponding captions to estimate the number of panels in the figure. Thus, our approach simultaneously identifies the number of panels and the layout of figures. In order to evaluate the approach described here, we applied our system on documents containing proteinprotein interactions (PPIs) and compared the results against a gold standard that was annotated by biologists. Experimental results showed that our automatic figure segmentation approach surpasses pure caption-based and image-based approaches, achieving a 96.64% accuracy. To allow for efficient retrieval of information, as well as to provide the basis for integration into document classification and retrieval systems among other, we further developed a web-based interface that lets users easily retrieve panels containing the terms specified in the user queries. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
15. BioCreative III interactive task: an overview.
- Author
-
Arighi, Cecilia N, Roberts, Phoebe M, Agarwal, Shashank, Bhattacharya, Sanmitra, Cesareni, Gianni, Chatr-aryamontri, Andrew, Clematide, Simon, Gaudet, Pascale, Giglio, Michelle Gwinn, Harrow, Ian, Huala, Eva, Krallinger, Martin, Leser, Ulf, Li, Donghui, Liu, Feifan, Lu, Zhiyong, Maltais, Lois J, Okazaki, Naoaki, Perfetto, Livia, and Rinaldi, Fabio
- Subjects
- *
TEXT mining , *INFORMATION retrieval , *METHODS engineering , *TASK analysis , *ACCESS to information - Abstract
Background: The BioCreative challenge evaluation is a community-wide effort for evaluating text mining and information extraction systems applied to the biological domain. The biocurator community, as an active user of biomedical literature, provides a diverse and engaged end user group for text mining tools. Earlier BioCreative challenges involved many text mining teams in developing basic capabilities relevant to biological curation, but they did not address the issues of system usage, insertion into the workflow and adoption by curators. Thus in BioCreative III (BC-III), the InterActive Task (IAT) was introduced to address the utility and usability of text mining tools for real-life biocuration tasks. To support the aims of the IAT in BC-III, involvement of both developers and end users was solicited, and the development of a user interface to address the tasks interactively was requested. Results: A User Advisory Group (UAG) actively participated in the IAT design and assessment. The task focused on gene normalization (identifying gene mentions in the article and linking these genes to standard database identifiers), gene ranking based on the overall importance of each gene mentioned in the article, and geneoriented document retrieval (identifying full text papers relevant to a selected gene). Six systems participated and all processed and displayed the same set of articles. The articles were selected based on content known to be problematic for curation, such as ambiguity of gene names, coverage of multiple genes and species, or introduction of a new gene name. Members of the UAG curated three articles for training and assessment purposes, and each member was assigned a system to review. A questionnaire related to the interface usability and task performance (as measured by precision and recall) was answered after systems were used to curate articles. Although the limited number of articles analyzed and users involved in the IAT experiment precluded rigorous quantitative analysis of the results, a qualitative analysis provided valuable insight into some of the problems encountered by users when using the systems. The overall assessment indicates that the system usability features appealed to most users, but the system performance was suboptimal (mainly due to low accuracy in gene normalization). Some of the issues included failure of species identification and gene name ambiguity in the gene normalization task leading to an extensive list of gene identifiers to review, which, in some cases, did not contain the relevant genes. The document retrieval suffered from the same shortfalls. The UAG favored achieving high performance (measured by precision and recall), but strongly recommended the addition of features that facilitate the identification of correct gene and its identifier, such as contextual information to assist in disambiguation. Discussion: The IAT was an informative exercise that advanced the dialog between curators and developers and increased the appreciation of challenges faced by each group. A major conclusion was that the intended users should be actively involved in every phase of software development, and this will be strongly encouraged in future tasks. The IAT Task provides the first steps toward the definition of metrics and functional requirements that are necessary for designing a formal evaluation of interactive curation systems in the BioCreative IV challenge. [ABSTRACT FROM AUTHOR]
- Published
- 2011
- Full Text
- View/download PDF
16. Overview of the BioCreative III Workshop.
- Author
-
Arighi, Cecilia N., Lu, Zhiyong, Krallinger, Martin, Cohen, Kevin B., Wilbur, W. John, Valencia, Alfonso, Hirschman, Lynette, and Wu, Cathy H.
- Subjects
- *
TEXT mining , *INFORMATION retrieval , *DATA mining , *ELECTRONIC data processing , *MEDICINE - Abstract
Background: The overall goal of the BioCreative Workshops is to promote the development of text mining and text processing tools which are useful to the communities of researchers and database curators in the biological sciences. To this end BioCreative I was held in 2004, BioCreative II in 2007, and BioCreative II.5 in 2009. Each of these workshops involved humanly annotated test data for several basic tasks in text mining applied to the biomedical literature. Participants in the workshops were invited to compete in the tasks by constructing software systems to perform the tasks automatically and were given scores based on their performance. The results of these workshops have benefited the community in several ways. They have 1) provided evidence for the most effective methods currently available to solve specific problems; 2) revealed the current state of the art for performance on those problems; 3) and provided gold standard data and results on that data by which future advances can be gauged. This special issue contains overview papers for the three tasks of BioCreative III. Results: The BioCreative III Workshop was held in September of 2010 and continued the tradition of a challenge evaluation on several tasks judged basic to effective text mining in biology, including a gene normalization (GN) task and two protein-protein interaction (PPI) tasks. In total the Workshop involved the work of twenty-three teams. Thirteen teams participated in the GN task which required the assignment of EntrezGene IDs to all named genes in full text papers without any species information being provided to a system. Ten teams participated in the PPI article classification task (ACT) requiring a system to classify and rank a PubMed® record as belonging to an article either having or not having "PPI relevant" information. Eight teams participated in the PPI interaction method task (IMT) where systems were given full text documents and were required to extract the experimental methods used to establish PPIs and a text segment supporting each such method. Gold standard data was compiled for each of these tasks and participants competed in developing systems to perform the tasks automatically. BioCreative III also introduced a new interactive task (IAT), run as a demonstration task. The goal was to develop an interactive system to facilitate a user's annotation of the unique database identifiers for all the genes appearing in an article. This task included ranking genes by importance (based preferably on the amount of described experimental information regarding genes). There was also an optional task to assist the user in finding the most relevant articles about a given gene. For BioCreative III, a user advisory group (UAG) was assembled and played an important role 1) in producing some of the gold standard annotations for the GN task, 2) in critiquing IAT systems,. 3) in providing guidance for a future more rigorous evaluation of IAT systems. Six teams participated in the IAT demonstration task and received feedback on their systems from the UAG group. Besides innovations in the GN and PPI tasks making them more realistic and practical and the introduction of the IAT task, discussions were begun on community data standards to promote interoperability and on user requirements and evaluation metrics to address utility and usability of systems. Conclusions: In this paper we give a brief history of the BioCreative Workshops and how they relate to other text mining competitions in biology. This is followed by a synopsis of the three tasks GN, PPI, and IAT in BioCreative III with figures for best participant performance on the GN and PPI tasks. These results are discussed and compared with results from previous BioCreative Workshops and we conclude that the best performing systems for GN, PPIACT and PPI-IMT in realistic settings are not sufficient for fully automatic use. This provides evidence for the importance of interactive systems and we present our vision of how best to construct an interactive system for a GN or PPI like task in the remainder of the paper. [ABSTRACT FROM AUTHOR]
- Published
- 2011
- Full Text
- View/download PDF
17. TGF-beta signaling proteins and the Protein Ontology.
- Author
-
Arighi, Cecilia N., Hongfang Liu, Natale, Darren A., Barker, Winona C., Drabkin, Harold, Blake, Judith A., Smith, Barry, and Wu, Cathy H.
- Subjects
- *
ONTOLOGY , *TRANSFORMING growth factors-beta , *PROTEINS , *RNA splicing , *GENE fusion , *PROTEOMICS - Abstract
Background: The Protein Ontology (PRO) is designed as a formal and principled Open Biomedical Ontologies (OBO) Foundry ontology for proteins. The components of PRO extend from a classification of proteins on the basis of evolutionary relationships at the homeomorphic level to the representation of the multiple protein forms of a gene, including those resulting from alternative splicing, cleavage and/or posttranslational modifications. Focusing specifically on the TGF-beta signaling proteins, we describe the building, curation, usage and dissemination of PRO. Results: PRO is manually curated on the basis of PrePRO, an automatically generated file with content derived from standard protein data sources. Manual curation ensures that the treatment of the protein classes and the internal and external relationships conform to the PRO framework. The current release of PRO is based upon experimental data from mouse and human proteins wherein equivalent protein forms are represented by single terms. In addition to the PRO ontology, the annotation of PRO terms is released as a separate PRO association file, which contains, for each given PRO term, an annotation from the experimentally characterized sub-types as well as the corresponding database identifiers and sequence coordinates. The annotations are added in the form of relationship to other ontologies. Whenever possible, equivalent forms in other species are listed to facilitate cross-species comparison. Splice and allelic variants, gene fusion products and modified protein forms are all represented as entities in the ontology. Therefore, PRO provides for the representation of protein entities and a resource for describing the associated data. This makes PRO useful both for proteomics studies where isoforms and modified forms must be differentiated, and for studies of biological pathways, where representations need to take account of the different ways in which the cascade of events may depend on specific protein modifications. Conclusion: PRO provides a framework for the formal representation of protein classes and protein forms in the OBO Foundry. It is designed to enable data retrieval and integration and machine reasoning at the molecular level of proteins, thereby facilitating cross-species comparisons, pathway analysis, disease modeling and the generation of new hypotheses. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
18. An improved ontological representation of dendritic cells as a paradigm for all cell types.
- Author
-
Masci, Anna Maria, Arighi, Cecilia N., Diehl, Alexander D., Lieberman, Anne E., Mungall, Chris, Scheuermann, Richard H., Smith, Barry, and Cowell, Lindsay G.
- Subjects
- *
ONTOLOGY , *CYTOLOGICAL research , *DENDRITIC cells , *LYMPHOID tissue , *LIFE sciences , *BIOINFORMATICS - Abstract
Background: Recent increases in the volume and diversity of life science data and information and an increasing emphasis on data sharing and interoperability have resulted in the creation of a large number of biological ontologies, including the Cell Ontology (CL), designed to provide a standardized representation of cell types for data annotation. Ontologies have been shown to have significant benefits for computational analyses of large data sets and for automated reasoning applications, leading to organized attempts to improve the structure and formal rigor of ontologies to better support computation. Currently, the CL employs multiple is_a relations, defining cell types in terms of histological, functional, and lineage properties, and the majority of definitions are written with sufficient generality to hold across multiple species. This approach limits the CL's utility for computation and for cross-species data integration. Results: To enhance the CL's utility for computational analyses, we developed a method for the ontological representation of cells and applied this method to develop a dendritic cell ontology (DC-CL). DC-CL subtypes are delineated on the basis of surface protein expression, systematically including both species-general and species-specific types and optimizing DC-CL for the analysis of flow cytometry data. We avoid multiple uses of is_a by linking DC-CL terms to terms in other ontologies via additional, formally defined relations such as has_function. Conclusion: This approach brings benefits in the form of increased accuracy, support for reasoning, and interoperability with other ontology resources. Accordingly, we propose our method as a general strategy for the ontological representation of cells. DC-CL is available from http://www.obofoundry.org. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
19. Framework for a Protein Ontology.
- Author
-
Natale, Darren A., Arighi, Cecilia N., Barker, Winona C., Blake, Judith, Ti-Cheng Chang, Zhangzhi Hu, Hongfang Liu, Smith, Barry, and Wu, Cathy H.
- Subjects
- *
PROTEINS , *GENES , *TRANSFORMING growth factors-beta , *BONE morphogenetic proteins , *BIOINFORMATICS - Abstract
Biomedical ontologies are emerging as critical tools in genomic and proteomic research, where complex data in disparate resources need to be integrated. A number of ontologies describe properties that can be attributed to proteins. For example, protein functions are described by the Gene Ontology (GO) and human diseases by SNOMED CT or ICD10. There is, however, a gap in the current set of ontologies -- one that describes the protein entities themselves and their relationships. We have designed the PRotein Ontology (PRO) to facilitate protein annotation and to guide new experiments. The components of PRO extend from the classification of proteins on the basis of evolutionary relationships to the representation of the multiple protein forms of a gene (products generated by genetic variation, alternative splicing, proteolytic cleavage, and other post-translational modifications). PRO will allow the specification of relationships between PRO, GO and other ontologies in the OBO Foundry. Here we describe the initial development of PRO, illustrated using human and mouse proteins involved in the transforming growth factor-beta and bone morphogenetic protein signaling pathways. [ABSTRACT FROM AUTHOR]
- Published
- 2007
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.