26 results on '"Profiti, Giuseppe"'
Search Results
2. The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens
- Author
-
Zhou, Naihui, Jiang, Yuxiang, Bergquist, Timothy R., Lee, Alexandra J., Kacsoh, Balint Z., Crocker, Alex W., Lewis, Kimberley A., Georghiou, George, Nguyen, Huy N., Hamid, Md Nafiz, Davis, Larry, Dogan, Tunca, Atalay, Volkan, Rifaioglu, Ahmet S., Dalkıran, Alperen, Cetin Atalay, Rengul, Zhang, Chengxin, Hurto, Rebecca L., Freddolino, Peter L., Zhang, Yang, Bhat, Prajwal, Supek, Fran, Fernández, José M., Gemovic, Branislava, Perovic, Vladimir R., Davidović, Radoslav S., Sumonja, Neven, Veljkovic, Nevena, Asgari, Ehsaneddin, Mofrad, Mohammad R.K., Profiti, Giuseppe, Savojardo, Castrense, Martelli, Pier Luigi, Casadio, Rita, Boecker, Florian, Schoof, Heiko, Kahanda, Indika, Thurlby, Natalie, McHardy, Alice C., Renaux, Alexandre, Saidi, Rabie, Gough, Julian, Freitas, Alex A., Antczak, Magdalena, Fabris, Fabio, Wass, Mark N., Hou, Jie, Cheng, Jianlin, Wang, Zheng, Romero, Alfonso E., Paccanaro, Alberto, Yang, Haixuan, Goldberg, Tatyana, Zhao, Chenguang, Holm, Liisa, Törönen, Petri, Medlar, Alan J., Zosa, Elaine, Borukhov, Itamar, Novikov, Ilya, Wilkins, Angela, Lichtarge, Olivier, Chi, Po-Han, Tseng, Wei-Cheng, Linial, Michal, Rose, Peter W., Dessimoz, Christophe, Vidulin, Vedrana, Dzeroski, Saso, Sillitoe, Ian, Das, Sayoni, Lees, Jonathan Gill, Jones, David T., Wan, Cen, Cozzetto, Domenico, Fa, Rui, Torres, Mateo, Warwick Vesztrocy, Alex, Rodriguez, Jose Manuel, Tress, Michael L., Frasca, Marco, Notaro, Marco, Grossi, Giuliano, Petrini, Alessandro, Re, Matteo, Valentini, Giorgio, Mesiti, Marco, Roche, Daniel B., Reeb, Jonas, Ritchie, David W., Aridhi, Sabeur, Alborzi, Seyed Ziaeddin, Devignes, Marie-Dominique, Koo, Da Chen Emily, Bonneau, Richard, Gligorijević, Vladimir, Barot, Meet, Fang, Hai, Toppo, Stefano, Lavezzo, Enrico, Falda, Marco, Berselli, Michele, Tosatto, Silvio C.E., Carraro, Marco, Piovesan, Damiano, Ur Rehman, Hafeez, Mao, Qizhong, Zhang, Shanshan, Vucetic, Slobodan, Black, Gage S., Jo, Dane, Suh, Erica, Dayton, Jonathan B., Larsen, Dallas J., Omdahl, Ashton R., McGuffin, Liam J., Brackenridge, Danielle A., Babbitt, Patricia C., Yunes, Jeffrey M., Fontana, Paolo, Zhang, Feng, Zhu, Shanfeng, You, Ronghui, Zhang, Zihan, Dai, Suyang, Yao, Shuwei, Tian, Weidong, Cao, Renzhi, Chandler, Caleb, Amezola, Miguel, Johnson, Devon, Chang, Jia-Ming, Liao, Wen-Hung, Liu, Yi-Wei, Pascarelli, Stefano, Frank, Yotam, Hoehndorf, Robert, Kulmanov, Maxat, Boudellioua, Imane, Politano, Gianfranco, Di Carlo, Stefano, Benso, Alfredo, Hakala, Kai, Ginter, Filip, Mehryary, Farrokh, Kaewphan, Suwisa, Björne, Jari, Moen, Hans, Tolvanen, Martti E.E., Salakoski, Tapio, Kihara, Daisuke, Jain, Aashish, Šmuc, Tomislav, Altenhoff, Adrian, Ben-Hur, Asa, Rost, Burkhard, Brenner, Steven E., Orengo, Christine A., Jeffery, Constance J., Bosco, Giovanni, Hogan, Deborah A., Martin, Maria J., O’Donovan, Claire, Mooney, Sean D., Greene, Casey S., Radivojac, Predrag, and Friedberg, Iddo
- Published
- 2019
- Full Text
- View/download PDF
3. AlignBucket: a tool to speed up ‘all-against-all’ protein sequence alignments optimizing length constraints
- Author
-
Profiti, Giuseppe, Fariselli, Piero, and Casadio, Rita
- Published
- 2015
- Full Text
- View/download PDF
4. COVID-19: A Perspective for the Italian Health Service
- Author
-
Profiti, Giuseppe
- Subjects
Health Care System ,Public Investments, Health Care System, Public Spending, Fiscal Policies ,Fiscal Policies ,Public Investments ,Public Spending - Published
- 2020
5. Protein Functional Annotation
- Author
-
Luigi Martelli, Pier, Profiti, Giuseppe, and Casadio, Rita
- Published
- 2017
- Full Text
- View/download PDF
6. Fido-SNP: the first webserver for scoring the impact of single nucleotide variants in the dog genome.
- Author
-
Capriotti, Emidio, Montanucci, Ludovica, Profiti, Giuseppe, Rossi, Ivan, Giannuzzi, Diana, Aresu, Luca, and Fariselli, Piero
- Published
- 2019
- Full Text
- View/download PDF
7. A graph-based approach for predicting protein function: challenges in interconnected data
- Author
-
PROFITI, GIUSEPPE, FARISELLI, PIERO, MARTELLI, PIER LUIGI, AGGAZIO, FRANCESCO, CASADIO, RITA, G. Profiti, P. Fariselli, P.L. Martelli, F. Aggazio, and R. Casadio
- Subjects
protein function prediction, graph database, electronic annotation - Published
- 2015
8. BUSCA: an integrative web server to predict subcellular localization of proteins.
- Author
-
Savojardo, Castrense, Martelli, Pier Luigi, Fariselli, Piero, Profiti, Giuseppe, and Casadio, Rita
- Published
- 2018
- Full Text
- View/download PDF
9. Extended and Robust Protein Sequence Annotation over Conservative Nonhierarchical Clusters
- Author
-
PIOVESAN, DAMIANO, PROFITI, GIUSEPPE, MARTELLI, PIER LUIGI, FARISELLI, PIERO, CASADIO, RITA, Damiano Piovesan, Giuseppe Profiti, Pier Luigi Martelli, Piero Fariselli, and Rita Casadio
- Subjects
cross-genome comparison ,ABC TRANSPORTERS ,profile HMMs ,Protein functional annotation ,ATP-binding domain ,distantly related homolog - Abstract
Genome annotation is one of the most important issues in the genomic era. The exponential growth rate of newly sequenced genomes and proteomes urges the development of fast and reliable annotation methods, suited to exploit all the information available in curated databases of protein sequences and structures. To this aim we developed BAR+, the Bologna Annotation Resource.1 The basic notion is that sequences with high identity value to a counterpart can inherit the same function/s and structure, if available. As a case study we describe how the ATP-binding domain of the ABC transporters can be found and modeled in over 30,000 new sequences not annotated before. We also mapped into BAR+ all the ABC transporters listed in the Transporter Classification DataBase2 and found that within our environment annotation could be extended to another 256,866 sequences.
- Published
- 2013
10. BAR-PIG: a database of the pig proteome with structural and functional statistically validated annotation
- Author
-
PIOVESAN, DAMIANO, MARTELLI, PIER LUIGI, FARISELLI, PIERO, PROFITI, GIUSEPPE, FONTANESI, LUCA, CASADIO, RITA, D. Piovesan, PL. Martelli, P. Fariselli, G. Profiti, L. Fontanesi, and R. Casadio
- Subjects
PIG ,PROTEIN FUNCTIONAL ANNOTATION ,SUS SCROFA ,PROTEIN STRUCTURAL ANNOTATION ,BOLOGNA ANNOTATION RESOURCE - Abstract
We describe a database of pig proteins, comprising a total of 35,381 sequences collected by merging 19,576 and 23,118 chains retrieved respectively from UniProtKB, one of the major resources of protein sequences freely available, and from Ensembl, the genome database for eukaryotic species. Some 90% of these chains are poorly annotated and their existence is inferred automatically by sequence alignment towards the entire protein universe database. Given the relevance of the pig proteome in different studies, including human complex maladies, a statistical validation of the annotation is required for a better understanding of the role of specific genes and proteins in the complex networks underlying biological processes in the animal. We introduce BAR-PIG, a database in which some 21,793 sequences are endowed with a statistically validated annotation. Statistical validation is determined by adopting a cluster-centric annotation procedure that allows different types of annotation from structure to function and when possible to both structure and function. Each sequence in the database can be associated with a set of statistically validated Gene Ontologies (GO) of the three main routes (Molecular Function, Biological Process, Cellular Component), with Pfam functional domains and when possible with a cluster HMM model that allows building of the three dimensional structure of the protein. A database search allows some statistics demonstrating the enrichment in both GO and Pfam terms of the pig proteins as compared to the UniProtKB annotation. Conclusion: Protein sequence annotation after cluster statistical validation is at the basis of the database that we present in this paper. Searching in the BAR-PIG database allows retrieval of the pig protein annotation for further analysis. The search is also possible on the basis of specific GO terms and this allows retrieval of all the pig sequences participating into a given biological process, after annotation with our system. Alternatively the search is possible on the basis of structural information, allowing retrieval of all the pig sequences with the same structural characteristics
- Published
- 2012
11. eDGAR: a database of Disease-Gene Associations with annotated Relationships among genes.
- Author
-
Babbi, Giulia, Martelli, Pier Luigi, Profiti, Giuseppe, Bovo, Samuele, Savojardo, Castrense, and Casadio, Rita
- Subjects
GENES ,DISEASES ,PROTEIN-protein interactions ,PHENOTYPES ,PATHOGENIC microorganisms - Abstract
Background: Genetic investigations, boosted by modern sequencing techniques, allow dissecting the genetic component of different phenotypic traits. These efforts result in the compilation of lists of genes related to diseases and show that an increasing number of diseases is associated with multiple genes. Investigating functional relations among genes associated with the same disease contributes to highlighting molecular mechanisms of the pathogenesis. Results: We present eDGAR, a database collecting and organizing the data on gene/disease associations as derived from OMIM, Humsavar and ClinVar. For each disease-associated gene, eDGAR collects information on its annotation. Specifically, for lists of genes, eDGAR provides information on: i) interactions retrieved from PDB, BIOGRID and STRING; ii) co-occurrence in stable and functional structural complexes; iii) shared Gene Ontology annotations; iv) shared KEGG and REACTOME pathways; v) enriched functional annotations computed with NET-GE; vi) regulatory interactions derived from TRRUST; vii) localization on chromosomes and/or co-localisation in neighboring loci. The present release of eDGAR includes 2672 diseases, related to 3658 different genes, for a total number of 5729 gene-disease associations. 71% of the genes are linked to 621 multigenic diseases and eDGAR highlights their common GO terms, KEGG/REACTOME pathways, physical and regulatory interactions. eDGAR includes a network based enrichment method for detecting statistically significant functional terms associated to groups of genes. Conclusions: eDGAR offers a resource to analyze disease-gene associations. In multigenic diseases genes can share physical interactions and/or co-occurrence in the same functional processes. eDGAR is freely available at: edgar. biocomp.unibo.it. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
12. The Bologna Annotation Resource (BAR 3.0): improving protein functional annotation.
- Author
-
Profiti, Giuseppe, Pier Luigi Martelli, and Casadio, Rita
- Published
- 2017
- Full Text
- View/download PDF
13. An expanded evaluation of protein function prediction methods shows an improvement in accuracy.
- Author
-
Yuxiang Jiang, Tal Ronnen Oron, Clark, Wyatt T., Bankapur, Asma R., D'Andrea, Daniel, Lepore, Rosalba, Funk, Christopher S., Kahanda, Indika, Verspoor, Karin M., Asa Ben-Hur, Da Chen Emily Koo, Penfold-Brown, Duncan, Shasha, Dennis, Noah Youngs, Bonneau, Richard, Lin, Alexandra, Sahraeian, Sayed M. E., Martelli, Pier Luigi, Profiti, Giuseppe, and Casadio, Rita
- Published
- 2016
- Full Text
- View/download PDF
14. Extended and Robust Protein Sequence Annotation over Conservative Nonhierarchical Clusters: The Case Study of the ABC Transporters.
- Author
-
PIOVESAN, DAMIANO, PROFITI, GIUSEPPE, MARTELLI, PIER LUIGI, FARISELLI, PIERO, and CASADIO, RITA
- Subjects
AMINO acid sequence ,ATP-binding cassette transporters ,GENOMES ,PROTEOMICS ,DATABASES - Abstract
Genome annotation is one of the most important issues in the genomic era. The exponential growth rate of newly sequenced genomes and proteomes urges the development of fast and reliable annotation methods, suited to exploit all the information available in curated databases of protein sequences and structures. To this aim we developed BAR+, the Bologna Annotation Resource.
1 The basic notion is that sequences with high identity value to a counterpart can inherit the same function/s and structure, if available. As a case study we describe how the ATP-binding domain of the ABC transporters can be found and modeled in over 30,000 new sequences not annotated before. We also mapped into BAR+ all the ABC transporters listed in the Transporter Classification DataBase2 and found that within our environment annotation could be extended to another 256,866 sequences. [ABSTRACT FROM AUTHOR]- Published
- 2013
- Full Text
- View/download PDF
15. How to inherit statistically validated annotation within BAR+ protein clusters.
- Author
-
Piovesan, Damiano, Martelli, Pier Luigi, Fariselli, Piero, Profiti, Giuseppe, Zaul, Andrea, Rossi, Ivan, and Casadio, Rita
- Subjects
AMINO acid sequence ,SEQUENCE alignment ,GENE ontology ,BIOCHEMICAL templates ,GENETIC translation ,NUCLEOTIDE sequence - Abstract
Background: In the genomic era a key issue is protein annotation, namely how to endow protein sequences, upon translation from the corresponding genes, with structural and functional features. Routinely this operation is electronically done by deriving and integrating information from previous knowledge. The reference database for protein sequences is UniProtKB divided into two sections, UniProtKB/TrEMBL which is automatically annotated and not reviewed and UniProtKB/Swiss-Prot which is manually annotated and reviewed. The annotation process is essentially based on sequence similarity search. The question therefore arises as to which extent annotation based on transfer by inheritance is valuable and specifically if it is possible to statistically validate inherited features when little homology exists among the target sequence and its template(s). Results: In this paper we address the problem of annotating protein sequences in a statistically validated manner considering as a reference annotation resource UniProtKB. The test case is the set of 48,298 proteins recently released by the Critical Assessment of Function Annotations (CAFA) organization. We show that we can transfer after validation, Gene Ontology (GO) terms of the three main categories and Pfam domains to about 68% and 72% of the sequences, respectively. This is possible after alignment of the CAFA sequences towards BAR+, our annotation resource that allows discriminating among statistically validated and not statistically validated annotation. By comparing with a direct UniProtKB annotation, we find that besides validating annotation of some 78% of the CAFA set, we assign new and statistically validated annotation to 14.8% of the sequences and find new structural templates for about 25% of the chains, half of which share less than 30% sequence identity to the corresponding template/s. Conclusion: Inheritance of annotation by transfer generally requires a careful selection of the identity value among the target and the template in order to transfer structural and/or functional features. Here we prove that even distantly remote homologs can be safely endowed with structural templates and GO and/or Pfam terms provided that annotation is done within clusters collecting cluster-related protein sequences and where a statistical validation of the shared structural and functional features is possible. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
16. SUS-BAR: a database of pig proteins with statistically validated structural and functional annotation.
- Author
-
Piovesan, Damiano, Profiti, Giuseppe, Martelli, Pier Luigi, Fariselli, Piero, Fontanesi, Luca, and Casadio, Rita
- Subjects
- *
BIOLOGICAL databases , *BIOMOLECULES , *DATABASE management , *PROTEINS , *MARKOV processes - Abstract
Given the relevance of the pig proteome in different studies, including human complex maladies, a statistical validation of the annotation is required for a better understanding of the role of specific genes and proteins in the complex networks underlying biological processes in the animal. Presently, approximately 80% of the pig proteome is still poorly annotated, and the existence of protein sequences is routinely inferred automatically by sequence alignment towards preexisting sequences. In this article, we introduce SUS-BAR, a database that derives information mainly from UniProt Knowledgebase and that includes 26 206 pig protein sequences. In SUS-BAR, 16 675 of the pig protein sequences are endowed with statistically validated functional and structural annotation. Our statistical validation is determined by adopting a cluster-centric annotation procedure that allows transfer of different types of annotation, including structure and function. Each sequence in the database can be associated with a set of statistically validated Gene Ontologies (GOs) of the three main subontologies (Molecular Function, Biological Process and Cellular Component), with Pfam functional domains, and when possible, with a cluster Hidden Markov Model that allows modelling the 3D structure of the protein. A database search allows some statistics demonstrating the enrichment in both GO and Pfam annotations of the pig proteins as compared with UniProt Knowledgebase annotation. Searching in SUS-BAR allows retrieval of the pig protein annotation for further analysis. The search is also possible on the basis of specific GO terms and this allows retrieval of all the pig sequences participating into a given biological process, after annotation with our system. Alternatively, the search is possible on the basis of structural information, allowing retrieval of all the pig sequences with the same structural characteristics. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
17. The human "magnesome": detecting magnesium binding sites on human proteins.
- Author
-
Piovesan, Damiano, Profiti, Giuseppe, Martelli, Pier Luigi, and Casadio, Rita
- Subjects
- *
MAGNESIUM , *PROTEINS , *CATIONS , *CALCIUM , *ZINC - Abstract
Background: Magnesium research is increasing in molecular medicine due to the relevance of this ion in several important biological processes and associated molecular pathogeneses. It is still difficult to predict from the protein covalent structure whether a human chain is or not involved in magnesium binding. This is mainly due to little information on the structural characteristics of magnesium binding sites in proteins and protein complexes. Magnesium binding features, differently from those of other divalent cations such as calcium and zinc, are elusive. Here we address a question that is relevant in protein annotation: how many human proteins can bind Mg2+? Our analysis is performed taking advantage of the recently implemented Bologna Annotation Resource (BAR-PLUS), a non hierarchical clustering method that relies on the pair wise sequence comparison of about 14 millions proteins from over 300.000 species and their grouping into clusters where annotation can safely be inherited after statistical validation. Results: After cluster assignment of the latest version of the human proteome, the total number of human proteins for which we can assign putative Mg binding sites is 3,751. Among these proteins, 2,688 inherit annotation directly from human templates and 1,063 inherit annotation from templates of other organisms. Protein structures are highly conserved inside a given cluster. Transfer of structural properties is possible after alignment of a given sequence with the protein structures that characterise a given cluster as obtained with a Hidden Markov Model (HMM) based procedure. Interestingly a set of 370 human sequences inherit Mg2+ binding sites from templates sharing less than 30% sequence identity with the template. Conclusion: We describe and deliver the "human magnesome", a set of proteins of the human proteome that inherit putative binding of magnesium ions. With our BAR-hMG, 251 clusters including 1,341 magnesium binding protein structures corresponding to 387 sequences are sufficient to annotate some 13,689 residues in 3,751 human sequences as "magnesium binding". Protein structures act therefore as three dimensional seeds for structural and functional annotation of human sequences. The data base collects specifically all the human proteins that can be annotated according to our procedure as "magnesium binding", the corresponding structures and BAR+ clusters from where they derive the annotation (http://bar.biocomp.unibo.it/mg). [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
18. Whole Genome Sequence Analysis of Brucella abortus Isolates from Various Regions of South Africa.
- Author
-
Ledwaba, Maphuti Betty, Glover, Barbara Akorfa, Matle, Itumeleng, Profiti, Giuseppe, Martelli, Pier Luigi, Casadio, Rita, Zilli, Katiuscia, Janowicz, Anna, Marotta, Francesca, Garofolo, Giuliano, van Heerden, Henriette, and Yagupsky, Pablo V.
- Subjects
BRUCELLA abortus ,BACTERIAL genomes ,SEQUENCE analysis ,SINGLE nucleotide polymorphisms ,COMPARATIVE genomics ,ANIMAL vaccination ,BRUCELLOSIS - Abstract
The availability of whole genome sequences in public databases permits genome-wide comparative studies of various bacterial species. Whole genome sequence-single nucleotide polymorphisms (WGS-SNP) analysis has been used in recent studies and allows the discrimination of various Brucella species and strains. In the present study, 13 Brucella spp. strains from cattle of various locations in provinces of South Africa were typed and discriminated. WGS-SNP analysis indicated a maximum pairwise distance ranging from 4 to 77 single nucleotide polymorphisms (SNPs) between the South African Brucella abortus virulent field strains. Moreover, it was shown that the South African B. abortus strains grouped closely to B. abortus strains from Mozambique and Zimbabwe, as well as other Eurasian countries, such as Portugal and India. WGS-SNP analysis of South African B. abortus strains demonstrated that the same genotype circulated in one farm (Farm 1), whereas another farm (Farm 2) in the same province had two different genotypes. This indicated that brucellosis in South Africa spreads within the herd on some farms, whereas the introduction of infected animals is the mode of transmission on other farms. Three B. abortus vaccine S19 strains isolated from tissue and aborted material were identical, even though they originated from different herds and regions of South Africa. This might be due to the incorrect vaccination of animals older than the recommended age of 4–8 months or might be a problem associated with vaccine production. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
19. Fido-SNP: the first webserver for scoring the impact of single nucleotide variants in the dog genome
- Author
-
Giuseppe Profiti, Diana Giannuzzi, Ludovica Montanucci, Emidio Capriotti, Ivan Rossi, Luca Aresu, Piero Fariselli, Capriotti, Emidio, Montanucci, Ludovica, Profiti, Giuseppe, Rossi, Ivan, Giannuzzi, Diana, Aresu, Luca, and Fariselli, Piero
- Subjects
genomic variants, variant interpretation, dog genome, machine learning ,Genotype ,Genomics ,Single-nucleotide polymorphism ,Genome-wide association study ,Computational biology ,Biology ,Genome ,Polymorphism, Single Nucleotide ,03 medical and health sciences ,0302 clinical medicine ,Dogs ,Genetics ,SNP ,Animals ,030304 developmental biology ,0303 health sciences ,Internet ,Genetic Variation ,Matthews correlation coefficient ,Binary classification ,Web Server Issue ,Human genome ,030217 neurology & neurosurgery ,Algorithms ,Software ,Genome-Wide Association Study - Abstract
As the amount of genomic variation data increases, tools that are able to score the functional impact of single nucleotide variants become more and more necessary. While there are several prediction servers available for interpreting the effects of variants in the human genome, only few have been developed for other species, and none were specifically designed for species of veterinary interest such as the dog. Here, we present Fido-SNP the first predictor able to discriminate between Pathogenic and Benign single-nucleotide variants in the dog genome. Fido-SNP is a binary classifier based on the Gradient Boosting algorithm. It is able to classify and score the impact of variants in both coding and non-coding regions based on sequence features within seconds. When validated on a previously unseen set of annotated variants from the OMIA database, Fido-SNP reaches 88% overall accuracy, 0.77 Matthews correlation coefficient and 0.91 Area Under the ROC Curve.
- Published
- 2019
20. The Bologna Annotation Resource (BAR 3.0): improving protein functional annotation
- Author
-
Giuseppe Profiti, Pier Luigi Martelli, Rita Casadio, Profiti, Giuseppe, Martelli, Pier Luigi, and Casadio, Rita
- Subjects
0301 basic medicine ,protein structure rpediction ,Bar (music) ,Biology ,Bioinformatics ,03 medical and health sciences ,Annotation ,Similarity (network science) ,Sequence Analysis, Protein ,Genetics ,Cluster Analysis ,protein interaction ,Cluster analysis ,Hidden Markov model ,Sequence ,Internet ,Information retrieval ,Proteins ,Molecular Sequence Annotation ,sequence similarity ,030104 developmental biology ,Web Server Issue ,Graph (abstract data type) ,UniProt ,Software ,protein function prediction ,clustering - Abstract
BAR 3.0 updates our server BAR (Bologna Annotation Resource) for predicting protein structural and functional features from sequence. We increase data volume, query capabilities and information conveyed to the user. The core of BAR 3.0 is a graph-based clustering procedure of UniProtKB sequences, following strict pairwise similarity criteria (sequence identity ≥40% with alignment coverage ≥90%). Each cluster contains the available annotation downloaded from UniProtKB, GO, PFAM and PDB. After statistical validation, GO terms and PFAM domains are cluster-specific and annotate new sequences entering the cluster after satisfying similarity constraints. BAR 3.0 includes 28 869 663 sequences in 1 361 773 clusters, of which 22.2% (22 241 661 sequences) and 47.4% (24 555 055 sequences) have at least one validated GO term and one PFAM domain, respectively. 1.4% of the clusters (36% of all sequences) include PDB structures and the cluster is associated to a hidden Markov model that allows building template-target alignment suitable for structural modeling. Some other 3 399 026 sequences are singletons. BAR 3.0 offers an improved search interface, allowing queries by UniProtKB-accession, Fasta sequence, GO-term, PFAM-domain, organism, PDB and ligand/s. When evaluated on the CAFA2 targets, BAR 3.0 largely outperforms our previous version and scores among state-of-the-art methods. BAR 3.0 is publicly available and accessible at http://bar.biocomp.unibo.it/bar3.
- Published
- 2017
21. eDGAR: a database of Disease-Gene Associations with annotated Relationships among genes
- Author
-
Giulia Babbi, Rita Casadio, Samuele Bovo, Giuseppe Profiti, Castrense Savojardo, Pier Luigi Martelli, Babbi, Giulia, Martelli, Pier Luigi, Profiti, Giuseppe, Bovo, Samuele, Savojardo, Castrense, and Casadio, Rita
- Subjects
0301 basic medicine ,lcsh:QH426-470 ,lcsh:Biotechnology ,Genomics ,Biology ,Protein functional annotation ,computer.software_genre ,03 medical and health sciences ,Annotation ,0302 clinical medicine ,Protein-protein interaction ,lcsh:TP248.13-248.65 ,Databases, Genetic ,Genetics ,Humans ,Protein Interaction Maps ,KEGG ,Gene ,Gene/disease relationship ,Database ,Research ,Genetic Diseases, Inborn ,Molecular Sequence Annotation ,Phenotypic trait ,lcsh:Genetics ,030104 developmental biology ,DNA microarray ,Functional enrichment ,computer ,Functional genomics ,030217 neurology & neurosurgery ,Metabolic Networks and Pathways ,Biotechnology - Abstract
Background Genetic investigations, boosted by modern sequencing techniques, allow dissecting the genetic component of different phenotypic traits. These efforts result in the compilation of lists of genes related to diseases and show that an increasing number of diseases is associated with multiple genes. Investigating functional relations among genes associated with the same disease contributes to highlighting molecular mechanisms of the pathogenesis. Results We present eDGAR, a database collecting and organizing the data on gene/disease associations as derived from OMIM, Humsavar and ClinVar. For each disease-associated gene, eDGAR collects information on its annotation. Specifically, for lists of genes, eDGAR provides information on: i) interactions retrieved from PDB, BIOGRID and STRING; ii) co-occurrence in stable and functional structural complexes; iii) shared Gene Ontology annotations; iv) shared KEGG and REACTOME pathways; v) enriched functional annotations computed with NET-GE; vi) regulatory interactions derived from TRRUST; vii) localization on chromosomes and/or co-localisation in neighboring loci. The present release of eDGAR includes 2672 diseases, related to 3658 different genes, for a total number of 5729 gene-disease associations. 71% of the genes are linked to 621 multigenic diseases and eDGAR highlights their common GO terms, KEGG/REACTOME pathways, physical and regulatory interactions. eDGAR includes a network based enrichment method for detecting statistically significant functional terms associated to groups of genes. Conclusions eDGAR offers a resource to analyze disease-gene associations. In multigenic diseases genes can share physical interactions and/or co-occurrence in the same functional processes. eDGAR is freely available at: edgar.biocomp.unibo.it Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3911-3) contains supplementary material, which is available to authorized users.
- Published
- 2017
22. AlignBucket: a tool to speed up 'all-against-all' protein sequence alignments optimizing length constraints
- Author
-
Giuseppe Profiti, Rita Casadio, Piero Fariselli, Profiti, Giuseppe, Fariselli, Piero, and Casadio, Rita
- Subjects
Statistics and Probability ,Computer science ,Nearest neighbor search ,Sequence alignment ,Bioinformatics ,Biochemistry ,Protein sequencing ,Humans ,Databases, Protein ,sequence alignment ,optimization ,constraints ,similarity search ,sequence alignment, optimization, constraints, similarity search ,Molecular Biology ,Sequence ,Computational Biology ,Proteins ,Partition (database) ,Computer Science Applications ,Computational Mathematics ,Range (mathematics) ,Computational Theory and Mathematics ,Algorithm ,Algorithms ,Software - Abstract
Motivation: The next-generation sequencing era requires reliable, fast and efficient approaches for the accurate annotation of the ever-increasing number of biological sequences and their variations. Transfer of annotation upon similarity search is a standard approach. The procedure of all-against-all protein comparison is a preliminary step of different available methods that annotate sequences based on information already present in databases. Given the actual volume of sequences, methods are necessary to pre-process data to reduce the time of sequence comparison. Results: We present an algorithm that optimizes the partition of a large volume of sequences (the whole database) into sets where sequence length values (in residues) are constrained depending on a bounded minimal and expected alignment coverage. The idea is to optimally group protein sequences according to their length, and then computing the all-against-all sequence alignments among sequences that fall in a selected length range. We describe a mathematically optimal solution and we show that our method leads to a 5-fold speed-up in real world cases. Availability and implementation: The software is available for downloading at http://www.biocomp.unibo.it/∼giuseppe/partitioning.html. Contact: giuseppe.profiti2@unibo.it Supplementary information: Supplementary data are available at Bioinformatics online.
- Published
- 2015
23. Tools and data services registry: a community effort to document bioinformatics resources
- Author
-
Callum Smith, Paolo Uva, Thomas Gatter, Peter Løngreen, Peter Juvan, Hans Ienasescu, Giuseppe Profiti, Aleksandra Nenadic, Kristoffer Rapacki, Chris Morris, Paola Roncaglia, Steffen Möller, Laura Emery, Søren Brunak, Maria Maddalena Sperotto, Heinz Stockinger, Kristian Davidsen, Federico Zambelli, Helen Parkinson, Olivia Doppelt-Azeroual, Luana Licata, Tatyana Goldberg, Andrea Schafferhans, Elisabeth Gasteiger, Emil Karol Rydza, Camille Laibe, Victor De La Torre, Marie Grosjean, Manuela Helmer-Citterich, Hervé Ménager, Radka Svobodová Vařeková, Rafael C. Jimenez, Martin Closter Jespersen, Anthony Bretaudeau, Jan Brezovsky, Tunca Doğan, Matúš Kalaš, Peter M. Rice, Ivan Mičetić, Rune Møllegaard Friborg, Maximilian Koch, Silvio C. E. Tosatto, Nick Juty, Björn Grüning, Gianmauro Cuccuru, Frederik Coppens, Gianni Cesareni, Jon Ison, Rabie Saidi, Sébastien Moretti, Rita Casadio, Gert Vriend, Guy Yachdav, Niall Beard, Timothy F. Booth, Michael Cornell, Piotr Jaroslaw Chmura, Veit Schwämmle, Karel Berka, Dan Bolser, Vassilios Ioannidis, Jing-Woei Li, Burkhard Rost, Gianluca Della Vedova, Fabien Mareuil, Hedi Peterson, Allegra Via, Paolo Romano, Christian Anthon, Technical University of Denmark [Lyngby] (DTU), Institut Pasteur de Madagascar, Réseau International des Instituts Pasteur (RIIP), University of Bergen (UIB), University of Copenhagen = Københavns Universitet (KU), University of Manchester, Palacky University, European Bioinformatics Institute, NEBC Wallingford, Institut de Génétique, Environnement et Protection des Plantes (IGEPP), Institut National de la Recherche Agronomique (INRA)-Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)-AGROCAMPUS OUEST, Masaryk University, University of Bologna, Università degli Studi di Roma Tor Vergata [Roma], Ghent University [Belgium] (UGENT), Flanders Institute for Biotechnology, CRS4 Bioinformat, Università degli studi di Milano-Bicocca, Swiss Institute of Bioinformatics, Universität Bielefeld = Bielefeld University, Tumor Biology Center, Centre National de la Recherche Scientifique (CNRS), University of Freiburg, University of Ljubljana, The Chinese University of Hong Kong [Hong Kong], Universita degli Studi di Padova, Bioinformatics Research Centre, Université de Lausanne, CCLRC Daresbury Laboratory, Universität zu Lübeck [Lübeck] - University of Lübeck [Lübeck], Universität Rostock, University of Tartu, Imperial College London, IRCCS Azienda Ospedaliera Universitaria Integrata San Martino (IRCCS AOU San Martino), University of Southern Denmark (SDU), WTCHG, Central European Institute of Technology [Brno] (CEITEC), Instituto Nacional de Bioinformática, Sapienza University of Rome (DIAG), Consiglio Nazionale delle Ricerche, University of Milan, Radboud University Nijmegen, Ison, J, Rapacki, K, Ménager, H, Kalaš, M, Rydza, E, Chmura, P, Anthon, C, Beard, N, Berka, K, Bolser, D, Booth, T, Bretaudeau, A, Brezovsky, J, Casadio, R, Cesareni, G, Coppens, F, Cornell, M, Cuccuru, G, Davidsen, K, DELLA VEDOVA, G, Dogan, T, Doppelt Azeroual, O, Emery, L, Gasteiger, E, Gatter, T, Goldberg, T, Grosjean, M, Grüning, B, Helmer Citterich, M, Ienasescu, H, Ioannidis, V, Jespersen, M, Jimenez, R, Juty, N, Juvan, P, Koch, M, Laibe, C, Li, J, Licata, L, Mareuil, F, Mičetić, I, Friborg, R, Moretti, S, Morris, C, Möller, S, Nenadic, A, Peterson, H, Profiti, G, Rice, P, Romano, P, Roncaglia, P, Saidi, R, Schafferhans, A, Schwämmle, V, Smith, C, Sperotto, M, Stockinger, H, Vařeková, R, Tosatto, S, de la Torre, V, Uva, P, Via, A, Yachdav, G, Zambelli, F, Vriend, G, Rost, B, Parkinson, H, Løngreen, P, Brunak, S, University of Bergen (UiB), Palacky University Olomouc, Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro)-Institut national d'enseignement supérieur pour l'agriculture, l'alimentation et l'environnement (Institut Agro), Masaryk University [Brno] (MUNI), Universiteit Gent = Ghent University [Belgium] (UGENT), Università degli Studi di Milano-Bicocca [Milano] (UNIMIB), Swiss Institute of Bioinformatics [Lausanne] (SIB), Université de Lausanne (UNIL), Universität zu Lübeck [Lübeck], Central European Institute of Technology [Brno] (CEITEC MU), Brno University of Technology [Brno] (BUT), Università degli Studi di Roma 'La Sapienza' = Sapienza University [Rome], Danmarks Tekniske Universitet = Technical University of Denmark (DTU), University of Copenhagen = Københavns Universitet (UCPH), Institut National de la Recherche Agronomique (INRA)-Université de Rennes (UR)-AGROCAMPUS OUEST, University of Bologna/Università di Bologna, Universiteit Gent = Ghent University (UGENT), Università degli Studi di Milano-Bicocca = University of Milano-Bicocca (UNIMIB), Université de Lausanne = University of Lausanne (UNIL), Università degli Studi di Padova = University of Padua (Unipd), Universität zu Lübeck = University of Lübeck [Lübeck], Università degli Studi di Roma 'La Sapienza' = Sapienza University [Rome] (UNIROMA), Università degli Studi di Milano = University of Milan (UNIMI), Ison, Jon, Rapacki, Kristoffer, Ménager, Hervé, Kalaš, Matúš, Rydza, Emil, Chmura, Piotr, Anthon, Christian, Beard, Niall, Berka, Karel, Bolser, Dan, Booth, Tim, Bretaudeau, Anthony, Brezovsky, Jan, Casadio, Rita, Cesareni, Gianni, Coppens, Frederik, Cornell, Michael, Cuccuru, Gianmauro, Davidsen, Kristian, Vedova, Gianluca Della, Dogan, Tunca, Doppelt-Azeroual, Olivia, Emery, Laura, Gasteiger, Elisabeth, Gatter, Thoma, Goldberg, Tatyana, Grosjean, Marie, Grüning, Björn, Helmer-Citterich, Manuela, Ienasescu, Han, Ioannidis, Vassilio, Jespersen, Martin Closter, Jimenez, Rafael, Juty, Nick, Juvan, Peter, Koch, Maximilian, Laibe, Camille, Li, Jing-Woei, Licata, Luana, Mareuil, Fabien, Mičetić, Ivan, Friborg, Rune Møllegaard, Moretti, Sebastien, Morris, Chri, Möller, Steffen, Nenadic, Aleksandra, Peterson, Hedi, Profiti, Giuseppe, Rice, Peter, Romano, Paolo, Roncaglia, Paola, Saidi, Rabie, Schafferhans, Andrea, Schwämmle, Veit, Smith, Callum, Sperotto, Maria Maddalena, Stockinger, Heinz, Vařeková, Radka Svobodová, Tosatto, Silvio C E, de la Torre, Victor, Uva, Paolo, Via, Allegra, Yachdav, Guy, Zambelli, Federico, Vriend, Gert, Rost, Burkhard, Parkinson, Helen, Løngreen, Peter, and Brunak, Søren
- Subjects
0301 basic medicine ,[SDV]Life Sciences [q-bio] ,registry ,Bioinformatics ,computer.software_genre ,Matematikk og naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420::Systemutvikling og -arbeid: 426 [VDP] ,Task (project management) ,Documentation ,Data and Information ,Database Issue ,Registries ,bioinformatique ,Data Curation ,base de données ,Settore BIO/11 ,gestion de données ,tool ,SOFTWARE-DEVELOPMENT ,bioinformatics ,ddc ,outil informatique ,Tools and data services registry ,SEQANSWERS ,Web service ,MOLECULAR-BIOLOGY ,Biology ,Ecology and Environment ,03 medical and health sciences ,SDG 3 - Good Health and Well-being ,Genetics ,Implementation ,Dissemination ,Bioinformatikk / Bioinformatics ,Data curation ,bioinformatic ,business.industry ,Computational Biology ,Software ,Software development ,bioinformatics, tools, registry, elixir ,Biology and Life Sciences ,Mathematics and natural scienses: 400::Information and communication science: 420::System development and design: 426 [VDP] ,FRAMEWORK ,ELIXIR ,Settore BIO/18 - Genetica ,030104 developmental biology ,tools ,Data as a service ,COMPILATION ,business ,COLLECTION ,Nanomedicine Radboud Institute for Molecular Life Sciences [Radboudumc 19] ,computer ,WEB SERVICES ,LIFE SCIENCES - Abstract
Contains fulltext : 171819.pdf (Publisher’s version ) (Open Access) Life sciences are yielding huge data sets that underpin scientific discoveries fundamental to improvement in human health, agriculture and the environment. In support of these discoveries, a plethora of databases and tools are deployed, in technically complex and diverse implementations, across a spectrum of scientific disciplines. The corpus of documentation of these resources is fragmented across the Web, with much redundancy, and has lacked a common standard of information. The outcome is that scientists must often struggle to find, understand, compare and use the best resources for the task at hand.Here we present a community-driven curation effort, supported by ELIXIR-the European infrastructure for biological information-that aspires to a comprehensive and consistent registry of information about bioinformatics resources. The sustainable upkeep of this Tools and Data Services Registry is assured by a curation effort driven by and tailored to local needs, and shared amongst a network of engaged partners.As of November 2015, the registry includes 1785 resources, with depositions from 126 individual registrations including 52 institutional providers and 74 individuals. With community support, the registry can become a standard for dissemination of information about bioinformatics resources: we welcome everyone to join us in this common endeavour. The registry is freely available at https://bio.tools.
- Full Text
- View/download PDF
24. An expanded evaluation of protein function prediction methods shows an improvement in accuracy.
- Author
-
Jiang Y, Oron TR, Clark WT, Bankapur AR, D'Andrea D, Lepore R, Funk CS, Kahanda I, Verspoor KM, Ben-Hur A, Koo da CE, Penfold-Brown D, Shasha D, Youngs N, Bonneau R, Lin A, Sahraeian SM, Martelli PL, Profiti G, Casadio R, Cao R, Zhong Z, Cheng J, Altenhoff A, Skunca N, Dessimoz C, Dogan T, Hakala K, Kaewphan S, Mehryary F, Salakoski T, Ginter F, Fang H, Smithers B, Oates M, Gough J, Törönen P, Koskinen P, Holm L, Chen CT, Hsu WL, Bryson K, Cozzetto D, Minneci F, Jones DT, Chapman S, Bkc D, Khan IK, Kihara D, Ofer D, Rappoport N, Stern A, Cibrian-Uhalte E, Denny P, Foulger RE, Hieta R, Legge D, Lovering RC, Magrane M, Melidoni AN, Mutowo-Meullenet P, Pichler K, Shypitsyna A, Li B, Zakeri P, ElShal S, Tranchevent LC, Das S, Dawson NL, Lee D, Lees JG, Sillitoe I, Bhat P, Nepusz T, Romero AE, Sasidharan R, Yang H, Paccanaro A, Gillis J, Sedeño-Cortés AE, Pavlidis P, Feng S, Cejuela JM, Goldberg T, Hamp T, Richter L, Salamov A, Gabaldon T, Marcet-Houben M, Supek F, Gong Q, Ning W, Zhou Y, Tian W, Falda M, Fontana P, Lavezzo E, Toppo S, Ferrari C, Giollo M, Piovesan D, Tosatto SC, Del Pozo A, Fernández JM, Maietta P, Valencia A, Tress ML, Benso A, Di Carlo S, Politano G, Savino A, Rehman HU, Re M, Mesiti M, Valentini G, Bargsten JW, van Dijk AD, Gemovic B, Glisic S, Perovic V, Veljkovic V, Veljkovic N, Almeida-E-Silva DC, Vencio RZ, Sharan M, Vogel J, Kansakar L, Zhang S, Vucetic S, Wang Z, Sternberg MJ, Wass MN, Huntley RP, Martin MJ, O'Donovan C, Robinson PN, Moreau Y, Tramontano A, Babbitt PC, Brenner SE, Linial M, Orengo CA, Rost B, Greene CS, Mooney SD, Friedberg I, and Radivojac P
- Subjects
- Algorithms, Databases, Protein, Gene Ontology, Humans, Molecular Sequence Annotation, Proteins genetics, Computational Biology, Proteins chemistry, Software, Structure-Activity Relationship
- Abstract
Background: A major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging., Results: We conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2., Conclusions: The top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent.
- Published
- 2016
- Full Text
- View/download PDF
25. Ancient pathogen-driven adaptation triggers increased susceptibility to non-celiac wheat sensitivity in present-day European populations.
- Author
-
Sazzini M, De Fanti S, Cherubini A, Quagliariello A, Profiti G, Martelli PL, Casadio R, Ricci C, Campieri M, Lanzini A, Volta U, Caio G, Franceschi C, Spisni E, and Luiselli D
- Abstract
Background: Non-celiac wheat sensitivity is an emerging wheat-related syndrome showing peak prevalence in Western populations. Recent studies hypothesize that new gliadin alleles introduced in the human diet by replacement of ancient wheat with modern varieties can prompt immune responses mediated by the CXCR3-chemokine axis potentially underlying such pathogenic inflammation. This cultural shift may also explain disease epidemiology, having turned European-specific adaptive alleles previously targeted by natural selection into disadvantageous ones., Methods: To explore this evolutionary scenario, we performed ultra-deep sequencing of genes pivotal in the CXCR3-inflammatory pathway on individuals diagnosed for non-celiac wheat sensitivity and we applied anthropological evolutionary genetics methods to sequence data from worldwide populations to investigate the genetic legacy of natural selection on these loci., Results: Our results indicate that balancing selection has maintained two divergent CXCL10/CXCL11 haplotypes in Europeans, one responsible for boosting inflammatory reactions and another for encoding moderate chemokine expression., Conclusions: This led to considerably higher occurrence of the former haplotype in Western people than in Africans and East Asians, suggesting that they might be more prone to side effects related to the consumption of modern wheat varieties. Accordingly, this study contributed to shed new light on some of the mechanisms potentially involved in the disease etiology and on the evolutionary bases of its present-day epidemiological patterns. Moreover, overrepresentation of disease homozygotes for the dis-adaptive haplotype plausibly accounts for their even more enhanced CXCR3-axis expression and for their further increase in disease risk, representing a promising finding to be validated by larger follow-up studies.
- Published
- 2016
- Full Text
- View/download PDF
26. Tools and data services registry: a community effort to document bioinformatics resources.
- Author
-
Ison J, Rapacki K, Ménager H, Kalaš M, Rydza E, Chmura P, Anthon C, Beard N, Berka K, Bolser D, Booth T, Bretaudeau A, Brezovsky J, Casadio R, Cesareni G, Coppens F, Cornell M, Cuccuru G, Davidsen K, Vedova GD, Dogan T, Doppelt-Azeroual O, Emery L, Gasteiger E, Gatter T, Goldberg T, Grosjean M, Grüning B, Helmer-Citterich M, Ienasescu H, Ioannidis V, Jespersen MC, Jimenez R, Juty N, Juvan P, Koch M, Laibe C, Li JW, Licata L, Mareuil F, Mičetić I, Friborg RM, Moretti S, Morris C, Möller S, Nenadic A, Peterson H, Profiti G, Rice P, Romano P, Roncaglia P, Saidi R, Schafferhans A, Schwämmle V, Smith C, Sperotto MM, Stockinger H, Vařeková RS, Tosatto SC, de la Torre V, Uva P, Via A, Yachdav G, Zambelli F, Vriend G, Rost B, Parkinson H, Løngreen P, and Brunak S
- Subjects
- Data Curation, Software, Computational Biology, Registries
- Abstract
Life sciences are yielding huge data sets that underpin scientific discoveries fundamental to improvement in human health, agriculture and the environment. In support of these discoveries, a plethora of databases and tools are deployed, in technically complex and diverse implementations, across a spectrum of scientific disciplines. The corpus of documentation of these resources is fragmented across the Web, with much redundancy, and has lacked a common standard of information. The outcome is that scientists must often struggle to find, understand, compare and use the best resources for the task at hand.Here we present a community-driven curation effort, supported by ELIXIR-the European infrastructure for biological information-that aspires to a comprehensive and consistent registry of information about bioinformatics resources. The sustainable upkeep of this Tools and Data Services Registry is assured by a curation effort driven by and tailored to local needs, and shared amongst a network of engaged partners.As of November 2015, the registry includes 1785 resources, with depositions from 126 individual registrations including 52 institutional providers and 74 individuals. With community support, the registry can become a standard for dissemination of information about bioinformatics resources: we welcome everyone to join us in this common endeavour. The registry is freely available at https://bio.tools., (© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2016
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.