662 results on '"Sequence logo"'
Search Results
652. Sequence banks: Searching for sequence similarities
- Author
-
Temple F. Smith and Christian Burks
- Subjects
Sequence logo ,Multidisciplinary ,Computational biology ,Biology ,Sequence (medicine) - Published
- 1983
- Full Text
- View/download PDF
653. The nucleotide sequence of a developmentally regulated cDNA fromPhysarum polycephalum
- Author
-
Gérald Lemieux, André Laroche, and Dominick Pallotta
- Subjects
Genetics ,Base Sequence ,biology ,Molecular Sequence Data ,Nucleic acid sequence ,RNA, Fungal ,Physarum polycephalum ,DNA ,Mycetozoa ,biology.organism_classification ,Cell biology ,Physarum ,Sequence logo ,Complementary DNA ,Consensus sequence ,Base sequence ,Amino Acid Sequence ,RNA, Messenger ,Gene - Published
- 1989
- Full Text
- View/download PDF
654. Nucleotide Sequence of a Gene: First Complete Specification
- Author
-
T. M. Sonneborn
- Subjects
Sequence logo ,Multidisciplinary ,Chemistry ,Nucleic acid sequence ,Consensus sequence ,Computational biology ,Gene - Published
- 1965
- Full Text
- View/download PDF
655. Atlas of Protein Sequence and Structure 1969
- Author
-
Alfred Burger
- Subjects
Sequence logo ,Protein sequencing ,Chemistry ,Atlas (topology) ,Drug Discovery ,Consensus sequence ,Molecular Medicine ,Computational biology - Published
- 1970
- Full Text
- View/download PDF
656. Post-main Sequence Evolution
- Author
-
L. G. Henyey, R. Ulrich, P. Bodenheimer, and F. Over
- Subjects
Sequence logo ,Space and Planetary Science ,Astronomy and Astrophysics ,Computational biology ,Mathematics ,Sequence (medicine) - Published
- 1965
- Full Text
- View/download PDF
657. Atlas of Protein Sequence and Structure, 1966
- Author
-
Robert T. Hersh, Richard V. Eck, and Margaret O. Dayhoff
- Subjects
Sequence logo ,Protein sequencing ,Atlas (topology) ,Genetics ,Consensus sequence ,Computational biology ,Biology ,Ecology, Evolution, Behavior and Systematics - Published
- 1967
- Full Text
- View/download PDF
658. Characterization and identification of ubiquitin conjugation sites with E3 ligase recognition specificities
- Author
-
Chien Hsun Huang, Tzu Hao Chang, Kai-Yao Huang, Van Nui Nguyen, Julia Tzu-Ya Weng, Neil Arvin Bretaña, Tzong-Yi Lee, K. Robert Lai, Nguyen, Van-Nui, Huang, Kai-Yao, Huang, Chien-Hsun, Chang, Tzu-Hao, Bretaña, Neil Arvin, Lai, K Robert, Weng, Julia Tzu-Ya, Lee, Tzong-Yi, and Thirteenth Asia Pacific Bioinformatics Conference (APBC 2015) Hsinchu, Taiwan 21-23 January 2015
- Subjects
Ubiquitin-Protein Ligases ,substrate site specificity ,Plasma protein binding ,Computational biology ,Protein degradation ,Biology ,ubiquitination ,Biochemistry ,Cross-validation ,Substrate Specificity ,Mice ,Ubiquitin ,Structural Biology ,Animals ,Humans ,Position-Specific Scoring Matrices ,Molecular Biology ,Applied Mathematics ,Ubiquitination ,Computational Biology ,Ubiquitin ligase ,profile hidden Markov model ,Computer Science Applications ,Sequence logo ,Proceedings ,maximal dependence decomposition ,biology.protein ,ubiquitin conjugation ,DNA microarray ,Protein Binding - Abstract
Background: In eukaryotes, ubiquitin-conjugation is an important mechanism underlying proteasome-mediated degradation of proteins, and as such, plays an essential role in the regulation of many cellular processes. In the ubiquitin-proteasome pathway, E3 ligases play important roles by recognizing a specific protein substrate and catalyzing the attachment of ubiquitin to a lysine (K) residue. As more and more experimental data on ubiquitin conjugation sites become available, it becomes possible to develop prediction models that can be scaled to big data. However, no development that focuses on the investigation of ubiquitinated substrate specificities has existed. Herein, we present an approach that exploits an iteratively statistical method to identify ubiquitin conjugation sites with substrate site specificities. Results: In this investigation, totally 6259 experimentally validated ubiquitinated proteins were obtained from dbPTM. After having filtered out homologous fragments with 40% sequence identity, the training data set contained 2658 ubiquitination sites (positive data) and 5532 non-ubiquitinated sites (negative data). Due to the difficulty in characterizing the substrate site specificities of E3 ligases by conventional sequence logo analysis, a recursively statistical method has been applied to obtain significant conserved motifs. The profile hidden Markov model (profile HMM) was adopted to construct the predictive models learned from the identified substrate motifs. A five-fold cross validation was then used to evaluate the predictive model, achieving sensitivity, specificity, and accuracy of 73.07%, 65.46%, and 67.93%, respectively. Additionally, an independent testing set, completely blind to the training data of the predictive model, was used to demonstrate that the proposed method could provide a promising accuracy (76.13%) and outperform other ubiquitination site prediction tool. Conclusion: A case study demonstrated the effectiveness of the characterized substrate motifs for identifying ubiquitination sites. The proposed method presents a practical means of preliminary analysis and greatly diminishes the total number of potential targets required for further experimental confirmation. This method may help unravel their mechanisms and roles in E3 recognition and ubiquitin-mediated protein degradation. Refereed/Peer-reviewed
- Full Text
- View/download PDF
659. MSACompro: protein multiple sequence alignment using predicted secondary structure, solvent accessibility, and residue-residue contacts
- Author
-
Jianlin Cheng and Xin Deng
- Subjects
Structural alignment ,Sequence alignment ,Computational biology ,Biology ,lcsh:Computer applications to medicine. Medical informatics ,Biochemistry ,Protein Structure, Secondary ,03 medical and health sciences ,Structural Biology ,Sequence Analysis, Protein ,Protein function prediction ,Nucleotide Motifs ,lcsh:QH301-705.5 ,Molecular Biology ,Alignment-free sequence analysis ,Phylogeny ,030304 developmental biology ,Probability ,Genetics ,0303 health sciences ,Multiple sequence alignment ,Applied Mathematics ,030302 biochemistry & molecular biology ,Proteins ,Protein structure prediction ,Computer Science Applications ,Sequence logo ,MUSCLE ,lcsh:Biology (General) ,lcsh:R858-859.7 ,Sequence Alignment ,Algorithms ,Software ,Research Article - Abstract
Background Multiple Sequence Alignment (MSA) is a basic tool for bioinformatics research and analysis. It has been used essentially in almost all bioinformatics tasks such as protein structure modeling, gene and protein function prediction, DNA motif recognition, and phylogenetic analysis. Therefore, improving the accuracy of multiple sequence alignment is important for advancing many bioinformatics fields. Results We designed and developed a new method, MSACompro, to synergistically incorporate predicted secondary structure, relative solvent accessibility, and residue-residue contact information into the currently most accurate posterior probability-based MSA methods to improve the accuracy of multiple sequence alignments. The method is different from the multiple sequence alignment methods (e.g. 3D-Coffee) that use the tertiary structure information of some sequences since the structural information of our method is fully predicted from sequences. To the best of our knowledge, applying predicted relative solvent accessibility and contact map to multiple sequence alignment is novel. The rigorous benchmarking of our method to the standard benchmarks (i.e. BAliBASE, SABmark and OXBENCH) clearly demonstrated that incorporating predicted protein structural information improves the multiple sequence alignment accuracy over the leading multiple protein sequence alignment tools without using this information, such as MSAProbs, ProbCons, Probalign, T-coffee, MAFFT and MUSCLE. And the performance of the method is comparable to the state-of-the-art method PROMALS of using structural features and additional homologous sequences by slightly lower scores. Conclusion MSACompro is an efficient and reliable multiple protein sequence alignment tool that can effectively incorporate predicted protein structural information into multiple sequence alignment. The software is available at http://sysbio.rnet.missouri.edu/multicom_toolbox/.
- Full Text
- View/download PDF
660. Not all transmembrane helices are born equal: Towards the extension of the sequence homology concept to membrane proteins
- Author
-
Sebastian Maurer-Stroh, Wing-Cheong Wong, and Frank Eisenhaber
- Subjects
Immunology ,Sequence alignment ,Computational biology ,Protein Sorting Signals ,Biology ,Protein Structure, Secondary ,General Biochemistry, Genetics and Molecular Biology ,Homology modeling ,Loop modeling ,Databases, Protein ,lcsh:QH301-705.5 ,Peptide sequence ,Ecology, Evolution, Behavior and Systematics ,Genetics ,Membranes ,Sequence Homology, Amino Acid ,Agricultural and Biological Sciences(all) ,Biochemistry, Genetics and Molecular Biology(all) ,Research ,Applied Mathematics ,Computational Biology ,Membrane Proteins ,Protein structure prediction ,Protein Structure, Tertiary ,Transmembrane domain ,Sequence logo ,lcsh:Biology (General) ,Modeling and Simulation ,Threading (protein sequence) ,General Agricultural and Biological Sciences ,Hydrophobic and Hydrophilic Interactions ,Sequence Alignment ,Software - Abstract
Background Sequence homology considerations widely used to transfer functional annotation to uncharacterized protein sequences require special precautions in the case of non-globular sequence segments including membrane-spanning stretches composed of non-polar residues. Simple, quantitative criteria are desirable for identifying transmembrane helices (TMs) that must be included into or should be excluded from start sequence segments in similarity searches aimed at finding distant homologues. Results We found that there are two types of TMs in membrane-associated proteins. On the one hand, there are so-called simple TMs with elevated hydrophobicity, low sequence complexity and extraordinary enrichment in long aliphatic residues. They merely serve as membrane-anchoring device. In contrast, so-called complex TMs have lower hydrophobicity, higher sequence complexity and some functional residues. These TMs have additional roles besides membrane anchoring such as intra-membrane complex formation, ligand binding or a catalytic role. Simple and complex TMs can occur both in single- and multi-membrane-spanning proteins essentially in any type of topology. Whereas simple TMs have the potential to confuse searches for sequence homologues and to generate unrelated hits with seemingly convincing statistical significance, complex TMs contain essential evolutionary information. Conclusion For extending the homology concept onto membrane proteins, we provide a necessary quantitative criterion to distinguish simple TMs (and a sufficient criterion for complex TMs) in query sequences prior to their usage in homology searches based on assessment of hydrophobicity and sequence complexity of the TM sequence segments. Reviewers This article was reviewed by Shamil Sunyaev, L. Aravind and Arcady Mushegian.
- Full Text
- View/download PDF
661. Sequence Diversity Diagram for comparative analysis of multiple sequence alignments
- Author
-
Ryo Sakai and Jan Aerts
- Subjects
Multiple sequence alignment ,business.industry ,Computer science ,Research ,Diagram ,Representation (systemics) ,Pattern recognition ,General Medicine ,computer.software_genre ,General Biochemistry, Genetics and Molecular Biology ,Domain (software engineering) ,Set (abstract data type) ,Sequence logo ,ComputingMethodologies_PATTERNRECOGNITION ,Position (vector) ,Artificial intelligence ,Data mining ,business ,computer ,Sequence (medicine) - Abstract
Background The sequence logo is a graphical representation of a set of aligned sequences, commonly used to depict conservation of amino acid or nucleotide sequences. Although it effectively communicates the amount of information present at every position, this visual representation falls short when the domain task is to compare between two or more sets of aligned sequences. We present a new visual presentation called a Sequence Diversity Diagram and validate our design choices with a case study. Methods Our software was developed using the open-source program called Processing. It loads multiple sequence alignment FASTA files and a configuration file, which can be modified as needed to change the visualization. Results The redesigned figure improves on the visual comparison of two or more sets, and it additionally encodes information on sequential position conservation. In our case study of the adenylate kinase lid domain, the Sequence Diversity Diagram reveals unexpected patterns and new insights, for example the identification of subgroups within the protein subfamily. Our future work will integrate this visual encoding into interactive visualization tools to support higher level data exploration tasks.
- Full Text
- View/download PDF
662. Computational evaluation of TIS annotation for prokaryotic genomes
- Author
-
Lining Ju, Gangqing Hu, Huaiqiu Zhu, Zhen-Su She, and Xiaobin Zheng
- Subjects
Molecular Sequence Data ,Codon, Initiator ,Biology ,computer.software_genre ,lcsh:Computer applications to medicine. Medical informatics ,Sensitivity and Specificity ,Biochemistry ,Set (abstract data type) ,Annotation ,Structural Biology ,Databases, Genetic ,RefSeq ,Linear combination ,lcsh:QH301-705.5 ,Molecular Biology ,Genetics ,Base Sequence ,Methodology Article ,Applied Mathematics ,Chromosome Mapping ,Reproducibility of Results ,Estimator ,Sequence Analysis, DNA ,Weighting ,Computer Science Applications ,Sequence logo ,Variable (computer science) ,Prokaryotic Cells ,lcsh:Biology (General) ,lcsh:R858-859.7 ,Data mining ,Sequence Alignment ,computer ,Algorithms - Abstract
Background Accurate annotation of translation initiation sites (TISs) is essential for understanding the translation initiation mechanism. However, the reliability of TIS annotation in widely used databases such as RefSeq is uncertain due to the lack of experimental benchmarks. Results Based on a homogeneity assumption that gene translation-related signals are uniformly distributed across a genome, we have established a computational method for a large-scale quantitative assessment of the reliability of TIS annotations for any prokaryotic genome. The method consists of modeling a positional weight matrix (PWM) of aligned sequences around predicted TISs in terms of a linear combination of three elementary PWMs, one for true TIS and the two others for false TISs. The three elementary PWMs are obtained using a reference set with highly reliable TIS predictions. A generalized least square estimator determines the weighting of the true TIS in the observed PWM, from which the accuracy of the prediction is derived. The validity of the method and the extent of the limitation of the assumptions are explicitly addressed by testing on experimentally verified TISs with variable accuracy of the reference sets. The method is applied to estimate the accuracy of TIS annotations that are provided on public databases such as RefSeq and ProTISA and by programs such as EasyGene, GeneMarkS, Glimmer 3 and TiCo. It is shown that RefSeq's TIS prediction is significantly less accurate than two recent predictors, Tico and ProTISA. With convincing proofs, we show two general preferential biases in the RefSeq annotation, i.e. over-annotating the longest open reading frame (LORF) and under-annotating ATG start codon. Finally, we have established a new TIS database, SupTISA, based on the best prediction of all the predictors; SupTISA has achieved an average accuracy of 92% over all 532 complete genomes. Conclusion Large-scale computational evaluation of TIS annotation has been achieved. A new TIS database much better than RefSeq has been constructed, and it provides a valuable resource for further TIS studies.
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.