19 results on '"Deng EZ"'
Search Results
2. RummaGEO: Automatic mining of human and mouse gene sets from GEO.
- Author
-
Marino GB, Clarke DJB, Lachmann A, Deng EZ, and Ma'ayan A
- Abstract
The Gene Expression Omnibus (GEO) has millions of samples from thousands of studies. While users of GEO can search the metadata describing studies, there is a need for methods to search GEO at the data level. RummaGEO is a gene expression signature search engine for human and mouse RNA sequencing perturbation studies extracted from GEO. To develop RummaGEO, we automatically identified groups of samples and computed differential expressions to extract gene sets from each study. The contents of RummaGEO are served for gene set, PubMed, and metadata search. Next, we analyzed the contents of RummaGEO to identify patterns and perform global analyses. Overall, RummaGEO provides a resource that is enabling users to identify relevant GEO studies based on their own gene expression results. Users of RummaGEO can incorporate RummaGEO into their analysis workflows for integrative analyses and hypothesis generation., Competing Interests: The authors declare no competing interests., (© 2024 The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
3. Multiomics2Targets identifies targets from cancer cohorts profiled with transcriptomics, proteomics, and phosphoproteomics.
- Author
-
Deng EZ, Marino GB, Clarke DJB, Diamant I, Resnick AC, Ma W, Wang P, and Ma'ayan A
- Subjects
- Humans, Cohort Studies, Gene Expression Profiling methods, Software, Computational Biology methods, Proteomics methods, Neoplasms genetics, Neoplasms metabolism, Transcriptome, Phosphoproteins metabolism, Phosphoproteins genetics
- Abstract
The availability of data from profiling of cancer patients with multiomics is rapidly increasing. However, integrative analysis of such data for personalized target identification is not trivial. Multiomics2Targets is a platform that enables users to upload transcriptomics, proteomics, and phosphoproteomics data matrices collected from the same cohort of cancer patients. After uploading the data, Multiomics2Targets produces a report that resembles a research publication. The uploaded matrices are processed, analyzed, and visualized using the tools Enrichr, KEA3, ChEA3, Expression2Kinases, and TargetRanger to identify and prioritize proteins, genes, and transcripts as potential targets. Figures and tables, as well as descriptions of the methods and results, are automatically generated. Reports include an abstract, introduction, methods, results, discussion, conclusions, and references and are exportable as citable PDFs and Jupyter Notebooks. Multiomics2Targets is applied to analyze version 3 of the Clinical Proteomic Tumor Analysis Consortium (CPTAC3) pan-cancer cohort, identifying potential targets for each CPTAC3 cancer subtype. Multiomics2Targets is available from https://multiomics2targets.maayanlab.cloud/., Competing Interests: Declaration of interests The authors declare no competing interests., (Copyright © 2024 The Author(s). Published by Elsevier Inc. All rights reserved.)
- Published
- 2024
- Full Text
- View/download PDF
4. Rummagene: massive mining of gene sets from supporting materials of biomedical research publications.
- Author
-
Clarke DJB, Marino GB, Deng EZ, Xie Z, Evangelista JE, and Ma'ayan A
- Subjects
- Animals, Software, Databases, Factual, Gene Expression Regulation, Mammals, Data Mining, Biomedical Research
- Abstract
Many biomedical research publications contain gene sets in their supporting tables, and these sets are currently not available for search and reuse. By crawling PubMed Central, the Rummagene server provides access to hundreds of thousands of such mammalian gene sets. So far, we scanned 5,448,589 articles to find 121,237 articles that contain 642,389 gene sets. These sets are served for enrichment analysis, free text, and table title search. Investigating statistical patterns within the Rummagene database, we demonstrate that Rummagene can be used for transcription factor and kinase enrichment analyses, and for gene function predictions. By combining gene set similarity with abstract similarity, Rummagene can find surprising relationships between biological processes, concepts, and named entities. Overall, Rummagene brings to surface the ability to search a massive collection of published biomedical datasets that are currently buried and inaccessible. The Rummagene web application is available at https://rummagene.com ., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
5. RummaGEO: Automatic Mining of Human and Mouse Gene Sets from GEO.
- Author
-
Marino GB, Clarke DJB, Deng EZ, and Ma'ayan A
- Abstract
The Gene Expression Omnibus (GEO) is a major open biomedical research repository for transcriptomics and other omics datasets. It currently contains millions of gene expression samples from tens of thousands of studies collected by many biomedical research laboratories from around the world. While users of the GEO repository can search the metadata describing studies for locating relevant datasets, there are currently no methods or resources that facilitate global search of GEO at the data level. To address this shortcoming, we developed RummaGEO, a webserver application that enables gene expression signature search of a large collection of human and mouse RNA-seq studies deposited into GEO. To develop the search engine, we performed offline automatic identification of sample conditions from the uniformly aligned GEO studies available from ARCHS4. We then computed differential expression signatures to extract gene sets from these studies. In total, RummaGEO currently contains 135,264 human and 158,062 mouse gene sets extracted from 23,395 GEO studies. Next, we analyzed the contents of the RummaGEO database to identify statistical patterns and perform various global analyses. The contents of the RummaGEO database are provided as a web-server search engine with signature search, PubMed search, and metadata search functionalities. Overall, RummaGEO provides an unprecedented resource for the biomedical research community enabling hypothesis generation for many future studies. The RummaGEO search engine is available from: https://rummageo.com/., Competing Interests: Competing Interests The authors declare that they do not have any competing interests.
- Published
- 2024
- Full Text
- View/download PDF
6. Pan-cancer proteogenomics characterization of tumor immunity.
- Author
-
Petralia F, Ma W, Yaron TM, Caruso FP, Tignor N, Wang JM, Charytonowicz D, Johnson JL, Huntsman EM, Marino GB, Calinawan A, Evangelista JE, Selvan ME, Chowdhury S, Rykunov D, Krek A, Song X, Turhan B, Christianson KE, Lewis DA, Deng EZ, Clarke DJB, Whiteaker JR, Kennedy JJ, Zhao L, Segura RL, Batra H, Raso MG, Parra ER, Soundararajan R, Tang X, Li Y, Yi X, Satpathy S, Wang Y, Wiznerowicz M, González-Robles TJ, Iavarone A, Gosline SJC, Reva B, Robles AI, Nesvizhskii AI, Mani DR, Gillette MA, Klein RJ, Cieslik M, Zhang B, Paulovich AG, Sebra R, Gümüş ZH, Hostetter G, Fenyö D, Omenn GS, Cantley LC, Ma'ayan A, Lazar AJ, Ceccarelli M, and Wang P
- Subjects
- Humans, Combined Modality Therapy, Genomics, Proteomics, Tumor Escape, Neoplasms genetics, Neoplasms immunology, Neoplasms therapy, Proteogenomics
- Abstract
Despite the successes of immunotherapy in cancer treatment over recent decades, less than <10%-20% cancer cases have demonstrated durable responses from immune checkpoint blockade. To enhance the efficacy of immunotherapies, combination therapies suppressing multiple immune evasion mechanisms are increasingly contemplated. To better understand immune cell surveillance and diverse immune evasion responses in tumor tissues, we comprehensively characterized the immune landscape of more than 1,000 tumors across ten different cancers using CPTAC pan-cancer proteogenomic data. We identified seven distinct immune subtypes based on integrative learning of cell type compositions and pathway activities. We then thoroughly categorized unique genomic, epigenetic, transcriptomic, and proteomic changes associated with each subtype. Further leveraging the deep phosphoproteomic data, we studied kinase activities in different immune subtypes, which revealed potential subtype-specific therapeutic targets. Insights from this work will facilitate the development of future immunotherapy strategies and enhance precision targeting with existing agents., Competing Interests: Declaration of interests R. Sebra is currently a paid consultant and equity holder at GeneDx. L.C.C. is a founder and member of the board of directors of Agios Pharmaceuticals; is a founder and receives research support from Petra Pharmaceuticals; has equity in and consults for Cell Signaling Technologies, Volastra, Larkspur, and 1 Base Pharmaceuticals; and consults for Loxo-Lilly. J.L.J. has received consulting fees from Scorpion Therapeutics and Volastra Therapeutics. T.M.Y. is a co-founder and stockholder of DeStroke., (Copyright © 2024 The Author(s). Published by Elsevier Inc. All rights reserved.)
- Published
- 2024
- Full Text
- View/download PDF
7. Engineering a Z-Scheme Heterostructure on ZnIn 2 S 4 @NH 2 -MIL-125 Composites for Boosting the Photocatalytic Performance.
- Author
-
Deng EZ, Fan YZ, Wang HP, Li Y, Peng C, and Liu J
- Abstract
Constructing a Z-scheme heterostructure on a metal-organic framework (MOF) composite with an explicit charge transfer mechanism at the interface is considered to be an effective strategy for improving the photocatalytic performance of MOFs. Herein, an internal electric field (IEF)-induced Z-scheme heterostructure on the ZnIn
2 S4 @NH2 -MIL-125 composite is designed and fabricated by a facile electrostatic self-assembly process. Systematic investigations reveal that close interfacial contact and difference in work function between NH2 -MIL-125 and ZnIn2 S4 enable the formation of the IEF, which drives the Z-scheme charge transfer as revealed by the in situ irradiated X-ray photoelectron spectroscopy (ISI-XPS), photoirradiated Kelvin probe force microscope (KPFM) measurement, electron paramagnetic resonance (EPR) radical trapping experiment, as well as density functional theory (DFT) calculation; meanwhile, directions of the interfacial IEFs are determined. Benefiting from the unique merit of IEF-induced Z-scheme charge transfer, the optimized ZnIn2 S4 @NH2 -MIL-125 composite exhibits significantly enhanced photocatalytic activity for the photoreduction of 4-nitroaniline (4-NA) to p -phenylenediamine (PPD) under visible light irradiation. This work not only provides in-depth insights for charge transfer in the IEF-induced Z scheme heterostructure but also affords useful inspirations on designing the Z-scheme MOF composite to boost the photocatalytic performance.- Published
- 2024
- Full Text
- View/download PDF
8. GeneRanger and TargetRanger: processed gene and protein expression levels across cells and tissues for target discovery.
- Author
-
Marino GB, Ngai M, Clarke DJB, Fleishman RH, Deng EZ, Xie Z, Ahmed N, and Ma'ayan A
- Subjects
- Humans, Cell Line, RNA-Seq, Internet, Proteomics, Pseudogenes, Software
- Abstract
Several atlasing efforts aim to profile human gene and protein expression across tissues, cell types and cell lines in normal physiology, development and disease. One utility of these resources is to examine the expression of a single gene across all cell types, tissues and cell lines in each atlas. However, there is currently no centralized place that integrates data from several atlases to provide this type of data in a uniform format for visualization, analysis and download, and via an application programming interface. To address this need, GeneRanger is a web server that provides access to processed data about gene and protein expression across normal human cell types, tissues and cell lines from several atlases. At the same time, TargetRanger is a related web server that takes as input RNA-seq data from profiled human cells and tissues, and then compares the uploaded input data to expression levels across the atlases to identify genes that are highly expressed in the input and lowly expressed across normal human cell types and tissues. Identified targets can be filtered by transmembrane or secreted proteins. The results from GeneRanger and TargetRanger are visualized as box and scatter plots, and as interactive tables. GeneRanger and TargetRanger are available from https://generanger.maayanlab.cloud and https://targetranger.maayanlab.cloud, respectively., (© The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2023
- Full Text
- View/download PDF
9. Computational screen to identify potential targets for immunotherapeutic identification and removal of senescence cells.
- Author
-
Deng EZ, Fleishman RH, Xie Z, Marino GB, Clarke DJB, and Ma'ayan A
- Subjects
- Humans, Aged, Gene Expression Profiling, Cell Line, Immunotherapy, Cellular Senescence genetics, Aging genetics
- Abstract
To prioritize gene and protein candidates that may enable the selective identification and removal of senescent cells, we compared gene expression signatures from replicative senescent cells to transcriptomics and proteomics atlases of normal human tissues and cell types. RNA-seq samples from in vitro senescent cells (6 studies, 13 conditions) were analyzed for identifying targets at the gene and transcript levels that are highly expressed in senescent cells compared to their expression in normal human tissues and cell types. A gene set made of 301 genes called SenoRanger was established based on consensus analysis across studies and backgrounds. Of the identified senescence-associated targets, 29% of the genes in SenoRanger are also highly differentially expressed in aged tissues from GTEx. The SenoRanger gene set includes previously known as well as novel senescence-associated genes. Pathway analysis that connected the SenoRanger genes to their functional annotations confirms their potential role in several aging and senescence-related processes. Overall, SenoRanger provides solid hypotheses about potentially useful targets for identifying and removing senescence cells., (© 2023 The Authors. Aging Cell published by Anatomical Society and John Wiley & Sons Ltd.)
- Published
- 2023
- Full Text
- View/download PDF
10. Replication of neural responses to monetary incentives and exploration of reward-influenced network connectivity in fibromyalgia.
- Author
-
Park SH, Deng EZ, Baker AK, MacNiven KH, Knutson B, and Martucci KT
- Abstract
Neuroimaging research has begun to implicate alterations of brain reward systems in chronic pain. Previously, using functional magnetic resonance imaging (fMRI) and a monetary incentive delay (MID) task, Martucci et al. (2018) showed that neural responses to reward anticipation and outcome are altered in fibromyalgia. In the present study, we aimed to test the replicability of these altered neural responses to reward in a separate fibromyalgia cohort. In addition, the present study was conducted at a distinct U.S. location but involved a similar study design. For the present study, 20 patients with fibromyalgia and 20 healthy controls participated in MID task fMRI scan procedures and completed clinical/psychological questionnaires. fMRI analyses comparing patient and control groups revealed a consistent trend of main results which were largely similar to the prior reported results. Specifically, in the replication fibromyalgia cohort, medial prefrontal cortex (MPFC) response was reduced during gain anticipation and was increased during no-loss (non-punishment) outcome compared to controls. Also consistent with previous findings, the nucleus accumbens response to gain anticipation did not differ in patients vs. controls. Further, results from similarly-designed behavioral, correlational, and exploratory analyses were complementary to previous findings. Finally, a novel network-based functional connectivity analysis of the MID task fMRI data across patients vs. controls implied enhanced connectivity within the default mode network in participants with fibromyalgia. Together, based on replicating prior univariate results and new network-based functional connectivity analyses of MID task fMRI data, we provide further evidence of altered brain reward responses, particularly in the MPFC response to reward outcomes, in patients with fibromyalgia., Competing Interests: Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
- Published
- 2022
- Full Text
- View/download PDF
11. Evaluation by Survival Analysis of Cold Pain Tolerance in Patients with Fibromyalgia and Opioid Use.
- Author
-
Deng EZ, Weikel DP, and Martucci KT
- Abstract
Purpose: The cold pressor test (CPT) is a clinical pain research method used to measure cold pain tolerance. During this test, participants immerse an extremity (ie, hand or foot) into cold water for as long as tolerable. The duration of the test (traditionally up to an experimentally imposed cut-off at 2 minutes) indicates the amount of cold pain tolerance by the participant. Prior research studies have investigated cold pain tolerance in patients with chronic pain. However, few of these studies have used survival analysis, which allows for proper handling of data censoring and is therefore, an optimal statistical method for CPT data analysis. The goal of the present study was to use survival analysis to evaluate cold pain tolerance in patients with fibromyalgia. Furthermore, we aimed to model relationships between psychological and clinical variables as well as opioid medication use and cold pain tolerance., Patients and Methods: A total of 85 patients with fibromyalgia (42 who were taking opioids) and 47 healthy pain-free controls provided CPT and questionnaire data (collected across 2 study sites) for a case-control study. We used survival analysis using Cox regression to evaluate group differences (patients vs controls) in cold pain tolerance and to evaluate cold pain tolerance relationships with psychological, clinical, and medication use., Results: As compared to healthy controls, patients with fibromyalgia exhibited significantly lower CPT survival (HR = 2.17, 95% CI: [1.42, 3.31], p = 0.00035). As indicated by Cox regression models, the significant group difference in CPT survival did not relate to our selected psychological and clinical measures (p > 0.05). The groups of non-opioid-taking patients and healthy controls showed consistent CPT survival across study sites. However, patients taking opioid pain medications showed differences in CPT survival across study sites., Conclusion: By using survival analysis, an optimal method for time-to-event pain measures such as the CPT, we confirmed previously identified reductions in cold pain tolerance in patients with fibromyalgia. While our selected psychological and clinical measures were not significantly associated with cold pain tolerance, our data suggest that opioid medication use may impart greater cold pain tolerance in some patients., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships, other than the described funding sources, that could be construed as a potential conflict of interest., (© 2022 Deng et al.)
- Published
- 2022
- Full Text
- View/download PDF
12. Pro54DB: a database for experimentally verified sigma-54 promoters.
- Author
-
Liang ZY, Lai HY, Yang H, Zhang CJ, Yang H, Wei HH, Chen XX, Zhao YW, Su ZD, Li WC, Deng EZ, Tang H, Chen W, and Lin H
- Subjects
- Bacteria genetics, Databases, Genetic, Promoter Regions, Genetic, RNA Polymerase Sigma 54 metabolism
- Abstract
Summary: In prokaryotes, the σ54 promoters are unique regulatory elements and have attracted much attention because they are in charge of the transcription of carbon and nitrogen-related genes and participate in numerous ancillary processes and environmental responses. All findings on σ54 promoters are favorable for a better understanding of their regulatory mechanisms in gene transcription and an accurate discovery of genes missed by the wet experimental evidences. In order to provide an up-to-date, interactive and extensible database for σ54 promoter, a free and easy accessed database called Pro54DB (σ54 promoter database) was built to collect information of σ54 promoter. In the current version, it has stored 210 experimental-confirmed σ54 promoters with 297 regulated genes in 43 species manually extracted from 133 publications, which is helpful for researchers in fields of bioinformatics and molecular biology., Availability and Implementation: Pro54DB is freely available on the web at http://lin.uestc.edu.cn/database/pro54db with all major browsers supported., Contacts: greatchen@ncst.edu.cn or hlin@uestc.edu.cn
- Published
- 2017
- Full Text
- View/download PDF
13. Predicting the subcellular localization of mycobacterial proteins by incorporating the optimal tripeptides into the general form of pseudo amino acid composition.
- Author
-
Zhu PP, Li WC, Zhong ZJ, Deng EZ, Ding H, Chen W, and Lin H
- Subjects
- Databases, Protein, Protein Transport, Subcellular Fractions metabolism, Amino Acids metabolism, Bacterial Proteins metabolism, Mycobacterium tuberculosis metabolism, Peptides metabolism
- Abstract
Mycobacterium tuberculosis is a bacterium that causes tuberculosis, one of the most prevalent infectious diseases. Predicting the subcellular localization of mycobacterial proteins in this bacterium may provide vital clues for the prediction of protein function as well as for drug discovery and design. Therefore, a computational method that can predict the subcellular localization of mycobacterial proteins with high precision is highly desirable. We propose a computational method to predict the subcellular localization of mycobacterial proteins. An objective and strict benchmark dataset was constructed after collecting 272 non-redundant proteins from the universal protein resource (the UniProt database). Subsequently, a novel feature selection strategy based on binomial distribution was used to optimize the feature vector. Finally, a subset containing 219 chosen tripeptide features was imported into a support vector machine-based method to estimate the performance of the dataset in accurately and sensitively identifying these proteins. We found that the proposed method gave a maximum overall accuracy of 89.71% with an average accuracy of 81.12% in the jackknife cross-validation. The results indicate that our prediction method gave an efficient and powerful performance when compared with other published methods. We made the proposed method available on a purpose built Web server called MycoSub that is freely accessible at . We anticipate that MycoSub will become a useful tool for studying the functions of mycobacterial proteins and for designing and developing anti-mycobacterium drugs.
- Published
- 2015
- Full Text
- View/download PDF
14. iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition.
- Author
-
Lin H, Deng EZ, Ding H, Chen W, and Chou KC
- Subjects
- Genome, Bacterial, Nucleotides chemistry, Peptide Chain Initiation, Translational, Transcription Initiation Site, Promoter Regions, Genetic, RNA Polymerase Sigma 54 metabolism, Sequence Analysis, DNA methods, Software
- Abstract
The σ(54) promoters are unique in prokaryotic genome and responsible for transcripting carbon and nitrogen-related genes. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapidly and effectively identifying the σ(54) promoters. Here, a predictor called 'iPro54-PseKNC' was developed. In the predictor, the samples of DNA sequences were formulated by a novel feature vector called 'pseudo k-tuple nucleotide composition', which was further optimized by the incremental feature selection procedure. The performance of iPro54-PseKNC was examined by the rigorous jackknife cross-validation tests on a stringent benchmark data set. As a user-friendly web-server, iPro54-PseKNC is freely accessible at http://lin.uestc.edu.cn/server/iPro54-PseKNC. For the convenience of the vast majority of experimental scientists, a step-by-step protocol guide was provided on how to use the web-server to get the desired results without the need to follow the complicated mathematics that were presented in this paper just for its integrity. Meanwhile, we also discovered through an in-depth statistical analysis that the distribution of distances between the transcription start sites and the translation initiation sites were governed by the gamma distribution, which may provide a fundamental physical principle for studying the σ(54) promoters., (© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2014
- Full Text
- View/download PDF
15. Sequence analysis of origins of replication in the Saccharomyces cerevisiae genomes.
- Author
-
Li WC, Zhong ZJ, Zhu PP, Deng EZ, Ding H, Chen W, and Lin H
- Abstract
DNA replication is a highly precise process that is initiated from origins of replication (ORIs) and is regulated by a set of regulatory proteins. The mining of DNA sequence information will be not only beneficial for understanding the regulatory mechanism of replication initiation but also for accurately identifying ORIs. In this study, the GC profile and GC skew were calculated to analyze the compositional bias in the Saccharomyces cerevisiae genome. We found that the GC profile in the region of ORIs is significantly lower than that in the flanking regions. By calculating the information redundancy, an estimation of the correlation of nucleotides, we found that the intensity of adjoining correlation in ORIs is dramatically higher than that in flanking regions. Furthermore, the relationships between ORIs and nucleosomes as well as transcription start sites were investigated. Results showed that ORIs are usually not occupied by nucleosomes. Finally, we calculated the distribution of ORIs in yeast chromosomes and found that most ORIs are in transcription terminal regions. We hope that these results will contribute to the identification of ORIs and the study of DNA replication mechanisms.
- Published
- 2014
- Full Text
- View/download PDF
16. iTIS-PseTNC: a sequence-based predictor for identifying translation initiation site in human genes using pseudo trinucleotide composition.
- Author
-
Chen W, Feng PM, Deng EZ, Lin H, and Chou KC
- Subjects
- Base Sequence, Genome, Human genetics, Humans, Internet, Support Vector Machine, User-Computer Interface, Algorithms, Genomics methods, Oligonucleotides genetics, Peptide Chain Initiation, Translational
- Abstract
Translation is a key process for gene expression. Timely identification of the translation initiation site (TIS) is very important for conducting in-depth genome analysis. With the avalanche of genome sequences generated in the postgenomic age, it is highly desirable to develop automated methods for rapidly and effectively identifying TIS. Although some computational methods were proposed in this regard, none of them considered the global or long-range sequence-order effects of DNA, and hence their prediction quality was limited. To count this kind of effects, a new predictor, called "iTIS-PseTNC," was developed by incorporating the physicochemical properties into the pseudo trinucleotide composition, quite similar to the PseAAC (pseudo amino acid composition) approach widely used in computational proteomics. It was observed by the rigorous cross-validation test on the benchmark dataset that the overall success rate achieved by the new predictor in identifying TIS locations was over 97%. As a web server, iTIS-PseTNC is freely accessible at http://lin.uestc.edu.cn/server/iTIS-PseTNC. To maximize the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the web server to obtain the desired results without the need to go through detailed mathematical equations, which are presented in this paper just for the integrity of the new prection method., (Copyright © 2014 Elsevier Inc. All rights reserved.)
- Published
- 2014
- Full Text
- View/download PDF
17. Identifying the subfamilies of voltage-gated potassium channels using feature selection technique.
- Author
-
Liu WX, Deng EZ, Chen W, and Lin H
- Subjects
- Databases, Protein, Internet, Oligopeptides chemistry, Oligopeptides metabolism, Support Vector Machine, User-Computer Interface, Algorithms, Computational Biology, Potassium Channels, Voltage-Gated analysis
- Abstract
Voltage-gated K+ channel (VKC) plays important roles in biology procession, especially in nervous system. Different subfamilies of VKCs have different biological functions. Thus, knowing VKCs' subfamilies has become a meaningful job because it can guide the direction for the disease diagnosis and drug design. However, the traditional wet-experimental methods were costly and time-consuming. It is highly desirable to develop an effective and powerful computational tool for identifying different subfamilies of VKCs. In this study, a predictor, called iVKC-OTC, has been developed by incorporating the optimized tripeptide composition (OTC) generated by feature selection technique into the general form of pseudo-amino acid composition to identify six subfamilies of VKCs. One of the remarkable advantages of introducing the optimized tripeptide composition is being able to avoid the notorious dimension disaster or over fitting problems in statistical predictions. It was observed on a benchmark dataset, by using a jackknife test, that the overall accuracy achieved by iVKC-OTC reaches to 96.77% in identifying the six subfamilies of VKCs, indicating that the new predictor is promising or at least may become a complementary tool to the existing methods in this area. It has not escaped our notice that the optimized tripeptide composition can also be used to investigate other protein classification problems.
- Published
- 2014
- Full Text
- View/download PDF
18. iNuc-PseKNC: a sequence-based predictor for predicting nucleosome positioning in genomes with pseudo k-tuple nucleotide composition.
- Author
-
Guo SH, Deng EZ, Xu LQ, Ding H, Lin H, Chen W, and Chou KC
- Subjects
- Animals, Caenorhabditis elegans genetics, DNA chemistry, Drosophila melanogaster genetics, Genome, Genomics methods, Humans, Nucleotides analysis, Software, Nucleosomes chemistry, Sequence Analysis, DNA methods
- Abstract
Motivation: Nucleosome positioning participates in many cellular activities and plays significant roles in regulating cellular processes. With the avalanche of genome sequences generated in the post-genomic age, it is highly desired to develop automated methods for rapidly and effectively identifying nucleosome positioning. Although some computational methods were proposed, most of them were species specific and neglected the intrinsic local structural properties that might play important roles in determining the nucleosome positioning on a DNA sequence., Results: Here a predictor called 'iNuc-PseKNC' was developed for predicting nucleosome positioning in Homo sapiens, Caenorhabditis elegans and Drosophila melanogaster genomes, respectively. In the new predictor, the samples of DNA sequences were formulated by a novel feature-vector called 'pseudo k-tuple nucleotide composition', into which six DNA local structural properties were incorporated. It was observed by the rigorous cross-validation tests on the three stringent benchmark datasets that the overall success rates achieved by iNuc-PseKNC in predicting the nucleosome positioning of the aforementioned three genomes were 86.27%, 86.90% and 79.97%, respectively. Meanwhile, the results obtained by iNuc-PseKNC on various benchmark datasets used by the previous investigators for different genomes also indicated that the current predictor remarkably outperformed its counterparts., Availability: A user-friendly web-server, iNuc-PseKNC is freely accessible at http://lin.uestc.edu.cn/server/iNuc-PseKNC., (© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.)
- Published
- 2014
- Full Text
- View/download PDF
19. iCTX-type: a sequence-based predictor for identifying the types of conotoxins in targeting ion channels.
- Author
-
Ding H, Deng EZ, Yuan LF, Liu L, Lin H, Chen W, and Chou KC
- Subjects
- Algorithms, Amino Acid Sequence, Calcium Channels chemistry, Calcium Channels drug effects, Conotoxins chemistry, Conotoxins classification, Humans, Neuropeptides chemistry, Neuropeptides classification, Peptides chemistry, Potassium Channels chemistry, Potassium Channels drug effects, Sodium Channels chemistry, Sodium Channels drug effects, Amino Acids chemistry, Conotoxins metabolism, Neuropeptides metabolism, Peptides metabolism
- Abstract
Conotoxins are small disulfide-rich neurotoxic peptides, which can bind to ion channels with very high specificity and modulate their activities. Over the last few decades, conotoxins have been the drug candidates for treating chronic pain, epilepsy, spasticity, and cardiovascular diseases. According to their functions and targets, conotoxins are generally categorized into three types: potassium-channel type, sodium-channel type, and calcium-channel types. With the avalanche of peptide sequences generated in the postgenomic age, it is urgent and challenging to develop an automated method for rapidly and accurately identifying the types of conotoxins based on their sequence information alone. To address this challenge, a new predictor, called iCTX-Type, was developed by incorporating the dipeptide occurrence frequencies of a conotoxin sequence into a 400-D (dimensional) general pseudoamino acid composition, followed by the feature optimization procedure to reduce the sample representation from 400-D to 50-D vector. The overall success rate achieved by iCTX-Type via a rigorous cross-validation was over 91%, outperforming its counterpart (RBF network). Besides, iCTX-Type is so far the only predictor in this area with its web-server available, and hence is particularly useful for most experimental scientists to get their desired results without the need to follow the complicated mathematics involved.
- Published
- 2014
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.