23 results on '"differential coexpression"'
Search Results
2. BioNetStat: A Tool for Biological Networks Differential Analysis
- Author
-
Vinícius Carvalho Jardim, Suzana de Siqueira Santos, Andre Fujita, and Marcos Silveira Buckeridge
- Subjects
differential network analysis ,coexpression network ,correlation network ,systems biology ,systems biology tool ,differential coexpression ,Genetics ,QH426-470 - Abstract
The study of interactions among biological components can be carried out by using methods grounded on network theory. Most of these methods focus on the comparison of two biological networks (e.g., control vs. disease). However, biological systems often present more than two biological states (e.g., tumor grades). To compare two or more networks simultaneously, we developed BioNetStat, a Bioconductor package with a user-friendly graphical interface. BioNetStat compares correlation networks based on the probability distribution of a feature of the graph (e.g., centrality measures). The analysis of the structural alterations on the network reveals significant modifications in the system. For example, the analysis of centrality measures provides information about how the relevance of the nodes changes among the biological states. We evaluated the performance of BioNetStat in both, toy models and two case studies. The latter related to gene expression of tumor cells and plant metabolism. Results based on simulated scenarios suggest that the statistical power of BioNetStat is less sensitive to the increase of the number of networks than Gene Set Coexpression Analysis (GSCA). Also, besides being able to identify nodes with modified centralities, BioNetStat identified altered networks associated with signaling pathways that were not identified by other methods.
- Published
- 2019
- Full Text
- View/download PDF
3. Differential coexpression networks in bronchiolitis and emphysema phenotypes reveal heterogeneous mechanisms of chronic obstructive pulmonary disease.
- Author
-
Qin, Jiangyue, Yang, Ting, Zeng, Ni, Wan, Chun, Gao, Lijuan, Li, Xiaoou, Chen, Lei, Shen, Yongchun, and Wen, Fuqiang
- Subjects
OBSTRUCTIVE lung diseases ,BCL genes ,B cell receptors ,IMMUNOGLOBULIN class switching ,GENE regulatory networks ,PHENOTYPES ,GENE expression - Abstract
Chronic obstructive pulmonary disease (COPD) is a heterogeneous disease with multiple molecular mechanisms. To investigate and contrast the molecular processes differing between bronchiolitis and emphysema phenotypes of COPD, we downloaded the GSE69818 microarray data set from the Gene Expression Omnibus (GEO), which based on lung tissues from 38 patients with emphysema and 32 patients with bronchiolitis. Then, weighted gene coexpression network analysis (WGCNA) and differential coexpression (DiffCoEx) analysis were performed, followed by gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes enrichment analysis (KEGG) analysis. Modules and hub genes for bronchiolitis and emphysema were identified, and we found that genes in modules linked to neutrophil degranulation, Rho protein signal transduction and B cell receptor signalling were coexpressed in emphysema. DiffCoEx analysis showed that four hub genes (IFT88, CCDC103, MMP10 and Bik) were consistently expressed in emphysema patients; these hub genes were enriched, respectively, for functions of cilium assembly and movement, proteolysis and apoptotic mitochondrial changes. In our re‐analysis of GSE69818, gene expression networks in relation to emphysema deepen insights into the molecular mechanism of COPD and also identify some promising therapeutic targets. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
4. BioNetStat: A Tool for Biological Networks Differential Analysis.
- Author
-
Jardim, Vinícius Carvalho, Santos, Suzana de Siqueira, Fujita, Andre, and Buckeridge, Marcos Silveira
- Subjects
BIOLOGICAL networks ,BIOLOGICAL systems ,PLANT metabolism ,CELL metabolism ,GENE regulatory networks ,STATISTICAL power analysis - Abstract
The study of interactions among biological components can be carried out by using methods grounded on network theory. Most of these methods focus on the comparison of two biological networks (e.g., control vs. disease). However, biological systems often present more than two biological states (e.g., tumor grades). To compare two or more networks simultaneously, we developed BioNetStat , a Bioconductor package with a user-friendly graphical interface. BioNetStat compares correlation networks based on the probability distribution of a feature of the graph (e.g., centrality measures). The analysis of the structural alterations on the network reveals significant modifications in the system. For example, the analysis of centrality measures provides information about how the relevance of the nodes changes among the biological states. We evaluated the performance of BioNetStat in both, toy models and two case studies. The latter related to gene expression of tumor cells and plant metabolism. Results based on simulated scenarios suggest that the statistical power of BioNetStat is less sensitive to the increase of the number of networks than Gene Set Coexpression Analysis (GSCA). Also, besides being able to identify nodes with modified centralities, BioNetStat identified altered networks associated with signaling pathways that were not identified by other methods. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
5. Limitation of permutation-based differential correlation analysis.
- Author
-
Song H and Wu MC
- Subjects
- Humans, Statistics as Topic, Genomics
- Abstract
The comparison of biological systems, through the analysis of molecular changes under different conditions, has played a crucial role in the progress of modern biological science. Specifically, differential correlation analysis (DCA) has been employed to determine whether relationships between genomic features differ across conditions or outcomes. Because ascertaining the null distribution of test statistics to capture variations in correlation is challenging, several DCA methods utilize permutation which can loosen parametric (e.g., normality) assumptions. However, permutation is often problematic for DCA due to violating the assumption that samples are exchangeable under the null. Here, we examine the limitations of permutation-based DCA and investigate instances where the permutation-based DCA exhibits poor performance. Experimental results show that the permutation-based DCA often fails to control the type I error under the null hypothesis of equal correlation structures., (© 2023 Wiley Periodicals LLC.)
- Published
- 2023
- Full Text
- View/download PDF
6. Combining differential expression and differential coexpression analysis identifies optimal gene and gene set in cervical cancer.
- Author
-
Fang, Sheng-Quan, Gao, Min, Xiong, Shi-Lu, Chen, Hai-Yan, Hu, Shan-Shan, and Cai, Hong-Bing
- Subjects
- *
CERVICAL cancer patients , *CERVICAL cancer treatment , *MORTALITY , *PAPILLOMAVIRUS diseases , *MICROARRAY technology , *ALGORITHMS , *DATABASES , *GENES , *MOLECULAR structure , *BIOINFORMATICS , *GENE expression profiling , *STATISTICAL models ,CERVIX uteri tumors - Abstract
Objective: The objective of this study is to investigate the optimal gene and functional-related gene set in cervical cancer through combing the differential expression (DE) and differential coexpression (DC) analysis.Materials and Methods: To achieve this, we first measured expression data of cervical cancer by incorporating DE and DC effects utilizing absolute t-value in t-statistic and Z-test, respectively. Then, we selected the optimal threshold pair to determine both high DE and high DC (HDE_HDC) partition on the basis of Chi-square maximization, and the best threshold pair divided all genes into four parts, including HDE_HDC, high DE and low DC (HDE_LDC), low DE and high DC (LDE_HDC), and low DE and low DC (LDE_LDC). Using the known functional gene sets, functional relevance of partition genes was explored to determine the best-associated gene set based on the functional information (FI) conception.Results: Under the optimal threshold pair of 3.629 and 1.108 for DE and DC, respectively genes were divided into four partitions: HDE_HDC (311 genes), HDE_LDC (2072 genes), LDE_HDC (seventy genes), and LDE_LDC (1623 genes). Meanwhile, the gene set epidermis development was the best-associated gene set with the largest △G* = 10.496. Among the genes of epidermis development, zinc finger protein 135 (ZNF135) attained highest minimum FI gain of 41.226.Conclusion: The combination of DE and DC analysis showed higher mean FI relative to individual DE and DC analyses. We successfully exhibited the optimal gene set epidermis development and gene ZNF135, which might be crucial for the prevention and treatment of cervical cancer. [ABSTRACT FROM AUTHOR]- Published
- 2018
- Full Text
- View/download PDF
7. Identification of CD34-Positive Cells by Multiparameter Flow Cytometry
- Author
-
Sovalat, H., Serke, S., Wunder, Eckart W., editor, and Henon, Philippe R., editor
- Published
- 1993
- Full Text
- View/download PDF
8. DGCA: A comprehensive R package for Differential Gene Correlation Analysis.
- Author
-
McKenzie, Andrew T., Katsyv, Igor, Won-Min Song, Minghui Wang, and Bin Zhang
- Subjects
- *
GENETIC correlations , *CLUSTER analysis (Statistics) , *RNA sequencing , *BREAST cancer , *BIOLOGICAL systems - Abstract
Background: Dissecting the regulatory relationships between genes is a critical step towards building accurate predictive models of biological systems. A powerful approach towards this end is to systematically study the differences in correlation between gene pairs in more than one distinct condition. Results: In this study we develop an R package, DGCA (for Differential Gene Correlation Analysis), which offers a suite of tools for computing and analyzing differential correlations between gene pairs across multiple conditions. To minimize parametric assumptions, DGCA computes empirical p-values via permutation testing. To understand differential correlations at a systems level, DGCA performs higher-order analyses such as measuring the average difference in correlation and multiscale clustering analysis of differential correlation networks. Through a simulation study, we show that the straightforward z-score based method that DGCA employs significantly outperforms the existing alternative methods for calculating differential correlation. Application of DGCA to the TCGA RNA-seq data in breast cancer not only identifies key changes in the regulatory relationships between TP53 and PTEN and their target genes in the presence of inactivating mutations, but also reveals an immune-related differential correlation module that is specific to triple negative breast cancer (TNBC). Conclusions: DGCA is an R package for systematically assessing the difference in gene-gene regulatory relationships under different conditions. This user-friendly, effective, and comprehensive software tool will greatly facilitate the application of differential correlation analysis in many biological studies and thus will help identification of novel signaling pathways, biomarkers, and targets in complex biological systems and diseases. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
9. Gene set analysis using sufficient dimension reduction.
- Author
-
Huey-Miin Hsueh and Chen-An Tsai
- Subjects
- *
GENES , *PHENOTYPES , *GENE expression , *DIMENSION reduction (Statistics) , *ERROR rates - Abstract
Background: Gene set analysis (GSA) aims to evaluate the association between the expression of biological pathways, or a priori defined gene sets, and a particular phenotype. Numerous GSA methods have been proposed to assess the enrichment of sets of genes. However, most methods are developed with respect to a specific alternative scenario, such as a differential mean pattern or a differential coexpression. Moreover, a very limited number of methods can handle either binary, categorical, or continuous phenotypes. In this paper, we develop two novel GSA tests, called SDRs, based on the sufficient dimension reduction technique, which aims to capture sufficient information about the relationship between genes and the phenotype. The advantages of our proposed methods are that they allow for categorical and continuous phenotypes, and they are also able to identify a variety of enriched gene sets. Results: Through simulation studies, we compared the type I error and power of SDRs with existing GSA methods for binary, triple, and continuous phenotypes. We found that SDR methods adequately control the type I error rate at the pre-specified nominal level, and they have a satisfactory power to detect gene sets with differential coexpression and to test non-linear associations between gene sets and a continuous phenotype. In addition, the SDR methods were compared with seven widely-used GSA methods using two real microarray datasets for illustration. Conclusions: We concluded that the SDR methods outperform the others because of their flexibility with regard to handling different kinds of phenotypes and their power to detect a wide range of alternative scenarios. Our real data analysis highlights the differences between GSA methods for detecting enriched gene sets. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
10. Differential coexpression networks in bronchiolitis and emphysema phenotypes reveal heterogeneous mechanisms of chronic obstructive pulmonary disease
- Author
-
Lei Chen, Lijuan Gao, Ni Zeng, Fuqiang Wen, Chun Wan, Ting Yang, Xiaoou Li, Jiangyue Qin, and Yongchun Shen
- Subjects
0301 basic medicine ,phenotype ,Biology ,chronic obstructive pulmonary disease ,Pulmonary Disease, Chronic Obstructive ,03 medical and health sciences ,0302 clinical medicine ,Apoptotic mitochondrial changes ,Gene expression ,Cluster Analysis ,Humans ,Gene Regulatory Networks ,KEGG ,Gene ,Emphysema ,Microarray analysis techniques ,Gene Expression Profiling ,Molecular Sequence Annotation ,Original Articles ,Cell Biology ,Cilium assembly ,Phenotype ,respiratory tract diseases ,Gene Ontology ,030104 developmental biology ,Gene Expression Regulation ,030220 oncology & carcinogenesis ,Neutrophil degranulation ,Cancer research ,Bronchiolitis ,Molecular Medicine ,Original Article ,differential coexpression ,Signal Transduction - Abstract
Chronic obstructive pulmonary disease (COPD) is a heterogeneous disease with multiple molecular mechanisms. To investigate and contrast the molecular processes differing between bronchiolitis and emphysema phenotypes of COPD, we downloaded the GSE69818 microarray data set from the Gene Expression Omnibus (GEO), which based on lung tissues from 38 patients with emphysema and 32 patients with bronchiolitis. Then, weighted gene coexpression network analysis (WGCNA) and differential coexpression (DiffCoEx) analysis were performed, followed by gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes enrichment analysis (KEGG) analysis. Modules and hub genes for bronchiolitis and emphysema were identified, and we found that genes in modules linked to neutrophil degranulation, Rho protein signal transduction and B cell receptor signalling were coexpressed in emphysema. DiffCoEx analysis showed that four hub genes (IFT88, CCDC103, MMP10 and Bik) were consistently expressed in emphysema patients; these hub genes were enriched, respectively, for functions of cilium assembly and movement, proteolysis and apoptotic mitochondrial changes. In our re‐analysis of GSE69818, gene expression networks in relation to emphysema deepen insights into the molecular mechanism of COPD and also identify some promising therapeutic targets.
- Published
- 2019
- Full Text
- View/download PDF
11. SUBSPACE DIFFERENTIAL COEXPRESSION ANALYSIS: PROBLEM DEFINITION AND A GENERAL APPROACH.
- Author
-
GANG FANG, RUI KUANG, PANDEY, GAURAV, STEINBACH, MICHAEL, MYERS, CHAD L., and KUMAR, VIPIN
- Subjects
GENE expression ,BIOLOGICAL networks ,DNA microarrays ,PERTURBATION theory ,HUMAN phenotype - Published
- 2009
12. Invariance and plasticity in the Drosophila melanogaster metabolomic network in response to temperature.
- Author
-
Hariharan, Ramkumar, Hoffman, Jessica M., Thomas, Ariel S., Soltow, Quinlyn A., Jones, Dean P., and Promislow, Daniel E. L.
- Subjects
- *
DROSOPHILA melanogaster , *MATERIAL plasticity , *TEMPERATURE , *THERMAL stresses , *GLOBAL temperature changes , *BIOLOGICAL networks , *PHYSIOLOGY - Abstract
Background Metabolomic responses to extreme thermal stress have recently been investigated in Drosophila melanogaster. However, a network level understanding of metabolomic responses to longer and less drastic temperature changes, which more closely reflect variation in natural ambient temperatures experienced during development and adulthood, is currently lacking. Here we use high-resolution, non-targeted metabolomics to dissect metabolomic changes in D. melanogaster elicited by moderately cool (18?C) or warm (27?C) developmental and adult temperature exposures. Results We find that temperature at which larvae are reared has a dramatic effect on metabolomic network structure measured in adults. Using network analysis, we are able to identify modules that are highly differentially expressed in response to changing developmental temperature, as well as modules whose correlation structure is strongly preserved across temperature. Conclusions Our results suggest that the effect of temperature on the metabolome provides an easily studied and powerful model for understanding the forces that influence invariance and plasticity in biological networks. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
13. No3CoGP: non-conserved and conserved coexpressed gene pairs.
- Author
-
Mal, Chittabrata, Aftabuddin, Md., and Kundu, Sudip
- Subjects
- *
DNA microarrays , *GENES , *GENETIC software , *COMPUTER software , *CENTRAL processing units - Abstract
Background Analyzing the microarray data of different conditions, one can identify the conserved and condition-specific genes and gene modules, and thus can infer the underlying cellular activities. All the available tools based on Bioconductor and R packages differ in how they extract differential coexpression and at what level they study. There is a need for a user-friendly, flexible tool which can start analysis using raw or preprocessed microarray data and can report different levels of useful information. Findings We present a GUI software, No3CoGP: Non-Conserved and Conserved Coexpressed Gene Pairs which takes Affymetrix microarray data (.CEL files or log2 normalized .txt files) along with annotation file (.csv file), Chip Definition File (CDF file) and probe file as inputs, utilizes the concept of network density cut-off and Fisher's z-test to extract biologically relevant information. It can identify four possible types of gene pairs based on their coexpression relationships. These are (i) gene pair showing coexpression in one condition but not in the other, (ii) gene pair which is positively coexpressed in one condition but negatively coexpressed in the other condition, (iii) positively and (iv) negatively coexpressed in both the conditions. Further, it can generate modules of coexpressed genes. Conclusion Easy-to-use GUI interface enables researchers without knowledge in R language to use No3CoGP. Utilization of one or more CPU cores, depending on the availability, speeds up the program. The output files stored in the respective directories under the user-defined project offer the researchers to unravel condition-specific functionalities of gene, gene sets or modules. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
14. Avoiding pitfalls in gene (co)expression meta-analysis.
- Author
-
Östlund, Gabriel and Sonnhammer, Erik L.L.
- Subjects
- *
GENE expression , *META-analysis , *MOLECULAR genetics , *BIOINFORMATICS , *COMPARATIVE studies , *DATA analysis - Abstract
Abstract: Differential gene expression analysis between healthy and diseased groups is a widely used approach to understand the molecular underpinnings of disease. A wide variety of experimental and bioinformatics techniques are available for this type of analysis, yet their impact on the reliability of the results has not been systematically studied. We performed a large scale comparative analysis of clinical expression data, using several background corrections and differential expression metrics. The agreement between studies was analyzed for study pairs of same cancer type, of different cancer types, and between cancer and non-cancer studies. We also replicated the analysis using differential coexpression. We found that agreement of differential expression is primarily dictated by the microarray platform, while differential coexpression requires large sample sizes. Two studies using different differential expression metrics may show no agreement, even if they agree strongly using the same metric. Our analysis provides practical recommendations for gene (co)expression analysis. [Copyright &y& Elsevier]
- Published
- 2014
- Full Text
- View/download PDF
15. Detecting Differentially Coexpressed Genesfrom Labeled Expression Data: A Brief Review.
- Author
-
Kayano, Mitsunori, Shiga, Motoki, and Mamitsuka, Hiroshi
- Abstract
We review methods for capturing differential coexpression, which can be divided into two cases by the size of gene sets: 1) two paired genes and 2) multiple genes. In the first case, two genes are positively and negatively correlated with each other under one and the other conditions, respectively. In the second case, multiple genes are coexpressed and randomly expressed under one and the other conditions, respectively. We summarize a variety of methods for the first and second cases into four and three approaches, respectively. We describe each of these approaches in detail technically, being followed by thorough comparative experiments with both synthetic and real data sets. Our experimental results imply high possibility of improving the efficiency of the current methods, particularly in the case of multiple genes, because of low performance achieved by the best methods which are relatively simple intuitive ones. [ABSTRACT FROM PUBLISHER]
- Published
- 2014
- Full Text
- View/download PDF
16. Using gene-network landscape to dissect genotype effects of TCF7L2 genetic variant on diabetes and cardiovascular risk.
- Author
-
Vaquero, Andre R., Ferreira, Noely E., Omae, Samantha V., Rodrigues, Mariliza V., Teixeira, Samantha K., Krieger, Jose E., and Pereira, Alexandre C.
- Abstract
The single nucleotide polymorphism (SNP) within the TCF7L2 gene, rs7903146, is, to date, the most significant genetic marker associated with Type 2 diabetes mellitus (T2DM) risk. Nonetheless, its functional role in disease pathology is poorly understood. The aim of the present study was to investigate, in vascular smooth muscle cells from 92 patients undergoing aortocoronary bypass surgery, the contribution of this SNP in T2DM using expression levels and expression correlation comparison approaches, which were visually represented as gene interaction networks. Initially, the expression levels of 41 genes (seven TCF7L2 splice forms and 40 other T2DM relevant genes) were compared between rs7903146 wild-type (CC) and T2DM-risk (CT + TT) genotype groups. Next, we compared the expression correlation patterns of these 41 genes between groups to observe if the relationships between genes were different. Five TCF7L2 splice forms and nine genes showed significant expression differences between groups. RXR gene was pinpointed as showing the most different expression correlation pattern with other genes. Therefore, T2DM risk alleles appear to be influencing TCF7L2 splice form's expression in vascular smooth muscle cells, and RXRα gene is pointed out as a treatment target candidate for risk reduction in individuals with high risk of developing T2DM, especially individuals harboring TCF7L2 risk genotypes. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
17. Combining differential expression and differential coexpression analysis identifies optimal gene and gene set in cervical cancer
- Author
-
Hong-Bing Cai, Shan-Shan Hu, Min Gao, Shi-Lu Xiong, Hai-Yan Chen, and Sheng-Quan Fang
- Subjects
0301 basic medicine ,Uterine Cervical Neoplasms ,Dc analysis ,Functional genes ,Computational biology ,Biology ,lcsh:RC254-282 ,differential expression ,Set (abstract data type) ,03 medical and health sciences ,0302 clinical medicine ,Databases, Genetic ,medicine ,Humans ,Gene Regulatory Networks ,Radiology, Nuclear Medicine and imaging ,Differential expression ,Gene ,Zinc finger ,Cervical cancer ,Models, Statistical ,Gene Expression Profiling ,Computational Biology ,Chi-square maximization ,General Medicine ,medicine.disease ,lcsh:Neoplasms. Tumors. Oncology. Including cancer and carcinogens ,Gene Expression Regulation, Neoplastic ,030104 developmental biology ,Oncology ,Expression data ,030220 oncology & carcinogenesis ,Female ,differential coexpression ,Transcriptome ,Algorithms - Abstract
Objective: The objective of this study is to investigate the optimal gene and functional-related gene set in cervical cancer through combing the differential expression (DE) and differential coexpression (DC) analysis. Materials and Methods: To achieve this, we first measured expression data of cervical cancer by incorporating DE and DC effects utilizing absolute t-value in t -statistic and Z -test, respectively. Then, we selected the optimal threshold pair to determine both high DE and high DC (HDE_HDC) partition on the basis of Chi-square maximization, and the best threshold pair divided all genes into four parts, including HDE_HDC, high DE and low DC (HDE_LDC), low DE and high DC (LDE_HDC), and low DE and low DC (LDE_LDC). Using the known functional gene sets, functional relevance of partition genes was explored to determine the best-associated gene set based on the functional information (FI) conception. Results: Under the optimal threshold pair of 3.629 and 1.108 for DE and DC, respectively genes were divided into four partitions: HDE_HDC (311 genes), HDE_LDC (2072 genes), LDE_HDC (seventy genes), and LDE_LDC (1623 genes). Meanwhile, the gene set epidermis development was the best-associated gene set with the largest △G* = 10.496. Among the genes of epidermis development, zinc finger protein 135 ( ZNF135) attained highest minimum FI gain of 41.226. Conclusion: The combination of DE and DC analysis showed higher mean FI relative to individual DE and DC analyses. We successfully exhibited the optimal gene set epidermis development and gene ZNF135 , which might be crucial for the prevention and treatment of cervical cancer.
- Published
- 2018
18. Network Analysis of Microarray Data.
- Author
-
Pavel A, Serra A, Cattelani L, Federico A, and Greco D
- Subjects
- Algorithms, Gene Expression Profiling, Humans, Oligonucleotide Array Sequence Analysis, Gene Regulatory Networks
- Abstract
DNA microarrays are widely used to investigate gene expression. Even though the classical analysis of microarray data is based on the study of differentially expressed genes, it is well known that genes do not act individually. Network analysis can be applied to study association patterns of the genes in a biological system. Moreover, it finds wide application in differential coexpression analysis between different systems. Network based coexpression studies have for example been used in (complex) disease gene prioritization, disease subtyping, and patient stratification.In this chapter we provide an overview of the methods and tools used to create networks from microarray data and describe multiple methods on how to analyze a single network or a group of networks. The described methods range from topological metrics, functional group identification to data integration strategies, topological pathway analysis as well as graphical models., (© 2022. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.)
- Published
- 2022
- Full Text
- View/download PDF
19. Avoiding pitfalls in gene (co)expression meta-analysis
- Author
-
Erik L. L. Sonnhammer and Gabriel Östlund
- Subjects
Genetics ,Models, Molecular ,Computational Biology ,Reproducibility of Results ,Cancer gene expression ,Computational biology ,Biology ,Microarray Analysis ,Expression (mathematics) ,Gene Expression Regulation, Neoplastic ,Differential coexpression ,Differential expression ,Expression data ,Meta-analysis ,Neoplasms ,Expression analysis ,Metric (mathematics) ,Databases, Genetic ,Humans ,Microarray data processing ,Gene ,Differential (mathematics) - Abstract
Differential gene expression analysis between healthy and diseased groups is a widely used approach to understand the molecular underpinnings of disease. A wide variety of experimental and bioinformatics techniques are available for this type of analysis, yet their impact on the reliability of the results has not been systematically studied.We performed a large scale comparative analysis of clinical expression data, using several background corrections and differential expression metrics. The agreement between studies was analyzed for study pairs of same cancer type, of different cancer types, and between cancer and non-cancer studies. We also replicated the analysis using differential coexpression.We found that agreement of differential expression is primarily dictated by the microarray platform, while differential coexpression requires large sample sizes. Two studies using different differential expression metrics may show no agreement, even if they agree strongly using the same metric. Our analysis provides practical recommendations for gene (co)expression analysis.
- Published
- 2014
- Full Text
- View/download PDF
20. DGCA: A comprehensive R package for Differential Gene Correlation Analysis
- Author
-
Won-Min Song, Minghui Wang, Andrew M. McKenzie, Igor Katsyv, and Bin Zhang
- Subjects
0301 basic medicine ,Theoretical computer science ,Computer science ,Systems biology ,Triple Negative Breast Neoplasms ,RNA-Seq ,Computational biology ,Multiscale clustering analysis ,Correlation ,Differential correlation ,Differential coexpression ,03 medical and health sciences ,Breast cancer ,Structural Biology ,Modelling and Simulation ,Humans ,Triple negative breast cancer ,TP53 ,Cluster analysis ,Molecular Biology ,Parametric statistics ,Gene Expression Profiling ,Applied Mathematics ,R package ,PTEN Phosphohydrolase ,Computational Biology ,3. Good health ,Computer Science Applications ,Gene expression profiling ,Identification (information) ,030104 developmental biology ,Receptors, Estrogen ,Modeling and Simulation ,Mutation ,Tumor Suppressor Protein p53 ,Software ,Differential (mathematics) ,Research Article - Abstract
Background Dissecting the regulatory relationships between genes is a critical step towards building accurate predictive models of biological systems. A powerful approach towards this end is to systematically study the differences in correlation between gene pairs in more than one distinct condition. Results In this study we develop an R package, DGCA (for Differential Gene Correlation Analysis), which offers a suite of tools for computing and analyzing differential correlations between gene pairs across multiple conditions. To minimize parametric assumptions, DGCA computes empirical p-values via permutation testing. To understand differential correlations at a systems level, DGCA performs higher-order analyses such as measuring the average difference in correlation and multiscale clustering analysis of differential correlation networks. Through a simulation study, we show that the straightforward z-score based method that DGCA employs significantly outperforms the existing alternative methods for calculating differential correlation. Application of DGCA to the TCGA RNA-seq data in breast cancer not only identifies key changes in the regulatory relationships between TP53 and PTEN and their target genes in the presence of inactivating mutations, but also reveals an immune-related differential correlation module that is specific to triple negative breast cancer (TNBC). Conclusions DGCA is an R package for systematically assessing the difference in gene-gene regulatory relationships under different conditions. This user-friendly, effective, and comprehensive software tool will greatly facilitate the application of differential correlation analysis in many biological studies and thus will help identification of novel signaling pathways, biomarkers, and targets in complex biological systems and diseases. Electronic supplementary material The online version of this article (doi:10.1186/s12918-016-0349-1) contains supplementary material, which is available to authorized users.
- Published
- 2016
- Full Text
- View/download PDF
21. Gene set analysis using sufficient dimension reduction
- Author
-
Chen-An Tsai and Huey-Miin Hsueh
- Subjects
Male ,0301 basic medicine ,Genotype ,Gene regulatory network ,Sufficient dimension reduction ,Computational biology ,Biology ,01 natural sciences ,Biochemistry ,Differential coexpression ,010104 statistics & probability ,03 medical and health sciences ,Structural Biology ,Humans ,Computer Simulation ,Gene Regulatory Networks ,0101 mathematics ,Molecular Biology ,Categorical variable ,Oligonucleotide Array Sequence Analysis ,Genetics ,Non-linear associations ,Gene Expression Profiling ,Methodology Article ,Applied Mathematics ,Computational Biology ,Prostatic Neoplasms ,Computer Science Applications ,Nominal level ,Black or African American ,Gene expression profiling ,Range (mathematics) ,Phenotype ,Gene set analysis ,030104 developmental biology ,Tumor Suppressor Protein p53 ,DNA microarray ,Type I and type II errors - Abstract
Background Gene set analysis (GSA) aims to evaluate the association between the expression of biological pathways, or a priori defined gene sets, and a particular phenotype. Numerous GSA methods have been proposed to assess the enrichment of sets of genes. However, most methods are developed with respect to a specific alternative scenario, such as a differential mean pattern or a differential coexpression. Moreover, a very limited number of methods can handle either binary, categorical, or continuous phenotypes. In this paper, we develop two novel GSA tests, called SDRs, based on the sufficient dimension reduction technique, which aims to capture sufficient information about the relationship between genes and the phenotype. The advantages of our proposed methods are that they allow for categorical and continuous phenotypes, and they are also able to identify a variety of enriched gene sets. Results Through simulation studies, we compared the type I error and power of SDRs with existing GSA methods for binary, triple, and continuous phenotypes. We found that SDR methods adequately control the type I error rate at the pre-specified nominal level, and they have a satisfactory power to detect gene sets with differential coexpression and to test non-linear associations between gene sets and a continuous phenotype. In addition, the SDR methods were compared with seven widely-used GSA methods using two real microarray datasets for illustration. Conclusions We concluded that the SDR methods outperform the others because of their flexibility with regard to handling different kinds of phenotypes and their power to detect a wide range of alternative scenarios. Our real data analysis highlights the differences between GSA methods for detecting enriched gene sets. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-0928-6) contains supplementary material, which is available to authorized users.
- Published
- 2016
- Full Text
- View/download PDF
22. Invariance and plasticity in the Drosophila melanogastermetabolomic network in response to temperature
- Author
-
Ramkumar Hariharan, Quinlyn A. Soltow, Daniel E. L. Promislow, Ariel S Thomas, Jessica M. Hoffman, and Dean P. Jones
- Subjects
Systems biology ,Plasticity ,Bioinformatics ,Models, Biological ,Differential coexpression ,Metabolomics ,Structural Biology ,Modelling and Simulation ,Network level ,Melanogaster ,Metabolome ,Animals ,Molecular Biology ,biology ,Applied Mathematics ,Age Factors ,Temperature ,biology.organism_classification ,Computer Science Applications ,Drosophila melanogaster ,Evolutionary biology ,Larva ,Modeling and Simulation ,Networks ,Biological network ,Research Article - Abstract
Background Metabolomic responses to extreme thermal stress have recently been investigated in Drosophila melanogaster. However, a network level understanding of metabolomic responses to longer and less drastic temperature changes, which more closely reflect variation in natural ambient temperatures experienced during development and adulthood, is currently lacking. Here we use high-resolution, non-targeted metabolomics to dissect metabolomic changes in D. melanogaster elicited by moderately cool (18°C) or warm (27°C) developmental and adult temperature exposures. Results We find that temperature at which larvae are reared has a dramatic effect on metabolomic network structure measured in adults. Using network analysis, we are able to identify modules that are highly differentially expressed in response to changing developmental temperature, as well as modules whose correlation structure is strongly preserved across temperature. Conclusions Our results suggest that the effect of temperature on the metabolome provides an easily studied and powerful model for understanding the forces that influence invariance and plasticity in biological networks. Electronic supplementary material The online version of this article (doi:10.1186/s12918-014-0139-6) contains supplementary material, which is available to authorized users.
- Published
- 2014
- Full Text
- View/download PDF
23. No3CoGP: non-conserved and conserved coexpressed gene pairs
- Author
-
Chittabrata Mal, Aftabuddin, and Sudip Kundu
- Subjects
Medicine(all) ,Genetics ,Biochemistry, Genetics and Molecular Biology(all) ,Microarray analysis techniques ,Affymetrix microarray ,fungi ,Gene Expression ,food and beverages ,Gene modules ,General Medicine ,Biology ,Gene coexpression ,General Biochemistry, Genetics and Molecular Biology ,Bioconductor ,Differential coexpression ,Gene Modules ,Gene expression ,Technical Note ,Microarray databases ,Programming Languages ,Gene ,Oligonucleotide Array Sequence Analysis - Abstract
Background Analyzing the microarray data of different conditions, one can identify the conserved and condition-specific genes and gene modules, and thus can infer the underlying cellular activities. All the available tools based on Bioconductor and R packages differ in how they extract differential coexpression and at what level they study. There is a need for a user-friendly, flexible tool which can start analysis using raw or preprocessed microarray data and can report different levels of useful information. Findings We present a GUI software, No3CoGP: Non-Conserved and Conserved Coexpressed Gene Pairs which takes Affymetrix microarray data (.CEL files or log2 normalized.txt files) along with annotation file (.csv file), Chip Definition File (CDF file) and probe file as inputs, utilizes the concept of network density cut-off and Fisher’s z-test to extract biologically relevant information. It can identify four possible types of gene pairs based on their coexpression relationships. These are (i) gene pair showing coexpression in one condition but not in the other, (ii) gene pair which is positively coexpressed in one condition but negatively coexpressed in the other condition, (iii) positively and (iv) negatively coexpressed in both the conditions. Further, it can generate modules of coexpressed genes. Conclusion Easy-to-use GUI interface enables researchers without knowledge in R language to use No3CoGP. Utilization of one or more CPU cores, depending on the availability, speeds up the program. The output files stored in the respective directories under the user-defined project offer the researchers to unravel condition-specific functionalities of gene, gene sets or modules. Electronic supplementary material The online version of this article (doi:10.1186/1756-0500-7-886) contains supplementary material, which is available to authorized users.
- Published
- 2014
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.