31 results on '"causal discovery"'
Search Results
2. Using GPT-4 to guide causal machine learning
- Author
-
Constantinou, Anthony C., Kitson, Neville K., and Zanga, Alessio
- Published
- 2025
- Full Text
- View/download PDF
3. Efficient Nonlinear DAG Learning Under Projection Framework
- Author
-
Yin, Naiyu, Yu, Yue, Gao, Tian, Ji, Qiang, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Antonacopoulos, Apostolos, editor, Chaudhuri, Subhasis, editor, Chellappa, Rama, editor, Liu, Cheng-Lin, editor, Bhattacharya, Saumik, editor, and Pal, Umapada, editor
- Published
- 2025
- Full Text
- View/download PDF
4. TSLiNGAM: DirectLiNGAM Under Heavy Tails.
- Author
-
Leyder, Sarah, Raymaekers, Jakob, and Verdonck, Tim
- Subjects
- *
DIRECTED acyclic graphs , *CAUSAL models , *STRUCTURAL models , *SUPPLY chain management , *NOISE - Abstract
AbstractOne of the established approaches to causal discovery consists of combining directed acyclic graphs (DAGs) with structural causal models (SCMs) to describe the functional dependencies of effects on their causes. Possible identifiability of SCMs given data depends on assumptions made on the noise variables and the functional classes in the SCM. For instance, in the LiNGAM model, the functional class is restricted to linear functions and the disturbances have to be non-Gaussian. In this work, we propose TSLiNGAM, a new method for identifying the DAG of a causal model based on observational data. TSLiNGAM builds on DirectLiNGAM, a popular algorithm which uses simple OLS regression for identifying causal directions between variables. TSLiNGAM leverages the non-Gaussianity assumption of the error terms in the LiNGAM model to obtain more efficient and robust estimation of the causal structure. TSLiNGAM is justified theoretically and is studied empirically in an extensive simulation study. It performs significantly better on heavy-tailed and skewed data and demonstrates a high small-sample efficiency. In addition, TSLiNGAM also shows better robustness properties as it is more resilient to contamination. Supplementary materials for this article are available online. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Invited commentary: where do the causal DAGS come from?
- Author
-
Didelez, Vanessa
- Subjects
- *
STATISTICAL models , *CAUSAL models , *DATA analysis , *CAUSALITY (Physics) , *LIFE course approach , *MATHEMATICAL models , *STATISTICS , *THEORY , *ALGORITHMS - Abstract
How do we construct our causal directed acyclic graphs (DAGs)—for example, for life-course modeling and analysis? In this commentary, I review how the data-driven construction of causal DAGs (causal discovery) has evolved, what promises it holds, and what limitations or caveats must be considered. I find that expert- or theory-driven model-building might benefit from some more checking against the data and that causal discovery could bring new ideas to old theories. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Causal Structure Learning with Conditional and Unique Information Groups-Decomposition Inequalities.
- Author
-
Chicharro, Daniel and Nguyen, Julia K.
- Subjects
- *
DISTRIBUTION (Probability theory) , *ELECTRONIC data processing - Abstract
The causal structure of a system imposes constraints on the joint probability distribution of variables that can be generated by the system. Archetypal constraints consist of conditional independencies between variables. However, particularly in the presence of hidden variables, many causal structures are compatible with the same set of independencies inferred from the marginal distributions of observed variables. Additional constraints allow further testing for the compatibility of data with specific causal structures. An existing family of causally informative inequalities compares the information about a set of target variables contained in a collection of variables, with a sum of the information contained in different groups defined as subsets of that collection. While procedures to identify the form of these groups-decomposition inequalities have been previously derived, we substantially enlarge the applicability of the framework. We derive groups-decomposition inequalities subject to weaker independence conditions, with weaker requirements in the configuration of the groups, and additionally allowing for conditioning sets. Furthermore, we show how constraints with higher inferential power may be derived with collections that include hidden variables, and then converted into testable constraints using data processing inequalities. For this purpose, we apply the standard data processing inequality of conditional mutual information and derive an analogous property for a measure of conditional unique information recently introduced to separate redundant, synergistic, and unique contributions to the information that a set of variables has about a target. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Choice Function-Based Hyper-Heuristics for Causal Discovery under Linear Structural Equation Models.
- Author
-
Dang, Yinglong, Gao, Xiaoguang, and Wang, Zidong
- Subjects
- *
STRUCTURAL equation modeling , *METAHEURISTIC algorithms , *LINEAR equations , *HEURISTIC , *DIRECTED acyclic graphs , *HEURISTIC algorithms - Abstract
Causal discovery is central to human cognition, and learning directed acyclic graphs (DAGs) is its foundation. Recently, many nature-inspired meta-heuristic optimization algorithms have been proposed to serve as the basis for DAG learning. However, a single meta-heuristic algorithm requires specific domain knowledge and empirical parameter tuning and cannot guarantee good performance in all cases. Hyper-heuristics provide an alternative methodology to meta-heuristics, enabling multiple heuristic algorithms to be combined and optimized to achieve better generalization ability. In this paper, we propose a multi-population choice function hyper-heuristic to discover the causal relationships encoded in a DAG. This algorithm provides a reasonable solution for combining structural priors or possible expert knowledge with swarm intelligence. Under a linear structural equation model (SEM), we first identify the partial v-structures through partial correlation analysis as the structural priors of the next nature-inspired swarm intelligence approach. Then, through partial correlation analysis, we can limit the search space. Experimental results demonstrate the effectiveness of the proposed methods compared to the earlier state-of-the-art methods on six standard networks. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
8. Delving into Causal Discovery in Health-Related Quality of Life Questionnaires.
- Author
-
Ganopoulou, Maria, Kontopoulos, Efstratios, Fokianos, Konstantinos, Koparanis, Dimitris, Angelis, Lefteris, Kotsianidis, Ioannis, and Moysiadis, Theodoros
- Subjects
- *
QUALITY of life , *RDF (Document markup language) , *MACHINE learning , *DIRECTED acyclic graphs , *KNOWLEDGE graphs - Abstract
Questionnaires on health-related quality of life (HRQoL) play a crucial role in managing patients by revealing insights into physical, psychological, lifestyle, and social factors affecting well-being. A methodological aspect that has not been adequately explored yet, and is of considerable potential, is causal discovery. This study explored causal discovery techniques within HRQoL, assessed various considerations for reliable estimation, and proposed means for interpreting outcomes. Five causal structure learning algorithms were employed to examine different aspects in structure estimation based on simulated data derived from HRQoL-related directed acyclic graphs. The performance of the algorithms was assessed based on various measures related to the differences between the true and estimated structures. Moreover, the Resource Description Framework was adopted to represent the responses to the HRQoL questionnaires and the detected cause–effect relationships among the questions, resulting in semantic knowledge graphs which are structured representations of interconnected information. It was found that the structure estimation was impacted negatively by the structure's complexity and favorably by increasing the sample size. The performance of the algorithms over increasing sample size exhibited a similar pattern, with distinct differences being observed for small samples. This study illustrates the dynamics of causal discovery in HRQoL-related research, highlights aspects that should be addressed in estimation, and fosters the shareability and interoperability of the output based on globally established standards. Thus, it provides critical insights in this context, further promoting the critical role of HRQoL questionnaires in advancing patient-centered care and management. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. Overcoming Confounding Bias in Causal Discovery Using Minimum Redundancy and Maximum Relevancy Constraint
- Author
-
Havisha Nadendla, Pujit Pavan Etha, and Pradeep Chowriappa
- Subjects
Causal discovery ,confounding bias ,directed acyclic graphs ,information theory ,Naïve Bayes ,tree augmented Naïve Bayes ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Causal discovery is the process of modeling cause and effect relationships among features. Unlike traditional model-based approaches, that rely on fitting data to the models, methods of causal discovery determine the causal structure from data. In clinical and EHR data analysis, causal discovery is used to identify dependencies among features that are difficult to estimate using model-based approaches. The resultant structures are represented as Directed Acyclic Graphs (DAG) consisting of nodes and arcs. Here, the direction of the arcs in a DAG indicates the influence of one feature over the other. These dependencies are fundamental to the discovery of novel insights obtained from data. However, causal discovery solely relies on establishing feature dependencies based on their conditional dependencies, that could lead to inaccurate inferences brought about by confounding bias. Our contribution in this work is ‘Non-Confounding Causal Discovery’ (NCCD), a framework aimed at overcoming confounding bias leveraging maximum relevancy and minimum redundancy between features using the concepts of information theory. The work presented uses threshold conditioned values on which the features in the graphical structure are connected to one another. Validation was carried out on three clinical trial benchmark datasets and compared the results against the previously known Naïve Bayes (NB) and Tree Augmented Naïve Bayes (TAN) algorithms. We observe a reduction in the complexity of the graph, evidenced by a decrease in the number of arcs. Notably, the graphs generated through NCCD exhibited a capacity to eliminate confounding dependencies while concurrently preserving the overall score of the network.
- Published
- 2024
- Full Text
- View/download PDF
10. Functional Bayesian networks for discovering causality from multivariate functional data.
- Author
-
Zhou, Fangting, He, Kejun, Wang, Kunbo, Xu, Yanxun, and Ni, Yang
- Subjects
- *
BAYESIAN analysis , *DIRECTED acyclic graphs , *GAUSSIAN processes - Abstract
Multivariate functional data arise in a wide range of applications. One fundamental task is to understand the causal relationships among these functional objects of interest. In this paper, we develop a novel Bayesian network (BN) model for multivariate functional data where conditional independencies and causal structure are encoded by a directed acyclic graph. Specifically, we allow the functional objects to deviate from Gaussian processes, which is the key to unique causal structure identification even when the functions are measured with noises. A fully Bayesian framework is designed to infer the functional BN model with natural uncertainty quantification through posterior summaries. Simulation studies and real data examples demonstrate the practical utility of the proposed model. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
11. Constructing Causal Life-Course Models: Comparative Study of Data-Driven and Theory-Driven Approaches.
- Author
-
Petersen, Anne Helby, Ekstrøm, Claus Thorn, Spirtes, Peter, and Osler, Merete
- Subjects
- *
STATISTICS , *LIFE course approach , *MATHEMATICAL models , *COMPARATIVE studies , *THEORY , *DESCRIPTIVE statistics , *RESEARCH funding , *STATISTICAL models , *DATA analysis , *ALGORITHMS , *CAUSAL models , *LONGITUDINAL method - Abstract
Life-course epidemiology relies on specifying complex (causal) models that describe how variables interplay over time. Traditionally, such models have been constructed by perusing existing theory and previous studies. By comparing data-driven and theory-driven models, we investigated whether data-driven causal discovery algorithms can help in this process. We focused on a longitudinal data set on a cohort of Danish men (the Metropolit Study, 1953–2017). The theory-driven models were constructed by 2 subject-field experts. The data-driven models were constructed by use of the temporal Peter-Clark (TPC) algorithm. The TPC algorithm utilizes the temporal information embedded in life-course data. We found that the data-driven models recovered some, but not all, causal relationships included in the theory-driven expert models. The data-driven method was especially good at identifying direct causal relationships that the experts had high confidence in. Moreover, in a post hoc assessment, we found that most of the direct causal relationships proposed by the data-driven model but not included in the theory-driven model were plausible. Thus, the data-driven model may propose additional meaningful causal hypotheses that are new or have been overlooked by the experts. In conclusion, data-driven methods can aid causal model construction in life-course epidemiology, and combining both data-driven and theory-driven methods can lead to even stronger models. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
12. The impact of prior knowledge on causal structure learning.
- Author
-
Constantinou, Anthony C., Guo, Zhigao, and Kitson, Neville K.
- Subjects
MACHINE learning ,PRIOR learning ,BAYESIAN analysis ,BIG data ,COMPUTATIONAL complexity ,KNOWLEDGE transfer - Abstract
Causal Bayesian networks have become a powerful technology for reasoning under uncertainty in areas that require transparency and explainability, by relying on causal assumptions that enable us to simulate hypothetical interventions. The graphical structure of such models can be estimated by structure learning algorithms, domain knowledge, or a combination of both. Various knowledge approaches have been proposed in the literature that enables us to specify prior knowledge that constrains or guides these algorithms. This paper introduces some novel, and also describes some existing, knowledge-based approaches that enable us to combine structure learning with knowledge obtained from heterogeneous sources. We investigate the impact of these approaches on structure learning across different algorithms, case studies and settings that we might encounter in practice. Each approach is assessed in terms of effectiveness and efficiency, including graphical accuracy, model fitting, complexity, and runtime; making this the first paper that provides a comparative evaluation of a wide range of knowledge approaches for structure learning. Because the value of knowledge depends on what data are available, we illustrate the results both with limited and big data. While the overall results show that knowledge becomes less important with big data due to higher learning accuracy rendering knowledge less important, some of the knowledge approaches are found to be more important with big data. Amongst the main conclusions is the observation that reduced search space obtained from knowledge does not always imply reduced computational complexity, perhaps because the relationships implied by the data and knowledge are in tension. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
13. Causal Discovery via Causal Star Graphs.
- Author
-
BOXIANG ZHAO, SHULIANG WANG, LIANHUA CHI, QI LI, XIAOJIA LIU, and JING GENG
- Subjects
DIRECTED acyclic graphs ,CAUSAL models ,DIRECTED graphs ,DATA mining ,STELLAR structure - Abstract
Discovering causal relationships among observed variables is an important research focus in data mining. Existing causal discovery approaches are mainly based on constraint-based methods and functional causal models (FCMs). However, the constraint-based method cannot identify the Markov equivalence class and the functional causal models cannot identify the complex interrelationships when multiple variables affect one variable. To address the two aforementioned problems, we propose a new graph structure Causal Star Graph (CSG) and a corresponding framework Causal Discovery via Causal Star Graphs (CD-CSG) to divide a causal directed acyclic graph into multiple CSGs for causal discovery. In this framework, we also propose a generalized learning in CSGs based on a variational approach to learn the representative intermediate variable of CSG’s non-central variables. Through the generalized learning in CSGs, the asymmetry in the forward and backward model of CD-CSG can be found to identify the causal directions in the directed acyclic graphs. We further divide the CSGs into three categories and provide the causal identification principle under each category in our proposed framework. Experiments using synthetic data show that the causal relationships between variables can be effectively identified with CD-CSG and the accuracy of CD-CSG is higher than the best existing model. By applying CD-CSG to real-world data, our proposed method can greatly augment the applicability and effectiveness of causal discovery. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
14. A Survey on Causal Discovery
- Author
-
Zhou, Wenxiu, Chen, QingCai, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Sun, Maosong, editor, Qi, Guilin, editor, Liu, Kang, editor, Ren, Jiadong, editor, Xu, Bin, editor, Feng, Yansong, editor, Liu, Yongbin, editor, and Chen, Yubo, editor
- Published
- 2022
- Full Text
- View/download PDF
15. Causal discovery and epidemiology: a potential for synergy.
- Author
-
Petersen, Anne Helby, Ekstrøm, Claus Thorn, Spirtes, Peter, and Osler, Merete
- Subjects
- *
STATISTICAL models , *CONSENSUS (Social sciences) , *CAUSAL models , *DECISION making , *MACHINE learning , *EPIDEMIOLOGICAL research , *ALGORITHMS - Abstract
The article addresses a question posed by V. Didelez regarding the expert consensus meeting and provides additional topics for further research that is believed to aid both epidemiology and causal discovery research. Topics include expert discussions when creating the consensus model, usefulness of knowing more about the traditional approach, and a benchmarking question on whether causal discovery is a useful aid for actual scientific progress.
- Published
- 2024
- Full Text
- View/download PDF
16. D'ya Like DAGs? A Survey on Structure Learning and Causal Discovery.
- Author
-
VOWELS, MATTHEW J., CIHAN CAMGOZ, NECATI, and BOWDEN, RICHARD
- Subjects
- *
REFERENCE sources , *INTEGRATED software , *ANTHROPOSOPHY - Abstract
Causal reasoning is a crucial part of science and human intelligence. In order to discover causal relationships from data, we need structure discovery methods. We provide a review of background theory and a survey of methods for structure discovery. We primarily focus on modern, continuous optimization methods, and provide reference to further resources such as benchmark datasets and software packages. Finally, we discuss the assumptive leap required to take us from structure to causality. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
17. GREEDY CAUSAL DISCOVERY IS GEOMETRIC.
- Author
-
LINUSSON, SVANTE, RESTADH, PETTER, and SOLUS, LIAM
- Subjects
- *
DIRECTED acyclic graphs , *SIMPLEX algorithm , *CAUSATION (Philosophy) , *GREEDY algorithms - Abstract
Finding a directed acyclic graph (DAG) that best encodes the conditional independence statements observable from data is a central question within causality. Algorithms that greedily transform one candidate DAG into another given a fixed set of moves have been particularly successful, for example, the greedy equivalence search, greedy interventional equivalence search, and max-min hill climbing algorithms. In 2010, Studeny\', Hemmecke, and Lindner introduced the characteristic imset (CIM) polytope, CIMp, whose vertices correspond to Markov equivalence classes, as a way of transforming causal discovery into a linear optimization problem. We show that the moves of the aforementioned algorithms are included within classes of edges of CIMp and that restrictions placed on the skeleton of the candidate DAGs correspond to faces of CIMp. Thus, we observe that greedy equivalence search, greedy interventional equivalence search, and max-min hill climbing all have geometric realizations as greedy edge-walks along CIMp. Furthermore, the identified edges of CIMp strictly generalize the moves of these algorithms. Exploiting this generalization, we introduce a greedy simplex-type algorithm called greedy CIM, and a hybrid variant, skeletal greedy CIM, that outperforms current competitors among hybrid and constraint-based algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
18. Chain Graph Reduction Into Power Chain Graphs.
- Author
-
Franco, Vithor Rosa, Wang Barros, Guilherme, Wiberg, Marie, and Laros, Jacob Arie
- Subjects
- *
CHAIN graphs , *DIMENSION reduction (Statistics) , *DIRECTED acyclic graphs , *COMPUTER simulation , *MATHEMATICAL variables - Abstract
Reduction of graphs is a class of procedures used to decrease the dimensionality of a given graph in which the properties of the reduced graph are to be induced from the properties of the larger original graph. This paper introduces both a new method for reducing chain graphs to simpler directed acyclic graphs (DAGs), that we call power chain graphs (PCG), as well as a procedure for structure learning of this new type of graph from correlational data of a Gaussian graphical model. A definition for PCGs is given, directly followed by the reduction method. The structure learning procedure is a two-step approach: first, the correlation matrix is used to cluster the variables; and then, the averaged correlation matrix is used to discover the DAGs using the PC-stable algorithm. The results of simulations are provided to illustrate the theoretical proposal, which demonstrate initial evidence for the validity of our procedure to recover the structure of power chain graphs. The paper ends with a discussion regarding suggestions for future studies as well as some practical implications. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
19. Data-driven discovery of interpretable causal relations for deep learning material laws with uncertainty propagation.
- Author
-
Sun, Xiao, Bahmani, Bahador, Vlassis, Nikolaos N., Sun, WaiChing, and Xu, Yanxun
- Subjects
- *
DEEP learning , *DIRECTED acyclic graphs , *DIRECTED graphs , *CIVIL engineering , *ELASTOPLASTICITY , *CIVIL engineers , *PREDICTION models - Abstract
This paper presents a computational framework that generates ensemble predictive mechanics models with uncertainty quantification (UQ). We first develop a causal discovery algorithm to infer causal relations among time-history data measured during each representative volume element (RVE) simulation through a directed acyclic graph. With multiple plausible sets of causal relationships estimated from multiple RVE simulations, the predictions are propagated in the derived causal graph while using a deep neural network equipped with dropout layers as a Bayesian approximation for UQ. We select two representative numerical examples (traction-separation laws for frictional interfaces, elastoplasticity models for granular assembles) to examine the accuracy and robustness of the proposed causal discovery method for the common material law predictions in civil engineering applications. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
20. Sparse estimation of Linear Non-Gaussian Acyclic Model for Causal Discovery.
- Author
-
Harada, Kazuharu and Fujisawa, Hironori
- Subjects
- *
ACYCLIC model , *INDEPENDENT component analysis , *CAUSAL models , *DIRECTED acyclic graphs , *ALGORITHMS - Abstract
• Existing algorithms for LiNGAM do not use sparseness and non-Gaussianity sufficiently. • Penalized likelihood-based approach enables us to estimate the model efficiently. • Solutions can be obtained by ADMM-based algorithm with some devices. • Our method exceeds the existing methods in numerical experiments and real data analysis. We consider the problem of inferring the causal structure from observational data, especially when the structure is sparse. This type of problem is usually formulated as an inference of a Directed Acyclic Graph (DAG) model. The Linear Non-Gaussian Acyclic Model (LiNGAM) is one of the most successful DAG models, and various estimation methods have been developed. However, existing methods are not efficient for some reasons: (i) the sparse structure is not always incorporated in causal order estimation, and (ii) the information of higher-order moments of the data is not used in parameter estimation. To address these issues, we propose a new estimation method for a linear DAG model with non-Gaussian noises. The proposed method is based on a single statistical criterion that includes the log-likelihood of independent component analysis (ICA) and two penalty terms. The two penalties are related to the sparsity and the consistency condition, respectively. This criterion enables us to leverage the sparse structure and the information of higher-order moments throughout the estimation. For stable and efficient optimization, we propose some devices, such as a modified natural gradient. Numerical experiments show that the proposed method outperforms the existing methods. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
21. Incorporating structural constraints into continuous optimization for causal discovery.
- Author
-
Wang, Zidong, Gao, Xiaoguang, Liu, Xiaohan, Ru, Xinxin, and Zhang, Qingfu
- Subjects
- *
DIRECTED acyclic graphs , *THRESHOLDING algorithms - Abstract
Directed Acyclic Graphs (DAGs) provide an efficient framework to describe the causal relations in actual applications, and it appears more and more important to learn a DAG from training data in causal discovery. Recently, a novel methodology, which projects the acyclic constraints by an algebraic characterization and employs continuous optimization to carry the causal discovery, gradually became the mainstream. However, such methods focus on a best-fitting to the training data and cannot utilize the prior knowledge in an efficient way. To resolve this problem, we suggest incorporating structural constraints into continuous optimization. For edge constraints, we regard the activation value of the difference between the constraint matrix after thresholding and the weight matrix as the optimization goal. For path constraints, we use the deviation concluded from the power matrix on k th path graphs to design the penalty functions. For ordering constraints, we exploit the representation based on the negative edge/path constraints. The mathematical derivations prove that equality constraint program (ECP), in which proposed equality constraints powerfully embody the required structural restrictions, are solvable. Furthermore, the experimental evaluations indicate that the proposed method develops higher scalability and accuracy against state-of-the-art algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
22. Discovering the effective connectome of the brain with dynamic Bayesian DAG learning.
- Author
-
Bagheri, Abdolmahdi, Pasande, Mohammad, Bello, Kevin, Araabi, Babak Nadjar, and Akhondi-Asl, Alireza
- Subjects
- *
DIRECTED acyclic graphs , *TRUST , *PRIOR learning , *FUNCTIONAL magnetic resonance imaging , *DATA quality - Abstract
Understanding the complex mechanisms of the brain can be unraveled by extracting the Dynamic Effective Connectome (DEC). Recently, score-based Directed Acyclic Graph (DAG) discovery methods have shown significant improvements in extracting the causal structure and inferring effective connectivity. However, learning DEC through these methods still faces two main challenges: one with the fundamental impotence of high-dimensional dynamic DAG discovery methods and the other with the low quality of fMRI data. In this paper, we introduce Bayesian Dynamic DAG learning with M-matrices Acyclicity characterization (BDyMA) method to address the challenges in discovering DEC. The presented dynamic DAG enables us to discover direct feedback loop edges as well. Leveraging an unconstrained framework in the BDyMA method leads to more accurate results in detecting high-dimensional networks, achieving sparser outcomes, making it particularly suitable for extracting DEC. Additionally, the score function of the BDyMA method allows the incorporation of prior knowledge into the process of dynamic causal discovery which further enhances the accuracy of results. Comprehensive simulations on synthetic data and experiments on Human Connectome Project (HCP) data demonstrate that our method can handle both of the two main challenges, yielding more accurate and reliable DEC compared to state-of-the-art and traditional methods. Additionally, we investigate the trustworthiness of DTI data as prior knowledge for DEC discovery and show the improvements in DEC discovery when the DTI data is incorporated into the process. • We introduce BDyMA to discover dynamic causal structure of high-dimensional networks. • We demonstrate the effectiveness of the BDyMA in comparison to both existing method. • We show that our method enhances the intrasubject and intersubject reliability. • We examine the trustworthiness of DTI data as prior knowledge of DEC discovery. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
23. Causal structure learning for high-dimensional non-stationary time series.
- Author
-
Chen, Siya, Wu, HaoTian, and Jin, Guang
- Subjects
- *
GRAPH neural networks , *BOOLEAN matrices , *DIRECTED acyclic graphs , *TIME series analysis , *MACHINE learning - Abstract
Learning the causal structure of high-dimensional non-stationary time series can help in understanding the data generation mechanism, which is a crucial task in machine learning. However, current causal discovery methods for high-dimensional non-stationary time series face several challenges, including the inability to effectively capture non-stationarity, failure to ensure acyclicity of causal graphs, and reliance on subjective threshold definitions, leading to suboptimal algorithm performance. To address these challenges, we introduce a novel Causal Structure Learning model for High-dimensional Non-stationary Time Series (CSL-HNTS). Firstly, this model presents a graph neural network to model the non-stationarity of time series. Secondly, it introduces a novel Directed Acyclic Graph (DAG) sampling method that transforms the space of DAGs into a continuous space, enabling the search for causal graphs within this continuous space to ensure acyclicity. Finally, the model proposes an automatic threshold definition method, without prior knowledge, to convert the weighted adjacency matrix into the Boolean adjacency matrix of the causal graph, thereby avoiding time-consuming postprocessing steps. The proposed approach is validated using simulation datasets and two real datasets, and is benchmarked against current state-of-the-art methods and ablation experiments. The results demonstrate a significant improvement over existing methods, highlighting the efficacy of the proposed model. • A Graph Neural Network (GNN) to model the non-stationarity of high-dimensional time series is proposed. • A novel method to transform the discrete space of directed acyclic graphs into a continuous space is introduced. • An automatic threshold definition method that does not rely on any prior knowledge to transform the weighted adjacency matrix into the Boolean adjacency matrix of a causal graph is presented. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
24. Learning Bayesian Networks That Enable Full Propagation of Evidence
- Author
-
Anthony C. Constantinou
- Subjects
Causal discovery ,conditional independence ,directed acyclic graphs ,probabilistic graphical models ,structure learning ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
This paper builds on recent developments in Bayesian network (BN) structure learning under the controversial assumption that the input variables are dependent. This assumption can be viewed as a learning constraint geared towards cases where the input variables are known or assumed to be dependent. It addresses the problem of learning multiple disjoint subgraphs that do not enable full propagation of evidence. This problem is highly prevalent in cases where the sample size of the input data is low with respect to the dimensionality of the model, which is often the case when working with real data. The paper presents a novel hybrid structure learning algorithm, called SaiyanH, that addresses this issue. The results show that this constraint helps the algorithm to estimate the number of true edges with higher accuracy compared to the state-of-the-art. Out of the 13 algorithms investigated, the results rank SaiyanH 4th in reconstructing the true DAG, with accuracy scores lower by 8.1% (F1), 10.2% (BSF), and 19.5% (SHD) compared to the top ranked algorithm, and higher by 75.5% (F1), 118% (BSF), and 4.3% (SHD) compared to the bottom ranked algorithm. Overall, the results suggest that the proposed algorithm discovers satisfactorily accurate connected DAGs in cases where other algorithms produce multiple disjoint subgraphs that often underfit the true graph.
- Published
- 2020
- Full Text
- View/download PDF
25. Large-scale empirical validation of Bayesian Network structure learning algorithms with noisy data.
- Author
-
Constantinou, Anthony C., Liu, Yang, Chobtham, Kiattikun, Guo, Zhigao, and Kitson, Neville K.
- Subjects
- *
MACHINE learning , *BIG data , *SAMPLE size (Statistics) , *MEASUREMENT errors - Abstract
Numerous Bayesian Network (BN) structure learning algorithms have been proposed in the literature over the past few decades. Each publication makes an empirical or theoretical case for the algorithm proposed in that publication and results across studies are often inconsistent in their claims about which algorithm is 'best'. This is partly because there is no agreed evaluation approach to determine their effectiveness. Moreover, each algorithm is based on a set of assumptions, such as complete data and causal sufficiency, and tend to be evaluated with data that conforms to these assumptions, however unrealistic these assumptions may be in the real world. As a result, it is widely accepted that synthetic performance overestimates real performance, although to what degree this may happen remains unknown. This paper investigates the performance of 15 state-of-the-art, well-established, or recent promising structure learning algorithms. We propose a methodology that applies the algorithms to data that incorporates synthetic noise, in an effort to better understand the performance of structure learning algorithms when applied to real data. Each algorithm is tested over multiple case studies, sample sizes, types of noise, and assessed with multiple evaluation criteria. This work involved learning approximately 10,000 graphs with a total structure learning runtime of seven months. In investigating the impact of data noise, we provide the first large scale empirical comparison of BN structure learning algorithms under different assumptions of data noise. The results suggest that traditional synthetic performance may overestimate real-world performance by anywhere between 10% and more than 50%. They also show that while score-based learning is generally superior to constraint-based learning, a higher fitting score does not necessarily imply a more accurate causal graph. The comparisons extend to other outcomes of interest, such as runtime, reliability, and resilience to noise, assessed over both small and large networks, and with both limited and big data. To facilitate comparisons with future studies, we have made all data, raw results, graphs and BN models freely available online. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
26. The Reduced PC-Algorithm: Improved Causal Structure Learning in Large Random Networks.
- Author
-
Sondhi, Arjun and Shojaie, Ali
- Subjects
- *
NETWORK hubs , *DIRECTED acyclic graphs , *RANDOM graphs , *STRUCTURAL equation modeling , *BIOLOGICAL systems , *LINEAR equations - Abstract
We consider the task of estimating a high-dimensional directed acyclic graph, given observations from a linear structural equation model with arbitrary noise distribution. By exploiting properties of common random graphs, we develop a new algorithm that requires conditioning only on small sets of variables. The proposed algorithm, which is essentially a modified version of the PC-Algorithm, offers significant gains in both computational complexity and estimation accuracy. In particular, it results in more efficient and accurate estimation in large networks containing hub nodes, which are common in biological systems. We prove the consistency of the proposed algorithm, and show that it also requires a less stringent faithfulness assumption than the PC-Algorithm. Simulations in low and high-dimensional settings are used to illustrate these findings. An application to gene expression data suggests that the proposed algorithm can identify a greater number of clinically relevant genes than current methods. [ABSTRACT FROM AUTHOR]
- Published
- 2019
27. Estimation of a causal directed acyclic graph process using non-gaussianity.
- Author
-
Einizade, Aref, Giraldo, Jhony H., Malliaros, Fragkiskos D., and Hajipour Sardouie, Sepideh
- Subjects
- *
DIRECTED acyclic graphs , *ACYCLIC model , *DIRECTED graphs , *SIGNAL processing , *DATA mining , *MACHINE learning - Abstract
In machine learning and data mining, causal relationship discovery is a critical task. While the state-of-the-art Vector Auto-Regressive Linear Non-Gaussian Acyclic Model (VAR-LiNGAM) method excels in uncovering both instantaneous and time-lagged connections, it entails analyzing multiple VAR matrices, leading to heightened parameter complexity. To address this challenge, we introduce the Causal Graph Process-LiNGAM (CGP-LiNGAM), a novel approach that significantly reduces parameter load by focusing on a single causal graph, a Directed Acyclic Graph (DAG). Leveraging Graph Signal Processing (GSP) techniques, our method interprets causal relations with graph shift invariance and uniqueness. Our experimental results demonstrate the superiority and robustness of CGP-LiNGAM, particularly in high-noise environments. Moreover, we showcase its real-world applicability in studying brain connectivity during sleep, underlining its compatibility with previous sleep-related neuroscientific research. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. A metaheuristic causal discovery method in directed acyclic graphs space.
- Author
-
Liu, Xiaohan, Gao, Xiaoguang, Wang, Zidong, Ru, Xinxin, and Zhang, Qingfu
- Subjects
- *
DIRECTED acyclic graphs , *DIRECTED graphs , *METAHEURISTIC algorithms , *SEARCH algorithms - Abstract
Causal discovery plays a vital role in the human understanding of the world. Searching a directed acyclic graph (DAG) from observed data is one of the most widely used methods. However, in most existing approaches, the global search has poor scalability, and the local search is often insufficient to discover a reliable causal graph. In this paper, we propose a generic metaheuristic method to discover the causal relationship in the DAG itself instead in of any equivalent but indirect substitutes. We first propose several novel heuristic factors to expand the search space and maintain acyclicity. Second, using these factors, we propose a metaheuristic algorithm to further search for the optimal solution closer to real causality in the DAG space. Theoretical studies show the correctness of our proposed method. Extensive experiments are conducted to verify its generalization ability, scalability, and effectiveness on real-world and simulated structures for both discrete and continuous models by comparing it with other state-of-the-art causal solvers. We also compare the performance of our method with that of a state-of-the-art approach on well-known medical data. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
29. Causal Graph Attention Network with Disentangled Representations for Complex Systems Fault Detection.
- Author
-
Liu, Jie, Zheng, Shuwen, and Wang, Chong
- Subjects
- *
DIRECTED acyclic graphs , *DATA structures , *SYSTEM failures , *BRAKE systems , *HIGH speed trains , *GRAPH algorithms , *FEATURE extraction - Abstract
• A Causal-GAT is proposed with adaptive attention on cause variables. • Node representations are disentangled to improve feature extraction efficiency. • Expertise and data are combined for charactering the high-dimensional data structure. • A real dataset concerning HST braking systems fault detection is conducted. • Interpretability of the proposed method is investigated. Considering the importance of complex systems fault detection, much efforts have been dedicated to fault feature extraction with monitoring data. The graph-based approach has become a trending topic, which exploits the non-Euclidean structure and generates representation based on spatial information. However, most graph-based models are built based on correlation assumption, and disregards the causality which are intrinsic in system and its failure process. In this paper, a causal graph attention network with disentangled representations (Causal-GAT) is proposed for fault detection. High-dimensional variables are first characterized into directed acyclic graphs using data-driven causal discovery combining expertise. The causal graph, which represents variables' cause-effect relations, is fed into the Causal-GAT. Disentangled Causal Attention (DC-Attention) is proposed to adaptively aggregate cause variables for embedding the effect variables. To improve feature extraction efficiency in the multi-head attention, the DC-Attention enforces disentangled node representation by regularizing it with a specified causal condition. To verify the effectiveness of the proposed method, a real case study concerning the high-speed train braking systems is considered. Experimental results with the benchmark methods demonstrate the advantages of the proposed method. Validities of causal graph construction, representation disentanglement, as well as interpretability of the model are also discussed in this work. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
30. The importance of temporal information in Bayesian network structure learning.
- Author
-
Constantinou, Anthony C.
- Subjects
- *
INFORMATION networks , *MACHINE learning , *ALGORITHMS - Abstract
Several algorithms have been proposed towards discovering the graphical structure of Bayesian networks. Most of these algorithms are restricted to observational data and some enable us to incorporate knowledge as constraints in terms of what can and cannot be discovered by an algorithm. A common type of such knowledge involves the temporal order of the variables in the data. For example, knowledge that event B occurs after observing A and hence, the constraint that B cannot cause A. This paper investigates real-world case studies that incorporate interesting properties of objective temporal variable order, and the impact these temporal constraints have on the learnt graph. The results show that most of the learnt graphs are subject to major modifications after incorporating incomplete temporal objective information. Because temporal information is widely viewed as a form of knowledge that is subjective, rather than as a form of data that tends to be objective, it is generally disregarded and reduced to an optional piece of information that only few of the structure learning algorithms may consider. The paper argues that objective temporal information should form part of observational data, to reduce the risk of disregarding such information when available and to encourage its reusability across related studies. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
31. Causal network learning with non-invertible functional relationships.
- Author
-
Wang, Bingling and Zhou, Qing
- Subjects
- *
DIRECTED acyclic graphs , *STRUCTURAL equation modeling , *CAUSAL models , *TRANSCRIPTION factors , *NONLINEAR equations - Abstract
Discovery of causal relationships from observational data is an important problem in many areas. Several recent results have established the identifiability of causal directed acyclic graphs (DAGs) with non-Gaussian and/or nonlinear structural equation models (SEMs). Focusing on nonlinear SEMs defined by non-invertible functions, which exist in many data domains, a novel test is proposed for non-invertible bivariate causal models. Algorithms are further developed to incorporate this test in structure learning of DAGs that contain both linear and nonlinear causal relations. Extensive numerical comparisons show that the proposed algorithms outperform existing DAG learning methods in identifying causal graphical structures. The practical application of the methods is illustrated by learning causal networks for combinatorial binding of transcription factors from ChIP-Seq data. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.