Back to Search
Start Over
ParallelPC: An R Package for Efficient Causal Exploration in Genomic Data
- Source :
- Lecture Notes in Computer Science ISBN: 9783030045029, PAKDD (Workshops)
- Publication Year :
- 2018
- Publisher :
- Springer International Publishing, 2018.
-
Abstract
- Discovering causal relationships from genomic data is the ultimate goal in gene regulation research. Constraint based causal exploration algorithms, such as PC, FCI, RFCI, PC-simple, IDA and Joint-IDA have achieved significant progress and have many applications. However, their applications in bioinformatics are still limited due to their high computational complexity. In this paper, we present an R package, ParallelPC, that includes the parallelised versions of these causal exploration algorithms and 12 different conditional independence tests for each. The parallelised algorithms help speed up the procedure of experimenting large biological datasets and reduce the memory used when running the algorithms. Our experiment results on a real gene expression dataset show that using the parallelised algorithms it is now practical to explore causal relationships in high dimensional datasets with thousands of variables in a personal multicore computer. We present some typical applications in bioinformatics using different algorithms in ParallelPC. ParallelPC is available in CRAN repository at https://cran.r-project.org/web/packages/ParallelPC/index.html Refereed/Peer-reviewed
- Subjects :
- 0301 basic medicine
Multi-core processor
Speedup
Computational complexity theory
parallel computing
Computer science
business.industry
Genomic data
Bayesian network
Machine learning
computer.software_genre
01 natural sciences
Constraint (information theory)
010104 statistics & probability
03 medical and health sciences
R package
Bayesian networks
030104 developmental biology
constraint-based methods
Conditional independence
Artificial intelligence
0101 mathematics
business
computer
causality discovery
Subjects
Details
- ISBN :
- 978-3-030-04502-9
- ISBNs :
- 9783030045029
- Database :
- OpenAIRE
- Journal :
- Lecture Notes in Computer Science ISBN: 9783030045029, PAKDD (Workshops)
- Accession number :
- edsair.doi.dedup.....cf96a9a3c540b81189c31c26bb652e3b
- Full Text :
- https://doi.org/10.1007/978-3-030-04503-6_22