Back to Search Start Over

ParallelPC: An R Package for Efficient Causal Exploration in Genomic Data

Authors :
Jiuyong Li
Lin Liu
Hu Shu
Thuc Duy Le
Tao Hoang
Taosheng Xu
Le, Thuc Duy
Xu, Taosheng
Liu, Lin
Shu, Hu
Hoang, Tao
Li, Jiuyong
22nd Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2018 Melbourne, Australia 3 June 2018
Source :
Lecture Notes in Computer Science ISBN: 9783030045029, PAKDD (Workshops)
Publication Year :
2018
Publisher :
Springer International Publishing, 2018.

Abstract

Discovering causal relationships from genomic data is the ultimate goal in gene regulation research. Constraint based causal exploration algorithms, such as PC, FCI, RFCI, PC-simple, IDA and Joint-IDA have achieved significant progress and have many applications. However, their applications in bioinformatics are still limited due to their high computational complexity. In this paper, we present an R package, ParallelPC, that includes the parallelised versions of these causal exploration algorithms and 12 different conditional independence tests for each. The parallelised algorithms help speed up the procedure of experimenting large biological datasets and reduce the memory used when running the algorithms. Our experiment results on a real gene expression dataset show that using the parallelised algorithms it is now practical to explore causal relationships in high dimensional datasets with thousands of variables in a personal multicore computer. We present some typical applications in bioinformatics using different algorithms in ParallelPC. ParallelPC is available in CRAN repository at https://cran.r-project.org/web/packages/ParallelPC/index.html Refereed/Peer-reviewed

Details

ISBN :
978-3-030-04502-9
ISBNs :
9783030045029
Database :
OpenAIRE
Journal :
Lecture Notes in Computer Science ISBN: 9783030045029, PAKDD (Workshops)
Accession number :
edsair.doi.dedup.....cf96a9a3c540b81189c31c26bb652e3b
Full Text :
https://doi.org/10.1007/978-3-030-04503-6_22