Back to Search Start Over

PARALLEL ALGORITHMS OF RANDOM FORESTS FOR CLASSIFYING VERY LARGE DATASETS

Authors :
Do Thanh Nghi
Pham Nguyen Khang
Nguyen Van Hoa
Ly Hoang Trong
Source :
Tạp chí Khoa học Đại học Đà Lạt, Vol 3, Iss 2 (2013)
Publication Year :
2013
Publisher :
Dalat University, 2013.

Abstract

The random forests algorithm proposed by Breiman is an ensemble-based approach with very high accuracy. The learning and classification tasks of a set of decision trees take a lot of time, make it intractable when dealing with very large datasets. There is a need to scale up the random forests algorithm to handle massive datasets. We propose parallel algorithms of random forests to take into account the benefits of Grids computing. These algorithms improve training and classification time compared with the original ones. The experimental results on large datasets including Forest cover type,KDD Cup 1999, Connect-4 from the UCI data repository showed that the training and classification time of parallel algorithms are significantly reduced.

Details

Language :
English, Vietnamese
ISSN :
0866787X
Volume :
3
Issue :
2
Database :
Directory of Open Access Journals
Journal :
Tạp chí Khoa học Đại học Đà Lạt
Publication Type :
Academic Journal
Accession number :
edsdoj.6359738de3634b1c9cb675977c075c46
Document Type :
article
Full Text :
https://doi.org/10.37569/DalatUniversity.3.2.247(2013)