Back to Search Start Over

Comparison of Classification Algorithms for Detection of Phishing Websites.

Authors :
VAITKEVICIUS, Paulius
MARCINKEVICIUS, Virginijus
Source :
Informatica; 2020, Vol. 31 Issue 1, p143-160, 18p
Publication Year :
2020

Abstract

Phishing activities remain a persistent security threat,with global losses exceeding 2.7 billion USD in 2018, according to the FBI's Internet Crime Complaint Center. In literature, different generations of phishing websites detectionmethods have been observed. The oldestmethods include manual blacklisting of known phishing websites' URLs in the centralized database, but they have not been able to detect newly launched phishing websites. More recent studies have attempted to solve phishing websites detection as a supervised machine learning problem on phishing datasets, designed on features extracted from phishing websites' URLs. These studies have shown some classification algorithms performing better than others on differently designed datasets but have not distinguished the best classification algorithm for the phishing websites detection problem in general. The purpose of this research is to compare classic supervised machine learning algorithms on all publicly available phishing datasets with predefined features and to distinguish the best performing algorithm for solving the problem of phishing websites detection, regardless of a specific dataset design. Eight widely used classification algorithms were configured in Python using the Scikit Learn library and tested for classification accuracy on all publicly available phishing datasets. Later, classification algorithms were ranked by accuracy on different datasets using three different ranking techniques while testing the results for a statistically significant difference using Welch's T-Test. The comparison results are presented in this paper, showing ensembles and neural networks outperforming other classical algorithms. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
08684952
Volume :
31
Issue :
1
Database :
Complementary Index
Journal :
Informatica
Publication Type :
Academic Journal
Accession number :
146255744
Full Text :
https://doi.org/10.15388/20-INFOR404