Phishing Website Detection With Semantic Features Based on Machine Learning Classifiers

Almomani, A, Alauthman, M, Shatnawi, M T, Alweshah, M, Alrosan, A, Alomoush, W, Gupta, B B and Gupta, B B (2022) Phishing Website Detection With Semantic Features Based on Machine Learning Classifiers. International Journal on Semantic Web and Information Systems, 18 (1). pp. 1-24. ISSN 1552-6283

[thumbnail of 68.pdf] Text
68.pdf - Published Version

Download (2MB)

Abstract

The phishing attack is one of the main cybersecurity threats in web phishing and spear phishing. Phishing websites continue to be a problem. One of the main contributions to our study was working and extracting the URL & Domain Identity feature, Abnormal Features, HTML and JavaScript Features, and Domain Features as semantic features to detect phishing websites, which makes the process of classification using those semantic features, more controllable and more effective. The current study used machine learning model algorithms to detect phishing websites, and comparisons were made. We have used 16 machine learning models adopted with 10 semantic features that represent the most effective features for the detection of phishing webpages extracted from two datasets. The GradientBoostingClassifier and RandomForestClassifier had the best accuracy based on the comparison results (i.e., about 97%). In contrast, GaussianNB and the stochastic gradient descent (SGD) classifier represent the lowest accuracy results; 84% and 81% respectively, in comparison with other classifiers.

Affiliation: Skyline University College
SUC Author(s): Almomani, A ORCID: https://orcid.org/0000-0002-8808-6114, Alrosan, A ORCID: https://orcid.org/0000-0001-9400-4077 and Alomoush, W ORCID: https://orcid.org/0000-0002-2937-4327
All Author(s): Almomani, A, Alauthman, M, Shatnawi, M T, Alweshah, M, Alrosan, A, Alomoush, W, Gupta, B B and Gupta, B B
Item Type: Article
Uncontrolled Keywords: Machine Learning Models, Phishing Website, Semantic Classification, Semantic Features
Subjects: B Information Technology > BL Machine Learning
Divisions: Skyline University College > School of IT
Depositing User: Mr Veeramani Rasu
Date Deposited: 21 May 2022 10:39
Last Modified: 21 May 2022 10:39
URI: https://research.skylineuniversity.ac.ae/id/eprint/197
Publisher URL: https://doi.org/10.4018/IJSWIS.297032
Publisher OA policy: https://v2.sherpa.ac.uk/id/publication/17961
Related URLs:

Actions (login required)

View Item
View Item
Statistics for SkyRep ePrint 197 Statistics for this ePrint Item