Back to Search Start Over

A Novel Approach for Deciphering Big Data Value Using Dark Data.

Authors :
Bhatia, Surbhi
Alojail, Mohammed
Source :
Intelligent Automation & Soft Computing; 2022, Vol. 33 Issue 2, p1261-1271, 11p
Publication Year :
2022

Abstract

The last decade has seen a rapid increase in big data, which has led to a need for more tools that can help organizations in their data management and decision making. Business intelligence tools have removed many of the obstacles to data visibility, and numerous data mining technologies are playing an essential role in this visibility. However, the increase in big data has also led to an increase in 'dark data', data that does not have any predefined structure and is not generated intentionally. In this paper, we show how dark data can be mined for practical purposes and utilized to gain business insight. The most common type of dark data is a log file generated on a web server. Using the example of log files generated by e-commerce transactions, this paper shows how residual data and data trails can prove to be valuable when an actual dataset is inaccessible, and explains the usage of residual data for modeling purposes. The work uses a system identification approach, based on natural language processing for log file tokenization and feature extraction. The features are then embedded into the next step, which uses a deep neural network to identify customers for targeted advertising. The results achieve a significant accuracy and show how dark data has the potential to deliver value for business. Locating, organizing, and understanding dark data can unlock its relevance, usefulness, and potential monetization, but it is important to act when the benefits of use outweigh the costs of access and analysis. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10798587
Volume :
33
Issue :
2
Database :
Complementary Index
Journal :
Intelligent Automation & Soft Computing
Publication Type :
Academic Journal
Accession number :
155230810
Full Text :
https://doi.org/10.32604/iasc.2022.023501