1. Statistical Modeling and Simulation of Online Shopping Customer Loyalty Based on Machine Learning and Big Data Analysis
- Author
-
Ching-Tang Hsieh, Po-Chang Ko, Cher-Min Fong, Hsin-Hung Chen, Sn-Man Lai, and Jui-Chan Huang
- Subjects
Science (General) ,Article Subject ,Association rule learning ,Computer Networks and Communications ,Computer science ,Big data ,Hash function ,02 engineering and technology ,Machine learning ,computer.software_genre ,030218 nuclear medicine & medical imaging ,Loyalty business model ,03 medical and health sciences ,Q1-390 ,0302 clinical medicine ,Web page ,0202 electrical engineering, electronic engineering, information engineering ,Data deduplication ,T1-995 ,Cluster analysis ,Technology (General) ,business.industry ,020206 networking & telecommunications ,Artificial intelligence ,Web crawler ,business ,computer ,Information Systems - Abstract
With the increase in the number of online shopping users, customer loyalty is directly related to product sales. This research mainly explores the statistical modeling and simulation of online shopping customer loyalty based on machine learning and big data analysis. This research mainly uses machine learning clustering algorithm to simulate customer loyalty. Call the k-means interactive mining algorithm based on the Hash structure to perform data mining on the multidimensional hierarchical tree of corporate credit risk, continuously adjust the support thresholds for different levels of data mining according to specific requirements and select effective association rules until satisfactory results are obtained. After conducting credit risk assessment and early warning modeling for the enterprise, the initial preselected model is obtained. The information to be collected is first obtained by the web crawler from the target website to the temporary web page database, where it will go through a series of preprocessing steps such as completion, deduplication, analysis, and extraction to ensure that the crawled web page is correctly analyzed, to avoid incorrect data due to network errors during the crawling process. The correctly parsed data will be stored for the next step of data cleaning or data analysis. For writing a Java program to parse HTML documents, first set the subject keyword and URL and parse the HTML from the obtained file or string by analyzing the structure of the website. Secondly, use the CSS selector to find the web page list information, retrieve the data, and store it in Elements. In the overall fit test of the model, the root mean square error approximation (RMSEA) value is 0.053, between 0.05 and 0.08. The results show that the model designed in this study achieves a relatively good fitting effect and strengthens customers’ perception of shopping websites, and relationship trust plays a greater role in maintaining customer loyalty.
- Published
- 2021