16 results
Search Results
2. Intelligent Low-Consumption Optimization Strategies: Economic Operation of Hydropower Stations Based on Improved LSTM and Random Forest Machine Learning Algorithm.
- Author
-
Pan, Hong, Yang, Jie, Yu, Yang, Zheng, Yuan, Zheng, Xiaonan, and Hang, Chenyang
- Subjects
- *
MACHINE learning , *RANDOM forest algorithms , *WATER consumption , *WATER power , *PARTICLE swarm optimization , *WATER efficiency , *RENEWABLE energy sources - Abstract
The economic operation of hydropower stations has the potential to increase water use efficiency. However, there are some challenges, such as the fixed and unchangeable flow characteristic curve of the hydraulic turbines, and the large number of variables in optimal load distribution, which limit the progress of research. In this paper, we propose a new optimal method of the economic operation of hydropower stations based on improved Long Short-Term Memory neural network (I-LSTM) and Random Forest (RF) algorithm. Firstly, in order to accurately estimate the water consumption, the LSTM model's hyperparameters are optimized using improved particle swarm optimization, and the I-LSTM method is proposed to fit the flow characteristic curve of the hydraulic turbines. Secondly, the Random Forest machine learning algorithm is introduced to establish a load-distribution model with its powerful feature extraction and learning ability. To improve the accuracy of the load-distribution model, we use the K-means algorithm to cluster the historical data and optimize the parameters of the Random Forest model. A Hydropower Station in China is selected for a case study. It is shown that (1) the I-LSTM method fits the operating characteristics under various working conditions and actual operating characteristics of hydraulic turbines, ensuring that they are closest to the actual operating state; (2) the I-LSTM method is compared with Support Vector Machine (SVM), Extreme Learning Machine (ELM) and Long Short-Term Memory neural network (LSTM). The prediction results of SVM have a large error, but compared with ELM and LSTM, MSE is reduced by about 46% and 38% respectively. MAE is reduced by about 25% and 21%, respectively. RMSE is reduced by about 27% and 24%, respectively; (3) the RF algorithm performs better than the traditional dynamic programming algorithm in load distribution. With the passage of time and the increase in training samples, the prediction accuracy of the Random Forest model has steadily improved, which helps to achieve optimal operation of the units, reducing their average total water consumption by 1.24%. This study provides strong support for the application of intelligent low-consumption optimization strategies in hydropower fields, which can bring higher economic benefits and resource savings to renewable energy production. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. A Study on the Effects of Digital Finance on Green Low-Carbon Circular Development Based on Machine Learning Models.
- Author
-
Zhang, Xuewei, Ai, Xiaoqing, Wang, Xiaoxiang, Zong, Gang, and Zhang, Jinghao
- Subjects
- *
MACHINE learning , *HIGH technology industries , *ARTIFICIAL intelligence , *RANDOM forest algorithms , *BLOCKCHAINS , *CARBON nanofibers , *CLOUD computing , *TECHNOLOGICAL innovations - Abstract
With technological transformations such as big data, blockchain, artificial intelligence, and cloud computing, digital techniques are infiltrating the field of finance. Digital finance (DF) is a resource-saving and environmentally friendly innovative financial service. It shows great green attributes and can drive the flow of financial resources towards environmentally-friendly enterprises, thereby promoting green low-carbon circular development (GLCD). However, few studies have explored the coupling mechanism between DF and GLCD. To fill this gap, this paper explores the effect of DF on GLCD, and established a mediating effect model to investigate the mechanism of DF in promoting GLCD. Additionally, this paper established a random forest model and a CatBoost model based on machine learning to examine the relative importance of DF and the factors affecting GLCD. The results show that DF has significant positive effects on GLCD, and technological innovation plays a key role in the effect of DF on GLCD; meanwhile, the effect of DF on GLCD shows nonlinear features with an increasing "marginal effect"; moreover, both DF and conventional factors have significant impacts on GLCD. Our study highlights the effect of DF on GLCD and underscores the importance of developing policies for DF and GLCD. This study provides an empirical basis and path reference for DF to achieve "carbon peak, carbon neutralization" in China. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
4. Healthcare Cost Prediction Based on Hybrid Machine Learning Algorithms.
- Author
-
Zou, Shujie, Chu, Chiawei, Shen, Ning, and Ren, Jia
- Subjects
- *
MEDICAL care costs , *MACHINE learning , *LONG-term memory , *BAYESIAN analysis , *SIMPLE machines , *RANDOM forest algorithms - Abstract
Healthcare cost is an issue of concern right now. While many complex machine learning algorithms have been proposed to analyze healthcare cost and address the shortcomings of linear regression and reliance on expert analyses, these algorithms do not take into account whether each characteristic variable contained in the healthcare data has a positive effect on predicting healthcare cost. This paper uses hybrid machine learning algorithms to predict healthcare cost. First, network structure learning algorithms (a score-based algorithm, constraint-based algorithm, and hybrid algorithm) for a Conditional Gaussian Bayesian Network (CGBN) are used to learn the isolated characteristic variables in healthcare data without changing the data properties (i.e., discrete or continuous). Then, the isolated characteristic variables are removed from the original data and the remaining data used to train regression algorithms. Two public healthcare datasets are used to test the performance of the proposed hybrid machine learning algorithm model. Experiments show that when compared to popular single machine learning algorithms (Long Short Term Memory, Random Forest, etc.) the proposed scheme can obtain similar or higher prediction accuracy with a reduced amount of data. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
5. Predicting Intensive Care Unit Patients' Discharge Date with a Hybrid Machine Learning Model That Combines Length of Stay and Days to Discharge.
- Author
-
Cuadrado, David, Valls, Aida, and Riaño, David
- Subjects
- *
MACHINE learning , *INTENSIVE care patients , *HOSPITAL admission & discharge , *INTENSIVE care units , *RANDOM forest algorithms - Abstract
Background: Accurate planning of the duration of stays at intensive care units is of utmost importance for resource planning. Currently, the discharge date used for resource management is calculated only at admission time and is called length of stay. However, the evolution of the treatment may be different from one patient to another, so a recalculation of the date of discharge should be performed, called days to discharge. The prediction of days to discharge during the stay at the ICU with statistical and data analysis methods has been poorly studied with low-quality results. This study aims to improve the prediction of the discharge date for any patient in intensive care units using artificial intelligence techniques. Methods: The paper proposes a hybrid method based on group-conditioned models obtained with machine learning techniques. Patients are grouped into three clusters based on an initial length of stay estimation. On each group (grouped by first days of stay), we calculate the group-conditioned length of stay value to know the predicted date of discharge, then, after a given number of days, another group-conditioned prediction model must be used to calculate the days to discharge in order to obtain a more accurate prediction of the number of remaining days. The study is performed with the eICU database, a public dataset of USA patients admitted to intensive care units between 2014 and 2015. Three machine learning methods (i.e., Random Forest, XGBoost, and lightGBM) are used to generate length of stay and days to discharge predictive models for each group. Results: Random Forest is the algorithm that obtains the best days to discharge predictors. The proposed hybrid method achieves a root mean square error (RMSE) and mean average error (MAE) below one day on the eICU dataset for the last six days of stay. Conclusions: Machine learning models improve quality of predictions for the days to discharge and length of stay for intensive care unit patients. The results demonstrate that the hybrid model, based on Random Forest, improves the accuracy for predicting length of stay at the start and days to discharge at the end of the intensive care unit stay. Implementing these prediction models may help in the accurate estimation of bed occupancy at intensive care units, thus improving the planning for these limited and critical health-care resources. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
6. Research on Emotional Infection of Passengers during the SRtP of a Cruise Ship by Combining an SIR Model and Machine Learning.
- Author
-
Xiong, Gaohan, Cai, Wei, Hu, Min, and Yu, Zhiyan
- Subjects
- *
MACHINE learning , *CRUISE ships , *RANDOM forest algorithms , *EMOTIONAL contagion , *AFFECTIVE forecasting (Psychology) - Abstract
The Safe Return to Port issue regarding cruise ships has been extensively researched, covering aspects such as performance, operations, and electrical systems. However, an often overlooked aspect is the potential eruption of negative emotions among passengers during SRtP. This study aims to investigate the prediction of collective emotions to facilitate timely safety planning and enhance the safety of the Safe Return to Port process. To achieve this objective, an improved susceptible-infectious-recovered model with bidirectional infection is proposed to describe the emotional contagion process during the Safe Return to Port process. This model classifies the population into five emotional (extremely anxious–anxious–normal–calm–very calm) states and introduces two sources of infection. Moreover, it allows for emotions to transition both positively and negatively, making it a more realistic representation of scenarios resembling long-term refuge scenarios. In this study, questionnaire data, collected and statistically analyzed, serve as the primary dataset. A machine learning technique (the weighted random forest algorithm) is integrated with the model to make predictions. The accuracy, precision, recall, and the F-measure of prediction results demonstrate good performance. Additionally, through simulation, this study illustrates the fluctuating nature of emotional changes during the Safe Return to Port process of the cruise ship and analyzes the effects of varying parameters. The findings suggest that the improved susceptible-infectious-recovered model proposed in this paper can provide valuable insights for cruise ship emergency planning and positively contribute to maintaining passenger emotional stability during the Safe Return to Port process. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
7. Multiaxial Strength Criterion Model of Concrete Based on Random Forest.
- Author
-
Chen, Xingqiao, Zheng, Dongjian, Liu, Yongtao, Wu, Xin, Jiang, Haifeng, and Qiu, Jianchun
- Subjects
- *
RANDOM forest algorithms , *COMPOSITE columns , *MACHINE learning , *BACK propagation , *CONVEX functions , *CONCRETE - Abstract
The concrete strength criterion is the basis of strength analysis and evaluation under a complex stress state. In this paper, a large number of multiaxial strength tests were carried out, and many mathematical expressions of strength criteria were proposed based on the geometric characteristics and the assumption of a convex function. However, the rationality of the assumption of a convex function limits the use of these strength criteria. In particular, misjudgment will occur near the failure curve surface. Therefore, this paper does not assume the shape function of the criterion in advance. By collecting experimental data and using a machine learning method, it proposes a method of hidden function of failure curve surface. Based on 777 groups of experimental data, the random forest (RF), the back propagation neural network (BP) and the radial basis neural network (RBF) models were used to analyze and verify the feasibility and effectiveness of the method. Subsequently, the results were compared with the Ottosen strength criterion, the Guo Wang strength criterion and the Drucker–Prager (DP) strength criterion. The results show that the consistency between the strength criterion model established by the machine learning algorithm (especially random forest) and the experimental data is higher than the convex function multiaxis strength criterion of the preset failure surface shape. Moreover, the physical significance is clearer, the deficiency of the convex function failure surface hypothesis is avoided and the established multiaxial strength criterion of concrete is more universal. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
8. New Machine-Learning Control Charts for Simultaneous Monitoring of Multivariate Normal Process Parameters with Detection and Identification.
- Author
-
Sabahno, Hamed and Niaki, Seyed Taghi Akhavan
- Subjects
- *
QUALITY control charts , *MACHINE learning , *PARAMETER identification , *SUPPORT vector machines , *RANDOM forest algorithms - Abstract
Simultaneous monitoring of the process parameters in a multivariate normal process has caught researchers' attention during the last two decades. However, only statistical control charts have been developed so far for this purpose. On the other hand, machine-learning (ML) techniques have rarely been developed to be used in control charts. In this paper, three ML control charts are proposed using the concepts of artificial neural networks, support vector machines, and random forests techniques. These ML techniques are trained to obtain linear outputs, and then based on the concepts of memory-less control charts, the process is classified into in-control or out-of-control states. Two different input scenarios and two different training methods are used for the proposed ML structures. In addition, two different process control scenarios are utilized. In one, the goal is only the detection of the out-of-control situation. In the other one, the identification of the responsible variable (s)/process parameter (s) for the out-of-control signal is also an aim (detection–identification). After developing the ML control charts for each scenario, we compare them to one another, as well as to the most recently developed statistical control charts. The results show significantly better performance of the proposed ML control charts against the traditional memory-less statistical control charts in most compared cases. Finally, an illustrative example is presented to show how the proposed scheme can be implemented in a healthcare process. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
9. The Fama–French Five-Factor Model with Hurst Exponents Compared with Machine Learning Methods.
- Author
-
Li, Yicun and Teng, Yuanyang
- Subjects
- *
MACHINE learning , *EXPONENTS , *RANDOM forest algorithms , *INVESTORS - Abstract
Scholars and investors have been interested in factor models for a long time. This paper builds models using the monthly data of the A-share market. We construct a seven-factor model by adding the Hurst exponent factor and the momentum factor to a Fama–French five-factor model and find that there is a 7% improvement in the average R–squared. Then, we compare five machine learning algorithms with ordinary least squares (OLS) in one representative stock and all A-Share stocks. We find that regularization algorithms, such as lasso and ridge, have worse performance than OLS. SVM and random forests have a good improvement in fitting power, while the neural network is not always better than OLS, depending on the data, frequency, period, etc. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
10. Automated EEG Pathology Detection Based on Significant Feature Extraction and Selection.
- Author
-
Zhong, Yunning, Wei, Hongyu, Chen, Lifei, and Wu, Tao
- Subjects
- *
ELECTROENCEPHALOGRAPHY , *FEATURE selection , *FEATURE extraction , *PATHOLOGY , *RANDOM forest algorithms , *MACHINE learning , *WAKEFULNESS - Abstract
Neurological diseases are a significant health threat, often presenting through abnormalities in electroencephalogram (EEG) signals during seizures. In recent years, machine learning (ML) technologies have been explored as a means of automated EEG pathology diagnosis. However, existing ML-based EEG binary classification methods largely focus on extracting EEG-related features, which may lead to poor performance in classifying EEG signals by overlooking potentially redundant information. In this paper, we propose a novel Kruskal–Wallis (KW) test-based framework for EEG pathology detection. Our framework first divides EEG data into frequency sub-bands using wavelet packet decomposition and then extracts statistical characteristics from each selected coefficient. Next, the piecewise aggregation approximation technique is used to obtain the aggregated feature vectors, followed by the KW statistical test methodology to select significant features. Finally, three ensemble learning classifiers, random forest, categorical boosting (CatBoost), and light gradient boosting machine, are used to classify the extracted significant features into normal or abnormal classes. Our proposed framework achieves an accuracy of 89.13%, F1-score of 87.60%, and G-mean of 88.60%, respectively, outperforming other competing techniques on the same dataset, which shows the great promise in EEG pathology detection. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
11. RanKer : An AI-Based Employee-Performance Classification Scheme to Rank and Identify Low Performers.
- Author
-
Patel, Keyur, Sheth, Karan, Mehta, Dev, Tanwar, Sudeep, Florea, Bogdan Cristian, Taralunga, Dragos Daniel, Altameem, Ahmed, Altameem, Torki, and Sharma, Ravi
- Subjects
- *
EMPLOYEE reviews , *ARTIFICIAL intelligence , *JOB performance , *RANDOM forest algorithms , *DECISION trees - Abstract
An organization's success depends on its employees, and an employee's performance decides whether the organization is successful. Employee performance enhances the productivity and output of organizations, i.e., the performance of an employee paves the way for the organization's success. Hence, analyzing employee performance and giving performance ratings to employees is essential for companies nowadays. It is evident that different people have different skill sets and behavior, so data should be gathered from all parts of an employee's life. This paper aims to provide the performance rating of an employee based on various factors. First, we compare various AI-based algorithms, such as random forest, artificial neural network, decision tree, and XGBoost. Then, we propose an ensemble approach, RanKer, combining all the above approaches. The empirical results illustrate that the efficacy of the proposed model compared to traditional models such as random forest, artificial neural network, decision tree, and XGBoost is high in terms of precision, recall, F1-score, and accuracy. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
12. Quantitative Analysis of Anesthesia Recovery Time by Machine Learning Prediction Models.
- Author
-
Yang, Shumin, Li, Huaying, Lin, Zhizhe, Song, Youyi, Lin, Cheng, and Zhou, Teng
- Subjects
- *
DEEP learning , *MACHINE learning , *PREDICTION models , *QUANTITATIVE research , *ANESTHESIA , *RANDOM forest algorithms - Abstract
It is significant for anesthesiologists to have a precise grasp of the recovery time of the patient after anesthesia. Accurate prediction of anesthesia recovery time can support anesthesiologist decision-making during surgery to help reduce the risk of surgery in patients. However, effective models are not proposed to solve this problem for anesthesiologists. In this paper, we seek to find effective forecasting methods. First, we collect 1824 patient anesthesia data from the eye center and then performed data preprocessing. We extracted 85 variables to predict recovery time from anesthesia. Second, we extract anesthesia information between variables for prediction using machine learning methods, including Bayesian ridge, lightGBM, random forest, support vector regression, and extreme gradient boosting. We also design simple deep learning models as prediction models, including linear residual neural networks and jumping knowledge linear neural networks. Lastly, we perform a comparative experiment of the above methods on the dataset. The experiment demonstrates that the machine learning method performs better than the deep learning model mentioned above on a small number of samples. We find random forest and XGBoost are more efficient than other methods to extract information between variables on postoperative anesthesia recovery time. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
13. Towards Explainable Machine Learning for Bank Churn Prediction Using Data Balancing and Ensemble-Based Methods.
- Author
-
Tékouabou, Stéphane C. K., Gherghina, Ștefan Cristian, Toulni, Hamza, Mata, Pedro Neves, and Martins, José Moleiro
- Subjects
- *
CUSTOMER relationship management , *CLASSIFICATION algorithms , *CUSTOMER loyalty , *ENTERPRISE resource planning , *CUSTOMER relationship management software , *RANDOM forest algorithms , *ELECTRONIC data processing , *MACHINE learning - Abstract
The diversity of data collected on both social networks and digital interfaces is extremely increased, raising the problem of heterogeneous variables that are not often favourable to classification algorithms. Despite the significant improvement in machine learning (ML) and predictive analysis efficiency for classification in customer relationship management systems (CRM), their performance remains very limited by heterogeneous data processing, class imbalance, and feature scales. This impact turned out to be more important for simple ML methods which in addition often suffer from over-fitting. This paper proposes a succinct and detailed ML model building process including cross-validation of the combination of SMOTE to balance data and ensemble methods for modelling. From the conducted experiments, the random forest (RF) model yielded the best performance of 0.86 in terms of accuracy and f1-scoreusing balanced data. It confirms the literature summary about this topic which shows that RF was among the most effective algorithms for customer predictive classification issues. The constructed and optimized models were interpreted by Shapley values and feature importance analysis which shows that the "age" feature was the most significant while "HasCrCard" was the less one. This process has proven effective in bridging previously reported research gaps and the resulting model should be used for supporting bank customer loyalty decision-making. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
14. Novel Ensemble Tree Solution for Rockburst Prediction Using Deep Forest.
- Author
-
Li, Diyuan, Liu, Zida, Armaghani, Danial Jahed, Xiao, Peng, and Zhou, Jian
- Subjects
- *
DEEP learning , *SUPPORT vector machines , *GOLD mining , *MACHINE learning , *RANDOM forest algorithms - Abstract
The occurrence of rockburst can cause significant disasters in underground rock engineering. It is crucial to predict and prevent rockburst in deep tunnels and mines. In this paper, the deficiencies of ensemble learning algorithms in rockburst prediction were investigated. Aiming at these shortages, a novel machine learning model, deep forest, was proposed to predict rockburst risk. The deep forest combines the characteristics of deep learning and ensemble models, which can solve complex problems. To develop the deep forest model for rockburst prediction, 329 real rockburst cases were collected to build a comprehensive database for intelligent analysis. Bayesian optimization was proposed to tune the hyperparameters of the deep forest. As a result, the deep forest model achieved 100% training accuracy and 92.4% testing accuracy, and it has more outstanding capability to forecast rockburst disasters compared to other widely used models (i.e., random forest, boosting tree models, neural network, support vector machine, etc.). The results of sensitivity analysis revealed the impact of variables on rockburst levels and the applicability of deep forest with a few input parameters. Eventually, real cases of rockburst in two gold mines, China, were used for validation purposes while the needed data sets were prepared by field observations and laboratory tests. The promoting results of the developed model during the validation phase confirm that it can be used with a high level of accuracy by practicing engineers for predicting rockburst occurrences. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
15. A Goal Programming-Based Methodology for Machine Learning Model Selection Decisions: A Predictive Maintenance Application.
- Author
-
Mallidis, Ioannis, Yakavenka, Volha, Konstantinidis, Anastasios, and Sariannidis, Nikolaos
- Subjects
- *
MACHINE learning , *RANDOM forest algorithms , *SUPPORT vector machines , *STOCHASTIC programming , *DECISION trees , *REGRESSION trees - Abstract
The paper develops a goal programming-based multi-criteria methodology, for assessing different machine learning (ML) regression models under accuracy and time efficiency criteria. The developed methodology provides users with high flexibility in assessing the models as it allows for a fast and computationally efficient sensitivity analysis of accuracy and time significance weights as well as accuracy and time significance threshold values. Four regression models were assessed, namely the decision tree, random forest, support vector and the neural network. The developed methodology was employed to forecast the time to failures of NASA Turbofans. The results reveal that decision tree regression (DTR) seems to be preferred for low values of accuracy weights (up to 30%) and low accuracy and time efficiency threshold values. As the accuracy weights tend to increase and for higher accuracy and time efficiency threshold values, random forest regression (RFR) seems to be the best choice. The preference for the RFR model however, seems to change towards the adoption of the neural network for accuracy weights equal to and higher than 90%. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
16. Deep Learning-Based Survival Analysis for High-Dimensional Survival Data.
- Author
-
Hao, Lin, Kim, Juncheol, Kwon, Sookhee, and Ha, Il Do
- Subjects
- *
SURVIVAL analysis (Biometry) , *RANDOM forest algorithms , *DEEP learning , *MACHINE learning , *PREDICTION models , *CENSORING (Statistics) , *DATA analysis - Abstract
With the development of high-throughput technologies, more and more high-dimensional or ultra-high-dimensional genomic data are being generated. Therefore, effectively analyzing such data has become a significant challenge. Machine learning (ML) algorithms have been widely applied for modeling nonlinear and complicated interactions in a variety of practical fields such as high-dimensional survival data. Recently, multilayer deep neural network (DNN) models have made remarkable achievements. Thus, a Cox-based DNN prediction survival model (DNNSurv model), which was built with Keras and TensorFlow, was developed. However, its results were only evaluated on the survival datasets with high-dimensional or large sample sizes. In this paper, we evaluated the prediction performance of the DNNSurv model using ultra-high-dimensional and high-dimensional survival datasets and compared it with three popular ML survival prediction models (i.e., random survival forest and the Cox-based LASSO and Ridge models). For this purpose, we also present the optimal setting of several hyperparameters, including the selection of a tuning parameter. The proposed method demonstrated via data analysis that the DNNSurv model performed well overall as compared with the ML models, in terms of the three main evaluation measures (i.e., concordance index, time-dependent Brier score, and the time-dependent AUC) for survival prediction performance. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.