Descriptor: "ensemble classifier" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"ensemble classifier"' showing total 1,163 results

Start Over Descriptor "ensemble classifier"

1,163 results on '"ensemble classifier"'

1. Emovere Agnitio by Textual Tweets Using Machine Learning

Author: Kumar, B. Prasanna, Rejeti, Venkata Kishore Kumar, Shanvitha, T. Bhavani, Jyothi Sri, S., Prathyusha, Y., Keerthana, S., Anand, D., Chaari, Fakher, Series Editor, Gherardini, Francesco, Series Editor, Ivanov, Vitalii, Series Editor, Haddar, Mohamed, Series Editor, Cavas-Martínez, Francisco, Editorial Board Member, di Mare, Francesca, Editorial Board Member, Kwon, Young W., Editorial Board Member, Tolio, Tullio A. M., Editorial Board Member, Trojanowska, Justyna, Editorial Board Member, Schmitt, Robert, Editorial Board Member, Xu, Jinyang, Editorial Board Member, Deepak, B B V L, editor, Bahubalendruni, M.V.A. Raju, editor, Parhi, D.R.K., editor, and Biswal, B. B., editor
Published: 2025
Full Text: View/download PDF

2. A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier.

Author: Edward, Jafhate, Rosli, Marshima Mohd, and Seman, Ali
Abstract: In medical data, addressing imbalanced datasets is paramount for accurate predictive modeling. This paper delves into exploring a well-established rebalancing framework proposed in previous research. While acknowledged for its effectiveness, the adaptability of this framework across diverse medical datasets remains unexplored. We conduct a comprehensive investigation to bridge this gap by integrating an ensemble-based classifier into the existing framework. By leveraging seven imbalanced medical binary datasets, our study comprises three distinct experiments: utilizing standard baseline classifiers from the framework (original), incorporating the baseline with an ensemble-based classifier, and introducing our novel ensemble-based classifier with the self-paced ensemble (SPE) algorithm. Our novel ensemble, composed of decision tree (DT), radial support vector machine (R.SVM), and extreme gradient boosting (XGB) classifiers, serves as the foundation for the SPE. Our primary objective is to demonstrate the potential improvement of the existing framework's overall performance through the integration of an ensemble. Experimental results reveal significant enhancements, with our proposed ensemble classifier outperforming the original by 4.96%, 5.89%, 5.68%, 7.85%, and 6.84% in terms of accuracy, precision, recall, F-score, and G-mean, respectively. This study contributes valuable insights into the adaptability and performance augmentation achievable through ensemble methods in addressing class imbalances within the medical domain. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

3. A novel secure image steganography scheme based on Hamming encoding and LSB matching revisited using Lah Transform.

Author: Nguyen, Tuan Duc and Le, Hai Quoc
Subjects: HAMMING codes, TWO-dimensional bar codes, INTEGERS, INTERNET, ENCODING
Abstract: Data hiding is a technique that conceals a given mystery message within a cover medium in a secure way. It provides an additional security layer to protect valuable data embedded into the cover medium and transferred via the Internet. Steganography approaches based on transform domains are the most popular due to their high security in image steganography. Unfortunately, these approaches have two major existing disadvantages significant computing power and limited embedding capacity. To overcome these drawbacks, in this paper a hybrid scheme based on Lah transform is proposed. At first, the Lah transform is utilized to generate the Lah coefficients from the considered 4-pixel blocks of the considered cover image. Then, these Lah components are modified to embed the valuable data. For embedding rates equal to or less than 0.5 bpp, a modified Hamming code based method (MHCBM) is employed to conceal the given message bits in the generated Lah components. For payload larger than 0.5 bpp into the Lah transform coefficients, a modified LSB matching revisited approach (MLSBMR) is used. This principle allows us to minimize statistical distortion to the stego images, while at the same time enhancing the security of the hidden message against statistical steganalysis approaches. Experimental results indicated that the proposed scheme achieved better perceptual quality and higher security performance compared to the existing Lah transform based method and the relevant integer based transform approaches. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

4. Automatic Classification of Anomalous ECG Heartbeats from Samples Acquired by Compressed Sensing.

Author: Picariello, Enrico, Picariello, Francesco, Tudosa, Ioan, Rajan, Sreeraman, and De Vito, Luca
Subjects: *DISCRETE cosine transforms, *SIGNAL classification, *COMPRESSED sensing, *K-nearest neighbor classification, *AUTOMATIC classification
Abstract: In this paper, a method for the classification of anomalous heartbeats from compressed ECG signals is proposed. The method operating on signals acquired by compressed sensing is based on a feature extraction stage consisting of the evaluation of the Discrete Cosine Transform (DCT) coefficients of the compressed signal and a classification stage performed by means of a set of k-nearest neighbor ensemble classifiers. The method was preliminarily tested on five classes of anomalous heartbeats, and it achieved a classification accuracy of 99.40%. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

5. Dog behaviors identification model using ensemble convolutional neural long short-term memory networks.

Author: Abd El-Latif, Eman I., El-dosuky, Mohamed, Darwish, Ashraf, and Hassanien, Aboul Ella
Abstract: This paper presents a new model based on Convolutional Neural Networks (CNN) with a long short-term memory network (LSTM) and ensemble technique for identifying seven different dogs' behaviors. The proposed model uses data collected from two sensors attached to the dog's back and neck. In the initial step in the model, the undefined tasks are removed, and the synthetic minority oversampling technique (SMOTE) is performed to address the imbalanced data problem. Then, CNN_LSTM and ensemble classifier are adapted to identify various dog behaviors. Finally, two experiments are performed to evaluate the model. The first experiment is performed utilizing the original data (imbalanced datasets) while the second uses a balanced dataset. Experimental results can identify seven dog behaviors with a potential accuracy of 96.73%, 96.76% sensitivity, 96.73% specificity, and 96.73% F1 score. Therefore, the SMOTE method, a data balancing strategy, not only overcomes the unbalanced data problem but also significantly improves minority class accuracy. Additionally, the suggested model is tested against cutting-edge algorithms, and the outcomes demonstrate its superior performance. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

6. Screening of Key Transcripts from Expression Data Using Applied Artificial Intelligence for Cancer Prediction

Author: Anju Pratap and Michiaki Hamada
Subjects: Ensemble classifier, Transcripts, Cancer prediction, Mitranscriptome, Electronic computers. Computer science, QA75.5-76.95
Abstract: Abstract Research on the effects of artificial intelligence (AI)-driven solutions in the field of oncology is still being conducted all around the world. Applications of AI to identify the transcripts that cause cancer are being investigated, particularly when employing ensemble learning techniques. Ensemble feature fusion is the process of distributing the feature selection process and combining the local features that were chosen to create a smaller global feature set. This article addresses the use of ensemble feature fusion in the field of differential transcript expression analysis to screen for important transcripts linked to a disease. Owing to the necessity and significance of research in cancer diagnosis, an ensemble feature fusion approach experimental case study has been carried out using 109 liver cancer samples obtained from Mitranscriptome dataset. It was found that the expression data were used to filter the most pertinent transcripts, which allowed for the best possible differentiation between sample type. Accordingly, a set of 26 significant liver cancer causing transcripts has been screened using unanimous voting-scheme, giving an accuracy of percentage of 96. During generalization testing, cancer prediction classifiers constructed with this essential transcript collection shown excellent discriminating power and performed well in differentiating between normal and malignant cells. By resolving the “high dimension-low sample (High p Low n)” issue that is typically present in the expression data, this improves the predicting ability of cancer diagnostic systems. In the field of oncology, artificial intelligence-powered solutions will facilitate the development of applications that prioritize Sustainable Development Goal 3: Good Health and Well-Being.
Published: 2024
Full Text: View/download PDF

7. Computer-aided diagnosis system for grading brain tumor using histopathology images based on color and texture features

Author: Naira Elazab, Wael Gab Allah, and Mohammed Elmogy
Subjects: Glioma computational pathology, Cancer grades, Color features, Texture features, Ensemble classifier, Medical technology, R855-855.5
Abstract: Abstract Background Cancer pathology shows disease development and associated molecular features. It provides extensive phenotypic information that is cancer-predictive and has potential implications for planning treatment. Based on the exceptional performance of computational approaches in the field of digital pathogenic, the use of rich phenotypic information in digital pathology images has enabled us to identify low-level gliomas (LGG) from high-grade gliomas (HGG). Because the differences between the textures are so slight, utilizing just one feature or a small number of features produces poor categorization results. Methods In this work, multiple feature extraction methods that can extract distinct features from the texture of histopathology image data are used to compare the classification outcomes. The successful feature extraction algorithms GLCM, LBP, multi-LBGLCM, GLRLM, color moment features, and RSHD have been chosen in this paper. LBP and GLCM algorithms are combined to create LBGLCM. The LBGLCM feature extraction approach is extended in this study to multiple scales using an image pyramid, which is defined by sampling the image both in space and scale. The preprocessing stage is first used to enhance the contrast of the images and remove noise and illumination effects. The feature extraction stage is then carried out to extract several important features (texture and color) from histopathology images. Third, the feature fusion and reduction step is put into practice to decrease the number of features that are processed, reducing the computation time of the suggested system. The classification stage is created at the end to categorize various brain cancer grades. We performed our analysis on the 821 whole-slide pathology images from glioma patients in the Cancer Genome Atlas (TCGA) dataset. Two types of brain cancer are included in the dataset: GBM and LGG (grades II and III). 506 GBM images and 315 LGG images are included in our analysis, guaranteeing representation of various tumor grades and histopathological features. Results The fusion of textural and color characteristics was validated in the glioma patients using the 10-fold cross-validation technique with an accuracy equals to 95.8%, sensitivity equals to 96.4%, DSC equals to 96.7%, and specificity equals to 97.1%. The combination of the color and texture characteristics produced significantly better accuracy, which supported their synergistic significance in the predictive model. The result indicates that the textural characteristics can be an objective, accurate, and comprehensive glioma prediction when paired with conventional imagery. Conclusion The results outperform current approaches for identifying LGG from HGG and provide competitive performance in classifying four categories of glioma in the literature. The proposed model can help stratify patients in clinical studies, choose patients for targeted therapy, and customize specific treatment schedules.
Published: 2024
Full Text: View/download PDF

8. DEVELOPMENT OF A NEURAL NETWORK WITH A LAYER OF TRAINABLE ACTIVATION FUNCTIONS FOR THE SECOND STAGE OF THE ENSEMBLE CLASSIFIER WITH STACKING.

Author: Galchonkov, Oleg, Baranov, Oleksii, Antoshchuk, Svetlana, Maslov, Oleh, and Babych, Mykola
Abstract: One of the promising directions for improving the quality of object recognition in images and parallelizing calculations is the use of ensemble classifiers with stacking. A neural network at the second level makes it possible to achieve the resulting quality of classification, which is significantly higher than each of the networks of the first level separately. The classification quality of the entire ensemble classifier with stacking depends on the efficiency of the neural networks at the first stage, their number, and the quality of the classification of the neural network of the second stage. This paper proposes a neural network architecture for the second stage of the ensemble classifier, which combines the approximating properties of traditional neurons and learning activation functions. Gaussian Radial Basis Functions (RBFs) were chosen to implement the learned activation functions, which are summed with the learned weights. The experimental studies showed that when working with the CIFAR-10 data set, the best results are obtained when six RBFs are used. A comparison with the use of multilayer perceptron (MLP) in the second stage showed a reduction in classification errors by 0.45–1.9 % depending on the number of neural networks in the first stage. At the same time, the proposed neural network architecture for the second degree had 1.69–3.7 times less learning coefficients than MLP. This result is explained by the fact that the use of an output layer with ordinary neurons allowed us not to enter into the architecture many learning activation functions for each output signal of the first stage, but to limit ourselves to only one. Since the results were obtained on the CIFAR-10 universal data set, a similar effect could be obtained on a large number of similar practical data sets. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

9. Securing IoT network with hybrid evolutionary lion intrusion detection system: a composite motion optimisation algorithm for feature selection and ensemble classification.

Author: Subramaniam, Anuvelavan, Chelladurai, Sureshkumar, Ande, Stanly Kumar, and Srinivasan, Sathiyandrakumar
Subjects: *OPTIMIZATION algorithms, *FEATURE selection, *INTRUSION detection systems (Computer security), *INTERNET of things, *ERROR rates, *CLASSIFICATION
Abstract: Networks connected to the Internet of Things (IoT) are often vulnerable to attacks. Several existing methods in the intrusion detection system for securing IoT have been presented with ensemble classifier, but it does not accurately classify attack, and also it takes high computation time. With intention of solving the security issues, Intrusion Detection System using Hybrid Evolutionary Lion and Balancing Composite Motion Optimisation Algorithm espoused feature selection with Ensemble Classifier (IDS-IoT-Hybrid ELOA-BCMOA-Ensemble-DT-LSVM-RF-XGBoost) is proposed for Securing IoT Network. At first, data were accumulated from the NSL-KDD data set. Afterward, data is fed to pre-processing, where it restored missing value using mean curvature flow method. At feature selection, optimum features are compiled under Hybrid Evolutionary Lion and Balancing Composite Motion Optimisation Algorithm. Based upon the optimum features, intruders of IoT data are categorised as denial-of-service (DoS), probe, remote to local attack (R2L), user to root attack (U2R), normal (no attack) with the help of Ensemble classifier. Proposed IDS-IoT-Hybrid ELOA-BCMOA-Ensemble-DT-LSVM-RF-XGBoost approach is constructed utilising Python. Then, proposed IDS-IoT-Hybrid ELOA-BCMOA-Ensemble-DT-LSVM-RF-XGBoost approach attains 21.11%, 19.58%, 24.61% and 9.52% higher accuracy; 94.47%, 93.95%, 93.08% and 90.59% lower error rate, and 62.94%, 36.69%, 64.17% and 50.97% less computation time analysed with existing models. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

10. Computer-aided diagnosis system for grading brain tumor using histopathology images based on color and texture features.

Author: Elazab, Naira, Gab Allah, Wael, and Elmogy, Mohammed
Subjects: COMPUTER-aided diagnosis, TUMOR grading, FEATURE extraction, HISTOPATHOLOGY, BRAIN tumors, PATHOLOGY
Abstract: Background: Cancer pathology shows disease development and associated molecular features. It provides extensive phenotypic information that is cancer-predictive and has potential implications for planning treatment. Based on the exceptional performance of computational approaches in the field of digital pathogenic, the use of rich phenotypic information in digital pathology images has enabled us to identify low-level gliomas (LGG) from high-grade gliomas (HGG). Because the differences between the textures are so slight, utilizing just one feature or a small number of features produces poor categorization results. Methods: In this work, multiple feature extraction methods that can extract distinct features from the texture of histopathology image data are used to compare the classification outcomes. The successful feature extraction algorithms GLCM, LBP, multi-LBGLCM, GLRLM, color moment features, and RSHD have been chosen in this paper. LBP and GLCM algorithms are combined to create LBGLCM. The LBGLCM feature extraction approach is extended in this study to multiple scales using an image pyramid, which is defined by sampling the image both in space and scale. The preprocessing stage is first used to enhance the contrast of the images and remove noise and illumination effects. The feature extraction stage is then carried out to extract several important features (texture and color) from histopathology images. Third, the feature fusion and reduction step is put into practice to decrease the number of features that are processed, reducing the computation time of the suggested system. The classification stage is created at the end to categorize various brain cancer grades. We performed our analysis on the 821 whole-slide pathology images from glioma patients in the Cancer Genome Atlas (TCGA) dataset. Two types of brain cancer are included in the dataset: GBM and LGG (grades II and III). 506 GBM images and 315 LGG images are included in our analysis, guaranteeing representation of various tumor grades and histopathological features. Results: The fusion of textural and color characteristics was validated in the glioma patients using the 10-fold cross-validation technique with an accuracy equals to 95.8%, sensitivity equals to 96.4%, DSC equals to 96.7%, and specificity equals to 97.1%. The combination of the color and texture characteristics produced significantly better accuracy, which supported their synergistic significance in the predictive model. The result indicates that the textural characteristics can be an objective, accurate, and comprehensive glioma prediction when paired with conventional imagery. Conclusion: The results outperform current approaches for identifying LGG from HGG and provide competitive performance in classifying four categories of glioma in the literature. The proposed model can help stratify patients in clinical studies, choose patients for targeted therapy, and customize specific treatment schedules. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

11. Stock market prediction-COVID-19 scenario with lexicon-based approach.

Author: Ayyappa, Yalanati and Siva Kumar, A.P.
Subjects: *COVID-19 pandemic, *INVESTORS, *MARKETING forecasting, *STATISTICAL models, *DECISION making, *FEATURE extraction, *EXPERT systems
Abstract: Stock market forecasting remains a difficult problem in the economics industry due to its incredible stochastic nature. The creation of such an expert system aids investors in making investment decisions about a certain company. Due to the complexity of the stock market, using a single data source is insufficient to accurately reflect all of the variables that influence stock fluctuations. However, predicting stock market movement is a challenging undertaking that requires extensive data analysis, particularly from a big data perspective. In order to address these problems and produce a feasible solution, appropriate statistical models and artificially intelligent algorithms are needed. This paper aims to propose a novel stock market prediction by the following four stages; they are, preprocessing, feature extraction, improved feature level fusion and prediction. The input data is first put through a preparation step in which stock, news, and Twitter data (related to the COVID-19 epidemic) are processed. Under the big data perspective, the input data is taken into account. These pre-processed data are then put through the feature extraction, The improved aspect-based lexicon generation, PMI, and n-gram-based features in this case are derived from the news and Twitter data, while technical indicator-based features are derived from the stock data. The improved feature-level fusion phase is then applied to the extracted features. The ensemble classifiers, which include DBN, CNN, and DRN, were proposed during the prediction phase. Additionally, a SI-MRFO model is suggested to enhance the efficiency of the prediction model by adjusting the best classifier weights. Finally, SI-MRFO model's effectiveness compared to the existing models with regard to MAE, MAPE, MSE and MSLE. The SI-MRFO accomplished the minimal MAE rate for the 90th learning percentage is approximately 0.015 while other models acquire maximum ratings. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

12. A Novel Ensemble Machine Learning Algorithm for Predicting the Suitable Crop to Cultivate Based on Soil and Environment Characteristics.

Author: Mariammal, G., Suruliandi, A., Stamenkovic, Z., and Raja, S. P.
Subjects: MACHINE learning, SUPPORT vector machines, ENVIRONMENTAL soil science, PREDICTION models, CLIMATE change
Abstract: Research in agriculture is a promising field, and crop prediction for particular land areas is especially critical to agriculture. Such prediction depends on the soil, minerals, and environment, the last of which has been short-changed by changing climatic conditions. Consequently, crop prediction for a particular zone presents difficulties for farmers. This is where machine learning (ML) steps in with techniques that are widely applied in agriculture. This work proposes a weighted stacked ensemble (WSE) method for the crop prediction process. It combines two base learners or classifiers to construct the WSE, which is a single predictive ensemble model, using weighted instances. The experimental outcomes show that the proposed WSE outperforms other classification and ensemble techniques in terms of improved crop prediction accuracy. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

13. Optimizing Bagged Trees in an Ensemble Classifier for Improved Prediction of Diabetes Prevalence in Women.

Author: Candia Jr., Jose, Adonis, Airish Mae, and Perlas, Jesica
Subjects: FEATURE selection, MATHEMATICAL optimization, BLOOD pressure, DIABETES, INSULIN
Abstract: This study aims to optimize the performance of the bagged tree in an ensemble classifier for predicting diabetes prevalence in women. The study used a dataset of 1,888 women with six features: age, BMI, glucose level, insulin level, blood pressure, and pregnancy status. The dataset was divided into training and testing sets with a 70:30 ratio. The bagged tree ensemble classifier was used for the analysis, and five-fold cross-validation was applied. The study found that using all features during training resulted in a 92.3% training accuracy and a 99.5% testing accuracy. However, applying optimization techniques such as feature selection, parameter tuning, and a maximum number of splits improved model performance. Feature selection optimized the accuracy performance by 0.2%, while parameter tuning improved the test accuracy by 0.2%. Moreover, decreasing the maximum number of splits from 1322 to 800 or 600 resulted in an optimized model with 0.1% higher validation accuracy. Finally, the optimized bagged tree models were evaluated using various performance metrics, including accuracy, precision, recall, and F1 score. The study found that Model 1, which used 800 maximum number of splits and 50 learners, outperformed Model 2 in terms of recall and F1 score, while Model 2, which used 600 maximum number of splits and 50 learners, had a higher precision score. The study concludes that optimization techniques can significantly improve the performance of the bagged tree in predicting diabetes prevalence in women. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

14. An Ensemble Machine Learning Method for Analyzing Various Medical Datasets.

Author: Gupta, Chhaya, Gill, Nasib Singh, Maheshwary, Priti, Pandit, Shraddha V., Gulia, Preeti, and Pareek, Piyush Kumar
Subjects: FEATURE selection, SUPPORT vector machines, RANDOM forest algorithms, GENETIC algorithms, DIAGNOSIS
Abstract: In recent years, machine learning (ML) has shown a significant impact in tackling various complicated problems in different application domains, including healthcare, economics, ecological, stock market, surveillance, and commercial applications. Machine Learning techniques are good enough to deal with a wide range of data, uncover fascinating links, offer insights, and spot trends. ML can improve disease diagnosis accuracy, predictability, performance, and reliability. This paper reviews various machine learning techniques applied to different medical datasets and proposes an ensemble method for helping in the early diagnosis of different diseases. The study compares existing machine learning techniques with the proposed ensemble method. The ensemble method uses the AdaBoost algorithm to combine the traits of choice trees, random forests, and support vector machines. Three feature selection techniques, Fisher’s score, information gain, and genetic algorithm, are used to select appropriate dataset features. The ensemble method also uses the K-fold cross-validation technique (where k=15) for validating results. SMOTE was employed to balance some of the datasets because they were quite unbalanced. All the methods used in this study are evaluated based on accuracy, AU Curve, Recall, Precision, and F1-score. The paper uses different medical datasets at the University of California Irvine and the Kaggle directory to compare machine-learning models with the proposed ensemble method. The encouraging results show that the ensemble method outperforms the existing machine-learning techniques. The paper thoroughly analyzes how machine learning is used in the medical industry, covering established technologies and their impact on medical diagnosis. An early diagnosis is needed to prevent people from deadly diseases. Hence, this study proposes an ensemble method that may be used to diagnose different diseases early. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

15. Machine Learning-Based Classification of Rock Bursts in an Active Coal Mine Dominated by Non-Destructive Tremors.

Author: Wojtecki, Łukasz, Bukowska, Mirosława, Iwaszenko, Sebastian, and Apel, Derek B.
Subjects: ROCK bursts, MACHINE learning, MINES & mineral resources, ANTHRACITE coal, COAL basins, COAL mining accidents, HAZARD mitigation
Abstract: Rock bursts are dynamic phenomena in underground openings, causing damage to support and infrastructure, and are one of the main natural hazards in underground coal mines. The prediction of rock bursts is important for improving safety in mine openings. The hazard of rock bursts is correlated with seismic activity, but rock bursts are rare compared to non-destructive tremors. The five machine learning classifiers (multilayer perceptron, adaptive boosting, gradient boosting, K-nearest neighbors, and Gaussian naïve Bayes), along with an ensemble hard-voting classifier composed of these classifiers, were used to recognize rock bursts among the dominant non-destructive tremors. Machine learning models were trained and tested on ten sets of randomly selected data obtained from one of the active hard coal mines in the Upper Silesian Coal Basin, Poland. For each of the 627 cases in the database, 15 features representing geological, geomechanical, mining, and technical conditions in the opening as well as tremor energy and correlated peak particle velocity were determined. Geological and geomechanical parameters of the coal seams and surrounding rocks were aggregated into a single GEO index. The share of rock bursts in the database was only about 8.5%; therefore, the ADASYN balancing method, which addresses imbalanced datasets, was used. The ensemble hard-voting classifier most effectively classified rock bursts, with an average recall of 0.74. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

16. Fault diagnosis method of PEMFC system based on ensemble learning.

Author: Zhang, Xuexia, Peng, Lishuo, He, Fei, and Huang, Ruike
Subjects: *FAULT diagnosis, *PROTON exchange membrane fuel cells, *FEATURE extraction, *MACHINE learning, *DIAGNOSIS methods, *FUEL cells
Abstract: Due to the coupling of multiple physical fields inside proton exchange membrane fuel cell (PEMFC) and the complex and changeable external application scenarios, the faults of PEMFC system greatly affects its engineering application. The paper proposes an ensemble learning-based fault diagnosis method for PEMFC systems, utilizing five distinct algorithms to develop an integrated ensemble classifier for fault detection. The original sample dataset collected from fuel cells undergoes a process of deep feature extraction to obtain a composite feature sample dataset, which is then inputted into an ensemble classifier to establish a fault diagnosis model. The experimental data of four kinds of faults of 80 W fuel cell test bench, EC fuel cell system and 50 W water-cooled fuel cell test bench are analyzed by using this classification model. The results show that the accuracy and generalization of the ensemble learning algorithm for different fault diagnosis of PEMFC system in different application environments are verified. • Adopt multiple classification algorithms for fault modeling and diagnosis. • Good applicability to different faults is shown in different test scenarios. • Compared with a single algorithm, it has higher fault identification accuracy. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

17. A voting-based ensemble classifier to predict phases and crystal structures of high entropy alloys through thermodynamic, electronic, and configurational parameters

Author: Pritam Mandal, Amitava Choudhury, Amitava Basu Mallick, and Manojit Ghosh
Subjects: High entropy alloy, Machine learning, Computational metallurgy, Phase predictions, Ensemble classifier, Mining engineering. Metallurgy, TN1-997
Abstract: This study aims to predict the various phases present in high entropy alloys (HEAs) and consequently classify their crystal structure employing multiple machine learning (ML) algorithms utilizing five thermodynamic, electronic and configurational parameters which are considered to be essential for the formation of HEA phases. The properties of a high entropy alloy can eventually be traced through accurate phase and crystal structure prediction, which is essential for selecting the ideal elements for designs. Twelve distinct ML algorithms were executed to predict the phases of HEAs, adopting an experimental database of 322 different HEAs, involving 33 amorphous (AM), 31 intermetallics (IM), and 258 solid solutions (SS) phases. Among the twelve ML models, Cat Boost Classifier displayed the optimum accuracy of 98.06 % for phase predictions. Further, crystal structure classification of the SS phase (body-centered cubic- BCC, face-centered cubic- FCC, and mixed body-centered and face-centered cubic- BCC+FCC) has endeavoured for better microstructure evolution using a different database containing of 194 additional HEAs data with 61 FCC, 76 BCC, and 57 BCC+FCC crystal structures and in comparison to the other models tested, the Gradient Boosting Classifier evolved with the highest accuracy of 86.90 %. An ensemble classifier was also introduced to improve the performance of the ML models, resulting in an accuracy increase to 98.70 % and 86.95 % for phase and crystal structure predictions, respectively. Additionally, the influence of parameters on model accuracy was determined independently.
Published: 2024
Full Text: View/download PDF

18. Ensembling of Performance Metrics in Credit Risk Assessment Using Machine Learning Analytics

Author: Bhattacharya, Arijit, Biswas, Saroj Kr., Mandal, Ardhendu, Das, Akhil Kumar, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Bansal, Jagdish Chand, editor, Borah, Samarjeet, editor, Hussain, Shahid, editor, and Salhi, Said, editor
Published: 2024
Full Text: View/download PDF

19. An Ensemble Classifier Based on kNN with an Interval Threshold Strategy

Author: Bentkowska, Urszula, Mrukowicz, Marcin, Gałka, Wojciech, Lech, Karol, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Nguyen, Ngoc Thanh, editor, Chbeir, Richard, editor, Manolopoulos, Yannis, editor, Fujita, Hamido, editor, Hong, Tzung-Pei, editor, Nguyen, Le Minh, editor, and Wojtkiewicz, Krystian, editor
Published: 2024
Full Text: View/download PDF

20. Comparing Ensemble Learning Algorithms to Improve Flight Prediction Accuracy and Reliability

Author: Jawahar, Malathy, Raj, J. Jai Ganesh, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Devi, B. Rama, editor, Kumar, Kishore, editor, Raju, M., editor, Raju, K. Srujan, editor, and Sellathurai, Mathini, editor
Published: 2024
Full Text: View/download PDF

21. Prevention and Mitigation of Intrusion Using an Efficient Ensemble Classification in Fog Computing

Author: Paul, P. Mano, Shekhar, R., Jingle, I. Diana Jeba, Jingle, I. Berin Jeba, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Devi, B. Rama, editor, Kumar, Kishore, editor, Raju, M., editor, Raju, K. Srujan, editor, and Sellathurai, Mathini, editor
Published: 2024
Full Text: View/download PDF

22. An Optimized Ensemble Machine Learning Framework for Multi-class Classification of Date Fruits by Integrating Feature Selection Techniques

Author: Maheswara Rao, V. V. R., Silpa, N., Reddy, Shiva Shankar, Hussain, S. Mahaboob, Bonthu, Sridevi, Uppalapati, Padma Jyothi, Akan, Ozgur, Editorial Board Member, Bellavista, Paolo, Editorial Board Member, Cao, Jiannong, Editorial Board Member, Coulson, Geoffrey, Editorial Board Member, Dressler, Falko, Editorial Board Member, Ferrari, Domenico, Editorial Board Member, Gerla, Mario, Editorial Board Member, Kobayashi, Hisashi, Editorial Board Member, Palazzo, Sergio, Editorial Board Member, Sahni, Sartaj, Editorial Board Member, Shen, Xuemin, Editorial Board Member, Stan, Mircea, Editorial Board Member, Jia, Xiaohua, Editorial Board Member, Zomaya, Albert Y., Editorial Board Member, Pareek, Prakash, editor, Gupta, Nishu, editor, and Reis, M. J. C. S., editor
Published: 2024
Full Text: View/download PDF

23. Similar Intensity-Based Euclidean Distance Feature Vector for Mammogram Image Classification

Author: Sharma, Bhanu Prakash, Purwar, Ravindra Kumar, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Tiwari, Shailesh, editor, Trivedi, Munesh C., editor, Kolhe, Mohan L., editor, and Singh, Brajesh Kumar, editor
Published: 2024
Full Text: View/download PDF

24. Non-invasive health status diagnosis of solar PV panel using ensemble classifier

Author: Krishna Veni, K. S., Senthil Kumar, N., and Gnanavadivel, J.
Published: 2024
Full Text: View/download PDF

25. Generalizable disease detection using model ensemble on chest X-ray images

Author: Maider Abad, Jordi Casas-Roma, and Ferran Prados
Subjects: Ensemble classifier, X-ray imaging, Transfer learning, Pre-trained models, Domain adaptation, Medicine, Science
Abstract: Abstract In the realm of healthcare, the demand for swift and precise diagnostic tools has been steadily increasing. This study delves into a comprehensive performance analysis of three pre-trained convolutional neural network (CNN) architectures: ResNet50, DenseNet121, and Inception-ResNet-v2. To ensure the broad applicability of our approach, we curated a large-scale dataset comprising a diverse collection of chest X-ray images, that included both positive and negative cases of COVID-19. The models’ performance was evaluated using separate datasets for internal validation (from the same source as the training images) and external validation (from different sources). Our examination uncovered a significant drop in network efficacy, registering a 10.66% reduction for ResNet50, a 36.33% decline for DenseNet121, and a 19.55% decrease for Inception-ResNet-v2 in terms of accuracy. Best results were obtained with DenseNet121 achieving the highest accuracy at 96.71% in internal validation and Inception-ResNet-v2 attaining 76.70% accuracy in external validation. Furthermore, we introduced a model ensemble approach aimed at improving network performance when making inferences on images from diverse sources beyond their training data. The proposed method uses uncertainty-based weighting by calculating the entropy in order to assign appropriate weights to the outputs of each network. Our results showcase the effectiveness of the ensemble method in enhancing accuracy up to 97.38% for internal validation and 81.18% for external validation, while maintaining a balanced ability to detect both positive and negative cases.
Published: 2024
Full Text: View/download PDF

26. Deep Learning-Based Classification and Semantic Segmentation of Lung Tuberculosis Lesions in Chest X-ray Images.

Author: Ou, Chih-Ying, Chen, I-Yen, Chang, Hsuan-Ting, Wei, Chuan-Yi, Li, Dian-Yu, Chen, Yen-Kai, and Chang, Chuan-Yu
Subjects: *LUNG diseases, *X-ray imaging, *DEEP learning, *DATA augmentation, *CHEST X rays, *SIGNAL convolution
Abstract: We present a deep learning (DL) network-based approach for detecting and semantically segmenting two specific types of tuberculosis (TB) lesions in chest X-ray (CXR) images. In the proposed method, we use a basic U-Net model and its enhanced versions to detect, classify, and segment TB lesions in CXR images. The model architectures used in this study are U-Net, Attention U-Net, U-Net++, Attention U-Net++, and pyramid spatial pooling (PSP) Attention U-Net++, which are optimized and compared based on the test results of each model to find the best parameters. Finally, we use four ensemble approaches which combine the top five models to further improve lesion classification and segmentation results. In the training stage, we use data augmentation and preprocessing methods to increase the number and strength of lesion features in CXR images, respectively. Our dataset consists of 110 training, 14 validation, and 98 test images. The experimental results show that the proposed ensemble model achieves a maximum mean intersection-over-union (MIoU) of 0.70, a mean precision rate of 0.88, a mean recall rate of 0.75, a mean F1-score of 0.81, and an accuracy of 1.0, which are all better than those of only using a single-network model. The proposed method can be used by clinicians as a diagnostic tool assisting in the examination of TB lesions in CXR images. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

27. Incorporating Multi-Temporal Remote Sensing and a Pixel-Based Deep Learning Classification Algorithm to Map Multiple-Crop Cultivated Areas.

Author: Wang, Xue, Zhang, Jiahua, Wang, Xiaopeng, Wu, Zhenjiang, and Prodhan, Foyez Ahmed
Subjects: MACHINE learning, DEEP learning, CLASSIFICATION algorithms, CONVOLUTIONAL neural networks, REMOTE sensing, HEBBIAN memory, CORN
Abstract: The accurate monitoring of crop areas is essential for food security and agriculture, but accurately extracting multiple-crop distribution over large areas remains challenging. To solve the above issue, in this study, the Pixel-based One-dimensional convolutional neural network (PB-Conv1D) and Pixel-based Bi-directional Long Short-Term Memory (PB-BiLSTM) were proposed to identify multiple-crop cultivated areas using time-series NaE (a combination of NDVI and EVI) as input for generating a baseline classification. Two approaches, Snapshot and Stochastic weighted averaging (SWA), were used in the base-model to minimize the loss function and improve model accuracy. Using an ensemble algorithm consisting of five PB-Conv1D and seven PB-BiLSTM models, the temporal vegetation index information in the base-model was comprehensively exploited for multiple-crop classification and produced the Pixel-Based Conv1D and BiLSTM Ensemble model (PB-CB), and this was compared with the PB-Transformer model to validate the effectiveness of the proposed method. The multiple-crop cultivated area was extracted from 2005, 2010, 2015, and 2020 in North China by using the PB-Conv1D combine Snapshot (PB-CDST) and PB-CB models, which are a performance-optimized single model and an integrated model, respectively. The results showed that the mapping results of the multiple-crop cultivated area derived by PB-CDST (OA: 81.36%) and PB-BiLSTM combined with Snapshot (PB-BMST) (OA: 79.40%) showed exceptional accuracy compared to PB-Transformer combined with Snapshot and SWA (PB-TRSTSA) (OA: 77.91%). Meanwhile, the PB-CB (OA: 83.43%) had the most accuracy compared to the pixel-based single algorithm. The MODIS-derived PB-CB method accurately identified multiple-crop areas for wheat, corn, and rice, showing a strong correlation with statistical data, exceeding 0.7 at the municipal level and 0.6 at the county level. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

28. Intelligent leather defect classification using Fourier angular radial partitioning algorithm with ensemble classifier.

Author: Jawahar, Malathy, Anbarasi, L. Jani, Anand, S. Mahesh, and Ravi, Vinayakumar
Abstract: Leather quality inspection is essential in determining the usable area of the material. As leather is a natural substance, surface defects can have a significant impact on its overall quality and reduce its usability. The automatic identification of surface defects in leather holds great importance in the inspection process. This study presents an innovative method called the Fourier Angular Radial Partitioning (FARP) algorithm for extracting features, specifically tailored for the identification of surface defects in leather. A cutting-edge industrial prototype machine vision system is designed with innovative capabilities to acquire high-quality entire leather surface image accurately. The FARP algorithm leverages a combination of spatial and radial distributed invariant feature descriptors obtained from the magnitude of the Fourier Transform. Furthermore, by partitioning the image into multiple sub-regions enables the FARP to extract features to effectively analyze both prominent flaws like cuts, scars and subtle imperfections like pinholes. The performance of the proposed FARP algorithm is compared to Gray Level Co-occurrence method and Spatial domain features. Correlation analysis is conducted on the extracted features from these three methods to identify the optimal feature set. Leather defects are classified using a multinomial logistic regression model and an ensemble classifier approach with random forest. Various measures, including accuracy, specificity, sensitivity, F-score, Mathew Correlation Coefficient, and ROC analysis using Z-test, are employed for a comprehensive evaluation. The experimental results indicate that the random forest and the proposed FARP feature set, achieves a remarkable classification accuracy of 88.67% and a notable area under the ROC curve of 0.875. This intelligent solution, which integrates FARP with the Random Forest classifier, surpasses the performance of manual expert leather defect classification, highlighting its superior effectiveness. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

29. Enhanced kinship verification analysis based on color and texture handcrafted techniques.

Author: Nader, Nermeen, EL-Gamal, Fatma EL-Zahraa A., and Elmogy, Mohammed
Subjects: *COLOR space, *KINSHIP, *COMPUTER vision, *EXTRACTION techniques
Abstract: Nowadays, kinship verification is an attractive research area within computer vision. It significantly affects applications in the real world, such as finding missing individuals and forensics. Despite the importance of this research topic, it still faces many challenges, such as low accuracy and illumination variations. Due to the existence of different classes of feature extraction techniques, different types of information can be extracted from the input data. Moreover, the fusion power produces complementary information that can address kinship verification problems. Therefore, this paper proposes a new approach for verifying kinship by fusing features from different perspectives, including color-texture and color features in different color spaces. Besides using promising methods in the field, such as local binary pattern (LBP) and scale-invariant feature transform (SIFT), the paper utilizes other feature extraction methods, which are heterogeneous auto-similarities of characteristics (HASC), color correlogram (CC), and dense color histogram (DCH). As far as we know, these features haven't been employed before in this research area. Accordingly, the proposed approach goes into six stages: preprocessing, feature extraction, feature normalization, feature fusion, feature representation, and kinship verification. The proposed approach was evaluated on the KinFaceW-I and KinFaceW-II field standard datasets, achieving maximum accuracy of 79.54% and 90.65%, respectively. Compared with many state-of-the-art approaches, the results of the proposed approach reflect the promising achievements and encourage the authors to plan for future enhancement. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

30. Diagnosis of bearing fault in induction motor using Bayesian optimization-based ensemble classifier.

Author: Krishna Veni, K. S. and Kumar, N. Senthil
Subjects: *INDUCTION motors, *FAULT diagnosis, *INDUCTION machinery, *ARTIFICIAL intelligence
Abstract: Electrical equipment plays a vital role in industry. Among various electrical equipment, induction motors are quite commonly used in many industrial applications. One of the most common faults that occurs in induction motors is bearing fault. In this article, bearing fault is diagnosed in an induction motor using vibration signals with the help of a simple Artificial Intelligence (AI)-based model. Because, the vibration signals are not dependent on the motor type, simple to measure, cost effective and yields good results. In the proposed system, accurate prediction of bearing condition is carried out using Bayesian optimization-based ensemble classifier (BOEC). The performance of the BOEC-based bearing fault diagnosis system is compared with other conventional techniques and the comparison results confirm the superior performance of the proposed system. Also, the accuracy obtained from the BOEC-based bearing fault diagnosis system is 99.97%. To verify the effectiveness of the proposed system, a hardware prototype is set up in the laboratory and bearing conditions of various induction motors are analyzed. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

31. REDUCING THE VOLUME OF COMPUTATIONS WHEN BUILDING ANALOGS OF NEURAL NETWORKS FOR THE FIRST STAGE OF AN ENSEMBLE CLASSIFIER WITH STACKING.

Author: Galchonkov, Oleg, Baranov, Oleksii, Chervonenko, Petr, and Babilunga, Oksana
Subjects: IMAGE recognition (Computer vision), ECCENTRIC loads, PROBLEM solving
Abstract: The object of research in this work is ensemble classifiers with stacking, intended for the classification of objects in images with the presence of small sets of labeled data for training. To improve the quality of classification at the first stage of such a classifier, it is necessary to place more primary classifiers that differ in heterogeneous structured processing. However, the number of known neural networks with appropriate characteristics is limited. One approach to solving this problem is to build analogs of known neural networks that make classification errors on other images compared to the base network. The disadvantage of the known methods for constructing such analogs is the need to perform additional floating-point operations. The current paper proposes and investigates a new method to form analogs through random cyclic shifts of rows or columns of input images. This has made it possible to completely eliminate additional floating-point operations. The effectiveness of using this method is explained by the structured processing of input images in basic neural networks. The use of analogs obtained by the proposed method does not impose additional restrictions in practice. This is because the heterogeneity of structured processing in basic neural networks is a typical requirement for them in an ensemble classifier with stacking. The simulation for the CIFAR-10 data set demonstrated that the proposed technique for constructing analogs allows for a comparative quality of classification by the ensemble classifier. Using MLP-Mixer analogs provided an improvement of 4.6 %, and CCT analogs – 5.9 %. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

32. Low Dimensional Multi Class Steganalysis of Spatial LSB based Stego Images Using Textural Features.

Author: Thanasekaran, Veena Sivasamy and Selvaraj, Arivazhagan
Published: 2024
Full Text: View/download PDF

33. A new camera model identification method based on color correction features.

Author: Liu, Yuan- yuan, Chen, Chong, Lin, Hong-wei, and Li, Zhu
Abstract: This paper focuses on source camera model identification technology in the field of digital image forensics. The research goal is to identify the source camera model, and researchers generally use the algorithm design of convolutional neural networks combined with noise residuals. However, traditional features such as noise residuals are easily polluted by noise and compression, which substantially affects the classification accuracy of source camera model identification algorithms for traditional features. Based on existing source camera model identification methods, this paper proposes the use of color correction features as the basic features of source camera model identification for the first time and proposes a new algorithm for source camera model identification based on image color correction features. A convolutional neural network is utilized to extract image color correction features and identify and classify source camera models. This paper has carried out experimental verification on a large-scale dataset, and the source camera model recognition accuracy of the proposed method in this paper can reach 97.23%; the recognition accuracy under compression conditions has reached 91.28%. The experimental results show that the image color correction feature is better than the source camera model in terms of recognition and has great research and application potential in the field of recognition. Additionally, the proposed algorithm is highly robust even after image compression and pollution, outperforming other methods under both original image conditions and compressed image conditions. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

34. An ensemble classification approach for cervical cancer prediction using behavioral risk factors

Author: Md Shahin Ali, Md Maruf Hossain, Moutushi Akter Kona, Kazi Rubaya Nowrin, and Md Khairul Islam
Subjects: Cervical cancer, Ensemble classifier, Data normalization, K fold cross-validation, SHapley Additive exPlanations (SHAP), Explainable Artificial Intelligence (XAI), Computer applications to medicine. Medical informatics, R858-859.7
Abstract: Cervical cancer is a significant public health concern among females worldwide. Despite being preventable, it remains a leading cause of mortality. Early detection is crucial for successful treatment and improved survival rates. This study proposes an ensemble Machine Learning (ML) classifier for efficient and accurate identification of cervical cancer using medical data. The proposed methodology involves preparing two datasets using effective preprocessing techniques, extracting essential features using the scikit-learn package, and developing an ensemble classifier based on Random Forest, Support Vector Machine, Gaussian Naïve Bayes, and Decision Tree classifier traits. Comparison with other state-of-the-art algorithms using several ML techniques, including support vector machine, decision tree, random forest, Naïve Bayes, logistic regression, CatBoost, and AdaBoost, demonstrates that the proposed ensemble classifier outperforms them significantly, achieving accuracies of 98.06% and 95.45% for Dataset 1 and Dataset 2, respectively. The proposed ensemble classifier outperforms current state-of-the-art algorithms by 1.50% and 6.67% for Dataset 1 and Dataset 2, respectively, highlighting its superior performance compared to existing methods. The study also utilizes a five-fold cross-validation technique to analyze the benefits and drawbacks of the proposed methodology for predicting cervical cancer using medical data. The Receiver Operating Characteristic (ROC) curves with corresponding Area Under the Curve (AUC) values are 0.95 for Dataset 1 and 0.97 for Dataset 2, indicating the overall performance of the classifiers in distinguishing between the classes. Additionally, we employed SHapley Additive exPlanations (SHAP) as an Explainable Artificial Intelligence (XAI) technique to visualize the classifier’s performance, providing insights into the important features contributing to cervical cancer identification. The results demonstrate that the proposed ensemble classifier can efficiently and accurately identify cervical cancer and potentially improve cervical cancer diagnosis and treatment.
Published: 2024
Full Text: View/download PDF

35. An automatic system to detect colorectal polyp using hybrid fused method from colonoscopy images

Author: Md. Nur-A-Alam, Khandaker Mohammad Mohi Uddin, M.M.R. Manu, Md. Mahbubur Rahman, and Mostofa Kamal Nasir
Subjects: Binary differential evolution (BDE), Colorectal cancer (CRC), Colonoscopy image, Convolutional neural network (CNN), Empirical mode decomposition (EMD), Ensemble classifier, Cybernetics, Q300-390, Electronic computers. Computer science, QA75.5-76.95
Abstract: The third most common disease in the world that leads to mortality is colorectal cancer (CRC). Colonoscopy, which locates and removes colonic polyps, is the most common procedure worldwide for the early identification and preventative treatment of colorectal cancer. The likelihood of a CRC patient dying can be considerably decreased by early diagnosis and treatment of precancerous polyps. Unidentified polyps may eventually turn into cancer. Although several classification algorithms have been put out to identify polyps, their effectiveness has not yet been compared to that of skilled endoscopes. An ensemble machine learning-based classification approach for detecting colorectal cancer from colonoscopy images is presented in this research. Firstly researchers collect colorectal cancer or polyp images from three standard datasets. In the preprocessing phase, researcher convert the RGB image to grayscale image and identified the region of interest (ROI) by removing the unwanted regions. A hybrid anisotropic diffusion filtering (HADF) approach was employed to eliminate noise from each image. Then the system extracts features from individual feature extractor methods and fused extracted features in a vector. The fused features help to detect colorectal polyp or cancer for increasing cancer or polyp identification rates. Finally, an ensemble classifier classifies the colorectal cancer or normal images and achieves better accuracy. The suggested technique exceeds the prior conventional methods, according to tests on widely used public datasets, improving accuracy by 99.45 %, sensitivity by 99.30 %, specificityby 99.58 %, and precision 99.53 %.
Published: 2024
Full Text: View/download PDF

36. A tweet sentiment classification approach using an ensemble classifier

Author: Vidyashree KP, Rajendra AB, Gururaj HL, Vinayakumar Ravi, and Moez Krichen
Subjects: Adaptive boosting, Ensemble classifier, Sentiment analysis, Tweets, Twitter API, Electronic computers. Computer science, QA75.5-76.95, Science
Abstract: Social media users are more receptive to products or events and share their thoughts through raw textual data, which is classified as semi-structured data. This data, which is presented using a variety of terminologies, is noisy by nature but yet contains important information and superfluous details, giving analysts a way to identify patterns and knowledge. This hidden information must be extracted from language data in order to make informed decisions and create strategic plans for entering new markets. Among the most prominent fields of study are natural language processing (NLP) and data mining techniques, especially when it comes to sentiment analysis—the process of identifying the feelings and insights concealed in the data. Twitter is one of the significant microblogging platform with millions of users. These users use Twitter to share sentiments using hash tags on different topics and to make status updates known as tweets. Twitter is therefore regarded as a significant real-time source and as one of the most active opinion indicators. The volume of information is produced by Twitter is enormous and manually scanning the entire data set is difficult process. The paper proposed an ensemble classifier to categorize emotion of the tweets on the basis of polarities such as positive and negative.In our study, we ensemble classifiers which is a combination of Random Forest (RF), Support Vector Machine (SVM) and Decision Tree (DT). The data is collected from Twitter API and the Twitter data is analysed autonomously to define public view on particular topic. The features obtained after the process of dimensionality reduction using LDA undergoes the stage of feature selection using Wrapper based technique. The iterative Wrapper based technique predict score for the features, the features with low score are ignored and high score is proceeded for classification. The ensemble classifier used Adaptive Boosting (AdaBoost) technique where the output from the Machine Learning (ML) classifiers are combined to produce a single output. Adaboost combines the poor classifiers and extracts the prediction value to make a better classifier. The experimental results show that the proposed ensemble classifier provides better accuracy of 93.42 % that is comparatively better than existing Convolutional Bidirectional - Long Short-Term Memory (ConvBiLSTM) classifier and Hybrid Lexicon- Naïve Bayes Classifier (HL-NBC) which produce classification accuracy of 91.53 % and 89.61 % respectively.
Published: 2024
Full Text: View/download PDF

37. Detecting Anomaly Classification Using PCA-Kmeans and Ensembled Classifier for Wind Turbines

Author: Prince Waqas Khan and Yung-Cheol Byun
Subjects: Wind turbine, anomaly detection, principal component analysis, k-means clustering, labeling, ensemble classifier, Distribution or transmission of electric power, TK3001-3521, Production of electric energy or power. Powerplants. Central stations, TK1001-1841
Abstract: Monitoring wind turbine performance is vital for ensuring wind turbines’ safe, efficient, and cost-effective operation over time. Using principal component analysis (PCA), k-means clustering for labeling, and an ensemble classifier for finding outliers, this study suggests a new way to find anomalies in wind turbines. The primary objective is to improve the precision of anomaly detection in wind turbines by leveraging machine-learning techniques. The proposed methodology utilizes the output of the PCA-Kmeans model to label supervisory control and data acquisition (SCADA) data. Furthermore, a stacking ensemble classifier is employed to refine the model’s precision. Our proposed model achieved a classification accuracy of 99%, which is a significant improvement compared to existing approaches. The significance of this study lies in its potential to enable more efficient wind turbine operation by identifying and resolving anomalies that may reduce their performance. This can ultimately contribute to achieving a sustainable and renewable energy future.
Published: 2024
Full Text: View/download PDF

38. Cross Subject Myocardial Infarction Detection From Vectorcardiogram Signals Using Binary Harry Hawks Feature Selection and Ensemble Classifiers

Author: M. Krishna Chaitanya and Lakhan Dev Sharma
Subjects: Vectorcardiography (VCG), myocardial infarction (MI), machine learning, binary Harry Hawks feature selection, ensemble classifier, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: Myocardial infarction (MI), widely referred to as a heart attack, is a leading reason for deaths worldwide. It is frequently caused by coronary artery occlusion, resulting in inadequate oxygen and blood supply, which damages the myocardial structure and function. Therefore, innovative diagnostic methods are required for reliable and timely identification of MI. The typical 12-lead electrocardiogram (ECG) technology causes patient discomfort and makes cardiac monitoring challenging. The frontal, sagittal, and transverse planes (3 orthogonal planes) are where vectorcardiogram (VCG) renders an edge over 12-lead ECG. This study, proposes a method for detecting MI utilising VCG signals of four seconds. Circulant singular spectrum analysis (CSSA) and four stage savitzky-golay (SG) filter were used in the filtering stage for the removal of power-line interference and base-line wander. The signal was time-invariantly decomposed using the CSSA, then features were extracted. The binary harry hawks-based feature selection method is employed on the extracted features to choose the optimal feature subspace which was followed by supervised machine learning based classification. The 10-fold cross validation, an even more practical leave-one-out (LOO) cross validation approach, and inter dataset cross validation (IDCV) were used to evaluate the reliability of the suggested method. Voting-based ensemble classification was used in LOO, IDCV validation, which improves the accuracy of this method. The proposed technique achieved an accuracy of 99.97%, 91.03%, and 99.41% for 10-fold, LOO cross validation, and IDCV, out-performing the state-of-the-art methods in the cross validation scenarios. The proposed technique results in an accurate detection of MI. Successful accomplishment of the LOO cross validation demonstrates the applicability and dependability of the suggested technique in the health care applications.
Published: 2024
Full Text: View/download PDF

39. ECG signal classification via ensemble learning: addressing intra and inter-patient variations

Author: Mahajan, Madhavi, Kadam, Sonali, Kulkarni, Vinaya, Gujar, Jotiram, Naik, Sanah, Bibikar, Suruchi, Ochani, Ankita, and Pratap, Sakshi
Published: 2024
Full Text: View/download PDF

40. An Ensemble Classification Model for Medical Databases Using Hybrid Weights

Author: Ahammad, Shaik Hasane, Mohammed, Thayyaba Khatoon, Mandula, Preeti Chenabathini, Nidumolu, Venkatram, Suman, Maloji, Hossain, Md. Amzad, and Rashed, Ahmed Nabih Zaki
Published: 2024
Full Text: View/download PDF

41. Hybrid optimization assisted deep ensemble classification framework for skin cancer detection

Author: Pukhta, Irfan Rashid and Rout, Ranjeet Kumar
Published: 2024
Full Text: View/download PDF

42. A Space Infrared Dim Target Recognition Algorithm Based on Improved DS Theory and Multi-Dimensional Feature Decision Level Fusion Ensemble Classifier.

Author: Chen, Xin, Zhang, Hao, Zhang, Shenghao, Feng, Jiapeng, Xia, Hui, Rao, Peng, and Ai, Jianliang
Subjects: *RADIANT intensity, *RECOGNITION (Psychology), *SUPPORT vector machines, *SITUATIONAL awareness, *FEATURE extraction
Abstract: Space infrared dim target recognition is an important applications of space situational awareness (SSA). Due to the weak observability and lack of geometric texture of the target, it may be unreliable to rely only on grayscale features for recognition. In this paper, an intelligent information decision-level fusion method for target recognition which takes full advantage of the ensemble classifier and Dempster–Shafer (DS) theory is proposed. To deal with the problem that DS produces counterintuitive results when evidence conflicts, a contraction–expansion function is introduced to modify the body of evidence to mitigate conflicts between pieces of evidence. In this method, preprocessing and feature extraction are first performed on the multi-frame dual-band infrared images to obtain the features of the target, which include long-wave radiant intensity, medium–long-wave radiant intensity, temperature, emissivity–area product, micromotion period, and velocity. Then, the radiation intensities are fed to the random convolutional kernel transform (ROCKET) architecture for recognition. For the micromotion period feature, a support vector machine (SVM) classifier is used, and the remaining categories of the features are input into the long short-term memory network (LSTM) for recognition, respectively. The posterior probabilities corresponding to each category, which are the result outputs of each classifier, are constructed using the basic probability assignment (BPA) function of the DS. Finally, the discrimination of the space target category is implemented according to improved DS fusion rules and decision rules. Continuous multi-frame infrared images of six flight scenes are used to evaluate the effectiveness of the proposed method. The experimental results indicate that the recognition accuracy of the proposed method in this paper can reach 93% under the strong noise level (signal-to-noise ratio is 5). Its performance outperforms single-feature recognition and other benchmark algorithms based on DS theory, which demonstrates that the proposed method can effectively enhance the recognition accuracy of space infrared dim targets. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

43. Experimental analysis of intrusion detection systems using machine learning algorithms and artificial neural networks.

Author: Abdulkareem, Ademola, Somefun, Tobiloba Emmanuel, Mutalub, Adesina Lambe, and Adeyinka, Adewale
Abstract: Since the invention of the internet for military and academic research purposes, it has evolved to meet the demands of the increasing number of users on the network, who have their scope beyond military and academics. As the scope of the network expanded maintaining its security became a matter of increasing importance. With various users and interconnections of more diversified networks, the internet needs to be maintained as securely as possible for the transmission of sensitive information to be one hundred per cent safe; several anomalies may intrude on private networks. Several research works have been released around network security and this research seeks to add to the already existing body of knowledge by expounding on these attacks, proffering efficient measures to detect network intrusions, and introducing an ensemble classifier: a combination of 3 different machine learning algorithms. An ensemble classifier is used for detecting remote to local (R2L) attacks, which showed the lowest level of accuracy when the network dataset is tested using single machine learning models but the ensemble classifier gives an overall efficiency of 99.8%. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

44. Ensemble Model-Based Singer Classification with Proposed Vocal Segmentation.

Author: Kumaraswamy, Balachandra
Subjects: CONVOLUTIONAL neural networks, FEATURE extraction, INFORMATION retrieval, SINGERS, CLASSIFICATION
Abstract: Music information retrieval (MIR) is a major topic in the domain of music retrieval and indexing. SID and categorization is a subset of MIR that may be applied to a variety of problems. People have gotten increasingly attached to music, searching for songs based on a genre or a specific performer. Songs with singer information in their tags may be readily filtered, while others cannot be identified devoid of listening. However, the majority of the songs were left out of the specifics. This paper mainly aims to propose the new singer classification model, where, pre-processing is initially done. Further, vocal segmentation is done using time domain filtering (TDF) and frequency domain filtering (FDF) and improved short-time Fourier transform (STFT). Further, timbre features, short-term energy (STE), mel-frequency cepstral coefficients (MFCCs) and improved vibrato estimation features are extracted that are given to EC that includes deep convolutional neural network (DCNN), bidirectional long short-term memory (LSTM) and deep belief network (DBN). Additionally, the final result is derived by averaging each ensemble classifier's efficiency. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

45. A novel approach to recognition of Alzheimer's and Parkinson's diseases: random subspace ensemble classifier based on deep hybrid features with a super-resolution image.

Author: Alhudhaif, Adi
Subjects: ALZHEIMER'S disease, CONVOLUTIONAL neural networks, PARKINSON'S disease, HIGH resolution imaging, FEATURE extraction, DEEP learning
Abstract: Background. Artificial intelligence technologies have great potential in classifying neurodegenerative diseases such as Alzheimer's and Parkinson's. These technologies can aid in early diagnosis, enhance classification accuracy, and improve patient access to appropriate treatments. For this purpose, we focused on AI-based auto-diagnosis of Alzheimer's disease, Parkinson's disease, and healthy MRI images. Methods. In the current study, a deep hybrid network based on an ensemble classifier and convolutional neural network was designed. First, a very deep super-resolution neural network was adapted to improve the resolution of MRI images. Low and high-level features were extracted from the images processed with the hybrid deep convolutional neural network. Finally, these deep features are given as input to the k-nearest neighbor (KNN)-based random subspace ensemble classifier. Results. A 3-class dataset containing publicly available MRI images was utilized to test the proposed architecture. In experimental works, the proposed model produced 99.11% accuracy, 98.75% sensitivity, 99.54% specificity, 98.65% precision, and 98.70% F1-score performance values. The results indicate that our AI system has the potential to provide valuable diagnostic assistance in clinical settings. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

46. Metaverse Applications in Bioinformatics: A Machine Learning Framework for the Discrimination of Anti-Cancer Peptides.

Author: Danish, Sufyan, Khan, Asfandyar, Dang, L. Minh, Alonazi, Mohammed, Alanazi, Sultan, Song, Hyoung-Kyu, and Moon, Hyeonjoon
Subjects: *SHARED virtual environments, *MACHINE learning, *INDIVIDUALIZED medicine, *AMINO acid analysis, *PATIENT experience
Abstract: Bioinformatics and genomics are driving a healthcare revolution, particularly in the domain of drug discovery for anticancer peptides (ACPs). The integration of artificial intelligence (AI) has transformed healthcare, enabling personalized and immersive patient care experiences. These advanced technologies, coupled with the power of bioinformatics and genomic data, facilitate groundbreaking developments. The precise prediction of ACPs from complex biological sequences remains an ongoing challenge in the genomic area. Currently, conventional approaches such as chemotherapy, target therapy, radiotherapy, and surgery are widely used for cancer treatment. However, these methods fail to completely eradicate neoplastic cells or cancer stem cells and damage healthy tissues, resulting in morbidity and even mortality. To control such diseases, oncologists and drug designers highly desire to develop new preventive techniques with more efficiency and minor side effects. Therefore, this research provides an optimized computational-based framework for discriminating against ACPs. In addition, the proposed approach intelligently integrates four peptide encoding methods, namely amino acid occurrence analysis (AAOA), dipeptide occurrence analysis (DOA), tripeptide occurrence analysis (TOA), and enhanced pseudo amino acid composition (EPseAAC). To overcome the issue of bias and reduce true error, the synthetic minority oversampling technique (SMOTE) is applied to balance the samples against each class. The empirical results over two datasets, where the accuracy of the proposed model on the benchmark dataset is 97.56% and on the independent dataset is 95.00%, verify the effectiveness of our ensemble learning mechanism and show remarkable performance when compared with state-of-the-art (SOTA) methods. In addition, the application of metaverse technology in healthcare holds promise for transformative innovations, potentially enhancing patient experiences and providing novel solutions in the realm of preventive techniques and patient care. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

47. Securing AMI-IoT networks against multiple RPL attacks using ensemble learning IDS and light-chain based prediction detection and mitigation mechanisms.

Author: M. M., Savitha and P. I., Basarkod
Subjects: *MACHINE learning, *FEATURE selection, *INTRUSION detection systems (Computer security), *CRYPTOCURRENCIES, *INTERNET of things, *FORECASTING
Abstract: Advanced Metering Infrastructure (AMI) is one of the Internet of Things (IoT) enabled smart applications of smart grids. The Routing Protocol for Low Power and Lossy network (RPL) has been accepted to facilitate effective routing services for the AMI. However, numerous RPL attacks appear in AMI due to resource scarcity and dubious wireless medium, which significantly impedes the successful deployment of AMI-RPL. To enable secure and reliable AMI-RPL, this work proposes a novel Intrusion Detection System (IDS) named AMI Attack-aware Intelligent Machine learning IDS (AIMS). The primary objectives of AIMS are to predict, detect, and mitigate different types of RPL security attacks in the AMI environment. To predict the RPL attacks using the Stacked Ensemble (SE) machine learning model, a novel AMI-RPL Attack Dataset (ARAD) is generated by the Cooja simulator with the suitable pre-processing and the Spider Monkey Optimization (SMO) based feature selection. The advanced prediction of attack nodes improves the performance and significantly diminishes the future damages of AMI. The attack detection is based on immutable blocks of a light-chain model, and the cryptocurrency-based mitigation model effectively isolates the attackers. AIMS mechanism amplifies RPL security with high reliability and maximizes the AMI network lifetime by delivering superior results. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

48. Human addictive behavior prediction by using lime with ensemble model.

Author: Sabapathi, V., Jacob, Selvin Paul Peter, Chembian, Woothukadu Thirumaran, and Thinakaran, Kandasamy
Subjects: COMPULSIVE behavior, HUMAN behavior, CONVOLUTIONAL neural networks, TREATMENT of drug addiction, EMOTIONS, MACHINE theory
Abstract: The data-driven techniques have utilized data mining and machine learning (ML) techniques in the biomedical and healthcare fields. The process of decision-making in uncertain contextual related to human addictions and emotions play an important role in the present research. The main aim of the research is to perform classification and generate a support system for uncertain addiction circumstances by proposing a technique for drug addiction treatment. The human behavior has majority shown challenges for the prediction of human behaviors that includes body poses estimation, movements and interaction with objects. This pose estimation has showed complexity with more pose aspects and the proposed research attempts to understand the human behaviors. The present research uses the local interpretable model-agnostic explanations (LIME) for finding the input features which are most important to generate a particular output based on decision service. LIME understands the model to perturb the data samples as an input and understands shows predictions change. Also, the ensemble classifier contains classifiers group that combines for performing the prediction of all unseen instances based on voting. The proposed LIME Feature-Ensemble classifier obtained 97.54% of accuracy when compared to the existing convolutional neural network (CNN) of 59.33% and Ensemble model of 93.33% accuracy. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

49. Breast Cancer Diagnosis Using Majority Voting Ensemble Classifier Approach.

Author: Dhas, M. Mohana and Singh, N. Suresh
Subjects: CANCER diagnosis, PLURALITY voting, IMAGE recognition (Computer vision), BREAST cancer, FEATURE selection
Abstract: One of the most common cancer affects women are the Breast Cancer. These are predicted using the diagnosing methodologies. The CT scans are the most common screening mechanisms is the computer aided prediction mechanism. Still, based on the reasons of genetic information, the microarray data is applied as the solution for diagnosing the cancer cells. Dealing with the microarray data it has various consequences, from those consequences one of them is its high dimensionality. The primary intention of this approach is to introduce an ensemble model to diagnose the cancer cells from the dataset. Initially, Adaptive Guided Bilateral Filter (AGBF) is applied to denoise the input images and to filter the images by sharpening them. Afterwards, to segment the images by the conversion of RGB images into grey levels the threshold mechanism is used. Minimum Redundancy Maximum Relevance (MRMR) feature inclusion and Local Search Adaptive Beta Hill Climbing (ABHC) are two hybrid feature selection approaches used in this article where the features and its importance were stored on a dataset. Then the diagnosing of the breast cancer is done by obtaining the majority voting ensemble classifier. The proposed approach is evaluated using a Breast Cancer Histopathological Image Classification (BreakHis) dataset and achieves the classification accuracy of 99.51%. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

50. A Deep Stacked Ensemble Model for Microarray Data Classification with Boosted Meta Classifier.

Author: Mohanty, Mihir Narayan, Mohapatra, Saumendra Kumar, and Das, Abhishek
Subjects: *LYMPHOBLASTIC leukemia, *ACUTE myeloid leukemia, *EARLY detection of cancer, *SUPPORT vector machines, *CIRRHOSIS of the liver, *LUNGS
Abstract: Classification of microarray data is one of the major research interests in the biomedical field. It allows physicians for early detection of cancer through analysis of the Deoxyribonucleic acid (DNA). Classification of these sensitive data is still challenging due to the small sample and more feature size. In this paper, the authors have used an ensemble model for classifying two types of leukemia as Acute lymphocytic leukemia (ALL) and Acute myelocytic leukemia (AML). The work is carried out with other types of genetic data such as Leukemia, lung tumor, liver cancer, and liver Cirrhosis. The biomedical data are imbalanced. The ensemble classifier is based on a stacked approach where deep neural network (DNN) classifiers are used as the base classifier. The structure of each DNN is chosen as the homogenous type for the same training process for all classifiers. Because of the adaptive nature and random weight initialization, it provides different results for each classifier. The outputs of the base classifiers are again fed to a gradient boosting ensemble model termed a meta classifier. The meta classifier provides the final classification output. For comparison purposes, two types of meta-classifiers such as support vector machine (SVM) and ensemble gradient boosting are used in the proposed work. The performance of the model is verified well and the results are provided in the result section. From the experimental result, it is observed that the classification accuracy is 96% with SVM and 98% with boosted meta classifier for leukemia data, whereas 99.04% for lung tumor and 99.03% for liver Cirrhosis. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

1,163 results on '"ensemble classifier"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources