183 results
Search Results
2. Special issue on deep learning and big data analytics for medical e-diagnosis/AI-based e-diagnosis.
- Author
-
Fong, Simon, Fortino, Giancarlo, Ghista, Dhanjoo, and Piccialli, Francesco
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,MACHINE learning ,ARTIFICIAL intelligence ,BIG data ,CONVOLUTIONAL neural networks - Abstract
The model integrates artificial intelligence (AI) and big data analytics, utilizing IoMT devices for data acquisition and Hadoop ecosystem for managing big data. The field of medical diagnosis is currently undergoing a remarkable transformation with the emergence of artificial intelligence (AI) techniques, particularly deep learning and big data analytics. By harnessing the power of deep learning and big data analytics, AI-based e-diagnosis has the potential to revolutionize healthcare delivery. [Extracted from the article]
- Published
- 2023
- Full Text
- View/download PDF
3. An automatic improved facial expression recognition for masked faces.
- Author
-
ELsayed, Yasmeen, ELSayed, Ashraf, and Abdou, Mohamed A.
- Subjects
FACIAL expression ,CONVOLUTIONAL neural networks ,EMOTION recognition ,FACIAL expression & emotions (Psychology) ,MACHINE learning - Abstract
Automatic facial expression recognition (AFER), sometimes referred to as emotional recognition, is important for socializing. Automatic methods in the past two years faced challenges due to Covid-19 and the vital wearing of a mask. Machine learning techniques tremendously increase the amount of data processed and achieved good results in such AFER to detect emotions; however, those techniques are not designed for masked faces and thus achieved poor recognition. This paper introduces a hybrid convolutional neural network aided by a local binary pattern to extract features in an accurate way, especially for masked faces. The basic seven emotions classified into anger, happiness, sadness, surprise, contempt, disgust, and fear are to be recognized. The proposed method is applied on two datasets: the first represents CK and CK +, while the second represents M-LFW-FER. Obtained results show that emotion recognition with a face mask achieved an accuracy of 70.76% on three emotions. Results are compared to existing techniques and show significant improvement. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
4. Applications of deep learning for mobile malware detection: A systematic literature review.
- Author
-
Catal, Cagatay, Giray, Görkem, and Tekinerdogan, Bedir
- Subjects
FEATURE selection ,DEEP learning ,SUPERVISED learning ,MALWARE ,CONVOLUTIONAL neural networks ,MACHINE learning - Abstract
For detecting and resolving the various types of malware, novel techniques are proposed, among which deep learning algorithms play a crucial role. Although there has been a lot of research on the development of DL-based mobile malware detection approaches, they were not reviewed in detail yet. This paper aims to identify, assess, and synthesize the reported articles related to the application of DL techniques for mobile malware detection. A Systematic Literature Review is performed in which we selected 40 journal articles for in-depth analysis. This SLR presents and categorizes these articles based on machine learning categories, data sources, DL algorithms, evaluation parameters & approaches, feature selection techniques, datasets, and DL implementation platforms. The study also highlights the challenges, proposed solutions, and future research directions on the use of DL in mobile malware detection. This study showed that Convolutional Neural Networks and Deep Neural Networks algorithms are the most used DL algorithms. API calls, Permissions, and System Calls are the most dominant features utilized. Keras and Tensorflow are the most popular platforms. Drebin and VirusShare are the most widely used datasets. Supervised learning and static features are the most preferred machine learning and data source categories. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
5. A review on evaluating mental stress by deep learning using EEG signals.
- Author
-
Badr, Yara, Tariq, Usman, Al-Shargie, Fares, Babiloni, Fabio, Al Mughairbi, Fadwa, and Al-Nashash, Hasan
- Subjects
- *
DEEP learning , *ARTIFICIAL neural networks , *MACHINE learning , *CONVOLUTIONAL neural networks , *ELECTROENCEPHALOGRAPHY , *REPRESENTATIONS of graphs , *JOB stress - Abstract
Mental stress is a common problem that affects individuals all over the world. Stress reduces human functionality during routine work and may lead to severe health defects. Early detection of stress is important for preventing diseases and other negative health-related consequences of stress. Several neuroimaging techniques have been utilized to assess mental stress, however, due to its ease of use, robustness, and non-invasiveness, electroencephalography (EEG) is commonly used. This paper aims to fill a knowledge gap by reviewing the different EEG-related deep learning algorithms with a focus on Convolutional Neural Networks (CNNs) and Long Short-Term Memory networks (LSTMs) for the evaluation of mental stress. The review focuses on data representation, individual deep neural network model architectures, hybrid models, and results amongst others. The contributions of the paper address important issues such as data representation and model architectures. Out of all reviewed papers, 67% used CNN, 9% LSTM, and 24% hybrid models. Based on the reviewed literature, we found that dataset size and different representations contributed to the performance of the proposed networks. Raw EEG data produced classification accuracy around 62% while using spectral and topographical representation produced up to 88%. Nevertheless, the roles of generalizability across different deep learning models and individual differences remain key areas of inquiry. The review encourages the exploration of innovative avenues, such as EEG data image representations concurrently with graph convolutional neural networks (GCN), to mitigate the impact of inter-subject variability. This novel approach not only allows us to harmonize structural nuances within the data but also facilitates the integration of temporal dynamics, thereby enabling a more comprehensive assessment of mental stress levels. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Research on simulation of 3D human animation vision technology based on an enhanced machine learning algorithm.
- Author
-
Yuan, Henning, Lee, Jong Han, and Zhang, Sai
- Subjects
MACHINE learning ,HUMAN activity recognition ,THREE-dimensional modeling ,3-D animation ,COGNITIVE processing speed ,COURSEWARE ,CONVOLUTIONAL neural networks - Abstract
This paper provides an in-depth analysis and study of the simulation of 3D human animation visualization techniques by enhancing machine learning algorithms. Based on the statistical analysis of the data obtained from different measurement methods, the extraction of human body feature parameters based on millimeter-wave point cloud data is realized, and the 3D reconstruction and simulation of the human body are realized using parametric human modeling software. In video-based action recognition, most methods are data-driven and use deep networks to automatically learn features of the entire video image. In this process, specific research on human actions is not included or reflected. However, human action recognition is a processing of the semantic level of video content. Realizing universal human action recognition requires a semantic understanding of human behavior. Firstly, the geometric feature analysis of the 3D scanned human model is performed to extract the human body shape characteristic parameters, and the research on the analysis and estimation methods of body shape characteristic parameters is carried out to establish the human body shape parameter relationship model; then, the millimeter-wave point cloud is calculated and measured, the Li group features extracted using the group skeletal representation model with high data dimensionality, to be able to process the high-dimensional data, while reducing the complexity of the recognition process and speeding up the computation, feature learning and classification are performed with convolutional neural networks. To verify the better library portability and robustness of the method in this paper, the method was tested on a self-built human action database in the laboratory, and an average recognition rate of 97.26% was achieved. Meanwhile, this paper investigates the natural interaction application of virtual characters in a virtual learning environment based on human action recognition. Four testers tested the virtual human–computer interaction system of this paper, respectively, and the final test results show that the system has flexibility and stability. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
7. Vision-based size classification of iron ore pellets using ensembled convolutional neural network.
- Author
-
Deo, Arya Jyoti, Sahoo, Animesh, Behera, Santosh Kumar, and Das, Debi Prasad
- Subjects
CONVOLUTIONAL neural networks ,IRON ores ,PELLETIZING ,DATA augmentation ,MACHINE learning - Abstract
In an iron ore pelletization plant, pellets are produced inside a rotating disc pelletizer. Online pellet size distribution is an important performance indicator of the pelletization process. Image processing-based system is an effective solution for online size analysis of iron ore pellets. This paper proposes a machine learning algorithm for estimating the size class of the pellets during their production by imaging from an area inside the disc pelletizer. Instead of computing the size of each individual pellets in the acquired image, this method proposes a qualitative approach to get the overall size estimate of the pellets in production. The key idea of this paper is to find out whether the disc is producing VERY SMALL, SMALL, MEDIUM, or BIG-sized pellets. A weighted average ensemble of different convolutional neural networks such as VGG16, Mobilenet, and Resnet50 is used to achieve this objective. Furthermore, batch normalization is applied to improve the estimation performance of the proposed model. A novel data augmentation method is applied to the in situ captured images to create the data set used to train and evaluate the proposed ensemble of CNN models. Results of experiments indicate that it is possible to detect the operating state of the pelletization disc by acquiring images from the inside area of the disc with sufficient accuracy. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
8. Hierarchical capsule network for hyperspectral image classification.
- Author
-
Shi, Meilin, Wang, Ruoxiang, and Ren, Jiansi
- Subjects
DEEP learning ,CAPSULE neural networks ,IMAGE recognition (Computer vision) ,MACHINE learning ,CONVOLUTIONAL neural networks ,FEATURE extraction - Abstract
Hyperspectral imaging is a highly advanced and sophisticated method for capturing images in hundreds of narrow, contiguous spectral bands. However, processing and analyzing such large amounts of data are challenging. Deep learning algorithms, especially those based on convolutional neural networks (CNNs), effectively extract rich feature representations from complex datasets, such as hyperspectral images (HSIs). These representations capture high-level patterns and characteristics that can facilitate accurate classification. The success of this approach has led to its widespread use in various remote sensing applications. However, the standard CNN inputs and outputs are scalars, ignoring the relative position relationships between features. In this paper, we propose a hierarchical capsule network for HSI classification. This network incorporates a multi-level convolutional structure for feature extraction and fusion. It utilizes convolutional feature maps of various depths to generate initial capsules, followed by vector computation using capsule neurons and a weight matrix to encode spatial location relationships among features. Furthermore, the shallow convolution of the hierarchical capsule network is pre-trained based on transfer learning to further improve the performance of HSI classification. According to experimental results, the proposed method for hyperspectral image classification has been found to outperform other state-of-the-art deep learning models on four benchmark datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
9. Refinement of ensemble strategy for acute lymphoblastic leukemia microscopic images using hybrid CNN-GRU-BiLSTM and MSVM classifier.
- Author
-
Mohammed, Kamel K., Hassanien, Aboul Ella, and Afify, Heba M.
- Subjects
DEEP learning ,LYMPHOBLASTIC leukemia ,ACUTE leukemia ,MACHINE learning ,CONVOLUTIONAL neural networks ,DATABASES - Abstract
Acute lymphocytic leukemia (ALL) is a common serious cancer in white blood cells (WBC) that advances quickly and produces abnormal cells in the bone marrow. Cancerous cells associated with ALL lead to impairment of body systems. Microscopic examination of ALL in a blood sample is applied manually by hematologists with many defects. Computer-aided leukemia image detection is used to avoid human visual recognition and to provide a more accurate diagnosis. This paper employs the ensemble strategy to detect ALL cells versus normal WBCs using three stages automatically. Firstly, image pre-processing is applied to handle the unbalanced database through the oversampling process. Secondly, deep spatial features are generated using a convolution neural network (CNN). At the same time, the gated recurrent unit (GRU)-bidirectional long short-term memory (BiLSTM) architecture is utilized to extract long-distance dependent information features or temporal features to obtain active feature learning. Thirdly, a softmax function and the multiclass support vector machine (MSVM) classifier are used for the classification mission. The proposed strategy has the resilience to classify the C-NMC 2019 database into two categories by using splitting the entire dataset into 90% as training and 10% as testing datasets. The main motivation of this paper is the novelty of the proposed framework for the purposeful and accurate diagnosis of ALL images. The proposed CNN-GRU-BiLSTM-MSVM is simply stacked by existing tools. However, the empirical results on C-NMC 2019 database show that the proposed framework is useful to the ALL image recognition problem compared to previous works. The DenseNet-201 model yielded an F1-score of 96.23% and an accuracy of 96.29% using the MSVM classifier in the test dataset. The findings exhibited that the proposed strategy can be employed as a complementary diagnostic tool for ALL cells. Further, this proposed strategy will encourage researchers to augment the rare database, such as blood microscopic images by creating powerful applications in terms of combining machine learning with deep learning algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
10. Special issue on towards advancements in machine learning for exploiting large-scale and heterogeneous repositories.
- Author
-
Anwar, Sajid and Rocha, Álvaro
- Subjects
MACHINE learning ,DEEP learning ,INFORMATION technology ,CHILDREN with autism spectrum disorders ,EXPERT systems ,CONVOLUTIONAL neural networks ,COMPUTER vision - Abstract
Experiments on the CheXpert dataset, which is a publicly available multi-disease chest radiograph dataset, and the TBX11K dataset show that the proposed model generates identical results. The MOFPA-WT is tested using a standard EEG signal processing dataset, namely the EEG motor movement/imagery dataset. [Extracted from the article]
- Published
- 2023
- Full Text
- View/download PDF
11. A maximum-entropy-attention-based convolutional neural network for image perception.
- Author
-
Chen, Qili, Zhang, Ancai, and Pan, Guangyuan
- Subjects
CONVOLUTIONAL neural networks ,DEEP learning ,OBJECT recognition (Computer vision) ,FEATURE extraction ,TRAFFIC signs & signals - Abstract
In recent years, image perception such as enhancement, classification and object detection with deep learning has achieved significant successes. However, in real world under extreme conditions, the training of a deep learning model often yields low accuracy, low efficiency in feature extraction and generalizability, due to the inner uncourteous and uninterpretable characteristics. In this paper, a maximal-entropy-attention-based convolutional neural network (MEA-CNN) framework is proposed. A maximum entropy algorithm is first used for image feature pre-extraction. An attention mechanism is then proposed by combining the extracted features on original images. By applying the mechanism, the key areas of an image are enhanced, and noised area can be ignored. Afterward, the processed images are transferred into region convolutional neural network, which is a well-known pre-trained CNN model, for further feature learning and extraction. Finally, two real-world experiments on traffic sign recognition and road surface condition monitoring are designed. The results show that the proposed framework has high testing accuracy, with improvements of 17% and 2.9%, compared with some other existing methods. In addition, the features extracted by the model are more easily interpretable. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
12. Special issue on Latin–American computational intelligence.
- Author
-
Estevez, Pablo A., Sbarbaro, Daniel, and Curilem, Millaray
- Subjects
COMPUTATIONAL intelligence ,ANT algorithms ,PARTICLE swarm optimization ,MACHINE learning ,SWARM intelligence ,SUPERVISED learning ,CONVOLUTIONAL neural networks - Abstract
The aim of this Topical Collection is to present the latest advances in Computational Intelligence either performed in Latin America, or with the important participation of Latin-American researchers. Three different algorithms are tuned: an ant colony optimization algorithm for solving the multidimensional knapsack problem, a genetic algorithm for solving landscapes that follow the NK model (N components and degree K), and a particle swarm optimization algorithm for solving continuous optimization problems. A hybrid feature selection method is employed that integrates a population-based meta-heuristic model, called Grey Wolf optimization, and a single solution-based meta-heuristic model, called the vortex search algorithm. [Extracted from the article]
- Published
- 2023
- Full Text
- View/download PDF
13. Discrimination of cycling patterns using accelerometric data and deep learning techniques.
- Author
-
Procházka, Aleš, Charvátová, Hana, Vyšata, Oldřich, Jarchi, Delaram, and Sanei, Saeid
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,GLOBAL Positioning System ,CONVOLUTIONAL neural networks ,SUPPORT vector machines ,ELECTRONIC data processing - Abstract
The monitoring of physical activities and recognition of motion disorders belong to important diagnostical tools in neurology and rehabilitation. The goal of the present paper is in the contribution to this topic by (1) analysis of accelerometric signals recorded by wearable sensors located at specific body positions and by (2) implementation of deep learning methods to classify signal features. This paper uses the general methodology to analysis of accelerometric signals acquired during cycling at different routes followed by the global positioning system. The experimental dataset includes 850 observations that were recorded by a mobile device in the spine area (L3 verterbra) for cycling routes with the different slope. The proposed methodology includes the use of deep learning convolutional neural networks with five layers applied to signal values transformed into the frequency domain without specification of any signal features. The accuracy of discrimination between different motion patterns for the uphill and downhill cycling and recognition of 4 classes associated with different route slopes was 96.6% with the loss criterion of 0.275 for sigmoidal activation functions. These results were compared with those evaluated for selected sets of features estimated for each observation and classified by the support vector machine, Bayesian methods, and the two-layer neural network. The best cross-validation error of 0.361 was achieved for the two-layer neural network model with the sigmoidal and softmax transfer functions. Our methodology suggests that deep learning neural networks are efficient in the assessment of motion activities for automated data processing and have a wide range of applications, including rehabilitation, early diagnosis of neurological problems, and possible use in engineering as well. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
14. DeepAHR: a deep neural network approach for recognizing Arabic handwritten recognition.
- Author
-
AlShehri, Helala
- Subjects
- *
ARTIFICIAL neural networks , *CONVOLUTIONAL neural networks , *PATTERN recognition systems , *DEEP learning - Abstract
Automatic handwritten character recognition plays a significant role in various applications across multiple fields. With the growing interest in automatic handwriting recognition and the advancement of deep learning methods, researchers have achieved significant improvements in the development of English handwriting recognition methods. However, the recognition of Arabic handwriting has received insufficient attention. In this paper, a novel "DeepAHR" model is presented to accurately and efficiently recognize Arabic handwritten characters using deep learning techniques. The "DeepAHR" model is based on a convolutional neural network (CNN) and is trained using two recent public datasets: Hijaa and Arabic handwritten characters dataset (AHCD). The overall accuracies of the proposed model were 98.66% and 88.24% on the AHCD and Hijaa datasets, respectively.The experimental results showed that DeepAHR outperformed state-of-the-art methods in the literature. These promising results provide evidence of the successful use of the DeepAHR model for recognizing handwritten Arabic characters [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
15. Deep learning-powered multimodal biometric authentication: integrating dynamic signatures and facial data for enhanced online security.
- Author
-
Salturk, Serkan and Kahraman, Nihan
- Subjects
- *
DEEP learning , *BIOMETRIC identification , *CONVOLUTIONAL neural networks , *MACHINE learning , *VIRTUAL reality - Abstract
The significant increase in online activities in the wake of recent global events has underlined the importance of biometric person authentication on digital platforms. Although many biometric devices may be used for precise biometric authentication, acquiring the necessary technology, such as 3D sensors or fingerprint scanners, can be prohibitively expensive and logistically challenging. Addressing the demands of online environments, where access to specialized hardware is limited, this paper introduces an innovative approach. In this work, by fusing static and dynamic signature data with facial data captured through regular computer cameras, a dataset of 1750 samples from 25 individuals is constructed. Deep learning models, including convolutional neural networks (CNN), long short-term memory (LSTM), gated recurrent unit (GRU), and temporal convolutional networks (TCN), are employed to craft a robust multi-classification model. This integration of various deep learning algorithms has demonstrated remarkable performance enhancements in biometric authentication. This research also underscores the potential of merging dynamic and static biometric features, derived from readily available sources, to yield a high-performance recognition framework. As online interactions continue to expand, the combination of various biometric modalities holds potential for enhancing the security and usability of virtual environments. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
16. A modified Adam algorithm for deep neural network optimization.
- Author
-
Reyad, Mohamed, Sarhan, Amany M., and Arafa, M.
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,OPTIMIZATION algorithms ,MACHINE learning ,ALGORITHMS - Abstract
Deep Neural Networks (DNNs) are widely regarded as the most effective learning tool for dealing with large datasets, and they have been successfully used in thousands of applications in a variety of fields. Based on these large datasets, they are trained to learn the relationships between various variables. The adaptive moment estimation (Adam) algorithm, a highly efficient adaptive optimization algorithm, is widely used as a learning algorithm in various fields for training DNN models. However, it needs to improve its generalization performance, especially when training with large-scale datasets. Therefore, in this paper, we propose HN Adam, a modified version of the Adam Algorithm, to improve its accuracy and convergence speed. The HN_Adam algorithm is modified by automatically adjusting the step size of the parameter updates over the training epochs. This automatic adjustment is based on the norm value of the parameter update formula according to the gradient values obtained during the training epochs. Furthermore, a hybrid mechanism was created by combining the standard Adam algorithm and the AMSGrad algorithm. As a result of these changes, the HN_Adam algorithm, like the stochastic gradient descent (SGD) algorithm, has good generalization performance and achieves fast convergence like other adaptive algorithms. To test the proposed HN_Adam algorithm performance, it is evaluated to train a deep convolutional neural network (CNN) model that classifies images using two different standard datasets: MNIST and CIFAR-10. The algorithm results are compared to the basic Adam algorithm and the SGD algorithm, in addition to other five recent SGD adaptive algorithms. In most comparisons, the HN Adam algorithm outperforms the compared algorithms in terms of accuracy and convergence speed. AdaBelief is the most competitive of the compared algorithms. In terms of testing accuracy and convergence speed (represented by the consumed training time), the HN-Adam algorithm outperforms the AdaBelief algorithm by an improvement of 1.0% and 0.29% for the MNIST dataset, and 0.93% and 1.68% for the CIFAR-10 dataset, respectively. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
17. A classical–quantum convolutional neural network for detecting pneumonia from chest radiographs.
- Author
-
Kulkarni, Viraj, Pawale, Sanjesh, and Kharat, Amit
- Subjects
CONVOLUTIONAL neural networks ,CHEST X rays ,QUANTUM computing ,COUGH ,NETWORK performance ,MACHINE learning - Abstract
While many quantum computing techniques for machine learning have been proposed, their performance on real-world datasets remains to be studied. In this paper, we explore how a variational quantum circuit could be integrated into a classical neural network for the problem of detecting pneumonia from chest radiographs. We substitute one layer of a classical convolutional neural network with a variational quantum circuit to create a hybrid neural network. We train both networks on an image dataset containing chest radiographs and benchmark their performance. To mitigate the influence of different sources of randomness in network training, we sample the results over multiple rounds. We show that the hybrid network outperforms the classical network on different performance measures and that these improvements are statistically significant. Our work serves as an experimental demonstration of the potential of quantum computing to significantly improve neural network performance for real-world, non-trivial problems relevant to society and industry. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
18. Machine learning-based diffusion model for prediction of coronavirus-19 outbreak.
- Author
-
Raheja, Supriya, Kasturia, Shreya, Cheng, Xiaochun, and Kumar, Manoj
- Subjects
CONVOLUTIONAL neural networks ,PREDICTION models ,MACHINE learning ,SUPPORT vector machines ,COVID-19 ,COVID-19 pandemic - Abstract
The coronavirus pandemic has been globally impacting the health and prosperity of people. A persistent increase in the number of positive cases has boost the stress among governments across the globe. There is a need of approach which gives more accurate predictions of outbreak. This paper presents a novel approach called diffusion prediction model for prediction of number of coronavirus cases in four countries: India, France, China and Nepal. Diffusion prediction model works on the diffusion process of the human contact. Model considers two forms of spread: when the spread takes time after infecting one person and when the spread is immediate after infecting one person. It makes the proposed model different over other state-of-the art models. It is giving more accurate results than other state-of-the art models. The proposed diffusion prediction model forecasts the number of new cases expected to occur in next 4 weeks. The model has predicted the number of confirmed cases, recovered cases, deaths and active cases. The model can facilitate government to be well prepared for any abrupt rise in this pandemic. The performance is evaluated in terms of accuracy and error rate and compared with the prediction results of support vector machine, logistic regression model and convolution neural network. The results prove the efficiency of the proposed model. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
19. Brain-inspired computing and machine learning.
- Author
-
Iliadis, Lazaros S., Kurkova, Vera, and Hammer, Barbara
- Subjects
MACHINE learning ,BIOLOGICALLY inspired computing ,DEEP learning ,CONVOLUTIONAL neural networks ,ABDOMINAL aorta ,COMPUTATIONAL mathematics ,MEDICAL sciences ,RETINAL ganglion cells - Published
- 2020
- Full Text
- View/download PDF
20. Parkinson classification neural network with mass algorithm for processing speech signals.
- Author
-
Akila, B. and Nayahi, J. Jesu Vedha
- Subjects
- *
ARTIFICIAL neural networks , *SIGNAL processing , *DEEP learning , *MACHINE learning , *CONVOLUTIONAL neural networks , *PARKINSON'S disease , *SPEECH perception , *DEEP brain stimulation - Abstract
Parkinson's disease (PD) is a condition that degenerates over time and impairs speech and pronunciation because brain cells have died. This research work aims to predict parkinson disease using the voice features extracted from speech signals recorded from PD individuals with dysphonic speech disorders by employing deep learning algorithms. PD is challenging to diagnose early on in the clinical presentation. To address the issue in machine learning methods, this paper proposes a neural network model by processing speech signals to classify PD using the University of California Irvine (UCI) machine learning repository dataset. Initially, a pre-loss reduction module is created by using pre-sampling to make the dataset balanced by reducing the dimensionality and maintaining the size of the space without influencing the learning process for data preparation. The relevant features are derived using a novel multi-agent salp swarm (MASS) algorithm, and a novel Parkinson classification neural network (PCNN) is proposed to classify Parkinson's patients with high accuracy employing these derived features. The result shows that the models that use MASS-PCNN produce higher classification accuracy of 99.1%, precision of 97.8%, recall of 94.7% and F1-score of 0.995 when paralleled to the existing models. As an outcome, the suggested model will perform superior to common convolutional neural networks. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
21. Personalized learning efficiency data analysis based on multi-scale convolution architecture and hybrid loss.
- Author
-
Jin, Dan, Wen, Xiaolan, and Wen, Yiming
- Subjects
- *
INDIVIDUALIZED instruction , *CONVOLUTIONAL neural networks , *DATA analysis , *EDUCATIONAL outcomes - Abstract
Personalized learning has gained significant attention in education as a means to cater to the diverse needs of learners and optimize educational outcomes. However, ensuring the efficiency of personalized learning remains a challenge. It requires the ability to accurately analyze and interpret vast amounts of data collected from learners. Traditional analytical approaches often struggle to handle the complexity and heterogeneity of this data, limiting the potential for personalized learning interventions. To address these challenges, this paper proposes a personalized learning efficiency data analysis network (PLEDANet) based on machine learning. First, PLEDANet redesigns a convolutional neural network based on the ResNet structure. The network performs convolutions using multiple convolution kernels of different scales to extract diverse feature information from personalized learning efficiency data. To enhance the extraction and representation of fine-grained differentiated features, PLEDANet introduces a hybrid attention module to combine channel and spatial information among feature maps. Second, PLEDANet designs a hybrid loss function for model training, which consists of the AM-softmax loss and the Center loss. The former increases the inter-class distance of features by imposing a fixed angular margin, while the latter reduces the intra-class distance by constraining the samples and feature centers. Finally, extensive experiments are conducted on PLEDANet. The experimental results validate the superiority of PLEDANet for personalized learning efficiency analysis. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
22. Deep learning-based comprehensive review on pulmonary tuberculosis.
- Author
-
Bansal, Twinkle, Gupta, Sheifali, and Jindal, Neeru
- Subjects
- *
TUBERCULOSIS , *CONVOLUTIONAL neural networks , *DELAYED diagnosis , *DEEP learning , *MEDICAL screening , *MACHINE learning - Abstract
In areas with high tuberculosis (TB) prevalence, high mortality rate has significantly increased over the past few decades. Even though tuberculosis can be treated, areas with high disease burden continue to have insufficient screening tools, leading to diagnostic delays and incorrect diagnoses. As a result of these challenges, a computer-aided diagnostics (CAD) system has been developed that can automatically detect tuberculosis. There are few different methods that can be used to screen for tuberculosis; however, chest X-ray (CXR) is most commonly used and strongly suggested because it is so effective in identifying lung irregularities. Over past ten years, we have seen a meteoric rise in amount of research conducted into application of machine learning strategies to examination of chest X-ray images for screening regarding pulmonary abnormalities. Particularly, we have also noticed significant interest in testing for TB. This attentiveness has increased in tandem with phenomenal progress that has been made in deep learning (DL), which is predominately founded on convolutional neural networks (CNNs). Because of these advancements, significant research contributions have been made in field of DL techniques for TB screening by utilizing CXR images. The main focus of this paper is to emphasize favorable methods and data collection, as well as methodological contributions, identify data collections, and identify challenges. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
23. Machine learning applications in detection and diagnosis of urology cancers: a systematic literature review.
- Author
-
Lubbad, M., Karaboga, D., Basturk, A., Akay, B., Nalbantoglu, U., and Pacal, I.
- Subjects
- *
DEEP learning , *MACHINE learning , *CANCER diagnosis , *CONVOLUTIONAL neural networks , *RENAL cell carcinoma , *ARTIFICIAL intelligence - Abstract
Deep learning integration in cancer diagnosis enhances accuracy and diagnosis speed which helps clinical decision-making and improves health outcomes. Despite all these benefits in cancer diagnosis, the present AI models in urology cancer diagnosis have not been sufficiently reviewed systematically. This paper reviews the artificial intelligence approaches used in cancer diagnosis, prediction, and treatment of urology cancer. AI models and their applications in urology subspecialties are evaluated and discussed. The Scopus, Microsoft Academic and PubMed/MEDLINE databases were searched in November 2022 using the terms "artificial intelligence", "neural network", "machine learning," or "deep learning" combined with the phrase "urology cancers". The search was limited to publications published within the previous 20 years to identify cutting-edge deep-learning applications published in English. Irrelevant review articles and publications were eliminated. The included research involves two kinds of research analysis: quantitative and qualitative. 48 articles were included in this survey. 25 studies proposed several approaches for prostate cancers, while 15 were for bladder cancers. 8 studies discussed renal cell carcinoma and kidney cancer. The models presented to detect urology cancers have achieved high detection accuracy (77–95%). Deep learning approaches that use convolutional neural networks have achieved the highest accuracy among other techniques. Although it is still progressing, the development of AI models for urology cancer detection, prediction, and therapy has shown significant promise. Additional research is required to employ more extensive, higher-quality, and more recent datasets to the clinical performance of the proposed AI models in urology cancer applications. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
24. The deep learning applications in IoT-based bio- and medical informatics: a systematic literature review.
- Author
-
Amiri, Zahra, Heidari, Arash, Navimipour, Nima Jafari, Esmaeilpour, Mansour, and Yazdani, Yalda
- Subjects
- *
MEDICAL informatics , *DEEP learning , *CONVOLUTIONAL neural networks , *CLINICAL decision support systems , *GENERATIVE adversarial networks , *RECURRENT neural networks , *DRUG discovery - Abstract
Nowadays, machine learning (ML) has attained a high level of achievement in many contexts. Considering the significance of ML in medical and bioinformatics owing to its accuracy, many investigators discussed multiple solutions for developing the function of medical and bioinformatics challenges using deep learning (DL) techniques. The importance of DL in Internet of Things (IoT)-based bio- and medical informatics lies in its ability to analyze and interpret large amounts of complex and diverse data in real time, providing insights that can improve healthcare outcomes and increase efficiency in the healthcare industry. Several applications of DL in IoT-based bio- and medical informatics include diagnosis, treatment recommendation, clinical decision support, image analysis, wearable monitoring, and drug discovery. The review aims to comprehensively evaluate and synthesize the existing body of the literature on applying deep learning in the intersection of the IoT with bio- and medical informatics. In this paper, we categorized the most cutting-edge DL solutions for medical and bioinformatics issues into five categories based on the DL technique utilized: convolutional neural network, recurrent neural network, generative adversarial network, multilayer perception, and hybrid methods. A systematic literature review was applied to study each one in terms of effective properties, like the main idea, benefits, drawbacks, methods, simulation environment, and datasets. After that, cutting-edge research on DL approaches and applications for bioinformatics concerns was emphasized. In addition, several challenges that contributed to DL implementation for medical and bioinformatics have been addressed, which are predicted to motivate more studies to develop medical and bioinformatics research progressively. According to the findings, most articles are evaluated using features like accuracy, sensitivity, specificity, F-score, latency, adaptability, and scalability. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
25. A hybrid deep learning approach for classification of music genres using wavelet and spectrogram analysis.
- Author
-
Jena, Kalyan Kumar, Bhoi, Sourav Kumar, Mohapatra, Sonalisha, and Bakshi, Sambit
- Subjects
DEEP learning ,POPULAR music genres ,CONVOLUTIONAL neural networks ,WAVELETS (Mathematics) ,MACHINE learning ,HEAVY metal music - Abstract
Manual classification of millions of songs of the same or different genres is a challenging task for human beings. Therefore, there should be a machine intelligent model that can classify the genres of the songs very accurately. In this paper, a deep learning-based hybrid model is proposed for the analysis and classification of different music genre files. The proposed hybrid model mainly uses a combination of multimodal and transfer learning-based models for classification. This model is analyzed using GTZAN and Ballroom datasets. The GTZAN dataset contains 1000 music files classified with 10 different kinds of music genres such as Metal, Classical, Rock, Reggae, Pop, Disco, Blues, Country, Hip-Hop and Jazz, and the duration of each music file is 30 s. The Ballroom dataset contains 698 music files classified into 8 different kinds of music genres such as Tango, ChaChaCha, Rumba, Viennese waltz, Jlive, Waltz, Quickstep and Samba, and the duration of each music file is 30 s. The performance of the model is evaluated using the Python tool. The macro-average and weighted average are taken for computing the percentage of accuracy of each model. From the results, it is found that the proposed hybrid model is able to perform better as compared to other deep learning models such as the convolution neural network model, transfer learning-based model, multimodal model, machine learning models and other existing models in terms of training accuracy, validation accuracy, training loss, validation loss, precision, recall, F1-score and support. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
26. Arbitrary surface data patching method based on geometric convolutional neural network.
- Author
-
Fan, Linyuan, Ji, Dandan, and Lin, Peng
- Subjects
CONVOLUTIONAL neural networks ,MACHINE learning ,REVERSE engineering ,TECHNICAL specifications ,SURFACE reconstruction ,REPRODUCTION ,QUALITY function deployment - Abstract
In free-form surface reconstruction technology, the ultimate goal is to obtain a computer model of a free-form surface. This study discusses the inpainting method of arbitrary surface data based on geometric convolutional neural networks. Reverse engineering is a process of product design technology reproduction, that is, reverse analysis and research of a target product, to deduce and obtain design elements such as the processing flow, organizational structure, functional characteristics and technical specifications of the product to produce functional similar but not identical products. This paper realizes the noise point elimination of point cloud data by studying reconstruction technology of free-form surfaces using a geometric convolutional neural network model. Point cloud data without gross errors are provided for later reconstruction. High-precision arbitrary surface data are realized for surface reconstruction. The repair of arbitrary surface data is completed based on BP and RBF neural networks, and a certain degree of data supplementing for the incomplete point cloud data measured is realized. Finally, free-form surface reconstruction is realized. In reverse engineering, a flexible surface is converted into a rigid surface, and the data points are collected using contact measuring equipment. The influencing factors of reverse engineering are work efficiency, technical ability and the influencing factors of innovative design. The data points obtained in this study are processed by denoising, streamlining and filtering. The sorted data points are used to obtain an optimized mathematical model after curve and surface fitting and smoothing. The research in this study proves that the data points obtained using the reverse engineering method can be very well applied in subsequent work. Therefore, this data collection method can be applied to other instances. In the experiment, the maximum error between the output of the RBF network and the test data is 0.0086 when surface 2 is repaired. When the hybrid learning algorithm is trained with different sample sets, the average width is 0.24, and the number of iterations is 1000. The experimental results of repairing various surface defect data show that the data repairing method has good versatility, fast data repairing speed and high precision, so it has high practical value. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
27. Dummy trajectory generation scheme based on generative adversarial networks.
- Author
-
Yang, Jingkang, Yu, Xiaobo, Meng, Weizhi, and Liu, Yining
- Subjects
GENERATIVE adversarial networks ,FEATURE extraction ,MACHINE learning ,DEEP learning ,CONVOLUTIONAL neural networks - Abstract
Dummy trajectory is widely used to protect the privacy of mobile users' locations. However, two main challenges remain: (1) Map background information has not been modeled by machine learning methods in existing schemes, and (2) it is difficult to generate a good quality dummy trajectory that is similar to the real one. Focused on these two challenges, in this paper, we propose a dummy trajectory generation scheme with conditional generative adversary network (GAN), where the map features are extracted using convolutional neural network, which is regarded as a prior restriction of conditional GAN. Then, the movement pattern of the real trajectory is deduced by an auto-encoder and is involved in the dummy trajectory generation. Our model is trained and evaluated with two real-world datasets. Experimental results demonstrate that our scheme addresses these challenges well and defends against various attacks effectively. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
28. A novel approach for APT attack detection based on combined deep learning model.
- Author
-
Do Xuan, Cho and Dao, Mai Hoang
- Subjects
DEEP learning ,CONVOLUTIONAL neural networks ,MACHINE learning ,BEHAVIORAL assessment ,INTERNET protocol address ,IP networks - Abstract
Advanced persistent threat (APT) attack is a malicious attack type which has intentional and clear targets. This attack technique has become a challenge for information security systems of organizations, governments, and businesses. The approaches of using machine learning or deep learning algorithms to analyze signs and abnormal behaviors of network traffic for detecting and preventing APT attacks have become popular in recent years. However, the APT attack detection approach that uses behavior analysis and evaluation techniques is facing many difficulties due to the lack of typical data of attack campaigns. To handle this situation, recent studies have selected and extracted the APT attack behaviors which based on datasets are built from experimental tools. Consequently, these properties are few and difficult to obtain in practical monitoring systems. Therefore, although the experimental results show good detection, it does not bring high efficiency in practice. For above reasons, in this paper, a new method based on network traffic analysis using a combined deep learning model to detect APT attacks will be proposed. Specifically, individual deep learning networks such as multilayer perceptron (MLP), convolutional neural network (CNN), and long short-term memory (LSTM) will also be sought, built and linked into combined deep learning networks to analyze and detect signs of APT attacks in network traffic. To detect APT attack signals, the combined deep learning models are performed in two main stages including (i) extracting IP features based on flow: In this phase, we will analyze network traffic into networking flows by IP address and then use the combined deep learning models to extract IP features by network flow; (ii) classifying APT attack IPs: Based on IP features extracted in a task (i), the APT attack IPs and normal IPs will be identified and classified. The proposal of a combined deep learning model to detect APT attacks based on network traffic is a new approach, and there is no research proposed and applied yet. In the experimental section, combined deep learning models proved their superior abilities to ensure accuracy on all measurements from 93 to 98%. This is a very good result for APT attack detection based on network traffic. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
29. A dynamic discarding technique to increase speed and preserve accuracy for YOLOv3.
- Author
-
Martinez-Alpiste, Ignacio, Golcarenarenji, Gelayol, Wang, Qi, and Alcaraz-Calero, Jose Maria
- Subjects
SPEED ,EMERGENCY vehicles ,CONVOLUTIONAL neural networks ,MACHINE learning ,AUTONOMOUS vehicles - Abstract
This paper proposes an acceleration technique to minimise the unnecessary operations on a state-of-the-art machine learning model and thus to improve the processing speed while maintaining the accuracy. After the study of the main bottlenecks that negatively affect the performance of convolutional neural networks, this paper designs and implements a discarding technique for YOLOv3-based algorithms to increase the speed and maintain accuracy. After applying the discarding technique, YOLOv3 can achieve a 22% of improvement in terms of speed. Moreover, the results of this new discarding technique were tested on Tiny-YOLOv3 with three output layers on an autonomous vehicle for pedestrian detection and it achieved an improvement of 48.7% in speed. The dynamic discarding technique just needs one training process to create the model and thus execute the approach, which preserves accuracy. The improved detector based on the discarding technique is able to readily alert the operator of the autonomous vehicle to take the emergency brake of the vehicle in order to avoid collision and consequently save lives. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
30. Synergy between traditional classification and classification based on negative features in deep convolutional neural networks.
- Author
-
Milošević, Nemanja and Racković, Miloš
- Subjects
CONVOLUTIONAL neural networks ,MACHINE learning ,COMPUTER vision ,CLASSIFICATION - Abstract
In recent times, convolutional neural networks became an irreplaceable tool in many different machine learning applications, especially in image classification. On the other hand, new research about robustness and susceptibility of these models to different adversarial attacks has emerged. With the rise in usage and widespread adoption of these models, it is very important to make them suitable for critical applications. In our previous work, we experimented with a new type of learning applicable to all convolutional neural networks: classification based on missing (low-impact) features. In the case of partial inputs/image occlusion, we have shown that our new method creates models that are more robust and perform better when compared to traditional models of the same architecture. In this paper, we explore an interesting characteristic of our newly developed models in that while we see a general increase in validation accuracy, we also lose some important knowledge. We propose one solution to overcome this problem and validate our assumptions against CIFAR-10 image classification dataset. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
31. A machine learning approach for cross-domain plant identification using herbarium specimens.
- Author
-
Chulif, Sophia, Lee, Sue Han, Chang, Yang Loong, and Chai, Kok Chin
- Subjects
BOTANICAL specimens ,PLANT identification ,COLLECTION & preservation of plant specimens ,MACHINE learning ,ENDANGERED species ,PLANT species ,DIGITAL image correlation ,IDENTIFICATION - Abstract
The preservation of plant specimens in herbaria has been carried out for centuries in efforts to study and confirm plant taxa. With the increasing collection of herbaria made available digitally, it is practical to use herbarium specimens for the automation of plant identification. They are also substantially more accessible and less expensive to obtain compared to field images. In fact, in remote and inaccessible habitats, field images of rare plant species are still immensely lacking. As a result, rare plant species identification is challenging due to the deficiency of training data. To address this problem, we investigate a cross-domain adaptation approach that allows knowledge transfer from a model learned from herbarium specimens to field images. We propose a model called Herbarium–Field Triplet Loss Network (HFTL network) to learn the mapping between herbarium and field domains. Specifically, the model is trained to maximize the embedding distance of different plant species and minimize the embedding distance of the same plant species given herbarium–field pairs. This paper presents the implementation and performance of the HFTL network to assess the herbarium–field similarity of plants. It corresponds to the cross-domain plant identification challenge in PlantCLEF 2020 and PlantCLEF 2021. Despite the lack of field images, our results show that the network can generalize and identify rare species. Our proposed HFTL network achieved a mean reciprocal rank score of 0.108 and 0.158 on the test set related to the species with few training field photographs in PlantCLEF 2020 and PlantCLEF 2021, respectively. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
32. EDense: a convolutional neural network with ELM-based dense connections.
- Author
-
Zhao, Xiangguo, Bi, Xin, Zeng, Xiangyu, Zhang, Yingchun, and Fang, Qiusheng
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,MACHINE learning ,OBJECT recognition (Computer vision) ,DEEP learning - Abstract
The explosive growth of geospatial data is increasing requirements for automatic and efficient data learning abilities. Many deep learning methods have been widely applied for geospatial data understanding tasks, such as road networks and geospatial object detection. However, the demands for more accurate learning of high-level features require the use of deeper neural networks. To further improve the learning efficiency of deep neural networks, in this paper, we propose an improved convolutional neural network named EDense. First, we use its dense connectivity to integrate a CNN with an extreme learning machine. Then, we expand the kernels in the convolutional layers to increase the width of the network model. Furthermore, we propose one-feature EDense (OF-EDense), which is a simplified version of EDense, to fit conditions in which the number of parameters is strictly limited. Finally, the experimental results fully demonstrate the strong learning ability and high learning efficiency of EDense. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
33. CNN-based architecture recognition and contour standardization based on aerial images.
- Author
-
Deng, Yi, Xie, Xiaodan, and Xing, Chengyue
- Subjects
IMAGE recognition (Computer vision) ,DEEP learning ,CONVOLUTIONAL neural networks ,STANDARDIZATION ,IMAGE segmentation ,BUILDING repair ,MACHINE learning - Abstract
The difference between the convolutional neural network and the ordinary neural network is that the convolutional neural network contains a feature extractor composed of a convolutional layer and a subsampling layer. With the development of society and economy, the pace of urbanization is accelerating, and the number and types of urban buildings are also growing rapidly. Digital management has put forward higher requirements for 3D reconstruction of urban buildings. Aiming at explanation the question that sharpness form are proetrate to be blea or bewildered in CNN-supported structure birth from lofty-separation airy conception, an optimise construction birth algorithmic rule is converse to increase the construction brink of proud-resoluteness atmospheric semblance and the twist projection. Remote sensing image target recognition, as the main research content in the current remote sensing image application field, has important theoretical significance and extensive application value. In recent years, deep learning has become an emerging research direction in the field of machine learning, and convolutional neural network is a deep learning model that has been widely studied and applied. More specifically, the construction brink is better by realm vary recursive filter out, and the better appearance is fed into the U-Net nerval netting for making. Afterward, in custom to plentifully take advantage of the sumptuous detail shape of buildings on supercilious sake picture, we tempt to en plot impair from the manage copy and pigeonhole supported on the origin U-Net edifice to increase the school data. These beauty spot can remarkably fortify the procurement of edifice hie viterbilt characteristic in eager and invert intense lore. Finally, construction essence is instrument by mechanical advantage the quotation intense characteristic. The trial effect of edifice extract from the Panjin City have demonstrated that for the hoagie-optimum pattern data with division of shade areas, the everywhere assortment propriety of buildings recognized by U-Net is above 80%, and the zenith everywhere assortment truth of the amended regularity extension 83%. In this paper, through the research on the application of convolutional neural network in the field of image segmentation, the problems of low segmentation accuracy, long time and high cost in the task of aerial image building image segmentation are solved to a certain extent. The detection and segmentation method of buildings in aerial images based on CNN can automatically detect and segment buildings, and can segment a large number of buildings in aerial images in batches. In scenarios with high segmentation efficiency requirements. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
34. WisdomNet: trustable machine learning toward error-free classification.
- Author
-
Tran, Truong X. and Aygun, Ramazan S.
- Subjects
MACHINE learning ,MULTILAYER perceptrons ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,ERROR rates - Abstract
Misclassification is a critical problem in many machine learning applications. Since even the classifier models with high accuracy (e.g., > 95%) still introduce some misclassification error, it may not be possible to rely on the output of a classifier. In this paper, we introduce trustable learning, which prompts the learning model to yield only the true output, thus avoiding misclassifications. Whenever the model cannot decide the output accurately, the learning model should indicate that there could be a misclassification error if it is forced to classify, and hence, it should reject to make a decision or defer it to a human expert. Therefore, we develop a methodology for trustable learning and apply it to artificial neural networks and show that it is possible to develop a classifier with 0% misclassification error. We propose a novel neural network architecture named WisdomNet that could provide zero prediction error by introducing an additional neuron named as conjugate neuron that would indicate whether the network is able to classify the data correctly or not. The WisdomNet architecture can be applied to any previously built model, and we have evaluated WisdomNet with several network architectures such as multilayer perceptron, convolutional neural network, and deep network on different data sets. The results show that the WisdomNet is able to reduce the classification error rate to 0%, while labeling the data is difficult to classify as 'reject' at a low percentage of within around 10%. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
35. Arabic handwriting recognition system using convolutional neural network.
- Author
-
Altwaijry, Najwa and Al-Turaiki, Isra
- Subjects
CONVOLUTIONAL neural networks ,HANDWRITING ,HANDWRITING recognition (Computer science) ,PATTERN recognition systems ,LATIN language - Abstract
Automatic handwriting recognition is an important component for many applications in various fields. It is a challenging problem that has received a lot of attention in the past three decades. Research has focused on the recognition of Latin languages' handwriting. Fewer studies have been done for the Arabic language. In this paper, we present a new dataset of Arabic letters written exclusively by children aged 7–12 which we call Hijja. Our dataset contains 47,434 characters written by 591 participants. In addition, we propose an automatic handwriting recognition model based on convolutional neural networks (CNN). We train our model on Hijja, as well as the Arabic Handwritten Character Dataset (AHCD) dataset. Results show that our model's performance is promising, achieving accuracies of 97% and 88% on the AHCD dataset and the Hijja dataset, respectively, outperforming other models in the literature. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
36. A fine-tuning deep learning with multi-objective-based feature selection approach for the classification of text.
- Author
-
Dhal, Pradip and Azad, Chandrashekhar
- Subjects
- *
DEEP learning , *FEATURE selection , *CONVOLUTIONAL neural networks , *METAHEURISTIC algorithms , *PARTICLE swarm optimization , *MACHINE learning - Abstract
Document classification is becoming increasingly essential for the vast number of documents available in digital libraries, emails, the Internet, etc. Textual records frequently contain non-discriminative (noisy and irrelevant) terms that are also high-dimensional, resulting in higher computing costs and poorer learning performance in Text Classification (TC). Feature selection (FS), which tries to discover discriminate terms or features from the textual data, is one of the most effective tasks for this issue. This paper introduces a novel multi-stage term-weighting scheme-based FS model designed for the single-label TC system to obtain the optimal set of features. We have also developed a hybrid deep learning fine-tuning network based on Bidirectional Long Short-Term Memory (BiLSTM) and Convolutional Neural Network (CNN) for the classification stage. The FS approach is worked on two-stage criteria. The filter model is used in the first stage, and the multi-objective wrapper model, an upgraded version of the Whale Optimization Algorithm (WOA) with Particle Swarm Optimization (PSO), is used in the second stage. The objective function in the above wrapper model is based on a tri-objective principle. It uses the Pareto front technique to discover the optimal set of features. Here in the wrapper model, a novel selection strategy has been introduced to select the whale instead of the random whale. The proposed work is evaluated on four popular benchmark text corpora, of which two are binary class, and two are multi-class. The suggested FS technique is compared against classic Machine Learning (ML) and deep learning classifiers. The results of the experiments reveal that the recommended FS technique is more effective in obtaining better results than the other results. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
37. Optimizing anomaly-based attack detection using classification machine learning.
- Author
-
Gouda, Hany Abdelghany, Ahmed, Mohamed Abdelslam, and Roushdy, Mohamed Ismail
- Subjects
- *
DEEP learning , *MACHINE learning , *CONVOLUTIONAL neural networks , *K-nearest neighbor classification , *DIGITAL technology , *RANDOM forest algorithms - Abstract
One of the significant aspects of our digital world is that data are literally everywhere, and it is increasing. On the other hand, the number of cyberattacks aiming to seize this data and use it illegally is increasing at an exponential rate, and this is the challenge. Therefore, intrusion detection systems (IDS) have attracted considerable interest from researchers and industries. In this regard, machine learning (ML) techniques are playing a pivotal role as they put the responsibility of analyzing enormous amounts of data, finding patterns, classifying intrusions, and solving issues on computers instead of humans. This paper implements two separate classification layers of ML-based algorithms with the recently published NF-UQ-NIDS-v2 dataset, preprocessing two volumes of sample records (100 k and 10 million), utilizing MinMaxScaler, LabelEncoder, selecting superlative features by recursive feature elimination, normalizing the data, and optimizing hyper-parameters for classical algorithms and neural networks. With a small dataset volume, the results of the classical algorithms layer show high detection accuracy rates for support vector (98.26%), decision tree (98.78%), random forest (99.07%), K-nearest neighbors (98.16%), CatBoost (99.04%), and gradient boosting (98.80%). In addition, the layer of neural network algorithms has proven to be a very powerful technology when using deep learning, particularly due to its unique ability to effectively handle enormous amounts of data and detect hidden correlations and patterns; it showed high detection results, which were (98.87%) for long short-term memory and (98.56%) for convolutional neural networks. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
38. Effective training of convolutional neural networks for age estimation based on knowledge distillation.
- Author
-
Greco, Antonio, Saggese, Alessia, Vento, Mario, and Vigilante, Vincenzo
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,FACE ,MACHINE learning ,DEEP learning - Abstract
Age estimation from face images can be profitably employed in several applications, ranging from digital signage to social robotics, from business intelligence to access control. Only in recent years, the advent of deep learning allowed for the design of extremely accurate methods based on convolutional neural networks (CNNs) that achieve a remarkable performance in various face analysis tasks. However, these networks are not always applicable in real scenarios, due to both time and resource constraints that the most accurate approaches often do not meet. Moreover, in case of age estimation, there is the lack of a large and reliably annotated dataset for training deep neural networks. Within this context, we propose in this paper an effective training procedure of CNNs for age estimation based on knowledge distillation, able to allow smaller and simpler "student" models to be trained to match the predictions of a larger "teacher" model. We experimentally show that such student models are able to almost reach the performance of the teacher, obtaining high accuracy over the LFW+, LAP 2016 and Adience datasets, but being up to 15 times faster. Furthermore, we evaluate the performance of the student models in the presence of image corruptions, and we demonstrate that some of them are even more resilient to these corruptions than the teacher model. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
39. Daytime sea fog monitoring using multimodal self-supervised learning with band attention mechanism.
- Author
-
Li, Tao, Jin, Wei, Fu, Randi, and He, Caifen
- Subjects
CONVOLUTIONAL neural networks ,SUPERVISED learning ,MULTISPECTRAL imaging ,MACHINE learning ,FOG - Abstract
Sea fog is a dangerous weather phenomenon that seriously affects maritime traffic and other operations at sea. The conventional sea fog detection methods are not only difficult to make full advantage of the multispectral information of cloud images, but also deficient in the exploration of deep-level semantic information, leading to poor detection results. In this paper, we proposed a multimodal self-supervised convolutional neural network incorporating intra-modal band attention mechanism (MSCNN-IBAM) based on multispectral images of Himawari-8. MSCNN-IBAM uses independent branches to extract features from different modality cloud images and characterize the importance of each band through attention mechanisms. Simultaneously, multimodal self-supervised learning and supervised learning are effectively combined to optimize the model by constructing a two-tuple trainset. Experimental results show the accuracy, precision, recall, and F1 score of the proposed method as 97.72%, 95.84%, 96.54%, and 96.08%, respectively, which have the competitive performance and acceptable computational efficiency. And the additional analysis of sea fog cases shows that the proposed method is not only effective in identifying sea fog, but also has the ability to locate sea fog regions. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
40. A multi-scale channel-wise convolution-based multi-level heat stress assessment.
- Author
-
Nagpal, Chetna and Upadhyay, Prabhat Kumar
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,DEEP learning ,MACHINE learning ,FEATURE extraction ,LABORATORY rats ,LOSSY data compression - Abstract
In the previous works on heat stress detection, various state-of-the-art machine learning techniques had been used to detect stress patterns from electroencephalographic (EEG) signals. Since the handcrafted feature engineering-based approaches pose certain limitations and are sensitive to transform the nonlinearities of EEG, deep learning techniques have drawn attention in the domain of EEG-based applications. Moreover, existing approaches for stress detection consider the whole frequency band (delta to gamma) which conceals the redundant and lossy information that increased the false detection rate. Also, these approaches were implemented only to identify the stress in binary classes and confined their application to identifying the level of stress. Therefore, a multi-scale channel-wise convolution-based multi-level heat stress assessment has been proposed in this paper. The novel contributions of this work are (1) designing a multi-scale convolutional neural network (MS-CNN) for extracting precise information from individual frequency bands of EEG, (2) use of sparse connectivity to reduce the redundant information and increase the performance with less trainable parameters, and (3) multi-level heat stress assessment for precise identification of the severity of stress. The proposed approaches were evaluated on pre-recorded data of 10 rats in a simulated laboratory environment. The high accuracy of approximately 96–97% and 90–92% has been achieved for binary and multi-level stress detection. There is approximately a 6 to 50% reduction in the trainable parameters with a 2% improvement of accuracy achieved on adopting channel-wise convolution over MS-CNN and deep convolutional neural network (DCNN). This demonstrates the effectiveness of the adopted multiscale feature extraction and sparse connectivity to improve the performance with a less complex model. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
41. ARDIS: a Swedish historical handwritten digit dataset.
- Author
-
Kusetogullari, Huseyin, Yavariabdi, Amir, Cheddad, Abbas, Grahn, Håkan, and Hall, Johan
- Subjects
HANDWRITING recognition (Computer science) ,CONVOLUTIONAL neural networks ,DEEP learning ,MACHINE learning ,POSTAL service ,SCIENTIFIC community - Abstract
This paper introduces a new image-based handwritten historical digit dataset named Arkiv Digital Sweden (ARDIS). The images in ARDIS dataset are extracted from 15,000 Swedish church records which were written by different priests with various handwriting styles in the nineteenth and twentieth centuries. The constructed dataset consists of three single-digit datasets and one-digit string dataset. The digit string dataset includes 10,000 samples in red–green–blue color space, whereas the other datasets contain 7600 single-digit images in different color spaces. An extensive analysis of machine learning methods on several digit datasets is carried out. Additionally, correlation between ARDIS and existing digit datasets Modified National Institute of Standards and Technology (MNIST) and US Postal Service (USPS) is investigated. Experimental results show that machine learning algorithms, including deep learning methods, provide low recognition accuracy as they face difficulties when trained on existing datasets and tested on ARDIS dataset. Accordingly, convolutional neural network trained on MNIST and USPS and tested on ARDIS provide the highest accuracies 58.80 % and 35.44 % , respectively. Consequently, the results reveal that machine learning methods trained on existing datasets can have difficulties to recognize digits effectively on our dataset which proves that ARDIS dataset has unique characteristics. This dataset is publicly available for the research community to further advance handwritten digit recognition algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
42. Deep learning for high-impedance fault detection and classification: transformer-CNN.
- Author
-
Rai, Khushwant, Hojatpanah, Farnam, Ajaei, Firouz Badrkhani, Guerrero, Josep M., and Grolinger, Katarina
- Subjects
DEEP learning ,CONVOLUTIONAL neural networks ,CAPACITOR switching ,SIGNAL processing ,MACHINE learning - Abstract
High-impedance faults (HIFs) exhibit low current amplitude and highly diverse characteristics, which make them difficult to be detected by conventional overcurrent relays. Various machine learning (ML) techniques have been proposed to detect and classify HIFs; however, these approaches are not reliable in presence of diverse HIF and non-HIF conditions and, moreover, rely on resource-intensive signal processing techniques. Consequently, this paper proposes a novel HIF detection and classification approach based on a state-of-the-art deep learning model, the transformer network, stacked with the Convolutional neural network (CNN). While the transformer network learns the complex HIF pattern in the data, the CNN enhances the generalization to provide robustness against noise. A kurtosis analysis is employed to prevent false detection of non-fault disturbances (e.g., capacitor and load switching) and nonlinear loads as HIFs. The performance of the proposed HIF detection and classification approach is evaluated using the IEEE 13-node test feeder. The results demonstrate that the proposed protection method reliably detects and classifies HIFs, is robust against noise, and outperforms the state-of-the-art techniques. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
43. Detection, localisation and tracking of pallets using machine learning techniques and 2D range data.
- Author
-
Mohamed, Ihab S., Capitanelli, Alessio, Mastrogiovanni, Fulvio, Rovetta, Stefano, and Zaccaria, Renato
- Subjects
CONVOLUTIONAL neural networks ,MACHINE learning ,KALMAN filtering ,ACQUISITION of data ,ARTIFICIAL satellite tracking - Abstract
The problem of autonomous transportation in industrial scenarios is receiving a renewed interest due to the way it can revolutionise internal logistics, especially in unstructured environments. This paper presents a novel architecture allowing a robot to detect, localise, and track (possibly multiple) pallets using machine learning techniques based on an on-board 2D laser rangefinder only. The architecture is composed of two main components: the first stage is a pallet detector employing a Faster Region-Based Convolutional Neural Network (Faster R-CNN) detector cascaded with a CNN-based classifier; the second stage is a Kalman filter for localising and tracking detected pallets, which we also use to defer commitment to a pallet detected in the first stage until sufficient confidence has been acquired via a sequential data acquisition process. For fine-tuning the CNNs, the architecture has been systematically evaluated using a real-world dataset containing 340 labelled 2D scans, which have been made freely available in an online repository. Detection performance has been assessed on the basis of the average accuracy over k-fold cross-validation, and it scored 99.58% in our tests. Concerning pallet localisation and tracking, experiments have been performed in a scenario where the robot is approaching the pallet to fork. Although data have been originally acquired by considering only one pallet as per specification of the use case we consider, artificial data have been generated as well to mimic the presence of multiple pallets in the robot workspace. Our experimental results confirm that the system is capable of identifying, localising and tracking pallets with a high success rate while being robust to false positives. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
44. Effective attention feature reconstruction loss for facial expression recognition in the wild.
- Author
-
Gong, Weijun, Fan, Yingying, and Qian, Yurong
- Subjects
FACIAL expression ,CONVOLUTIONAL neural networks ,MACHINE learning ,PROBLEM solving - Abstract
Facial expression recognition (FER) in the wild is very challenging due to occlusion, posture, illumination, and other uncontrolled factors. Learning discriminant features for FER using Convolutional Neural Networks is a momentous task for the significant class imbalance, wrong labels, inter-class similarities, and intra-class variations. The traditional method utilizes the Cross entropy loss function to optimize the convolutional network to obtain discriminative features for classification. However, this loss function cannot effectively solve the above problems in practice and cannot contribute to obtaining highly discriminant facial features for further analysis. Center loss improves the learning efficiency by reducing the intra-class distance of similar expressions, while the improvement of inter-class similarity, class imbalance, and generalization is insufficient. In this paper, we propose a lightweight Effective Attention Feature Reconstruction loss (EAFR loss), which can further optimize the feature space and enhance the discriminability of expression. The loss model is composed of the Focal Smoothing loss (FS loss) and the Aggregation-Separation loss (AS loss). Firstly, the FS loss can improve the poor recognition performance caused by imbalanced classes and prevent paranoid knowledge learning behaviors. Meanwhile, AS loss further accurately condenses the intra-class expression features and expands the inter-class distance, which is achieved by using progressive stage max-pooling channel and position attention mechanism and lightweight asymmetric autoencoder model for feature reconstruction. Finally, the EAFR loss joins the above two loss functions to more comprehensively solve the above typical problems for FER in the wild. We validate the proposed loss function on three most commonly used large-scale wild expression datasets (RAF-DB, FERPlus, and AffectNet), and the results show that our model achieves superior performance to several state-of-the-art methods. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
45. Data portability for activities of daily living and fall detection in different environments using radar micro-doppler.
- Author
-
Shah, Syed Aziz, Tahir, Ahsen, Le Kernec, Julien, Zoha, Ahmed, and Fioranelli, Francesco
- Subjects
CONTINUOUS wave radar ,ACTIVITIES of daily living ,CONVOLUTIONAL neural networks ,OLDER people ,MACHINE learning ,FRAIL elderly ,OLDER men - Abstract
The health status of an older or vulnerable person can be determined by looking into the additive effects of aging as well as any associated diseases. This status can lead the person to a situation of 'unstable incapacity' for normal aging and is determined by the decrease in response to the environment and to specific pathologies with apparent decrease of independence in activities of daily living (ADL). In this paper, we use micro-Doppler images obtained using a frequency-modulated continuous wave radar (FMCW) operating at 5.8 GHz with 400 MHz bandwidth as the sensor to perform assessment of this health status. The core idea is to develop a generalized system where the data obtained for ADL can be portable across different environments and groups of subjects, and critical events such as falls in mature individuals can be detected. In this context, we have conducted comprehensive experimental campaigns at nine different locations including four laboratory environments and five elderly care homes. A total of 99 subjects participated in the experiments where 1453 micro-Doppler signatures were recorded for six activities. Different machine learning, deep learning algorithms and transfer learning technique were used to classify the ADL. The support vector machine (SVM), K-nearest neighbor (KNN) and convolutional neural network (CNN) provided adequate classification accuracies for particular scenarios; however, the autoencoder neural network outperformed the mentioned classifiers by providing classification accuracy of ~ 88%. The proposed system for fall detection in elderly people can be deployed in care centers and is application for any indoor settings with various age group of people. For future work, we would focus on monitoring multiple older adults, concurrently in indoor settings using continuous radar sensor data stream which is limitation of the present system. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
46. Incremental deep learning for reflectivity data recognition in stomatology.
- Author
-
Procházka, Aleš, Charvát, Jindřich, Vyšata, Oldřich, and Mandic, Danilo
- Subjects
MACHINE learning ,DEEP learning ,ARTIFICIAL intelligence ,ORAL medicine ,CONVOLUTIONAL neural networks ,SUPPORT vector machines ,HYPERSPECTRAL imaging systems - Abstract
The recognition of stomatological disorders and the classification of dental caries are important areas of biomedicine that can hugely benefit from machine learning tools for the construction of relevant mathematical models. This paper explores the possibility of using reflectivity data to distinguish between healthy tissues and caries by deep learning and multilayer convolutional neural networks. The experimental data set includes more than 700 observations recorded in the stomatology laboratory. For rigor, the results obtained from the deep learning systems are compared with those evaluated for selected sets of features estimated for each observation and classified by a decision tree, support vector machine (SVM), k-nearest neighbor, Bayesian methods, and two-layer neural networks. The classification accuracy obtained for the deep learning systems was 98.1% and 94.4% for data in the signal and spectral domains, respectively, in comparison with an accuracy of 97.2% and 87.2% evaluated by the SVM method. The proposed method conclusively demonstrates how the artificial intelligence and deep learning methodology can contribute to improved diagnosis of dental problem in stomatology. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
47. Image recognition algorithm based on artificial intelligence.
- Author
-
Chen, Hong, Geng, Liwei, Zhao, Hongdong, Zhao, Cuijie, and Liu, Aiyong
- Subjects
ARTIFICIAL intelligence ,CONVOLUTIONAL neural networks ,ALGORITHMS ,IMAGE recognition (Computer vision) ,RECURRENT neural networks - Abstract
Convolutional neural networks also encountered some problems in the development of image recognition. The most prominent problem is that it is costly and time-consuming to collect data sets and train models. Limited data sets will cause the trained models to overfit. This paper proposes two methods to reduce overfitting based on the residual neural network architecture. The first type of method proposes a method of cross-combining waivers, reducing the size of the convolution kernel, and reducing the number of convolution kernels. The fitting method uses cross-combination to make the accuracy of Kaggle cat and dog data on the validation data set reach 95.37% and 90.31% on 30 types of engineering practice verification data set. The second method is based on the finetune residual neural network. A method of recurrent finetune residual neural network is proposed to improve the accuracy of the model. The accuracy of the finetune residual neural network on the Kaggle cat and dog validation dataset is 99.37%, and the accuracy of the dataset is verified in 30 types of engineering practice. The accuracy is 99.30%. The residual neural network method achieves 99.68% accuracy in the Kaggle cat and dog validation dataset and 99.61% in the validation dataset for 30 types of engineering practice. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
48. CN-waterfall: a deep convolutional neural network for multimodal physiological affect detection.
- Author
-
Fouladgar, Nazanin, Alirezaie, Marjan, and Främling, Kary
- Subjects
CONVOLUTIONAL neural networks ,AFFECTIVE computing ,DEEP learning ,SIGNAL processing ,MACHINE learning - Abstract
Affective computing solutions, in the literature, mainly rely on machine learning methods designed to accurately detect human affective states. Nevertheless, many of the proposed methods are based on handcrafted features, requiring sufficient expert knowledge in the realm of signal processing. With the advent of deep learning methods, attention has turned toward reduced feature engineering and more end-to-end machine learning. However, most of the proposed models rely on late fusion in a multimodal context. Meanwhile, addressing interrelations between modalities for intermediate-level data representation has been largely neglected. In this paper, we propose a novel deep convolutional neural network, called CN-Waterfall, consisting of two modules: Base and General. While the Base module focuses on the low-level representation of data from each single modality, the General module provides further information, indicating relations between modalities in the intermediate- and high-level data representations. The latter module has been designed based on theoretically grounded concepts in the Explainable AI (XAI) domain, consisting of four different fusions. These fusions are mainly tailored to correlation- and non-correlation-based modalities. To validate our model, we conduct an exhaustive experiment on WESAD and MAHNOB-HCI, two publicly and academically available datasets in the context of multimodal affective computing. We demonstrate that our proposed model significantly improves the performance of physiological-based multimodal affect detection. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
49. Deep convolutional neural network for diabetes mellitus prediction.
- Author
-
Alex, Suja A., Nayahi, J. Jesu Vedha, Shine, H., and Gopirekha, Vaisshalli
- Subjects
CONVOLUTIONAL neural networks ,DIABETES ,OUTLIER detection ,MACHINE learning ,FORECASTING - Abstract
As a widely known disease diabetes mellitus makes the human body produce quite less hormone and also tend to cause increased glucose that results in abnormal metabolism of varied organs in the body like eyes, kidneys, etc. Diabetic analysis has attracted the research community to treat some missing values and class imbalance issues. The performance of diabetes mellitus classification by the usage of machine learning techniques is comparatively low. We suggest this paper on imbalanced dataset with missing values, an efficient prediction algorithm for diabetes mellitus classification using Deep 1D-Convolutional Neural Network values. The outlier detection is used for removing missing values first. Then, oversampling method (SMOTE) is used to reduce the influence of imbalance class on prediction performance. Finally, predictions are produced using a DCNN classifier and are evaluated using a selective set of evaluation indicators. Experiments on the Pima Indian diabetes dataset (PIDD) from UCI Repository (University of California at Irvine) have yielded positive results. Our proposed DCNN algorithm has been shown to be successful and superior. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
50. Deep learning for fake news detection on Twitter regarding the 2019 Hong Kong protests.
- Author
-
Zervopoulos, Alexandros, Alvanou, Aikaterini Georgia, Bezas, Konstantinos, Papamichail, Asterios, Maragoudakis, Manolis, and Kermanidis, Katia
- Subjects
ANTI-extradition bill protests, Hong Kong, China, 2019 ,DEEP learning ,FAKE news ,MACHINE learning ,ARTIFICIAL intelligence ,PUBLIC demonstrations - Abstract
The dissemination of fake news on social media platforms is an issue of considerable interest, as it can be used to misinform people or lead them astray, which is particularly concerning when it comes to political events. The recent event of Hong Kong protests triggered an outburst of fake news posts that were identified on Twitter, which were then promptly removed and compiled into datasets to promote research. These datasets focusing on linguistic content were used in previous work to classify between tweets spreading fake and real news using traditional machine learning algorithms (Zervopoulos et al., in: IFIP international conference on artificial intelligence applications and innovations, Springer, Berlin, 2020). In this paper, the experimentation process on the previously constructed dataset is extended using deep learning algorithms along with a diverse set of input features, ranging from raw text to handcrafted features. Experiments showed that the deep learning algorithms outperformed the traditional approaches, reaching scores as high as 99.3% F1 Score, with the multilingual state-of-the-art model XLM-RoBERTa outperforming other algorithms using raw untranslated text. The combination of both traditional and deep learning algorithms allows for increased performance through the latter, while also gaining insight regarding tweet structure from the interpretability of the former. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.