1,462 results
Search Results
2. Computer vision digitization of smartphone images of anesthesia paper health records from low-middle income countries.
- Author
-
Folks, Ryan D., Naik, Bhiken I., Brown, Donald E., and Durieux, Marcel E.
- Subjects
- *
MEDICAL records , *ARTIFICIAL neural networks , *COMPUTER vision , *DIASTOLIC blood pressure , *MEDICAL personnel , *DEEP learning , *SYSTOLIC blood pressure - Abstract
Background: In low-middle income countries, healthcare providers primarily use paper health records for capturing data. Paper health records are utilized predominately due to the prohibitive cost of acquisition and maintenance of automated data capture devices and electronic medical records. Data recorded on paper health records is not easily accessible in a digital format to healthcare providers. The lack of real time accessible digital data limits healthcare providers, researchers, and quality improvement champions to leverage data to improve patient outcomes. In this project, we demonstrate the novel use of computer vision software to digitize handwritten intraoperative data elements from smartphone photographs of paper anesthesia charts from the University Teaching Hospital of Kigali. We specifically report our approach to digitize checkbox data, symbol-denoted systolic and diastolic blood pressure, and physiological data. Methods: We implemented approaches for removing perspective distortions from smartphone photographs, removing shadows, and improving image readability through morphological operations. YOLOv8 models were used to deconstruct the anesthesia paper chart into specific data sections. Handwritten blood pressure symbols and physiological data were identified, and values were assigned using deep neural networks. Our work builds upon the contributions of previous research by improving upon their methods, updating the deep learning models to newer architectures, as well as consolidating them into a single piece of software. Results: The model for extracting the sections of the anesthesia paper chart achieved an average box precision of 0.99, an average box recall of 0.99, and an mAP0.5-95 of 0.97. Our software digitizes checkbox data with greater than 99% accuracy and digitizes blood pressure data with a mean average error of 1.0 and 1.36 mmHg for systolic and diastolic blood pressure respectively. Overall accuracy for physiological data which includes oxygen saturation, inspired oxygen concentration and end tidal carbon dioxide concentration was 85.2%. Conclusions: We demonstrate that under normal photography conditions we can digitize checkbox, blood pressure and physiological data to within human accuracy when provided legible handwriting. Our contributions provide improved access to digital data to healthcare practitioners in low-middle income countries. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Special Issue "Emerging AI+X-Based Sensor and Networking Technologies including Selected Papers from ICGHIT 2022–2023".
- Author
-
Kim, Byung-Seo, Afzal, Muhammad Khalil, and Ullah, Rehmat
- Subjects
- *
MULTICASTING (Computer networks) , *INFORMATION technology , *SENSOR networks , *ARTIFICIAL neural networks , *DEEP learning , *BEAM steering , *INTEGRATED circuit design , *COMPUTER network security - Abstract
This document is a summary of a special issue of the journal Sensors, titled "Emerging AI+X-Based Sensor and Networking Technologies including Selected Papers from ICGHIT 2022–2023." The special issue features selected papers from the 10th and 11th International Conferences on Green and Human Information Technology (ICGHITs), which were held in Korea and Thailand. The conferences focused on the theme of "Emerging Artificial Intelligent (AI)+X technology" and "Hyper Automation + Human AI" respectively. The selected papers cover various topics such as network security, routing protocols, signal detection, and clustering mechanisms, all incorporating AI-based methods. The issue also includes papers on topics like secure authentication, distance estimation in RFID systems, energy optimization in smart homes, blockchain technology, and radar signal detection. The authors emphasize the importance of both technology and humanity in advancing green and information technologies. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
4. Comment on papers using machine learning for significant wave height time series prediction: Complex models do not outperform auto-regression.
- Author
-
Jiang, Haoyu, Zhang, Yuan, Qian, Chengcheng, and Wang, Xuan
- Subjects
- *
ARTIFICIAL neural networks , *TIME series analysis , *PREDICTION models , *ARTIFICIAL intelligence , *MACHINE learning , *DECOMPOSITION method - Abstract
• Five Machine Learning (ML) models compared for wave height time series prediction. • Complex ML models do not outperform simple AR in wave height time series prediction. • Comment to related papers: signal decomposition in test set series is WRONG. Significant Wave Height (SWH) is crucial in many aspect of ocean engineering. The accurate prediction of SWH has therefore been of immense practical value. Recently, Artificial Intelligence (AI) time series prediction methods have been widely used for single-point short-term SWH time-series forecasting, resulting in many AI-based models claiming to achieve good results. However, the extent to which these complex AI models can outperform traditional methods has largely been overlooked. This study compared five different models - AutoRegressive (AR), eXtreme Gradient Boosting (XGB), Artificial Neural Network (ANN), Long Short-Term Memory (LSTM), and WaveNet - for their performance on SWH time series prediction at 16 buoy locations. Surprisingly, the results suggest that the differences of performance among different models are negligible, indicating that all these AI models have only "learned" the linear auto-regression from the data. Additionally, we noticed that many recent studies used signal decomposition method for such time series prediction, and most of them decomposed the test sets, which is WRONG. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Introduction to the virtual collection of papers on Artificial neural networks: applications in X‐ray photon science and crystallography.
- Author
-
Ekeberg, Tomas
- Subjects
- *
ARTIFICIAL neural networks , *DEEP learning , *CRYSTALLOGRAPHY , *ARTIFICIAL intelligence , *MACHINE learning , *PHOTONS - Abstract
Artificial intelligence is more present than ever, both in our society in general and in science. At the center of this development has been the concept of deep learning, the use of artificial neural networks that are many layers deep and can often reproduce human‐like behavior much better than other machine‐learning techniques. The articles in this collection are some recent examples of its application for X‐ray photon science and crystallography that have been published in Journal of Applied Crystallography. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Advanced Machine Learning and Deep Learning Approaches for Remote Sensing II.
- Author
-
Jeon, Gwanggil
- Subjects
REMOTE sensing ,MACHINE learning ,ARTIFICIAL neural networks ,DEEP learning ,ARTIFICIAL intelligence ,DISTANCE education - Abstract
This document is a summary of a special issue on advanced machine learning and deep learning techniques for remote sensing. The issue includes 16 research papers that cover a range of topics, including hyperspectral image classification, moving point target detection, radar echo extrapolation, and remote sensing object detection. Each paper introduces a novel approach or model and provides extensive testing and evaluation to demonstrate its effectiveness. The insights shared in this special issue are expected to contribute to future advancements in artificial intelligence-based remote sensing research. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
7. Special Issue: Artificial Intelligence Technology in Medical Image Analysis.
- Author
-
Szilágyi, László and Kovács, Levente
- Subjects
DEEP learning ,COMPUTER-assisted image analysis (Medicine) ,IMAGE analysis ,ARTIFICIAL intelligence ,DIAGNOSTIC imaging ,ARTIFICIAL neural networks - Abstract
This document is a summary of a special issue in the journal Applied Sciences titled "Artificial Intelligence Technology in Medical Image Analysis." The special issue explores the applications of artificial intelligence (AI) in medical imaging and its impact on diagnostic and therapeutic processes. The use of AI-powered tools in image interpretation has shown exceptional capabilities in detecting and diagnosing medical conditions from imaging data, particularly in radiology. AI also contributes to improving image quality, automating routine tasks, and streamlining healthcare workflows. However, challenges such as data privacy, ethics, and regulatory frameworks need to be addressed for responsible implementation. The special issue includes several research papers that present advancements in automated medical decision support, age estimation, quality assurance, orthotic insole recommendation, tumor identification, thalamus segmentation, medical image classification, hyperparameter optimization, lung disease classification, and thoracic cavity segmentation. These papers demonstrate the potential of AI in improving accuracy, efficiency, and personalized treatment in medical image analysis. The integration of AI into healthcare requires collaboration between AI researchers, healthcare professionals, and regulatory bodies to ensure responsible and effective deployment. The future of AI in medical image analysis holds promise for improved diagnostic accuracy, early disease detection, and personalized treatment strategies. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
8. Editorial for the Special Issue "Data Science and Big Data in Biology, Physical Science and Engineering".
- Author
-
Mahmoud, Mohammed
- Subjects
PHYSICAL sciences ,BIG data ,DEEP learning ,ARTIFICIAL neural networks ,DATA science ,MACHINE learning ,REINFORCEMENT learning - Abstract
This document is an editorial for a special issue of the journal "Technologies" focused on data science and big data in various fields such as biology, physical science, and engineering. The editorial highlights the importance of analyzing large amounts of data generated by digital technologies and the need for data scientists to use artificial intelligence and machine learning to extract valuable knowledge. The special issue includes 12 papers covering topics such as machine learning techniques for customer churn prediction, agile program management in the U.S. Navy, deep learning for cybersecurity in Industry 5.0, self-directed learning during the COVID-19 era, decision tree-based neural networks for data classification, data-driven governance in technology companies, and more. The papers explore different approaches, models, and tools in the context of data science and big data. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
9. An Experimental Analysis of Various Deep Learning Architectures for the Classification of Cognitive Stimuli based EEG Signals.
- Author
-
Sarkar, Prashant Srinivasan, Mary Kanaga, E. Grace, Bhuvaneshwari, M., Mathew, Joel, and Stephen, Caleb
- Subjects
DEEP learning ,RECURRENT neural networks ,ARTIFICIAL neural networks ,COMPUTER interfaces ,ELECTROENCEPHALOGRAPHY ,SIGNAL classification - Abstract
The human brain functions through electrical signals. By measuring these signals, one can monitor brain activity and gain insights into the brain function of the subject. An electroencephalogram (EEG) allows one to monitor brain activity by having the subject wear an array of sensors on their head. This process is frequently used to diagnose medical conditions such as epilepsy. In recent years, there have been efforts to use EEG signals in concert with deep learning to create a brain computer interface (BCI). Such a device would enable the wearer to communicate to a system via brain signals. While such a system would not be so advanced as to enable the translation of complex thoughts, it would enable a user to command a machine to perform a small number of functions. The objective of this paper was to develop and optimize recurrent neural network architectures for use with a brain computer interface. Using EEG data collected from subjects, a variety of neural network models were created to learn from the data. The models that were used were simple recurrent neural networks (RNN), long short-term memory (LSTM), and gated recurrent units (GRU). This paper proposes a novel approach to EEG signal classification, demonstrating the capabilities of recurrent networks which are seldom explored for this purpose. This study produced promising results for recurrent models, obtaining a 91% accuracy with the 4-layer LSTM architecture. This presents a solid foundation for the argument that LSTM and similar architectures are feasible for BCI applications. [ABSTRACT FROM AUTHOR]
- Published
- 2024
10. Privacy-Preserving Federated Deep Learning Diagnostic Method for Multi-Stage Diseases.
- Author
-
Jinbo Yang, Hai Huang, Lailai Yin, Jiaxing Qu, and Wanjuan Xie
- Subjects
ARTIFICIAL neural networks ,MACHINE learning ,INTEGRATED circuits ,DATA privacy ,ALGORITHMS ,NATURAL languages ,DEEP learning - Abstract
Diagnosing multi-stage diseases typically requires doctors to consider multiple data sources, including clinical symptoms, physical signs, biochemical test results, imaging findings, pathological examination data, and even genetic data. When applying machine learning modeling to predict and diagnose multi-stage diseases, several challenges need to be addressed. Firstly, the model needs to handle multimodal data, as the data used by doctors for diagnosis includes image data, natural language data, and structured data. Secondly, privacy of patients' data needs to be protected, as these data contain the most sensitive and private information. Lastly, considering the practicality of the model, the computational requirements should not be too high. To address these challenges, this paper proposes a privacy-preserving federated deep learning diagnostic method for multi-stage diseases. This method improves the forward and backward propagation processes of deep neural network modeling algorithms and introduces a homomorphic encryption step to design a federated modeling algorithm without the need for an arbiter. It also utilizes dedicated integrated circuits to implement the hardware Paillier algorithm, providing accelerated support for homomorphic encryption in modeling. Finally, this paper designs and conducts experiments to evaluate the proposed solution. The experimental results show that in privacy-preserving federated deep learning diagnostic modeling, the method in this paper achieves the same modeling performance as ordinary modeling without privacy protection, and has higher modeling speed compared to similar algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. STUDYING THE INFLUENCE OF ENGINE SPEED ON THE ENTIRE PROCESS OF SPAN-LOWERING OF THE HEAVY MECHANIZED BRIDGE.
- Author
-
Duong Van Le, Thang Duc Tran, Quyen Manh Dao, and Dat Van Chu
- Subjects
BRIDGE design & construction ,MILITARY bridges ,ARTIFICIAL intelligence ,ARTIFICIAL neural networks ,DEEP learning - Abstract
The paper presents a dynamic model of the TMM-3M heavy mechanized bridge during the span lowering stage. The model is constructed as a multi-body mechanical system, taking into account the elastic deformation of the cable, rear outriggers, front tires, and front suspension system. It is a mechanical model driven by a cable mechanism. Lagrangian equations of the second kind have been applied to establish a system of differential equations describing the oscillations of the mechanical system and serve as the basis for investigating the dynamics of the span-lowering process. The system of differential equations is solved using numerical methods based on MATLAB simulation software. The study has revealed laws of the displacement, velocity, and acceleration of components within the mechanical system, especially those related to the bridge span depending on the choice of the drive speed of the engine during lowering by operator. The research results show that the lowering time increases from 52 seconds to 104 seconds when the engine speed decreases from 1800 rpm to 900 rpm. The tension force on the cable is surveyed to confirm the safety conditions during the span-lowering process. The study also provides recommendations for selecting appropriate engine speeds to minimize span-lowering time while ensuring the safety conditions of the TMM-3M bridge during the span-lowering process. This research is an important part of a comprehensive study on the working process of the heavy mechanized bridge TMM-3M to make practical improvements, aiming to reduce deployment time, decrease the number of deployment crew members, and increase the automation capability of the equipment. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. EFFECT OF CONTACT BLAST LOADING ON THE PLASTIC DEFORMATION FORMING ABILITY OF LARGE STEEL PIPES.
- Author
-
Quang Duc Vu
- Subjects
STEEL pipe ,ARTIFICIAL intelligence ,ARTIFICIAL neural networks ,DEEP learning ,COMPUTER simulation - Abstract
Plastic deformation forming with metal pipe blanks by contact blast loading inside pipes is an interesting moldless forming technique, also a complex and error-prone process. Some advantages are very characteristic of this forming technique such as no cost of mold, tooling and low energy consumption, no complicated control equipment compared to other forming techniques such as casting, rolling, tube hydrostatic forming, bending - welding. Up to now, the calculation and design of this forming technique mainly use some existing reference empirical formulas, so the experimental results are only suitable in the range of small pipe diameters, and still there are significant deviations for larger pipe diameters. In order to increase the predictability and accuracy of forming process by contact blast loading inside large pipes, this paper presents a study on the influence of the mass of highly explosive material - TNT to the forming ability of large steel pipes from API-5LX-42 mild steel materials by modern 3D numerical simulation using Abaqus/Cae software. Four output criteria with maximum values are used to evaluate the efficiency of this forming process, includ- ing maximum diameter of the blast zone (Dmax ≤2*Do), Von Mises stress (Smax ≤UTS), Hoop plastic strain component (PE22 max ≤1), and Pipe wall thinning rate (€7-max ≤60%). The results of this research on the plastic deformation forming process using numerical simulation can be used for the next experimental step to evaluate the difference between simulation and experiment, as well as use this data in the calculation and design of pipe products with circular or square cross-sections to save both time and money of trial and error before application in actual manufacturing. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
13. Special Issue on "Process Monitoring and Fault Diagnosis".
- Author
-
Ji, Cheng and Sun, Wei
- Subjects
ARTIFICIAL neural networks ,REMAINING useful life ,CONVOLUTIONAL neural networks ,PATTERN recognition systems ,TRANSFORMER models ,DEEP learning ,STATISTICAL learning ,WATER pipelines - Abstract
This document is a summary of a special issue of the journal Processes titled "Process Monitoring and Fault Diagnosis." The issue explores the application of data analytic techniques to enhance stable operation and safety in chemical processes and related industries. The collection of research papers covers various topics, including process fault detection, bearing fault diagnosis, remaining useful life prediction, and more. The papers introduce cutting-edge methodologies and demonstrate their reliability through validation. The issue aims to foster communication and the development of advanced process monitoring techniques. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
14. Guest editorial: Special Topic on software for atomistic machine learning.
- Author
-
Rupp, Matthias, Küçükbenli, Emine, and Csányi, Gábor
- Subjects
- *
ARTIFICIAL neural networks , *OPEN source software , *KRIGING , *POTENTIAL energy surfaces , *PYTHON programming language , *DEEP learning - Abstract
The Journal of Chemical Physics has released a special issue focused on software for atomistic machine learning. This issue aims to address the lack of journals dedicated to publishing scientific software papers. The collection of papers in this issue provides insight into the tools and goals of software implementations in the field of atomistic machine learning. The articles cover a range of topics, including machine-learning interatomic potentials, sampling, dataset repositories, workflows, and auxiliary tooling and analysis. The article concludes by emphasizing the importance of software implementations in the field and encourages further submissions on relevant topics. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
15. Improving Sentiment Analysis With Neural Networks.
- Author
-
Sârbu, Annamaria, Romaniuc, Alexandru, and Gavrilaş, Anca
- Subjects
ARTIFICIAL neural networks ,SENTIMENT analysis ,DEEP learning ,NATURAL language processing ,MATHEMATICAL regularization - Abstract
This paper investigates the effectiveness of sentiment analysis (SA) methods, ranging from rule-based approaches to deep learning architectures, in analysing textual data. The study focuses on three Python libraries: TextBlob, VADER, and Flair, evaluating their accuracy on a public dataset of Twitter posts. Additionally, custom neural network architectures are developed to optimize sentiment classification. Results indicate that while rule-based libraries offer simplicity, deep learning-based libraries show promise for higher accuracy. The customized LSTM models, particularly LSTM2 with architectural adjustments and regularization techniques, demonstrate improved performance over baseline models with classification accuracy as high as 76.3%. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
16. Maize Leaf Disease Recognition Based on Improved Convolutional Neural Network ShuffleNetV2.
- Author
-
Zhou, Hanmi, Su, Yumin, Chen, Jiageng, Li, Jichen, Ma, Linshuang, Liu, Xingyi, Lu, Sibo, and Wu, Qi
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,CORN diseases ,CORN ,PRECISION farming ,AGRICULTURAL development - Abstract
The occurrence of maize diseases is frequent but challenging to manage. Traditional identification methods have low accuracy and complex model structures with numerous parameters, making them difficult to implement on mobile devices. To address these challenges, this paper proposes a corn leaf disease recognition model SNMPF based on convolutional neural network ShuffleNetV2. In the down-sampling module of the ShuffleNet model, the max pooling layer replaces the deep convolutional layer to perform down-sampling. This improvement helps to extract key features from images, reduce the overfitting of the model, and improve the model's generalization ability. In addition, to enhance the model's ability to express features in complex backgrounds, the Sim AM attention mechanism was introduced. This mechanism enables the model to adaptively adjust focus and pay more attention to local discriminative features. The results on a maize disease image dataset demonstrate that the SNMPF model achieves a recognition accuracy of 98.40%, representing a 4.1 percentage point improvement over the original model, while its size is only 1.56 MB. Compared with existing convolutional neural network models such as EfficientNet, MobileViT, EfficientNetV2, RegNet, and DenseNet, this model offers higher accuracy and a more compact size. As a result, it can automatically detect and classify maize leaf diseases under natural field conditions, boasting high-precision recognition capabilities. Its accurate identification results provide scientific guidance for preventing corn leaf disease and promote the development of precision agriculture. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
17. A miRNA-disease association prediction model based on tree-path global feature extraction and fully connected artificial neural network with multi-head self-attention mechanism.
- Author
-
Biyu, Hou, Mengshan, Li, Yuxin, Hou, Ming, Zeng, Nan, Wang, and Lixin, Guan
- Subjects
ARTIFICIAL neural networks ,FEATURE extraction ,PREDICTION models ,DATA mining ,ASSOCIATION rule mining - Abstract
Background: MicroRNAs (miRNAs) emerge in various organisms, ranging from viruses to humans, and play crucial regulatory roles within cells, participating in a variety of biological processes. In numerous prediction methods for miRNA-disease associations, the issue of over-dependence on both similarity measurement data and the association matrix still hasn't been improved. In this paper, a miRNA-Disease association prediction model (called TP-MDA) based on tree path global feature extraction and fully connected artificial neural network (FANN) with multi-head self-attention mechanism is proposed. The TP-MDA model utilizes an association tree structure to represent the data relationships, multi-head self-attention mechanism for extracting feature vectors, and fully connected artificial neural network with 5-fold cross-validation for model training. Results: The experimental results indicate that the TP-MDA model outperforms the other comparative models, AUC is 0.9714. In the case studies of miRNAs associated with colorectal cancer and lung cancer, among the top 15 miRNAs predicted by the model, 12 in colorectal cancer and 15 in lung cancer were validated respectively, the accuracy is as high as 0.9227. Conclusions: The model proposed in this paper can accurately predict the miRNA-disease association, and can serve as a valuable reference for data mining and association prediction in the fields of life sciences, biology, and disease genetics, among others. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
18. A COMPARATIVE EXPLORATION OF ACTIVATION FUNCTIONS FOR IMAGE CLASSIFICATION IN CONVOLUTIONAL NEURAL NETWORKS.
- Author
-
MAKHDOOM, FAIZA and RAHMAN, JAMSHAID UL
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,ARTIFICIAL intelligence ,MACHINE learning ,DIGITAL image processing ,COMPUTER vision - Abstract
Activation functions play a crucial role in enabling neural networks to carry out tasks with increased flexibility by introducing non-linearity. The selection of appropriate activation functions becomes even more crucial, especially in the context of deeper networks where the objective is to learn more intricate patterns. Among various deep learning tools, Convolutional Neural Networks (CNNs) stand out for their exceptional ability to learn complex visual patterns. In practice, ReLu is commonly employed in convolutional layers of CNNs, yet other activation functions like Swish can demonstrate superior training performance while maintaining good testing accuracy on different datasets. This paper presents an optimally refined strategy for deep learning-based image classification tasks by incorporating CNNs with advanced activation functions and an adjustable setting of layers. A thorough analysis has been conducted to support the effectiveness of various activation functions when coupled with the favorable softmax loss, rendering them suitable for ensuring a stable training process. The results obtained on the CIFAR-10 dataset demonstrate the favorability and stability of the adopted strategy throughout the training process. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. Early fire detection technology based on improved transformers in aircraft cargo compartments.
- Author
-
Hong-zhou Ai, Dong Han, Xin-zhi Wang, Quan-yi Liu, Yue Wang, Meng-yue Li, and Pei Zhu
- Subjects
FIRE detectors ,ELECTRIC transformers ,DEEP learning ,RECURRENT neural networks ,ARTIFICIAL neural networks - Abstract
The implementation of early and accurate detection of aircraft cargo compartment fire is of great significance to ensure flight safety. The current airborne fire detection technology mostly relies on single-parameter smoke detection using infrared light. This often results in a high false alarm rate in complex air transportation environments. The traditional deep learning model struggles to effectively address the issue of long-term dependency in multivariate fire information. This paper proposes a multi-technology collaborative fire detection method based on an improved transformers model. Dual-wavelength optical sensors, flue gas analyzers, and other equipment are used to carry out multi-technology collaborative detection methods and characterize various feature dimensions of fire to improve detection accuracy. The improved Transformer model which integrates the self-attention mechanism and position encoding mechanism is applied to the problem of long-time series modeling of fire information from a global perspective, which effectively solves the problem of gradient disappearance and gradient explosion in traditional RNN (recurrent neural network) and CNN (convolutional neural network). Two different multi-head self-attention mechanisms are used to classify and model multivariate fire information, respectively, which solves the problem of confusing time series modeling and classification modeling in dealing with multivariate classification tasks by a single attention mechanism. Finally, the output results of the two models are fused through the gate mechanism. The research results show that, compared with the traditional single-feature detection technology, the multi-technology collaborative fire detection method can better capture fire information. Compared with the traditional deep learning model, the multivariate fire prediction model constructed by the improved Transformer can better detect fires, and the accuracy rate is 0.995. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
20. Intelligent Stroke Disease Prediction Model Using Deep Learning Approaches.
- Author
-
Gao, Chunhua, Wang, Hui, and Mezzapesa, Domenico Maria
- Subjects
STROKE diagnosis ,RISK assessment ,RANDOM forest algorithms ,PREDICTION models ,DATABASE management ,RESEARCH funding ,SYMPTOMS ,SUPPORT vector machines ,DEEP learning ,ARTIFICIAL neural networks ,STROKE ,COMPARATIVE studies ,MACHINE learning ,DECISION trees ,REGRESSION analysis ,ALGORITHMS ,DISEASE risk factors - Abstract
Stroke is a high morbidity and mortality disease that poses a serious threat to people's health. Early recognition of the various warning signs of stroke is necessary so that timely clinical intervention can help reduce the severity of stroke. Deep neural networks have powerful feature representation capabilities and can automatically learn discriminant features from large amounts of data. This paper uses a range of physiological characteristic parameters and collaborates with deep neural networks, such as the Wasserstein generative adversarial networks with gradient penalty and regression network, to construct a stroke prediction model. Firstly, to address the problem of imbalance between positive and negative samples in the stroke public data set, we performed positive sample data augmentation and utilized WGAN‐GP to generate stroke data with high fidelity and used it for the training of the prediction network model. Then, the relationship between observable physiological characteristic parameters and the predicted risk of suffering a stroke was modeled as a nonlinear mapping transformation, and a stroke prediction model based on a deep regression network was designed. Finally, the proposed method is compared with commonly used machine learning‐based classification algorithms such as decision tree, random forest, support vector machine, and artificial neural networks. The prediction results of the proposed method are optimal in the comprehensive measurement index F. Further ablation experiments also show that the designed prediction model has certain robustness and can effectively predict stroke diseases. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
21. Deep learning-based classification of anti-personnel mines and sub-gram metal content in mineralized soil (DL-MMD).
- Author
-
Minhas, Shahab Faiz, Shah, Maqsood Hussain, and Khaliq, Talal
- Subjects
METAL content of soils ,ARTIFICIAL neural networks ,SUPPORT vector machines ,K-nearest neighbor classification ,DEEP learning - Abstract
De-mining operations are of critical importance for humanitarian efforts and safety in conflict-affected regions. In this paper, we address the challenge of enhancing the accuracy and efficiency of mine detection systems. We present an innovative Deep Learning architecture tailored for pulse induction-based Metallic Mine Detectors (MMD), so called DL-MMD. Our methodology leverages deep neural networks to distinguish amongst nine distinct materials with an exceptional validation accuracy of 93.5%. This high level of precision enables us not only to differentiate between anti-personnel mines, without metal plates but also to detect minuscule 0.2-g vertical paper pins in both mineralized soil and non-mineralized environments. Moreover, through comparative analysis, we demonstrate a substantial 3% and 7% improvement (approx.) in accuracy performance compared to the traditional K-Nearest Neighbors and Support Vector Machine classifiers, respectively. The fusion of deep neural networks with the pulse induction-based MMD not only presents a cost-effective solution but also significantly expedites decision-making processes in de-mining operations, ultimately contributing to improved safety and effectiveness in these critical endeavors. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
22. Editorial for the Special Issue "Machine Learning in Computer Vision and Image Sensing: Theory and Applications".
- Author
-
Chakraborty, Subrata and Pradhan, Biswajeet
- Subjects
COMPUTER vision ,MACHINE learning ,ARTIFICIAL neural networks ,DEEP learning ,CONVOLUTIONAL neural networks ,SIGNAL processing ,GAIT in humans - Abstract
This document is an editorial for a special issue titled "Machine Learning in Computer Vision and Image Sensing: Theory and Applications." The editorial highlights the diverse applications of machine learning (ML) models in various domains such as medical imaging, signal processing, remote sensing, and human activity detection. The special issue includes 11 papers that cover topics such as image segmentation, fluvial navigation, Alzheimer's disease classification, pneumothorax detection, lung cancer malignancy prediction, amniotic fluid volume detection, COVID-19 detection, and Parkinson's disease detection. The papers showcase the progress and potential of ML models in computer vision applications and provide valuable insights for future research. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
23. Research on electric vehicle charging load prediction method based on spectral clustering and deep learning network.
- Author
-
Fang Xin, Xie Yang, Wang Beibei, Xu Ruilin, Mei Fei, and Zheng Jianyong
- Subjects
DEEP learning ,ELECTRIC charge ,ELECTRIC vehicles ,ARTIFICIAL neural networks ,STATISTICAL sampling ,CONVOLUTIONAL neural networks - Abstract
With the increasing prominence of environmental and energy issues, electric vehicles (EVs) as representatives of clean energy vehicles have experienced rapid development in recent years, and the charging load has also exhibited statistical characteristics. Accurate prediction of EV charging load is crucial to improve grid load dispatch and intelligent level. However, current research on EV charging load prediction still faces challenges such as data reliability, complexity and variability of charging behavior, uncertainty, and lack of standardization methods. Therefore, this paper proposes an electric vehicle charging load prediction method based on spectral clustering and deep learning network (SC-CNNLSTM). Firstly, to address the insufficient amount of EV charging load data, this paper proposes to use Monte Carlo simulation to sample and simulate historical load data. Then, in order to identify the internal structure and patterns of charging load, the sampled and simulated dataset is clustered using spectral clustering, dividing the data into different clusters, where each cluster represents samples with similar charging load characteristics. Finally, based on the different sample features of each cluster, corresponding CNN-LSTM models are constructed and trained and predict using the respective data. By modifying the model parameters, the prediction accuracy of the model is improved. Through comparative experiments, the proposed method in this paper has significantly improved prediction accuracy compared to traditional prediction methods without clustering, thus validating the effectiveness and practicality of the method. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
24. Deep neural network models for improving truck productivity prediction in openpit mines.
- Author
-
Ugurlu, Omer Faruk, Fan, Chengkai, Bei Jiang, and Liu, Wei Victor
- Abstract
Accurate prediction of truck productivity plays a pivotal role in improving the efficiency and profitability of openpit mining operations. This paper proposes a deep neural network (DNN) model to overcome the challenge of predicting truck productivity in openpit mines. The prediction model was built using eight variables and was optimized by considering different train-test split ratios, numbers of hidden layers and neurons, and activation functions. The proposed model's performance was evaluated using various metrics and compared with other commonly used machine learning algorithms. The results showed that the proposed model outperformed traditional machine learning models by achieving higher prediction accuracy. Moreover, a single-variable sensitivity analysis showed that haul distance is the most influential variable for predicting truck productivity. This study marks a pioneering effort in employing DNN to predict truck productivity in openpit mining, signifying a notable advancement in the field. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
25. An experimental study of acoustic bird repellents for reducing bird encroachment in pear orchards.
- Author
-
Qing Chen, Jingjing Xie, Qiang Yu, Can Liu, Wenqin Ding, Xiaogang Li, and Hongping Zhou
- Subjects
ARTIFICIAL neural networks ,AGRICULTURAL economics ,PEST control ,AGRICULTURAL development ,CROP yields ,DEEP learning ,COMPUTER vision - Abstract
Bird invasion will reduce the yield of high-value crops, which threatens the healthy development of agricultural economy. Sonic bird repellent has the advantages of large range, no time and geographical restrictions, and low cost, which has attracted people's attention in the field of agriculture. At present, there are few studies on the application of sonic bird repellents in pear orchards to minimize economic losses and prolong the adaptive capacity of birds. In this paper, a sound wave bird repellent system based on computer vision is designed, which combines deep learning target recognition technology to accurately identify birds and drive them away. The neural network model that can recognize birds is first trained and deployed to the server. Live video is captured by an installed webcam, and the sonic bird repellent is powered by an ESP-8266 relay switch. In a pear orchard, two experimental areas were divided into two experimental areas to test the designed sonic bird repellent device, and the number of bad fruits pecked by birds was used as an indicator to evaluate the bird repelling effect. The results showed that the pear pecked fruit rate was 6.03% in the pear orchard area that used the acoustic bird repeller based on computer recognition, 7.29% in the pear orchard area of the control group that used the acoustic bird repeller with continuous operation, and 13.07% in the pear orchard area that did not use any bird repellent device. While acoustic bird repellers based on computer vision can be more effective at repelling birds, they can be used in combination with methods such as fruit bags to reduce the economic damage caused by birds. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. Influence of Temperature on Brushless Synchronous Machine Field Winding Interturn Fault Severity Estimation.
- Author
-
Pascual, Rubén, Rivero, Eduardo, Guerrero, José M., Mahtani, Kumar, and Platero, Carlos A.
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,SYNCHRONOUS generators ,ERROR rates ,WINDING machines ,STATORS - Abstract
There are numerous methods for detecting interturn faults (ITFs) in the field winding of synchronous machines (SMs). One effective approach is based on comparing theoretical and measured excitation currents. This method is unaffected by rotor temperature in static excitation SMs. However, this paper investigates the influence of rotor temperature in brushless synchronous machines (BSMs), where rotor temperature significantly impacts the exciter excitation current. Extensive experimental tests were conducted on a special BSM with measurable rotor temperature. Given the challenges of measuring rotor temperature in industrial machines, this paper explores the feasibility of using stator temperature in the exciter field current estimation model. The theoretical exciter field current is calculated using a deep neural network (DNN), which incorporates electrical brushless synchronous generator output values and stator temperature, and it is subsequently compared with the measured exciter field current. This method achieves an error rate below 0.5% under healthy conditions, demonstrating its potential for simple implementation in industrial BSMs for ITF detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
27. Scene representation using a new two-branch neural network model.
- Author
-
Parseh, Mohammad Javad, Rahmanimanesh, Mohammad, Keshavarzi, Parviz, and Azimifar, Zohreh
- Subjects
ARTIFICIAL neural networks ,IMAGE representation ,DEEP learning ,COMPUTER vision ,RECOGNITION (Psychology) ,FEATURE extraction ,CONVOLUTIONAL neural networks - Abstract
Scene classification and recognition have always been one of the most challenging tasks of scene understanding due to the inherent ambiguity in visual scenes. The core of scene classification and recognition tasks is scene representation. Deep learning advances in computer vision, especially deep CNNs, have significantly improved scene representation in the last decade. Deep convolutional features extracted from deep CNNs provide discriminative representations of the images and are widely used in various computer vision tasks, such as scene classification. Deep convolutional features capture the appearance characteristics of the image and the spatial information about different image regions. Meanwhile, the semantic and context information obtained from high-level concepts about scene images, such as objects and their relationships, can significantly contribute to identifying scene images. Therefore, in this paper, we divide visual scenes into two categories, object-based and layout-based. Object-based scenes are scenes that have scene-specific objects and, based on those objects, can be described and identified. In contrast, the layout-based scenes do not have scene-specific objects and are described and identified based on the appearance and layout of the image. This paper proposes a new neural network model for representing and classifying visual scenes, which we call G-CNN (GNN-CNN). The proposed model includes two modules, feature extraction and feature fusion, and the feature extraction module composes of visual and semantic branches. The visual branch is responsible for extracting deep CNN features from the image, and the semantic branch is responsible for extracting semantic GNN features from the scene graph corresponding to the image. The feature fusion module is a novel two-stream neural network that fuses the CNN and GNN feature vectors to produce a comprehensive representation of the scene image. Finally, a fully-connected classifier classified the obtained comprehensive feature vector into one of the pre-defined categories. The proposed model has been evaluated on three benchmark scene datasets, UIUC Sports, MIT67, and SUN397, and obtained classification accuracy of 99.91%, 96.01%, and 85.32%, respectively. In addition, a new dataset named Scene40, which has been introduced in our previous paper, is also used for further evaluation of the proposed method. The comparison results based on classification accuracy criteria show that the proposed model can outperform the best previous methods on three benchmark scene datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. Short-time photovoltaic output prediction method based on depthwise separable convolution Visual Geometry group-deep gate recurrent neural network.
- Author
-
Lei Zhang, Shuang Zhao, Guanchao Zhao, Lingyi Wang, Baolin Liu, Zhimin Na, Zhijian Liu, Zhongming Yu, Wei He, Mrzljak, Vedran, and Ling-Ling Li
- Subjects
RECURRENT neural networks ,SOLAR technology ,ARTIFICIAL neural networks ,ELECTRIC power engineering ,GENERATIVE adversarial networks ,PHOTOVOLTAIC power generation ,PHOTOVOLTAIC power systems - Abstract
In response to the issue of short-term fluctuations in photovoltaic (PV) output due to Cloud movement, this paper proposes a method for forecasting short-term PV output based on a Depthwise Separable Convolution Visual Geometry Group (DSCVGG) and a Deep Gate Recurrent Neural Network (DGN). Initially, a cloud motion prediction model is constructed using a DSCVGG, which achieves edge recognition and motion prediction of clouds by replacing the previous convolution layer of the pooling layer in VGG with a depthwise separable convolution. Subsequently, the output results of the DSCVGG network, along with historical PV output data, are introduced into a Deep Gate Recurrent Unit Network (DGN) to establish a PV output prediction model, thereby achieving precise prediction of PV output. Through experiments on actual data, the Mean Absolute Error (MAE) and Mean Squared Error (MSE) of our model are only 2.18% and 5.32 x 10
-5 , respectively, which validates the effectiveness, accuracy, and superiority of the proposed method. This provides new insights and methods for improving the stability of PV power generation. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
29. BFNet: a full-encoder skip connect way for medical image segmentation.
- Author
-
Siyu Zhan, Quan Yuan, Xin Lei, Rui Huang, Lu Guo, Ke Liu, and Rong Chen
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,COMPUTER-assisted image analysis (Medicine) ,DEEP learning ,IMAGE segmentation - Abstract
In recent years, semantic segmentation in deep learning has been widely applied in medical image segmentation, leading to the development of numerous models. Convolutional Neural Network (CNNs) have achieved milestone achievements in medical image analysis. Particularly, deep neural networks based on U-shaped architectures and skip connections have been extensively employed in various medical image tasks. U-Net is characterized by its encoder-decoder architecture and pioneering skip connections, along with multi-scale features, has served as a fundamental network architecture for many modifications. But U-Net cannot fully utilize all the information from the encoder layer in the decoder layer. U-Net++ connects mid parameters of different dimensions through nested and dense skip connections. However, it can only alleviate the disadvantage of not being able to fully utilize the encoder information and will greatly increase the model parameters. In this paper, a novel BFNet is proposed to utilize all feature maps from the encoder at every layer of the decoder and reconnects with the current layer of the encoder. This allows the decoder to better learn the positional information of segmentation targets and improves learning of boundary information and abstract semantics in the current layer of the encoder. Our proposed method has a significant improvement in accuracy with 1.4 percent. Besides enhancing accuracy, our proposed BFNet also reduces network parameters. All the advantages we proposed are demonstrated on our dataset. We also discuss how different loss functions influence this model and some possible improvements. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
30. FN-GNN: A Novel Graph Embedding Approach for Enhancing Graph Neural Networks in Network Intrusion Detection Systems.
- Author
-
Tran, Dinh-Hau and Park, Minho
- Subjects
ARTIFICIAL neural networks ,GRAPH neural networks ,RECURRENT neural networks ,CONVOLUTIONAL neural networks ,DEEP learning ,INTRUSION detection systems (Computer security) - Abstract
With the proliferation of the Internet, network complexities for both commercial and state organizations have significantly increased, leading to more sophisticated and harder-to-detect network attacks. This evolution poses substantial challenges for intrusion detection systems, threatening the cybersecurity of organizations and national infrastructure alike. Although numerous deep learning techniques such as convolutional neural networks (CNNs), recurrent neural networks (RNNs), and graph neural networks (GNNs) have been applied to detect various network attacks, they face limitations due to the lack of standardized input data, affecting model accuracy and performance. This paper proposes a novel preprocessing method for flow data from network intrusion detection systems (NIDSs), enhancing the efficacy of a graph neural network model in malicious flow detection. Our approach initializes graph nodes with data derived from flow features and constructs graph edges through the analysis of IP relationships within the system. Additionally, we propose a new graph model based on the combination of the graph neural network (GCN) model and SAGEConv, a variant of the GraphSAGE model. The proposed model leverages the strengths while addressing the limitations encountered by the previous models. Evaluations on two IDS datasets, CICIDS-2017 and UNSW-NB15, demonstrate that our model outperforms existing methods, offering a significant advancement in the detection of network threats. This work not only addresses a critical gap in the standardization of input data for deep learning models in cybersecurity but also proposes a scalable solution for improving the intrusion detection accuracy. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
31. Assessment of reinforced concrete corrosion degree based on the quantum particle swarm optimised-generative adversarial network.
- Author
-
Xumei Lin, Shijie Yu, Peng Wang, and Shiyuan Wang
- Subjects
ARTIFICIAL neural networks ,MACHINE learning ,GENERATIVE adversarial networks ,PARTICLE swarm optimization ,DEEP learning ,REINFORCED concrete corrosion - Abstract
Reinforced concrete corrosion inspection methods based on deep learning have been widely used in the engineering field to monitor the service status of reinforced concrete. However, in engineering practice, it is difficult to obtain a large amount of reinforced concrete corrosion data of different types, which greatly hinders the improvement of the accuracy of neural network models in predicting corrosion conditions. The classic generative adversarial network (GAN) model gives poor model quality for datasets with small amounts of data and high concentration. This paper proposes an improved generative adversarial network approach to optimise reinforced concrete corrosion data. Firstly, a quantum particle swarm optimisation (QPSO) algorithm is used to improve the generative adversarial network. Then, existing corrosion characteristic data is used to train the improved generative adversarial network until the ideal equilibrium state is reached. Next, the feature data generated by the generator are fused with the original data and the fused data are input into several common machine learning models for training. Experimental results show that compared with other conventional results obtained by directly inputting corrosion data into a neural network model for training, the improved method makes full use of multi-source signal data and achieves better classification performance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
32. Node classifications with DjCaNE: Disjoint content and network embedding.
- Author
-
Fazaeli, Mohsen and Momtazi, Saeedeh
- Subjects
ARTIFICIAL neural networks ,GRAPH neural networks ,DEEP learning ,MACHINE learning ,COMPUTATIONAL complexity - Abstract
Machine learning approaches have become a crucial tool in graph analysis. Despite the accurate results of the existing approaches, most of them are not scalable enough to be used in real-world problems. Networks provide two different kinds of information, nodes contents and nodes relations (network structure). Training deep graph neural networks (GNN) over large-scale graphs is challenging due to the limitation of the message passing framework. Graph Convolutional Networks (GCN) work on all node neighbours at once. Furthermore, it is usual to transform node features with a deep neural network before the GC operation. Therefore, the deep transform operation may apply up to hundreds of times for each target node which is heavy computation and hard to batch. This paper presents an abstract framework with two embedding components, the first component embeds node relations, and the second one embeds node contents. The model makes predictions by aggregating these embeddings through a combination component. The presented approach limits the deep transform only to the target node and uses random walk-based embedding instead of the GC operator to reduce the cost. The main goal of the proposed approach is to provide a light framework for the task. To this aim, node relations are embedded based on node neighbourhood structure by a biased variant of the DeepWalk model, called GuidedWalk, and an autoencoder embeds node contents. The experimental results on three well-known datasets show the superiority of the proposed model compared to the state-of-the-art GraphSAGE and TADW models with less computational complexity. On the Citeseer, Cora, and PubMed datasets, the model has achieved 3.23%, 0.88%, and 7.63% improvement in Macro-F1 and 3.25%, 0.7%, and 6.34% improvement in Micro-F1, respectively. Although GNNs are state-of-the-art models, considering node content is their main advantage. This paper shows that even a simple integration of node content to available random walk-based methods improves their performance up to GCNs without increasing the complexity. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
33. Identification of cervical spine fracture using deep learning.
- Author
-
Gaikwad, D P, Sejal, Ahire, Bagade, Swarupa, Ghodekar, Netra, and Labade, Srushti
- Subjects
CERVICAL vertebrae ,VERTEBRAL fractures ,ARTIFICIAL neural networks ,SPINE ,DEEP learning ,MAGNETIC resonance imaging - Abstract
The first seven vertebrae of the spine are referred to as the cervical spine. X-rays, Computed Tomography scans, Magnetic Resonance Imaging, and physical examinations methods are used to identify spine fractures which are complex and non-interpretable. To overcome these drawbacks, automated systems are required. In this paper, the fifth version of You Only Look Once (YOLO) and deep neural network have combined to detect dislocations in vertebral columns. YOLO v5 is used to detect major and minute fractures of C1 to C7 vertebrae and deep neural network is used to classify normal and fractured vertebrate. Training dataset provided by ASNR and ASSR spine radiology specialists is used to train model. Experimental results show that the proposed model offers classification accuracy of 97.7% and 89% on training and validation respectively. The proposed system is able to show exact location of fracture in Cervical Spine and perform better than existing systems. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
34. Ensemble Deep Learning Technique for Detecting MRI Brain Tumor.
- Author
-
Jader, Rasool Fakhir, Kareem, Shahab Wahhab, Awla, Hoshang Qasim, and Ashraf, Imran
- Subjects
MACHINE learning ,ARTIFICIAL neural networks ,MAGNETIC resonance imaging ,DEEP learning ,EMPLOYEE handbooks - Abstract
The classification process of MRI (magnetic resonance imaging) is frequently used for making medical diagnoses for conditions including pituitary, glioma, meningioma, and no tumor. For this reason, determining the type of MRI and its quantity are significant and valuable measurements that reveal the brain's state of health. To segment and classify brain analysis, laboratory personnel employ manual examination via screen; this requires a lot of labour and time. On the other hand, the devices used by specialists are not practical or inexpensive for every doctor or institution. In recent years, a variety of computational algorithms for segmentation and classification have been developed with improved results to get around the issue. Artificial neural networks (ANNs) have the capability and promise to classify in this regard. The purpose of this paper is to create and put into practice a system for classifying different types of MRI images of brain tumor samples. As a result, this paper concentrated on the tasks of segmentation, feature extraction, classifier building, and classification into four categories using various machine learning algorithms. The authors used VGG‐16, ResNet‐50, and AlexNet models based on the transfer learning algorithm for three models to classify images as an ensemble model. As a result, MRI brain tumor segmentation is more precise because each spatial feature point can now refer to all other contextual data. In the specifics, our models outperformed every other published modern ensemble model in the official deep learning challenge without any postprocessing. The ensemble model achieved an accuracy of 99.16%, a sensitivity of 98.47%, a specificity of 98.57%, a precision of 98.74%, a recall of 98.49%, and an F1‐score of 98.18%. These results significantly surpass the accuracy of other methods such as Naive Bayes, decision tree classifier, random forest, and DNN models. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
35. Generative Adversarial Networks for SAR Automatic Target Recognition and Classification Models Enhanced Explainability: Perspectives and Challenges.
- Author
-
Remusati, Héloïse, Le Caillec, Jean-Marc, Schneider, Jean-Yves, Petit-Frère, Jacques, and Merlet, Thomas
- Subjects
ARTIFICIAL neural networks ,GENERATIVE adversarial networks ,AUTOMATIC target recognition ,SYNTHETIC aperture radar ,DEEP learning - Abstract
Generative adversarial networks (or GANs) are a specific deep learning architecture often used for different usages, such as data generation or image-to-image translation. In recent years, this structure has gained increased popularity and has been used in different fields. One area of expertise currently in vogue is the use of GANs to produce synthetic aperture radar (SAR) data, and especially expand training datasets for SAR automatic target recognition (ATR). In effect, the complex SAR image formation makes these kind of data rich in information, leading to the use of deep networks in deep learning-based methods. Yet, deep networks also require sufficient data for training. However, contrary to optical images, we generally do not have a substantial number of available SAR images because of their acquisition and labelling cost; GANs are then an interesting tool. Concurrently, how to improve explainability for SAR ATR deep neural networks and how to make their reasoning more transparent have been increasingly explored as model opacity deteriorates trust of users. This paper aims at reviewing how GANs are used with SAR images, but also giving perspectives on how GANs could be used to improve interpretability and explainability of SAR classifiers. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
36. Short-Term Photovoltaic Power Generation Based on MVMD Feature Extraction and Informer Model.
- Author
-
Xu, Ruilin, Zheng, Jianyong, Mei, Fei, Yang, Xie, Wu, Yue, and Zhang, Heng
- Subjects
ARTIFICIAL neural networks ,PHOTOVOLTAIC power generation ,DEEP learning ,POWER series ,STATISTICAL correlation - Abstract
Photovoltaic (PV) power fluctuates with weather changes, and traditional forecasting methods typically decompose the power itself to study its characteristics, ignoring the impact of multidimensional weather conditions on the power decomposition. Therefore, this paper proposes a short-term PV power generation method based on MVMD (multivariate variational mode decomposition) feature extraction and the Informer model. First, MIC correlation analysis is used to extract weather features most related to PV power. Next, to more comprehensively describe the relationship between PV power and environmental conditions, MVMD is used for time–frequency synchronous analysis of the PV power time series combined with the highest MIC correlation weather data, obtaining frequency-aligned multivariate intrinsic modes. These modes incorporate multidimensional weather factors into the data-decomposition-based forecasting method. Finally, to enhance the model's learning capability, the Informer neural network model is employed in the prediction phase. Based on the input PV IMF time series and associated weather mode components, the Informer prediction model is constructed for training and forecasting. The predicted results of different PV IMF modes are then superimposed to obtain the total PV power generation. Experiments show that this method improves PV power generation accuracy, with an MAPE value of 4.31%, demonstrating good robustness. In terms of computational efficiency, the Informer model's ability to handle long sequences with sparse attention mechanisms reduces training and prediction times by approximately 15%, making it faster than conventional deep learning models. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
37. Triple-0: Zero-shot denoising and dereverberation on an end-to-end frozen anechoic speech separation network.
- Author
-
Gul, Sania, Khan, Muhammad Salman, and Ur-Rehman, Ata
- Subjects
REVERBERATION time ,ARTIFICIAL neural networks ,SPEECH ,MACHINE learning ,SPEECH enhancement ,DEEP learning - Abstract
Speech enhancement is crucial both for human and machine listening applications. Over the last decade, the use of deep learning for speech enhancement has resulted in tremendous improvement over the classical signal processing and machine learning methods. However, training a deep neural network is not only time-consuming; it also requires extensive computational resources and a large training dataset. Transfer learning, i.e. using a pretrained network for a new task, comes to the rescue by reducing the amount of training time, computational resources, and the required dataset, but the network still needs to be fine-tuned for the new task. This paper presents a novel method of speech denoising and dereverberation (SD&D) on an end-to-end frozen binaural anechoic speech separation network. The frozen network requires neither any architectural change nor any fine-tuning for the new task, as is usually required for transfer learning. The interaural cues of a source placed inside noisy and echoic surroundings are given as input to this pretrained network to extract the target speech from noise and reverberation. Although the pretrained model used in this paper has never seen noisy reverberant conditions during its training, it performs satisfactorily for zero-shot testing (ZST) under these conditions. It is because the pretrained model used here has been trained on the direct-path interaural cues of an active source and so it can recognize them even in the presence of echoes and noise. ZST on the same dataset on which the pretrained network was trained (homo-corpus) for the unseen class of interference, has shown considerable improvement over the weighted prediction error (WPE) algorithm in terms of four objective speech quality and intelligibility metrics. Also, the proposed model offers similar performance provided by a deep learning SD&D algorithm for this dataset under varying conditions of noise and reverberations. Similarly, ZST on a different dataset has provided an improvement in intelligibility and almost equivalent quality as provided by the WPE algorithm. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
38. Modified receiver architecture in software-defined radio for real-time modulation classification.
- Author
-
Le, Quoc Nam, Huynh, Tan Quoc, Ta, Hien Quang, Tan, Phuoc Vo, and Nguyen, Lap Luat
- Subjects
SOFTWARE radio ,ELECTRONIC modulation ,ARTIFICIAL neural networks ,DEEP learning ,SPECTRUM allocation ,TELECOMMUNICATION systems - Abstract
Automatic modulation classification (AMC) is an important process for future communication systems with prominent applications from spectrum management, and secure communication, to cognitive radio. The requirement for an efficient AMC classifier is due to its capability in blind modulation recognition, which is a difficult task in real scenarios where the limitations of traditional hardware and the complexity of channel impairments are involved. Therefore, this paper proposes a complete real-time AMC system based on software-defined radio and deep learning architecture. The system demodulation performance is verified through simulations and real channel impairment conditions to ensure reliability. With at most 6 times reduced number of parameters, two proposed models convolutional long short-term memory deep neural network and residual long short-term memory neural network also show a general improvement in classification accuracy compared with reference studies. The performance of these models at real-time AMC is tested with suitable processing time for practical applications. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
39. Multi-Directional Long-Term Recurrent Convolutional Network for Road Situation Recognition.
- Author
-
Dofitas Jr., Cyreneo, Gil, Joon-Min, and Byun, Yung-Cheol
- Subjects
ARTIFICIAL neural networks ,RECURRENT neural networks ,PEDESTRIANS ,ROAD safety measures ,CONVOLUTIONAL neural networks ,DEEP learning - Abstract
Understanding road conditions is essential for implementing effective road safety measures and driving solutions. Road situations encompass the day-to-day conditions of roads, including the presence of vehicles and pedestrians. Surveillance cameras strategically placed along streets have been instrumental in monitoring road situations and providing valuable information on pedestrians, moving vehicles, and objects within road environments. However, these video data and information are stored in large volumes, making analysis tedious and time-consuming. Deep learning models are increasingly utilized to monitor vehicles and identify and evaluate road and driving comfort situations. However, the current neural network model requires the recognition of situations using time-series video data. In this paper, we introduced a multi-directional detection model for road situations to uphold high accuracy. Deep learning methods often integrate long short-term memory (LSTM) into long-term recurrent network architectures. This approach effectively combines recurrent neural networks to capture temporal dependencies and convolutional neural networks (CNNs) to extract features from extensive video data. In our proposed method, we form a multi-directional long-term recurrent convolutional network approach with two groups equipped with CNN and two layers of LSTM. Additionally, we compare road situation recognition using convolutional neural networks, long short-term networks, and long-term recurrent convolutional networks. The paper presents a method for detecting and recognizing multi-directional road contexts using a modified LRCN. After balancing the dataset through data augmentation, the number of video files increased, resulting in our model achieving 91% accuracy, a significant improvement from the original dataset. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
40. A Deep-Learning-Based Method for Spectrum Sensing with Multiple Feature Combination.
- Author
-
Zhang, Yixuan and Luo, Zhongqiang
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,DEEP learning ,COGNITIVE radio ,STATISTICAL learning ,RADIO networks - Abstract
Cognitive radio networks enable the detection and opportunistic access to an idle spectrum through spectrum-sensing technologies, thus providing services to secondary users. However, at a low signal-to-noise ratio (SNR), existing spectrum-sensing methods, such as energy statistics and cyclostationary detection, tend to fail or become overly complex, limiting their sensing accuracy in complex application scenarios. In recent years, the integration of deep learning with wireless communications has shown significant potential. Utilizing neural networks to learn the statistical characteristics of signals can effectively adapt to the changing communication environment. To enhance spectrum-sensing performance under low-SNR conditions, this paper proposes a deep-learning-based spectrum-sensing method that combines multiple signal features, including energy statistics, power spectrum, cyclostationarity, and I/Q components. The proposed method used these combined features to form a specific matrix, which was then efficiently learned and detected through the designed 'SenseNet' network. Experimental results showed that at an SNR of −20 dB, the SenseNet model achieved a 58.8% spectrum-sensing accuracy, which is a 3.3% improvement over the existing convolutional neural network model. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
41. Data modeling analysis of GFRP tubular filled concrete column based on small sample deep meta learning method.
- Author
-
Deng, Tianyi, Xue, Chengqi, and Zhang, Gengpei
- Subjects
ARTIFICIAL neural networks ,CONCRETE columns ,DATA modeling ,DATA analysis ,COMPOSITE columns ,DEEP learning ,DATA augmentation - Abstract
The meta-learning method proposed in this paper addresses the issue of small-sample regression in the application of engineering data analysis, which is a highly promising direction for research. By integrating traditional regression models with optimization-based data augmentation from meta-learning, the proposed deep neural network demonstrates excellent performance in optimizing glass fiber reinforced plastic (GFRP) for wrapping concrete short columns. When compared with traditional regression models, such as Support Vector Regression (SVR), Gaussian Process Regression (GPR), and Radial Basis Function Neural Networks (RBFNN), the meta-learning method proposed here performs better in modeling small data samples. The success of this approach illustrates the potential of deep learning in dealing with limited amounts of data, offering new opportunities in the field of material data analysis. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
42. A Comprehensive Analysis Of Diverse Image Processing Techniques In Agriculture.
- Author
-
Goyal, Punam and Gill, Jasmeen
- Subjects
AGRICULTURAL technology ,IMAGE processing ,IMAGE analysis ,TECHNOLOGICAL progress ,DEEP learning ,ARTIFICIAL neural networks ,TECHNOLOGICAL innovations ,PLANT diseases - Abstract
Agriculture plays a crucial role in fostering sustainable growth through the integration of various technological advancements such as image processing, artificial intelligence, deep learning, and the Internet of Things (IoT). The global population is increasing on a daily basis. The increasing demand within the agriculture industry has necessitated the collective enhancement of plant cultivation and field productivity. This paper emphasizes the significance of effectively managing the crop during its initial growth phase as well as during the harvesting era. Image processing and artificial neural networks are employed as distinct methodologies for detecting illnesses on leaves. When capturing images using drones, the resulting images undergo a process of segmentation and transformation, resulting in the identification of three distinct vectors that represent diseases. These vectors include colour, texture, and morphology. This paper reviews on various disease classification strategies that can be utilized for the detection of plant diseases. [ABSTRACT FROM AUTHOR]
- Published
- 2024
43. Hybrid-Gird: 遥感图像细粒度分类可解释方法.
- Author
-
朱, 凯雯, 尤, 亚楠, 曹, 婧宜, 孟, 钢, 乔, 媛媛, and 杨, 洁
- Subjects
ARTIFICIAL neural networks ,MATHEMATICAL models ,REMOTE sensing ,DEEP learning ,COOPERATIVE game theory - Abstract
Copyright of Journal of Remote Sensing is the property of Editorial Office of Journal of Remote Sensing & Science Publishing Co. and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2024
- Full Text
- View/download PDF
44. A Review of Deep Learning-Based Vulnerability Detection Tools for Ethernet Smart Contracts.
- Author
-
Huaiguang Wu, Yibo Peng, Yaqiong He, and Jinlin Fan
- Subjects
BLOCKCHAINS ,ARTIFICIAL neural networks ,DEEP learning ,ETHERNET ,FEATURE extraction ,CONTRACTS - Abstract
In recent years, the number of smart contracts deployed on blockchain has exploded. However, the issue of vulnerability has caused incalculable losses. Due to the irreversible and immutability of smart contracts, vulnerability detection has become particularly important. With the popular use of neural network model, there has been a growing utilization of deep learning-based methods and tools for the identification of vulnerabilities within smart contracts. This paper commences by providing a succinct overview of prevalent categories of vulnerabilities found in smart contracts. Subsequently, it categorizes and presents an overview of contemporary deep learning-based tools developed for smart contract detection. These tools are categorized based on their open-source status, the data format and the type of feature extraction they employ. Then we conduct a comprehensive comparative analysis of these tools, selecting representative tools for experimental validation and comparing them with traditional tools in terms of detection coverage and accuracy. Finally, Based on the insights gained from the experimental results and the current state of research in the field of smart contract vulnerability detection tools, we suppose to provide a reference standard for developers of contract vulnerability detection tools.Meanwhile, forward-looking research directions are also proposed for deep learning-based smart contract vulnerability detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
45. Cross-Parallel Attention and Efficient Match Transformer for Aerial Tracking.
- Author
-
Deng, Anping, Han, Guangliang, Zhang, Zhongbo, Chen, Dianbing, Ma, Tianjiao, and Liu, Zhichao
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,TRACKING radar ,TRACKING algorithms ,DRONE aircraft ,ARTIFICIAL intelligence - Abstract
Visual object tracking is a key technology that is used in unmanned aerial vehicles (UAVs) to achieve autonomous navigation. In recent years, with the rapid development of deep learning, tracking algorithms based on Siamese neural networks have received widespread attention. However, because of complex and diverse tracking scenarios, as well as limited computational resources, most existing tracking algorithms struggle to ensure real-time stable operation while improving tracking performance. Therefore, studying efficient and fast-tracking frameworks, and enhancing the ability of algorithms to respond to complex scenarios has become crucial. Therefore, this paper proposes a cross-parallel attention and efficient match transformer for aerial tracking (SiamEMT). Firstly, we carefully designed the cross-parallel attention mechanism to encode global feature information and to achieve cross-dimensional interaction and feature correlation aggregation via parallel branches, highlighting feature saliency and reducing global redundancy information, as well as improving the tracking algorithm's ability to distinguish between targets and backgrounds. Meanwhile, we implemented an efficient match transformer to achieve feature matching. This network utilizes parallel, lightweight, multi-head attention mechanisms to pass template information to the search region features, better matching the global similarity between the template and search regions, and improving the algorithm's ability to perceive target location and feature information. Experiments on multiple drone public benchmark tests verified the accuracy and robustness of the proposed tracker in drone tracking scenarios. In addition, on the embedded artificial intelligence (AI) platform AGX Xavier, our algorithm achieved real-time tracking speed, indicating that our algorithm can be effectively applied to UAV tracking scenarios. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
46. A Comparison of Neural-Network-Based Intrusion Detection against Signature-Based Detection in IoT Networks.
- Author
-
Schrötter, Max, Niemann, Andreas, and Schnor, Bettina
- Subjects
ARTIFICIAL neural networks ,INTERNET of things ,MACHINE learning - Abstract
Over the last few years, a plethora of papers presenting machine-learning-based approaches for intrusion detection have been published. However, the majority of those papers do not compare their results with a proper baseline of a signature-based intrusion detection system, thus violating good machine learning practices. In order to evaluate the pros and cons of the machine-learning-based approach, we replicated a research study that uses a deep neural network model for intrusion detection. The results of our replicated research study expose several systematic problems with the used datasets and evaluation methods. In our experiments, a signature-based intrusion detection system with a minimal setup was able to outperform the tested model even under small traffic changes. Testing the replicated neural network on a new dataset recorded in the same environment with the same attacks using the same tools showed that the accuracy of the neural network dropped to 54%. Furthermore, the often-claimed advantage of being able to detect zero-day attacks could not be seen in our experiments. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
47. Multimodal Deep Neural Networks for Digitized Document Classification.
- Author
-
Baimakhanova, Aigerim, Zhumadillayeva, Ainur, Mukhametzhanova, Bigul, Glazyrina, Natalya, Niyazova, Rozamgul, Zhunissov, Nurseit, and Sambetbayeva, Aizhan
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,DIGITIZATION ,DOCUMENT classification (Electronic documents) ,DIGITAL technology ,MACHINE learning - Abstract
As digital technologies have advanced more rapidly, the number of paper documents recently converted into a digital format has exponentially increased. To respond to the urgent need to categorize the growing number of digitized documents, the classification of digitized documents in real time has been identified as the primary goal of our study. A paper classification is the first stage in automating document control and efficient knowledge discovery with no or little human involvement. Artificial intelligence methods such as Deep Learning are now combined with segmentation to study and interpret those traits, which were not conceivable ten years ago. Deep learning aids in comprehending input patterns so that object classes may be predicted. The segmentation process divides the input image into separate segments for a more thorough image study. This study proposes a deep learning-enabled framework for automated document classification, which can be implemented in higher education. To further this goal, a dataset was developed that includes seven categories: Diplomas, Personal documents, Journal of Accounting of higher education diplomas, Service letters, Orders, Production orders, and Student orders. Subsequently, a deep learning model based on Conv2D layers is proposed for the document classification process. In the final part of this research, the proposed model is evaluated and compared with other machine-learning techniques. The results demonstrate that the proposed deep learning model shows high results in document categorization overtaking the other machine learning models by reaching 94.84%, 94.79%, 94.62%, 94.43%, 94.07% in accuracy, precision, recall, F-score, and AUC-ROC, respectively. The achieved results prove that the proposed deep model is acceptable to use in practice as an assistant to an office worker. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
48. Enhancing Deep Learning and Computer Image Analysis in Petrography through Artificial Self-Awareness Mechanisms.
- Author
-
Dell'Aversana, Paolo
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,IMAGE analysis ,SELF-consciousness (Awareness) ,IMAGE recognition (Computer vision) ,PETROLOGY - Abstract
In this paper, we discuss the implementation of artificial self-awareness mechanisms and self-reflection abilities in deep neural networks. While the current limitations of research prevent achieving cognitive capabilities on par with natural biological entities, the incorporation of basic self-awareness and self-reflection mechanisms in deep learning architectures offers substantial advantages in tackling specific problems across various scientific fields, including geosciences. In the first section, we outline the foundational architecture of our deep learning approach termed Self-Aware Learning (SAL). The subsequent part of the paper highlights the practical benefits of this machine learning methodology through synthetic tests and applications addressed to automatic classification and image analysis of real petrological data sets. We show how Self-Aware Learning allows enhanced accuracy, reduced overfitting problems, and improved performances compared to other existing methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
49. Deep Learning-based DSM Generation from Dual-Aspect SAR Data.
- Author
-
Recla, Michael and Schmitt, Michael
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,SYNTHETIC aperture radar ,DATA mining ,REMOTE sensing ,GEOMETRIC modeling - Abstract
Rapid mapping demands efficient methods for a fast extraction of information from satellite data while minimizing data requirements. This paper explores the potential of deep learning for the generation of high-resolution urban elevation data from Synthetic Aperture Radar (SAR) imagery. In order to mitigate occlusion effects caused by the side-looking nature of SAR remote sensing, two SAR images from opposing aspects are leveraged and processed in an end-to-end deep neural network. The presented approach is the first of its kind to implicitly handle the transition from the SAR-specific slant range geometry to a ground-based mapping geometry within the model architecture. Comparative experiments demonstrate the superiority of the dual-aspect fusion over single-image methods in terms of reconstruction quality and geolocation accuracy. Notably, the model exhibits robust performance across diverse acquisition modes and geometries, showcasing its generalizability and suitability for height mapping applications. The study's findings underscore the potential of deep learning-driven SAR techniques in generating high-quality urban surface models efficiently and economically. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
50. A Unique Identification-Oriented Black-Box Watermarking Scheme for Deep Classification Neural Networks.
- Author
-
Mo, Mouke, Wang, Chuntao, and Bian, Shan
- Subjects
ARTIFICIAL neural networks ,DIGITAL watermarking ,DISCRETE cosine transforms ,SINGULAR value decomposition ,DEEP learning ,IDENTIFICATION ,WATERMARKS - Abstract
Given the substantial value and considerable training costs associated with deep neural network models, the field of deep neural network model watermarking has come to the forefront. While black-box model watermarking has made commendable strides, the current methodology for constructing poisoned images in the existing literature is simplistic and susceptible to forgery. Notably, there is a scarcity of black-box model watermarking techniques capable of discerning a unique user in a multi-user model distribution setting. For this reason, this paper proposes a novel black-box model watermarking method for unique identity identification, which is denoted as the ID watermarking of neural networks (IDwNet). Specifically, to enhance the distinguishability of deep neural network models in multi-user scenarios and mitigate the likelihood of poisoned image counterfeiting, this study develops a discrete cosine transform (DCT) and singular value decomposition (SVD)-based symmetrical embedding method to form the poisoned image. As this ID embedding method leads to indistinguishable deep features, the study constructs a poisoned adversary training strategy by simultaneously inputting clean images, poisoned images with the correct ID, and poisoned adversary images with incorrect IDs to train a deep neural network. Extensive simulation experiments show that the proposed scheme achieves excellent invisibility for the concealed ID, surpassing remarkably the state-of-the-art. In addition, the proposed scheme obtains a validation success rate exceeding 99% for the poisoned images at the cost of a marginal classification accuracy reduction of less than 0.5%. Moreover, even though there is only a 1-bit discrepancy between IDs, the proposed scheme still results in an accurate validation of user copyright. These results indicate that the proposed scheme is promising. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.