83 results
Search Results
2. Guest Editorial: Special issue on computational methods and artificial intelligence applications in low‐carbon energy systems.
- Author
-
Wang, Yishen, Zhou, Fei, Guerrero, Josep M., Baker, Kyri, Chen, Yize, Wang, Hao, Xu, Bolun, Xu, Qianwen, Zhu, Hong, and Agwan, Utkarsha
- Subjects
ARTIFICIAL intelligence ,ARTIFICIAL neural networks ,MACHINE learning ,REINFORCEMENT learning ,DEEP reinforcement learning ,DEEP learning - Abstract
This document is a guest editorial for a special issue on computational methods and artificial intelligence applications in low-carbon energy systems. The editorial highlights the urgent need for advanced computing and artificial intelligence in the clean energy transition to improve system reliability, economics, and sustainability. The special issue includes 19 original research articles covering topics such as energy forecasting, situational awareness, multi-energy system dispatch, and power system operation. The articles present state-of-the-art methods and techniques in these areas, including wind power forecasting, demand-side flexibility, fault diagnosis of photovoltaic strings, and energy management strategies. The authors express their gratitude to the participating authors and anonymous reviewers for their contributions to the special section. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
3. Guest Editorial: Special issue on advances in representation learning for computer vision.
- Author
-
Teoh, Andrew Beng Jin, Song Ong, Thian, Lim, Kian Ming, and Lee, Chin Poo
- Subjects
COMPUTER vision ,DEEP learning ,ARTIFICIAL neural networks ,IMAGE representation ,CONVOLUTIONAL neural networks ,IMAGE recognition (Computer vision) ,DATA privacy - Abstract
This document is a guest editorial for a special issue of the CAAI Transactions on Intelligence Technology journal. The special issue focuses on advances in representation learning for computer vision. The editorial highlights the success of deep learning methods in deriving powerful representations from visual data, but also acknowledges the challenges of conducting representation learning with deep models, especially with large and noisy datasets. The document provides summaries of several research papers included in the special issue, covering topics such as cancellable biometrics, medical image analysis, watermarking for medical images, facial pattern description, multi-biometric strategies, semantic segmentation, image enhancement, image classification, and hyperspectral image super-resolution. The authors express their hope that these papers will enhance readers' understanding of current trends and guide future research in the field. The document also includes brief biographies of the authors. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
4. Slip Tendency Analysis From Sparse Stress and Satellite Data Using Physics‐Guided Deep Neural Networks.
- Author
-
Poulet, Thomas and Behnoudfar, Pouria
- Subjects
ARTIFICIAL neural networks ,GLOBAL Positioning System ,DEEP learning ,GEOLOGICAL statistics ,FEMORAL epiphysis ,CONTINENTAL drift ,DISPLACEMENT (Psychology) - Abstract
The significant risk associated with fault reactivation often necessitates slip tendency analyses for effective risk assessment. However, such analyses are challenging, particularly in large areas with limited or absent reliable stress measurements and where the cost of extensive geomechanical analyses or simulations is prohibitive. In this paper, we propose a novel approach using a physics‐informed neural network that integrates stress orientation and satellite displacement observations in a top‐down multi‐scale framework to estimate two‐dimensional slip tendency analyses even in regions lacking comprehensive stress data. Our study demonstrates that velocities derived from a continental scale analysis, combined with reliable stress orientation averages, can effectively guide models at smaller scales to generate qualitative slip tendency maps. By offering customizable data selection and stress resolution options, this method presents a robust solution to address data scarcity issues, as exemplified through a case study of the South Australian Eyre Peninsula. Plain Language Summary: Fault reactivation poses significant risks, often requiring slip tendency analyses for thorough risk assessment. Yet, such analyses face challenges, especially in large areas lacking reliable stress measurements or where extensive geomechanical analyses are too costly. Our paper suggests a new method using a physics‐based neural network. This approach combines compressive direction and satellite displacement observations to estimate slip tendencies in two dimensions, even where stress data is lacking. Our study shows that by using displacements from a continental scale analysis and reliable averages of compressive directions, we can guide models to create smaller‐scale maps indicating where faults are more likely to reactivate. This method allows for customizable data selection and stress resolution, offering a strong solution to data scarcity issues. We demonstrate its effectiveness through a case study of South Australia's Eyre Peninsula. Key Points: Physics‐based neural networks allow two‐dimensional slip tendency analyses without prior full‐stress informationA multi‐scale approach provides required displacement constraints when inferring full stresses from global navigation satellite system (GNSS) and stress orientation dataWe present a new application for GNSS data that would welcome more stations, even in seismically stable areas [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Mixed‐decomposed convolutional network: A lightweight yet efficient convolutional neural network for ocular disease recognition.
- Author
-
Zhang, Xiaoqing, Wu, Xiao, Xiao, Zunjie, Hu, Lingxi, Qiu, Zhongxi, Sun, Qingyang, Higashita, Risa, and Liu, Jiang
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,OPTICAL coherence tomography ,IMAGE recognition (Computer vision) ,ARTIFICIAL intelligence ,EYE tracking - Abstract
Eye health has become a global health concern and attracted broad attention. Over the years, researchers have proposed many state‐of‐the‐art convolutional neural networks (CNNs) to assist ophthalmologists in diagnosing ocular diseases efficiently and precisely. However, most existing methods were dedicated to constructing sophisticated CNNs, inevitably ignoring the trade‐off between performance and model complexity. To alleviate this paradox, this paper proposes a lightweight yet efficient network architecture, mixed‐decomposed convolutional network (MDNet), to recognise ocular diseases. In MDNet, we introduce a novel mixed‐decomposed depthwise convolution method, which takes advantage of depthwise convolution and depthwise dilated convolution operations to capture low‐resolution and high‐resolution patterns by using fewer computations and fewer parameters. We conduct extensive experiments on the clinical anterior segment optical coherence tomography (AS‐OCT), LAG, University of California San Diego, and CIFAR‐100 datasets. The results show our MDNet achieves a better trade‐off between the performance and model complexity than efficient CNNs including MobileNets and MixNets. Specifically, our MDNet outperforms MobileNets by 2.5% of accuracy by using 22% fewer parameters and 30% fewer computations on the AS‐OCT dataset. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. KDGAN: Knowledge distillation‐based model copyright protection for secure and communication‐efficient model publishing.
- Author
-
Xie, Bingyi, Xu, Honghui, Seo, Daehee, Shin, DongMyung, and Cai, Zhipeng
- Subjects
ARTIFICIAL neural networks ,GENERATIVE adversarial networks ,COPYRIGHT ,INTELLECTUAL property ,NATURAL language processing ,DEEP learning - Abstract
Deep learning‐based models have become ubiquitous across a wide range of applications, including computer vision, natural language processing, and robotics. Despite their efficacy, one of the significant challenges associated with deep neural network (DNN) models is the potential risk of copyright leakage due to the inherent vulnerability of the entire model architecture and the communication burden of the large models during publishing. So far, it is still challenging for us to safeguard the intellectual property rights of these DNN models while reducing the communication time during model publishing. To this end, this paper introduces a novel approach using knowledge distillation techniques aimed at training a surrogate model to stand in for the original DNN model. To be specific, a knowledge distillation generative adversarial network (KDGAN) model is proposed to train a student model capable of achieving remarkable performance levels while simultaneously safeguarding the copyright integrity of the original large teacher model and improving communication efficiency during model publishing. Herein, comprehensive experiments are conducted to showcase the efficacy of model copyright protection, communication‐efficient model publishing, and the superiority of the proposed KDGAN model over other copyright protection mechanisms. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Comment on "Prediction of the SYM‐H Index Using a Bayesian Deep Learning Method With Uncertainty Quantification" by Abduallah et al. (2024).
- Author
-
Collado‐Villaverde, Armando, Muñoz, Pablo, and Cid, Consuelo
- Subjects
ARTIFICIAL neural networks ,GRAPH neural networks ,DEEP learning ,MAGNETIC storms ,PREDICTION models - Abstract
Abduallah et al. (2024b, https://doi.org/10.1029/2023sw003824) proposed a novel approach using a deep neural network model, which includes a graph neural network and a bidirectional LSTM layer, named SYMHnet, to forecast the SYM‐H index one and 2 hr in advance. Additionally, the network also provides an uncertainty quantification of the predictions. While the approach is innovative, there are some areas where the model's design and implementation may not align with best practices in uncertainty quantification and predictive modeling. We focus on discrepancies in the input and output of the model, which can limit the applicability in real‐world forecasting scenarios. This comment aims to clarify these issues, offering detailed insights into how such discrepancies could compromise the model's interpretability and reliability, thereby contributing to the advancement of predictive modeling in space weather research. Plain Language Summary: The use of Machine learning to predict geomagnetic storms is becoming a trend. A recent study by Abduallah et al. (2024b, https://doi.org/10.1029/2023sw003824) introduces a novel approach to forecast the SYM‐H index, which measures geomagnetic activity on a global scale, while also quantifying the uncertainty of these predictions. However, this commentary highlights methodological concerns with their approach, such as data selection issues and the reliability of uncertainty calculations. These factors could significantly affect the model's accuracy and applicability in real‐time forecasting scenarios. Key Points: Examination of the model input and output reveals oversights that could undermine the model's predictive accuracy and applicabilityThe process to estimate uncertainty has limited applicability for use in real timeThere are no coverage metrics regarding the uncertainty intervals [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
8. An infrared and visible image fusion network based on multi‐scale feature cascades and non‐local attention.
- Author
-
Xu, Jing, Liu, Zhenjin, and Fang, Ming
- Subjects
IMAGE fusion ,DEEP learning ,INFRARED imaging ,ARTIFICIAL neural networks ,FEATURE extraction ,IMAGE reconstruction - Abstract
In recent years, research on infrared and visible image fusion has mainly focused on deep learning‐based approaches, particularly deep neural networks with auto‐encoder architectures. However, these approaches suffer from problems such as insufficient feature extraction capability and inefficient fusion strategies. Therefore, this paper introduces a novel image fusion network to address the limitations of infrared and visible image fusion networks with auto‐encoder architectures. In the designed network, the encoder employs a multi‐branch cascade structure, and these convolution branches with different kernel sizes provide the encoder with an adaptive receptive field to extract multi‐scale features. In addition, the fusion layer incorporates a non‐local attention module that is inspired by the self‐attention mechanism. With its global receptive field, this module is used to build a non‐local attention fusion network, which works together with the l1${l}_1$‐norm spatial fusion strategy to extract, split, filter, and fuse global and local features. Comparative experiments on the TNO and MSRS datasets demonstrate that the proposed method outperforms other state‐of‐the‐art fusion approaches. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. A topic‐controllable keywords‐to‐text generator with knowledge base network.
- Author
-
He, Li, Shi, Kaize, Wang, Dingxian, Wang, Xianzhi, and Xu, Guandong
- Subjects
KNOWLEDGE base ,DEEP learning ,ARTIFICIAL neural networks ,NATURAL languages - Abstract
With the introduction of more recent deep learning models such as encoder‐decoder, text generation frameworks have gained a lot of popularity. In Natural Language Generation (NLG), controlling the information and style of the output produced is a crucial and challenging task. The purpose of this paper is to develop informative and controllable text using social media language by incorporating topic knowledge into a keyword‐to‐text framework. A novel Topic‐Controllable Key‐to‐Text (TC‐K2T) generator that focuses on the issues of ignoring unordered keywords and utilising subject‐controlled information from previous research is presented. TC‐K2T is built on the framework of conditional language encoders. In order to guide the model to produce an informative and controllable language, the generator first inputs unordered keywords and uses subjects to simulate prior human knowledge. Using an additional probability term, the model increases the likelihood of topic words appearing in the generated text to bias the overall distribution. The proposed TC‐K2T can produce more informative and controllable senescence, outperforming state‐of‐the‐art models, according to empirical research on automatic evaluation metrics and human annotations. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
10. Scalable semantic 3D mapping of coral reefs with deep learning.
- Author
-
Sauder, Jonathan, Banc‐Prandi, Guilhem, Meibom, Anders, and Tuia, Devis
- Subjects
CORAL reefs & islands ,CORALS ,ARTIFICIAL neural networks ,DEEP-sea corals ,DEEP learning ,EFFECT of human beings on climate change - Abstract
Coral reefs are among the most diverse ecosystems on our planet, and essential to the livelihood of hundreds of millions of people who depend on them for food security, income from tourism and coastal protection. Unfortunately, most coral reefs are existentially threatened by global climate change and local anthropogenic pressures. To better understand the dynamics underlying deterioration of reefs, monitoring at high spatial and temporal resolution is key. However, conventional monitoring methods for quantifying coral cover and species abundance are limited in scale due to the extensive manual labor required. Although computer vision tools have been employed to aid in this process, in particular structure‐from‐motion (SfM) photogrammetry for 3D mapping and deep neural networks for image segmentation, analysis of the data products creates a bottleneck, effectively limiting their scalability.This paper presents a new paradigm for mapping underwater environments from ego‐motion video, unifying 3D mapping systems that use machine learning to adapt to challenging conditions under water, combined with a modern approach for semantic segmentation of images.The method is exemplified on coral reefs in the northern Gulf of Aqaba, Red Sea, demonstrating high‐precision 3D semantic mapping at unprecedented scale with significantly reduced required labor costs: given a trained model, a 100 m video transect acquired within 5 min of diving with a cheap consumer‐grade camera can be fully automatically transformed into a semantic point cloud within 5 min. We demonstrate the spatial accuracy of our method and the semantic segmentation performance (of at least 80% total accuracy), and publish a large dataset of ego‐motion videos from the northern Gulf of Aqaba, along with a dataset of video frames annotated for dense semantic segmentation of benthic classes.Our approach significantly scales up coral reef monitoring by taking a leap towards fully automatic analysis of video transects. The method advances coral reef transects by reducing the labor, equipment, logistics, and computing cost. This can help to inform conservation policies more efficiently. The underlying computational method of learning‐based Structure‐from‐Motion has broad implications for fast low‐cost mapping of underwater environments other than coral reefs. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. Few‐shot segmentation framework for lung nodules via an optimized active contour model.
- Author
-
Yang, Lin, Shao, Dan, Huang, Zhenxing, Geng, Mengxiao, Zhang, Na, Chen, Long, Wang, Xi, Liang, Dong, Pang, Zhi‐Feng, and Hu, Zhanli
- Subjects
- *
ARTIFICIAL neural networks , *PULMONARY nodules , *NONSMOOTH optimization , *DEEP learning , *ACTIVE learning - Abstract
Background: Accurate segmentation of lung nodules is crucial for the early diagnosis and treatment of lung cancer in clinical practice. However, the similarity between lung nodules and surrounding tissues has made their segmentation a longstanding challenge. Purpose: Existing deep learning and active contour models each have their limitations. This paper aims to integrate the strengths of both approaches while mitigating their respective shortcomings. Methods: In this paper, we propose a few‐shot segmentation framework that combines a deep neural network with an active contour model. We introduce heat kernel convolutions and high‐order total variation into the active contour model and solve the challenging nonsmooth optimization problem using the alternating direction method of multipliers. Additionally, we use the presegmentation results obtained from training a deep neural network on a small sample set as the initial contours for our optimized active contour model, addressing the difficulty of manually setting the initial contours. Results: We compared our proposed method with state‐of‐the‐art methods for segmentation effectiveness using clinical computed tomography (CT) images acquired from two different hospitals and the publicly available LIDC dataset. The results demonstrate that our proposed method achieved outstanding segmentation performance according to both visual and quantitative indicators. Conclusion: Our approach utilizes the output of few‐shot network training as prior information, avoiding the need to select the initial contour in the active contour model. Additionally, it provides mathematical interpretability to the deep learning, reducing its dependency on the quantity of training samples. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. Learning Deep Embedding with Acoustic and Phoneme Features for Speaker Recognition in FM Broadcasting.
- Author
-
Li, Xiao, Chen, Xiao, Fu, Rui, Hu, Xiao, Chen, Mintong, and Niu, Kun
- Subjects
FM broadcasting ,ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,DEEP learning ,PHONEME (Linguistics) - Abstract
Text-independent speaker verification (TI-SV) is a crucial task in speaker recognition, as it involves verifying an individual's claimed identity from speech of arbitrary content without any human intervention. The target for TI-SV is to design a discriminative network to learn deep speaker embedding for speaker idiosyncrasy. In this paper, we propose a deep speaker embedding learning approach of a hybrid deep neural network (DNN) for TI-SV in FM broadcasting. Not only acoustic features are utilized, but also phoneme features are introduced as prior knowledge to collectively learn deep speaker embedding. The hybrid DNN consists of a convolutional neural network architecture for generating acoustic features and a multilayer perceptron architecture for extracting phoneme features sequentially, which represent significant pronunciation attributes. The extracted acoustic and phoneme features are concatenated to form deep embedding descriptors for speaker identity. The hybrid DNN demonstrates not only the complementarity between acoustic and phoneme features but also the temporality of phoneme features in a sequence. Our experiments show that the hybrid DNN outperforms existing methods and delivers a remarkable performance in FM broadcasting TI-SV. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
13. Multi‐agent protection scheme for microgrid using deep learning.
- Author
-
Najar, Abolfazl, Kazemi Karegar, Hossein, and Esmaeilbeigi, Saman
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,FAULT location (Engineering) ,MICROGRIDS ,DISCRETE wavelet transforms ,PYTHON programming language - Abstract
Producing clean energy and feeding critical loads in islanding mode are the main reasons for interest in microgrids. Different operation topologies of microgrids make traditional protection schemes inefficient. This paper proposes a multi‐agent protection scheme in which each protection agent can detect different fault events and isolate faulty phases at a fast rate. A unique algorithm is utilized for determining fault location in microgrids and system operators are informed accordingly. Microgrids have various operation modes due to the stochastic behavior of distributed generators and different topologies. Here, a significant number of operating conditions of the studied microgrid are considered. These operation conditions are simulated in the DIgSILENT Power Factory, and different parameters are stored. Raw measured parameters need to be pre‐processed by a signal processing method in MATLAB. Discrete wavelet transform is chosen for this purpose. Deep learning is used as a machine learning technique due to the various operation modes of the microgrid. Deep neural networks are constructed using Python programming language. The proposed scheme ensures high accuracy in fault detection and fault location in the microgrid, as well as fault isolation in different operation conditions. We proposed a multi‐agent algorithm for fault detection in alternating current (AC) microgrids that can diagnose the type, phase, location, and impedance of faults. This scheme performs comprehensive case studies containing possible operation conditions such as lateral branches, high fault impedance, and different types of distributed generation (DG). The proposed scheme has higher efficiency in comparison with other state‐of‐the‐art. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
14. A systematic review on deep learning‐based automated cancer diagnosis models.
- Author
-
Tandon, Ritu, Agrawal, Shweta, Rathore, Narendra Pal Singh, Mishra, Abhinava K., and Jain, Sanjiv Kumar
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,CANCER diagnosis ,SIGNAL convolution ,DEEP learning ,CANCER patients - Abstract
Deep learning is gaining importance due to its wide range of applications. Many researchers have utilized deep learning (DL) models for the automated diagnosis of cancer patients. This paper provides a systematic review of DL models for automated diagnosis of cancer patients. Initially, various DL models for cancer diagnosis are presented. Five major categories of cancers such as breast, lung, liver, brain and cervical cancer are considered. As these categories of cancers have a very high percentage of occurrences with high mortality rate. The comparative analysis of different types of DL models is drawn for the diagnosis of cancer at early stages by considering the latest research articles from 2016 to 2022. After comprehensive comparative analysis, it is found that most of the researchers achieved appreciable accuracy with implementation of the convolutional neural network model. These utilized the pretrained models for automated diagnosis of cancer patients. Various shortcomings with the existing DL‐based automated cancer diagnosis models are also been presented. Finally, future directions are discussed to facilitate further research for automated diagnosis of cancer patients. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
15. Effective Deep Learning Seasonal Prediction Model for Summer Drought Over China.
- Author
-
Liu, Wenbo, Huang, Yanyan, and Wang, Huijun
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,DEEP learning ,PREDICTION models ,NATURAL disasters ,DROUGHTS ,SUMMER - Abstract
Drought is an important meteorological event in China and can cause severe damage to both livelihoods and socio‐ecological systems, but current seasonal prediction skill for drought is far from successful. This study used convolutional neural network (CNN) to build an effective seasonal forecast model for the summer consecutive dry days (CDD) over China. The principal components (PC) of the six leading empirical orthogonal function modes of CDD anomaly were predicted by CNN, using the previous winter precipitation, 2‐m temperature and 500 hPa geopotential height as predictors. These predicted PCs were then projected onto the observed spatial patterns to obtain the final predicted summer CDD anomaly over China. In the independent hindcast period of 2007–2018, the interannual variabilities of first six PCs were significantly predicted by CNN. The spatial patterns of CDD were then skillfully predicted with anomaly correlation coefficient skills generally higher than 0.2. The interannual variability of summer CDD over the middle and lower Yangtze River valley, northwestern China and northern China were significant predicted by our CNN model three months in advance. CNN identified that the previous winter precipitation was the important predictor for PC1–PC3, whereas the previous winter 2‐m temperature and 500 hPa geopotential height were important for the prediction of PC4–PC6. This research provides a new and effective method for the seasonal prediction of summer drought. Plain Language Summary: Drought can cause serious agricultural and ecosystem disasters, so its forecast is valuable for preventing and mitigating related natural disasters and regional socioeconomic sustainability. However, current prediction skill for drought is far from successful since its extreme feature. The gradually emerging deep learning methods offer new possibilities, but how to effectively apply deep learning models in climate prediction with a small sample size remains an open question. In this paper, we build seasonal prediction convolutional neural network model for summer consecutive dry days over China using previous winter predictors. This model achieves significant prediction skill three months in advance. The empirical orthogonal function decomposition is used to reduce the dimensionality of consecutive dry days data in our model. Our research provides a new perspective for drought prediction, and it is expected that such method will be also useful for other seasonal climate prediction problems. Key Points: Convolutional neural network (CNN) skillfully predicts summer consecutive dry days (CDD) over China three months in advanceThe principal components of CDD are predicted by CNN and then projected on the observed spatial patternsPrevious winter 2‐m temperature, geopotential height at 500 hPa and precipitation are the essential predictors in CNN [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
16. Deep Learning Enables Rapid Identification of a New Quasicrystal from Multiphase Powder Diffraction Patterns.
- Author
-
Uryu, Hirotaka, Yamada, Tsunetomo, Kitahara, Koichi, Singh, Alok, Iwasaki, Yutaka, Kimura, Kaoru, Hiroki, Kanta, Miyao, Naoya, Ishikawa, Asuka, Tamura, Ryuji, Ohhashi, Satoshi, Liu, Chang, and Yoshida, Ryo
- Subjects
DEEP learning ,ARTIFICIAL neural networks ,DIFFRACTION patterns ,X-ray powder diffraction ,POWDERS ,QUASICRYSTALS - Abstract
Since the discovery of the quasicrystal, approximately 100 stable quasicrystals are identified. To date, the existence of quasicrystals is verified using transmission electron microscopy; however, this technique requires significantly more elaboration than rapid and automatic powder X‐ray diffraction. Therefore, to facilitate the search for novel quasicrystals, developing a rapid technique for phase‐identification from powder diffraction patterns is desirable. This paper reports the identification of a new Al–Si–Ru quasicrystal using deep learning technologies from multiphase powder patterns, from which it is difficult to discriminate the presence of quasicrystalline phases even for well‐trained human experts. Deep neural networks trained with artificially generated multiphase powder patterns determine the presence of quasicrystals with an accuracy >92% from actual powder patterns. Specifically, 440 powder patterns are screened using the trained classifier, from which the Al–Si–Ru quasicrystal is identified. This study demonstrates an excellent potential of deep learning to identify an unknown phase of a targeted structure from powder patterns even when existing in a multiphase sample. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
17. Deep learning‐based channel estimation for OFDM‐IM systems over Rayleigh fading channels.
- Author
-
Adiguzel, Omer and Develi, Ibrahim
- Subjects
- *
ARTIFICIAL neural networks , *ORTHOGONAL frequency division multiplexing , *RAYLEIGH fading channels , *LEAST squares , *MINI-Mental State Examination - Abstract
Summary Deep learning (DL)‐based channel estimation for orthogonal frequency division multiplexing with index modulation (OFDM‐IM) under Rayleigh fading channel conditions is presented in this paper. A deep neural network (DNN) is utilized to estimate the channel response in simulations. The proposed DNN is trained using the channel coefficient derived through the least squares (LS) method. Then channel estimation is conducted using the trained DNN. Within the DNN, the long short‐term memory (LSTM) layer is included as the hidden layer. Different scenarios are handled in simulations and the proposed DNN is compared with traditional channel estimation methods. The simulations demonstrate that the DL‐based channel estimation significantly surpasses the LS and minimum mean‐square error (MMSE) techniques. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
18. Leveraging quantum‐inspired chimp optimization and deep neural networks for enhanced profit forecasting in financial accounting systems.
- Author
-
Zhang, Lin, Alsubai, Shtwai, Alqahtani, Abdullah, Alanazi, Abed, and Abualigah, Laith
- Subjects
- *
ARTIFICIAL neural networks , *METAHEURISTIC algorithms , *BUSINESS forecasting , *GREY Wolf Optimizer algorithm , *ACCOUNTING software , *OPTIMIZATION algorithms , *CHIMPANZEES - Abstract
Deep learning and metaheuristic algorithms have recently increased in various sciences, including financial accounting information systems (FAISs). However, the existence of large datasets has dramatically increased the complexity of these hybrid networks, so to address this shortcoming, this paper aims to develop a quantum‐behaved chimp optimization algorithm (QCHOA) and deep neural network (DNN) for the prediction of the profit based on FAISs. Considering that there is no suitable dataset for the challenge, a novel dataset is developed utilizing the 15 features from the Chinese market dataset to compare more. This work designs QCHOA and five DNN‐based predictors to forecast profit. These algorithms include the universal learning CHOA (ULCHOA), the niching CHOA (NCHOA) as the two best‐modified versions of CHOA, the quantum‐behaved whale optimization algorithm (QWOA), and the quantum‐behaved grey wolf optimizer (QGWO) as the two best quantum‐behaved optimizers as well as classic CHOA. The most effective deep learning‐based predictors for forecasting the profit, ranked from highest to lowest, are DNN‐QCHOA, DNN‐NCHOA, DNN‐QWOA, DNN‐QGWO, DNN‐ULCHOA, DNN‐CHOA, and classic DNN, with corresponding ranking scores of 42, 36, 30, 24, 18, 12, and 6. As a final suggestion for profit prediction, the DNN‐CHOA is shown to be the most accurate model. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. Deep Learning‐Driven Robust Glucose Sensing and Fruit Brix Estimation Using a Single Microwave Split Ring Resonator.
- Author
-
Lee, Seokho, Kim, Kyungtae, Yang, Younghwan, Seong, Junhwa, Jung, Chunghwan, Lee, Hee‐Jo, and Rho, Junsuk
- Subjects
- *
CONVOLUTIONAL neural networks , *ARTIFICIAL neural networks , *POSITION sensors , *RANDOM measures , *RESONATORS , *DEEP learning - Abstract
Extracting the desired information from sensor data with various internal and external effects is a significant challenge in sensor applications. Difficult‐to‐control factors such as temperature, humidity, and sample position can significantly affect the stability and reliability of sensor data. In this paper, a deep learning‐based glucose sensing method that is robust to variations in sample position is proposed. It is shown that the variations in sample position affect the sensor data measured by the designed split ring resonator‐based microwave sensor. Then, artificial neural network and 1D convolutional neural network (CNN) models are evaluated for extracting glucose concentration information from the sensor data measured at random sample positions. The concentration of the glucose solution ranged from 1% to 23% (2% increments). The 1D CNN with all frequencies (0.5–18 GHz) of the and datasets outperformed the other model, with a mean absolute error (MAE) of 0.695% and a mean squared error (MSE) of 0.876 evaluated via cross‐validation. The study demonstrated that the sensor system can be applied in real life by performing fruit Brix estimation based on transfer learning of the previous 1D CNN network, and the MAE and MSE are 0.450% and 0.305, respectively. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
20. Exploring an effective automated grading model with reliability detection for large‐scale online peer assessment.
- Author
-
Lin, Zirou, Yan, Hanbing, and Zhao, Li
- Subjects
- *
HIGH schools , *RESEARCH funding , *AFFINITY groups , *EDUCATIONAL outcomes , *HIGH school students , *EDUCATIONAL tests & measurements , *DESCRIPTIVE statistics , *TEACHERS , *MIDDLE school students , *DEEP learning , *ONLINE education , *ARTIFICIAL neural networks , *WEB development , *AUTOMATION , *COMPUTER assisted instruction , *SHORT-term memory , *MIDDLE schools , *COMPUTER assisted testing (Education) - Abstract
Background: Peer assessment has played an important role in large‐scale online learning, as it helps promote the effectiveness of learners' online learning. However, with the emergence of numerical grades and textual feedback generated by peers, it is necessary to detect the reliability of the large amount of peer assessment data, and then develop an effective automated grading model to analyse the data and predict learners' learning results. Objectives: The present study aimed to propose an automated grading model with reliability detection. Methods: A total of 109,327 instances of peer assessment from a large‐scale teacher online learning program were tested in the experiments. The reliability detection approach included three steps: recurrent convolutional neural networks (RCNN) was used to detect grade consistency, bidirectional encoder representations from transformers (BERT) was used to detect text originality, and long short‐term memory (LSTM) was used to detect grade‐text consistency. Furthermore, the automated grading was designed with the BERT‐RCNN model. Results and Conclusions: The effectiveness of the automated grading model with reliability detection was shown. For reliability detection, RCNN performed best in detecting grade consistency with an accuracy rate of 0.889, BERT performed best in detecting text originality with an improvement of 4.47% compared to the benchmark model, and LSTM performed best with an accuracy rate of 0.883. Moreover, the automated grading model with reliability detection achieved good performance, with an accuracy rate of 0.89. Compared to the absence of reliability detection, it increased by 12.1%. Implications: The results strongly suggest that the automated grading model with reliability detection for large‐scale peer assessment is effective, with the following implications: (1) The introduction of reliability detection is necessary to help filter out low reliability data in peer assessment, thus promoting effective automated grading results. (2) This solution could assist assessors in adjusting the exclusion threshold of peer assessment reliability, providing a controllable automated grading tool to reducing manual workload with high quality. (3) This solution could shift educational institutions from labour‐intensive grading procedures to a more efficient educational assessment pattern, allowing for more investment in supporting instructors and learners to improve the quality of peer feedback. Lay Description: What is already known about this topic: Peer assessment has played an important role in large‐scale online learning, as it helps promote the effectiveness of learners' online learning.Issues such as disagreement between peer assessors, rough assessment, and plagiarism in large‐scale online learning can decrease peer assessment reliabilityIncorporating extensive data into a training model may result in grading uncertainties. What this paper adds: Detecting the peer assessment reliability before grading is essential in the context of large‐scale online learning.This study aimed to propose and validate an automated grading model with reliability detection for the large‐scale online peer assessment, which will help improve the effectiveness of automated grading, combining the advantages of computer technology and human expertise. Implications for practice and/or policy: The introduction of reliability detection is necessary to help filter out low reliability data in peer assessment, thus promoting effective automated grading results.This solution could assist assessors in adjusting the exclusion threshold of peer assessment reliability, providing a controllable automated grading tool to reducing manual workload with high quality. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
21. A deep learning hierarchical approach to road traffic forecasting.
- Author
-
Benabdallah Benarmas, Redouane and Beghdad Bey, Kadda
- Subjects
TRAFFIC estimation ,ARTIFICIAL neural networks ,DEEP learning ,INTELLIGENT transportation systems ,TRAFFIC flow ,TIME series analysis - Abstract
Traffic forecasting is a crucial task of an Intelligent Transportation System (ITS), which remains very challenging as it is affected by the complexity and depth of the road network. Although the decision‐makers focus on the accuracy of the top‐level roads, the forecasts on the lower levels also improve the overall performance of ITS. In such a situation, a hierarchical forecasting strategy is more appropriate as well as a more accurate prediction methods to reach an efficient forecast. In this paper, we present a deep learning (DL) approach for hierarchical forecasting of traffic flow by exploring the hierarchical structure of the road network. The proposed approach is considered an improved variation on the top‐down strategy for the reconciliation process. We propose a model based on two deep neural network components to generate a coherent forecast for the total number of road segments. We use N‐BEATS, a pure deep learning forecasting method, at the highest levels for traffic time series, then disaggregate these downwards to get coherent forecasts for each series of the hierarchy using a combination of CNN and LSTM. Experiments were carried out using Beijing road traffic dataset to demonstrate the effectiveness of the approach. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
22. Effective contact texture region aware pavement skid resistance prediction via convolutional neural network.
- Author
-
Shi, Weibo, Niu, Dongyu, Li, Zirui, and Niu, Yanhui
- Subjects
- *
CONVOLUTIONAL neural networks , *SKID resistance , *ARTIFICIAL neural networks , *PEARSON correlation (Statistics) , *FAST Fourier transforms , *PAVEMENTS , *DEEP learning , *ASPHALT pavements - Abstract
The surface texture of asphalt pavement has a significant effect on skid resistance performance. However, its contribution to the performance of skid resistance is non‐homogeneous and subjects to local validity. There are also a few deep learning models that take into account the effective contact texture region. This paper proposes a convolutional neural network model based on the effective contact texture region, containing macro‐ and micro‐scale awareness sub‐modules. In this study, the asphalt mixture with varying gradations was designed to accurately obtain the effective contact texture region. Then, the textures were disentangled into macro‐ and micro‐texture scales by applying the fast Fourier transform and fed into the model for training. Finally, the area of effective contact texture region was calculated, and the effective contact ratio parameter was then proposed using the triangulation algorithm. The results showed that the effective contact texture area of pavement varies by the asphalt mixture type. The effective contact ratio parameter exhibited a significant positive correlation (Pearson correlation coefficient is 0.901, R2= 0.8129) with skid resistance performance and was also influenced by key sieve aggregate content from 2.36 to 4.75 mm. The data of effective contact texture region following disentanglement significantly released the model performance (the relative error dropped to 1.81%). The model exhibited improved precision and performance, which can be utilized as an efficient, non‐contact alternative method for skid resistance analysis. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
23. Functional data analysis using deep neural networks.
- Author
-
Wang, Shuoyang, Zhang, Wanyu, Cao, Guanqun, and Huang, Yuan
- Subjects
- *
ARTIFICIAL neural networks , *FUNCTIONAL analysis , *DATA science , *STOCHASTIC processes , *DATA analysis - Abstract
Functional data analysis is an evolving field focused on analyzing data that reveals insights into curves, surfaces, or entities within a continuous domain. This type of data is typically distinguished by the inherent dependence and smoothness observed within each data curve. Traditional functional data analysis approaches have predominantly relied on linear models, which, while foundational, often fall short in capturing the intricate, nonlinear relationships within the data. This paper seeks to bridge this gap by reviewing the integration of deep neural networks into functional data analysis. Deep neural networks present a transformative approach to navigating these complexities, excelling particularly in high‐dimensional spaces and demonstrating unparalleled flexibility in managing diverse data constructs. This review aims to advance functional data regression, classification, and representation by integrating deep neural networks with functional data analysis, fostering a harmonious and synergistic union between these two fields. The remarkable ability of deep neural networks to adeptly navigate the intricate functional data highlights a wealth of opportunities for ongoing exploration and research across various interdisciplinary areas. This article is categorized under:Data: Types and Structure > Time Series, Stochastic Processes, and Functional DataStatistical Learning and Exploratory Methods of the Data Sciences > Deep LearningStatistical Learning and Exploratory Methods of the Data Sciences > Neural Networks [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
24. Unsupervised motion artifact correction of turbo spin‐echo MRI using deep image prior.
- Author
-
Lee, Jongyeon, Seo, Hyunseok, Lee, Wonil, and Park, HyunWook
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,MAGNETIC resonance imaging - Abstract
Purpose: In MRI, motion artifacts can significantly degrade image quality. Motion artifact correction methods using deep neural networks usually required extensive training on large datasets, making them time‐consuming and resource‐intensive. In this paper, an unsupervised deep learning‐based motion artifact correction method for turbo‐spin echo MRI is proposed using the deep image prior framework. Theory and Methods: The proposed approach takes advantage of the high impedance to motion artifacts offered by the neural network parameterization to remove motion artifacts in MR images. The framework consists of parameterization of MR image, automatic spatial transformation, and motion simulation model. The proposed method synthesizes motion‐corrupted images from the motion‐corrected images generated by the convolutional neural network, where an optimization process minimizes the objective function between the synthesized images and the acquired images. Results: In the simulation study of 280 slices from 14 subjects, the proposed method showed a significant increase in the averaged structural similarity index measure by 0.2737 in individual coil images and by 0.4550 in the root‐sum‐of‐square images. In addition, the ablation study demonstrated the effectiveness of each proposed component in correcting motion artifacts compared to the corrected images produced by the baseline method. The experiments on real motion dataset has shown its clinical potential. Conclusion: The proposed method exhibited significant quantitative and qualitative improvements in correcting rigid and in‐plane motion artifacts in MR images acquired using turbo spin‐echo sequence. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
25. Construction personnel dress code detection based on YOLO framework.
- Author
-
Lyu, Yunkai, Yang, Xiaobing, Guan, Ai, Wang, Jingwen, and Dai, Leni
- Subjects
DRESS codes ,ARTIFICIAL neural networks ,SAFETY hats ,SPINE - Abstract
It is important for construction personnel to observe the dress code, such as the correct wearing of safety helmets and reflective vests is conducive to protecting the workers' lives and safety of construction. A YOLO network‐based detection algorithm is proposed for the construction personnel dress code (YOLO‐CPDC). Firstly, Multi‐Head Self‐Attention (MHSA) is introduced into the backbone network to build a hybrid backbone, called Convolution MHSA Network (CMNet). The CMNet gives the model a global field of view and enhances the detection capability of the model for small and obscured targets. Secondly, an efficient and lightweight convolution module is designed. It is named Ghost Shuffle Attention‐Conv‐BN‐SiLU (GSA‐CBS) and is used in the neck network. The GSANeck network reduces the model size without affecting the performance. Finally, the SIoU is used in the loss function and Soft NMS is used for post‐processing. Experimental results on the self‐constructed dataset show that YOLO‐CPDC algorithm has higher detection accuracy than current methods. YOLO‐CPDC achieves a mAP50 of 93.6%. Compared with the YOLOv5s, the number of parameters of our model is reduced by 18% and the mAP50 is improved by 1.1%. Overall, this research effectively meets the actual demand of dress code detection in construction scenes. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. 79‐4: Efficient Deep Learning‐based Backlight Extraction for Local Dimming Display.
- Author
-
Chung, Hanwook, Tarabay, Nizar, Okon, Alexandre, and Yoo, Hyunjin
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,COMPUTATIONAL complexity ,GENERALIZATION - Abstract
In this paper, we introduce deep learning (DL)‐based backlight extraction methods for local dimming display. The main objective is to better handle the trade‐off between the displayed image quality and power consumption. To this end, we propose an enhanced power regularization. Moreover, we design a smaller model with reduced computational complexity and propose an efficient post‐processing for better generalization. Experimental results show that the proposed methods reduce power consumption while better maintaining the image quality than the selected benchmarks. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
27. The macular retinal ganglion cell layer as a biomarker for diagnosis and prognosis in multiple sclerosis: A deep learning approach.
- Author
-
Montolío, Alberto, Cegoñino, José, Garcia‐Martin, Elena, and Pérez del Palomar, Amaya
- Subjects
- *
RETINAL ganglion cells , *ARTIFICIAL neural networks , *DEEP learning , *MULTIPLE sclerosis , *OPTICAL coherence tomography , *RETINAL blood vessels , *MACULA lutea - Abstract
Purpose: The macular ganglion cell layer (mGCL) is a strong potential biomarker of axonal degeneration in multiple sclerosis (MS). For this reason, this study aims to develop a computer‐aided method to facilitate diagnosis and prognosis in MS. Methods: This paper combines a cross‐sectional study of 72 MS patients and 30 healthy control subjects for diagnosis and a 10‐year longitudinal study of the same MS patients for the prediction of disability progression, during which the mGCL was measured using optical coherence tomography (OCT). Deep neural networks were used as an automatic classifier. Results: For MS diagnosis, greatest accuracy (90.3%) was achieved using 17 features as inputs. The neural network architecture comprised the input layer, two hidden layers and the output layer with softmax activation. For the prediction of disability progression 8 years later, accuracy of 81.9% was achieved with a neural network comprising two hidden layers and 400 epochs. Conclusion: We present evidence that by applying deep learning techniques to clinical and mGCL thickness data it is possible to identify MS and predict the course of the disease. This approach potentially constitutes a non‐invasive, low‐cost, easy‐to‐implement and effective method. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. Time‐based DDoS attack detection through hybrid LSTM‐CNN model architectures: An investigation of many‐to‐one and many‐to‐many approaches.
- Author
-
Habib, Beenish and Khursheed, Farida
- Subjects
ARTIFICIAL neural networks ,DENIAL of service attacks ,CONVOLUTIONAL neural networks ,COMPUTER network traffic ,ARTIFICIAL intelligence ,DEEP learning - Abstract
Summary: Internet data thefts, intrusions and DDoS attacks are some of the big concerns for the network security today. Detection of these anomalies, is gaining tremendous impetus with the development of machine learning and artificial intelligence. Even now researchers are shifting the base from machine learning to the deep neural architectures with auto‐feature selection capabilities. We in this paper propose multiple deep neural network architectures which can select, co‐learn and teach the gradients of the neural network by itself with no human intervention. This is what we call as meta‐learning. The models are configured in both many to one and many to many design architectures. We combine long short‐term memory (LSTM), bi‐directional long short‐term memory (BiLSTM), convolutional neural network (CNN) layers along with attention mechanism to achieve the higher accuracy values among all the available deep learning model architectures. LSTMs overcomes the vanishing and exploding gradient problem of RNN and attention mechanism mimics the human cognitive attention that screens the network flow to obtain the key features for network traffic classification. In addition, we also add multiple convolutional layers to get the key features for network traffic classification. We get the time series analysis of the traffic done for the possibility of a DDoS attack without using any feature selection techniques and without balancing the dataset. The performance analysis is done based on confusion matrix scores, that is, accuracy, false alarm rate (FAR), sensitivity, specificity, false‐positive rate (FPR), F1 score, area under curve (AUC) analysis and loss functions on well‐known public benchmark KDD Cup'99 data set. The results of our experiments reveal that our models outperform existing techniques, showing their superiority in performance. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
29. A feed forward deep neural network model using feature selection for cloud intrusion detection system.
- Author
-
Sharma, Hidangmayum Satyajeet and Singh, Khundrakpam Johnson
- Subjects
ARTIFICIAL neural networks ,FEATURE selection ,MACHINE learning ,INTRUSION detection systems (Computer security) ,DEEP learning ,CLOUD computing - Abstract
Summary: The rapid advancement and growth of technology have rendered cloud computing services indispensable to our activities. Threats and intrusions have since multiplied exponentially across a range of industries. In such a scenario, the intrusion detection system, or simply the IDS, is deployed on the network to monitor and detect any attacks. The paper proposes a feed‐forward deep neural network (FFDNN) method based on deep learning methodology using a filter‐based feature selection model. The feature selection strategy aims to determine and select the most highly relevant subset of attributes from the feature importance score for training the deep learning model. Three benchmark data sets were used to assess the experiment: CIC‐IDS 2017, UNSW‐NB15, and NSL‐KDD. In order to justify the proposed technique, a comparison was done using other learning algorithms ranging from classical machine learning to ensemble learning methods that can detect various attacks. The experiments showed that the FFDNN model with reduced feature subsets gave the highest accuracy of 99.53% and 94.45% in the NSL‐KDD and UNSW‐NB15 data sets, while the ensemble‐based XGBoost model performed better in the CIC‐IDS 2017 data set. In addition, the results show that the overall accuracy, recall, and F1 score of the deep learning algorithm are generally better for all the data sets. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
30. Deep learning‐based response spectrum analysis method for building structures.
- Author
-
Kim, Taeyong, Kwon, Oh‐Sung, and Song, Junho
- Subjects
DEEP learning ,SPECTRUM analysis ,ARTIFICIAL neural networks ,GROUND motion ,SUM of squares ,SQUARE root - Abstract
The response spectrum method has gained widespread acceptance in practical applications owing to its favorable compromise between accuracy and practical efficiency. The method predicts the peak responses of multi‐degree‐of‐freedom (MDOF) systems by combining modal responses. The Square Root of the Sum of Squares (SRSS) and Complete Quadratic Combination (CQC) rules are commonly used for modal combinations. However, it has been widely known that these rules have limitations in accurately predicting responses influenced by higher modes and cross‐modal correlations. To improve the accuracy of the response spectrum analysis method for building structures, this paper proposes a Deep learning‐based modal Combination (DC) rule by introducing modal contribution coefficients predicted by a deep neural network (DNN) model. The DC rule enhances prediction accuracy by considering the characteristics of ground motion and the dynamic properties of a structural system. The DC rule provides more accurate predictions than the conventional rules, particularly for irregular response spectra and responses affected by higher modes. The efficiency and applicability of the DC rule are demonstrated by numerical investigations of multistory shear buildings and steel frame structures with regular and irregular shapes. The source codes, data, and trained models are available for download at https://github.com/tyongkim/ERD2. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
31. An adaptive weight search method based on the Grey wolf optimizer algorithm for skin lesion ensemble classification.
- Author
-
Liu, Luzhou, Zhang, Xiaoxia, and Xu, Zhinan
- Subjects
- *
GREY Wolf Optimizer algorithm , *DEEP learning , *ARTIFICIAL neural networks , *OPTIMIZATION algorithms , *MACHINE learning , *PARTICLE swarm optimization , *CLASSIFICATION algorithms , *SKIN imaging - Abstract
Skin cancer is a common type of malignant tumor that poses a serious threat to patients' lives and health, especially melanoma. It may spread to other body parts, resulting in serious complications and death. In the medical field, accurate identification of skin lesion images is crucial for diagnosing different diseases. However, due to the similarity between different skin lesions, it brings some challenges to medical diagnosis. In this paper, a novel Ensemble Learning Model (EL‐DLOA) based on deep learning and optimization algorithms is proposed, which uses four different deep neural network architectures to generate confidence levels for classes, and optimization algorithms are used to integrate these confidence levels to make the final predictions. To ensure the model's accuracy and reliability, it is first trained using three different learning rates to find the best classification performance of the model. Then, a new search method based on the grey wolf optimization algorithm is proposed to enhance the grey wolf search efficiency. The method improves the search mechanism by changing the grey wolf's individual position through random perturbation or adaptive mutation, which solves the problem that the grey wolf algorithm is easy to fall into local optimum. Finally, four different ensemble strategies are used to reduce individual model bias in the classification process. The proposed model is trained and evaluated using the publicly available dataset HAM10000. The experimental results show that the improved grey wolf optimization algorithm effectively avoids the premature convergence problem and improves the search combination efficiency. Furthermore, in the ensemble methods, the adaptive weight average ensemble strategy effectively improves the classification performance, yielding accuracy, precision, recall, and F1 scores of 0.888, 0.837, 0.897, and 0.862, respectively. These metrics show varying degrees of improvement over the best performing single model. In general, the results indicate that the proposed method achieves high accuracy and practicality in skin lesion classification. Our model shows excellent performance in comparison with other existing models, which makes it significant for research and application in dermatology diagnosis. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
32. A gradient mapping guided explainable deep neural network for extracapsular extension identification in 3D head and neck cancer computed tomography images.
- Author
-
Wang, Yibin, Rahman, Abdur, Duggar, William Neil, Thomas, Toms V., Roberts, Paul Russell, Vijayakumar, Srinivasan, Jiao, Zhicheng, Bian, Linkan, and Wang, Haifeng
- Subjects
- *
ARTIFICIAL neural networks , *DEEP learning , *HEAD & neck cancer , *MACHINE learning , *SQUAMOUS cell carcinoma , *LYMPH nodes - Abstract
Background: Diagnosis and treatment management for head and neck squamous cell carcinoma (HNSCC) is guided by routine diagnostic head and neck computed tomography (CT) scans to identify tumor and lymph node features. The extracapsular extension (ECE) is a strong predictor of patients' survival outcomes with HNSCC. It is essential to detect the occurrence of ECE as it changes staging and treatment planning for patients. Current clinical ECE detection relies on visual identification and pathologic confirmation conducted by clinicians. However, manual annotation of the lymph node region is a required data preprocessing step in most of the current machine learning‐based ECE diagnosis studies. Purpose: In this paper, we propose a Gradient Mapping Guided Explainable Network (GMGENet) framework to perform ECE identification automatically without requiring annotated lymph node region information. Methods: The gradient‐weighted class activation mapping (Grad‐CAM) technique is applied to guide the deep learning algorithm to focus on the regions that are highly related to ECE. The proposed framework includes an extractor and a classifier. In a joint training process, informative volumes of interest (VOIs) are extracted by the extractor without labeled lymph node region information, and the classifier learns the pattern to classify the extracted VOIs into ECE positive and negative. Results: In evaluation, the proposed methods are well‐trained and tested using cross‐validation. GMGENet achieved test accuracy and area under the curve (AUC) of 92.2% and 89.3%, respectively. GMGENetV2 achieved 90.3% accuracy and 91.7% AUC in the test. The results were compared with different existing models and further confirmed and explained by generating ECE probability heatmaps via a Grad‐CAM technique. The presence or absence of ECE has been analyzed and correlated with ground truth histopathological findings. Conclusions: The proposed deep network can learn meaningful patterns to identify ECE without providing lymph node contours. The introduced ECE heatmaps will contribute to the clinical implementations of the proposed model and reveal unknown features to radiologists. The outcome of this study is expected to promote the implementation of explainable artificial intelligence‐assiste ECE detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
33. Deep neural network aided cohesive zone parameter identifications through die shear test in electronic packaging.
- Author
-
Zhao, Libo, Dai, Yanwei, Wei, Jiahui, and Qin, Fei
- Subjects
- *
ARTIFICIAL neural networks , *ELECTRONIC packaging , *PARAMETER identification , *COHESIVE strength (Mechanics) , *PACKAGING materials , *SHEAR strength , *DEEP learning - Abstract
The die shear test is a feasible and conventional method to characterize the shear strength of die‐attaching layer materials in electronic packaging. A new method for determining cohesive zone model (CZM) parameters using deep neural networks (DNN) and die shear tests is proposed, different from classical fracture framework or lap shear test‐based methods. With the sintered nano‐silver die shear test, the results show that the bilinear CZM inversion results agree well with the experimental results. It is found that the DNN model has high accuracy in predicting and identifying the maximum shear traction strength τmax, separation displacement of the interface δf, and the interface stiffness k1 of CZM parameters for sintered nano‐silver adhesive layer through die shear test load versus displacement curves. The presented DNN‐aided inverse identifying method through the die shear test in this paper could provide an alternative and convenient method for extracting CZM parameters of various kinds of adhesive materials in electronic packaging. Highlights: Die shear tests were used for the inverse identification of CZM parameters.The die shear test P–δ curves were established as the dataset.A DNN‐aided CZM inverse identification method was proposed.The DNN‐aided model can accurately identify the CZM parameters. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
34. Investigating fault injection techniques in hardware‐based deep neural networks and mutation‐based fault localization.
- Author
-
Le Traon, Yves and Xie, Tao
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,SOFTWARE reliability ,SOFTWARE localization - Abstract
This article discusses two papers that examine different aspects of software reliability using fault injection techniques. The first paper investigates the impact of transient hardware faults on deep learning neural network inference, particularly in safety-critical applications like autonomous vehicles and healthcare systems. The authors enhance fault injection techniques to reveal the significant influence of hardware faults on these applications. The second paper addresses the challenges of fault localization in software debugging and presents a novel approach, Delta4Ms, that mitigates mutant bias and improves fault localization accuracy. These papers provide valuable insights into ensuring software reliability and resilience in different contexts. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
35. LDCNN: A new arrhythmia detection technique with ECG signals using a linear deep convolutional neural network.
- Author
-
Bayani, Ali and Kargar, Masoud
- Subjects
ARTIFICIAL neural networks ,CONVOLUTIONAL neural networks ,MUSCLE contraction ,DEEP learning ,MYOCARDIUM - Abstract
The electrocardiogram (ECG) is a fundamental and widely used tool for diagnosing cardiovascular diseases. It involves recording cardiac electrical signals using electrodes, which illustrate the functioning of cardiac muscles during contraction and relaxation phases. ECG is instrumental in identifying abnormal cardiac activity, heart attacks, and various cardiac conditions. Arrhythmia detection, a critical aspect of ECG analysis, entails accurately classifying heartbeats. However, ECG signal analysis demands a high level of expertise, introducing the possibility of human errors in interpretation. Hence, there is a clear need for robust automated detection techniques. Recently, numerous methods have emerged for arrhythmia detection from ECG signals. In our research, we developed a novel one‐dimensional deep neural network technique called linear deep convolutional neural network (LDCNN) to identify arrhythmias from ECG signals. We compare our suggested method with several state‐of‐the‐art algorithms for arrhythmia detection. We evaluate our methodology using benchmark datasets, including the PTB Diagnostic ECG and MIT‐BIH Arrhythmia databases. Our proposed method achieves high accuracy rates of 99.24% on the PTB Diagnostic ECG dataset and 99.38% on the MIT‐BIH Arrhythmia dataset. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
36. A novel approach to inverse design of wind turbine airfoils using tandem neural networks.
- Author
-
Anand, Apurva, Marepally, Koushik, Muneeb Safdar, M, and Baeder, James D.
- Subjects
WIND turbine efficiency ,WIND turbine blades ,ARTIFICIAL neural networks ,FLUID dynamics ,WIND turbines ,DEEP learning ,AEROFOILS - Abstract
The performance of a wind turbine and its efficiency majorly depends on wind‐to‐rotor efficiency. The aerodynamic design of the wind turbine blades using high‐fidelity tools such as adjoint‐computational fluid dynamics (CFD) is accurate but computationally expensive. It becomes impractical when the number of design variables increases for multidisciplinary optimization (MDO). Low‐fidelity tools are computationally cheaper but are not accurate, especially in regions of adverse pressure gradient and reverse flows. Surrogate modeling has been used in many aerodynamic problems. We develop and apply a recent architecture of the deep learning module, tandem neural networks (T‐NNs) for the inverse design of wind turbine airfoils. The T‐NNs trained on CFD data for fully turbulent cases predict not only the performance parameters for the given airfoil geometry but also the airfoil geometry for a given design objective. This framework uses the entire performance polar for inverse design which ensures that the airfoil optimization is not a single‐point optimization problem which is essential for practical design problems. The T‐NNs are also optimized to include multiple constraints like maximum thickness and trailing edge (TE) thickness which is a novel contribution in the field of inverse design using surrogate models. A statistical analysis is also performed to predict a family of airfoil geometries. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
37. Fast Pure Shift NMR Spectroscopy Using Attention‐Assisted Deep Neural Network.
- Author
-
Zhan, Haolin, Liu, Jiawei, Fang, Qiyuan, Chen, Xinyu, Ni, Yang, and Zhou, Lingling
- Subjects
ARTIFICIAL neural networks ,NUCLEAR magnetic resonance spectroscopy ,DEEP learning ,ARTIFICIAL intelligence ,MOLECULAR structure ,CHEMICAL shift (Nuclear magnetic resonance) - Abstract
Pure shift NMR spectroscopy enables the robust probing on molecular structure and dynamics, benefiting from great resolution enhancements. Despite extensive application landscapes in various branches of chemistry, the long experimental times induced by the additional time dimension generally hinder its further developments and practical deployments, especially for multi‐dimensional pure shift NMR. Herein, this study proposes and implements the fast, reliable, and robust reconstruction for accelerated pure shift NMR spectroscopy with lightweight attention‐assisted deep neural network. This deep learning protocol allows one to regain high‐resolution signals and suppress undersampling artifacts, as well as furnish high‐fidelity signal intensities along with the accelerated pure shift acquisition, benefitting from the introduction of the attention mechanism to highlight the spectral feature and information of interest. Extensive results of simulated and experimental NMR data demonstrate that this attention‐assisted deep learning protocol enables the effective recovery of weak signals that are almost drown in the serious undersampling artifacts, and the distinction and recognition of close chemical shifts even though using merely 5.4% data, highlighting its huge potentials on fast pure shift NMR spectroscopy. As a result, this study affords a promising paradigm for the AI‐assisted NMR protocols toward broader applications in chemistry, biology, materials, and life sciences, and among others. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
38. Reinforcement learning layout‐based optimal energy management in smart home: AI‐based approach.
- Author
-
Afroosheh, Sajjad, Esapour, Khodakhast, Khorram‐Nia, Reza, and Karimi, Mazaher
- Subjects
ARTIFICIAL neural networks ,RENEWABLE energy sources ,REINFORCEMENT learning ,ENERGY storage ,SMART homes ,DEEP learning - Abstract
This research addresses the pressing need for enhanced energy management in smart homes, motivated by the inefficiencies of current methods in balancing power usage optimization with user comfort. By integrating reinforcement learning and a unique column‐and‐constraint generation strategy, the study aims to fill this gap and offer a comprehensive solution. Furthermore, the increasing adoption of renewable energy sources like solar panels underscores the importance of developing advanced energy management techniques, driving the exploration of innovative approaches such as the one proposed herein. The constraint coordination game (CCG) method is designed to efficiently manage the power usage of each appliance, including the charging and discharging of the energy storage system. Additionally, a deep learning model, specifically a deep neural network, is employed to forecast indoor temperatures, which significantly influence the energy demands of the air conditioning system. The synergistic combination of the CCG method with deep learning‐based indoor temperature forecasting promises significant reductions in homeowner energy expenses while maintaining optimal appliance performance and user satisfaction. Testing conducted in simulated environments demonstrates promising results, showcasing a 12% reduction in energy costs compared to conventional energy management strategies. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
39. Improved organs at risk segmentation based on modified U‐Net with self‐attention and consistency regularisation.
- Author
-
Manko, Maksym, Popov, Anton, Gorriz, Juan Manuel, and Ramirez, Javier
- Subjects
CHEST (Anatomy) ,ARTIFICIAL neural networks ,COMPUTED tomography ,RETINAL blood vessels ,IMAGE segmentation ,HEART ,ESOPHAGUS - Abstract
Cancer is one of the leading causes of death in the world, with radiotherapy as one of the treatment options. Radiotherapy planning starts with delineating the affected area from healthy organs, called organs at risk (OAR). A new approach to automatic OAR segmentation in the chest cavity in Computed Tomography (CT) images is presented. The proposed approach is based on the modified U‐Net architecture with the ResNet‐34 encoder, which is the baseline adopted in this work. The new two‐branch CS‐SA U‐Net architecture is proposed, which consists of two parallel U‐Net models in which self‐attention blocks with cosine similarity as query‐key similarity function (CS‐SA) blocks are inserted between the encoder and decoder, which enabled the use of consistency regularisation. The proposed solution demonstrates state‐of‐the‐art performance for the problem of OAR segmentation in CT images on the publicly available SegTHOR benchmark dataset in terms of a Dice coefficient (oesophagus—0.8714, heart—0.9516, trachea—0.9286, aorta—0.9510) and Hausdorff distance (oesophagus—0.2541, heart—0.1514, trachea—0.1722, aorta—0.1114) and significantly outperforms the baseline. The current approach is demonstrated to be viable for improving the quality of OAR segmentation for radiotherapy planning. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
40. Tjong: A transformer‐based Mahjong AI via hierarchical decision‐making and fan backward.
- Author
-
Li, Xiali, Liu, Bo, Wei, Zhi, Wang, Zhaoqi, and Wu, Licheng
- Subjects
ARTIFICIAL neural networks ,ARTIFICIAL intelligence ,REINFORCEMENT learning ,DECISION making ,DEEP learning - Abstract
Mahjong, a complex game with hidden information and sparse rewards, poses significant challenges. Existing Mahjong AIs require substantial hardware resources and extensive datasets to enhance AI capabilities. The authors propose a transformer‐based Mahjong AI (Tjong) via hierarchical decision‐making. By utilising self‐attention mechanisms, Tjong effectively captures tile patterns and game dynamics, and it decouples the decision process into two distinct stages: action decision and tile decision. This design reduces decision complexity considerably. Additionally, a fan backward technique is proposed to address the sparse rewards by allocating reversed rewards for actions based on winning hands. Tjong consists of 15M parameters and is trained using approximately 0.5 M data over 7 days of supervised learning on a single server with 2 GPUs. The action decision achieved an accuracy of 94.63%, while the claim decision attained 98.55% and the discard decision reached 81.51%. In a tournament format, Tjong outperformed AIs (CNN, MLP, RNN, ResNet, VIT), achieving scores up to 230% higher than its opponents. Furthermore, after 3 days of reinforcement learning training, it ranked within the top 1% on the leaderboard on the Botzone platform. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
41. Improved 5G network slicing for enhanced QoS against attack in SDN environment using deep learning.
- Author
-
Abood, Mohammed Salah, Wang, Hua, Virdee, Bal S., He, Dongxuan, Fathy, Maha, Yusuf, Abdulganiyu Abdu, Jamal, Omar, Elwi, Taha A., Alibakhshikenari, Mohammad, Kouhalvandi, Lida, and Ahmad, Ashfaq
- Subjects
ARTIFICIAL neural networks ,MACHINE learning ,OPENFLOW (Computer network protocol) ,DEEP learning ,COMPUTER network security ,5G networks - Abstract
Within the evolving landscape of fifth‐generation (5G) wireless networks, the introduction of network‐slicing protocols has become pivotal, enabling the accommodation of diverse application needs while fortifying defences against potential security breaches. This study endeavours to construct a comprehensive network‐slicing model integrated with an attack detection system within the 5G framework. Leveraging software‐defined networking (SDN) along with deep learning techniques, this approach seeks to fortify security measures while optimizing network performance. This undertaking introduces network slicing predicated on SDN with the OpenFlow protocol and Ryu control technology, complemented by a neural network model for attack detection using deep learning methodologies. Additionally, the proposed convolutional neural networks‐long short‐term memory approach demonstrates superiority over conventional ML algorithms, signifying its potential for real‐time attack detection. Evaluation of the proposed system using a 5G dataset showcases an impressive accuracy of 99%, surpassing previous studies, and affirming the efficacy of the approach. Moreover, network slicing significantly enhances quality of service by segmenting services based on bandwidth. Future research will concentrate on real‐world implementation, encompassing diverse dataset evaluations, and assessing the model's adaptability across varied scenarios. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
42. Using photographs and deep neural networks to understand flowering phenology and diversity in mountain meadows.
- Author
-
John, Aji, Theobald, Elli J., Cristea, Nicoleta, Tan, Amanda, and Hille Ris Lambers, Janneke
- Subjects
ARTIFICIAL neural networks ,MOUNTAIN meadows ,PLANT phenology ,PHENOLOGY ,FLOWERING of plants ,TIMBERLINE ,DEEP learning ,FLOWERING time - Abstract
Mountain meadows are an essential part of the alpine–subalpine ecosystem; they provide ecosystem services like pollination and are home to diverse plant communities. Changes in climate affect meadow ecology on multiple levels, for example, by altering growing season dynamics. Tracking the effects of climate change on meadow diversity through the impacts on individual species and overall growing season dynamics is critical to conservation efforts. Here, we explore how to combine crowd‐sourced camera images with machine learning to quantify flowering species richness across a range of elevations in alpine meadows located in Mt. Rainier National Park, Washington, USA. We employed three machine‐learning techniques (Mask R‐CNN, RetinaNet and YOLOv5) to detect wildflower species in images taken during two flowering seasons. We demonstrate that deep learning techniques can detect multiple species, providing information on flowering richness in photographed meadows. The results indicate higher richness just above the tree line for most of the species, which is comparable with patterns found using field studies. We found that the two‐stage detector Mask R‐CNN was more accurate than single‐stage detectors like RetinaNet and YOLO, with the Mask R‐CNN network performing best overall with mean average precision (mAP) of 0.67 followed by RetinaNet (0.5) and YOLO (0.4). We found that across the methods using anchor box variations in multiples of 16 led to enhanced accuracy. We also show that detection is possible even when pictures are interspersed with complex backgrounds and are not in focus. We found differential detection rates depending on species abundance, with additional challenges related to similarity in flower characteristics, labeling errors and occlusion issues. Despite these potential biases and limitations in capturing flowering abundance and location‐specific quantification, accuracy was notable considering the complexity of flower types and picture angles in this dataset. We, therefore, expect that this approach can be used to address many ecological questions that benefit from automated flower detection, including studies of flowering phenology and floral resources, and that this approach can, therefore, complement a wide range of ecological approaches (e.g., field observations, experiments, community science, etc.). In all, our study suggests that ecological metrics like floral richness can be efficiently monitored by combining machine learning with easily accessible publicly curated datasets (e.g., Flickr, iNaturalist). [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
43. Machine learning applications in vadose zone hydrology: A review.
- Author
-
Li, Xiang, Nieber, John L., and Kumar, Vipin
- Subjects
ARTIFICIAL neural networks ,MACHINE learning ,DEEP learning ,HYDROLOGY ,RANDOM forest algorithms - Abstract
Machine learning (ML) has been broadly applied for vadose zone applications in recent years. This article provides a comprehensive review of such developments. ML applications for variables corresponding to different complex vadose zone processes are summarized mostly in a prediction context. By analyzing and assessing these applications, we discovered extensive usages of classic ML models with relatively limited applications of deep learning (DL) approaches in general. We also recognized a lack of benchmark datasets for soil property research as well as limited integration of physics‐based vadose zone principles into the ML approaches. To facilitate this interdisciplinary research of ML in vadose zone characterization and processes, a paradigm of knowledge‐guided machine learning is suggested along with other data‐driven and ML model‐based research suggestions to advance future research. Core Ideas: Random forest and artificial neural network are two widely applied machine learning options for predicting vadose zone studies.A benchmark dataset is missing in soil property studies.We suggest vadose zone scientists explore more deep learning options and expand knowledge‐guided machine learning implementations. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
44. Editorial for special issue on "Edge computing accelerated deep learning: Technologies and applications".
- Author
-
Liu, Xiao
- Subjects
EDGE computing ,DEEP learning ,ARTIFICIAL neural networks ,MACHINE learning ,DISTRIBUTED computing - Abstract
This document is an editorial for a special issue on "Edge computing accelerated deep learning: Technologies and applications." The traditional centralized approach to implementing machine learning (ML) and deep learning (DL) applications has limitations such as high latency, large bandwidth usage, and privacy concerns. Edge computing, which integrates mobile/wireless infrastructure and cloud datacenters, has emerged as a paradigm to address these issues. The special issue aims to promote innovative technologies and applications that accelerate DL in the distributed edge computing environment. The editorial highlights five selected papers that focus on using DL and other ML methods to address challenges in edge computing-based application scenarios and designing efficient and lightweight ML and DL models for the edge computing environment. The Lead Guest Editor expresses gratitude to the authors and reviewers for their contributions. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
45. Improving Explainability of Deep Learning for Polarimetric Radar Rainfall Estimation.
- Author
-
Li, Wenyuan, Chen, Haonan, and Han, Lei
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,RAIN gauges ,RADAR ,RAINDROP size ,GROUND penetrating radar ,RAINFALL - Abstract
Machine learning‐based approaches demonstrate a significant potential in radar quantitative precipitation estimation (QPE) applications. In contrast to conventional methods that depend on local raindrop size distributions, deep learning (DL) can establish an effective mapping from three‐dimensional radar observations to ground rain rates. However, the lack of transparency in DL models poses challenges toward understanding the underlying physical mechanisms that drive their outcomes. This study aims to develop a DL‐based QPE system and provide a physical explanation of radar precipitation estimation process. This research is designed by employing a deep neural network consisting of two modules. The first module is a quantitative precipitation estimation network that has the capability to learn precipitation patterns and spatial distribution from multidimensional polarimetric radar observations. The second module introduces a quantitative precipitation estimation shapley additive explanations method to quantify the influence of each radar observable on the model estimate across various precipitation intensities. Plain Language Summary: Ground radars can provide continuous spatial observations over large areas with high spatiotemporal resolutions, so they form the infrastructure for precipitation monitoring and observation in many countries. Recently, deep learning (DL) techniques have shown great potential for use in polarimetric radar‐based precipitation estimates. Nevertheless, the black‐box and turn‐key characteristics of DL models make it difficult for researchers to understand the model decision‐making process and cast doubt on the reliability of the model results. This study introduces a physically explainable polarization radar‐based quantitative precipitation estimation (QPE) system built on DL technology that can explain the causes of the precipitation estimates provided by deep learning models under different rainfall amounts. An experiment indicates that our model achieves better estimates than the conventional methods. Furthermore, the explainability methodology allows for visualization of the microphysical precipitation information. Being the initial attempt to apply explainability learning in the QPE domain, the explainability results may offer valuable guidance for rainfall estimation. Key Points: A polarimetric radar‐based rainfall estimation system is developed using deep neural networksThe deep learning‐based rainfall estimates generally outperform products derived from traditional parametric relationsThe proposed deep learning interpretation method can provide physical and statistical explanations of the model decision‐making process [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
46. 'cito': an R package for training neural networks using 'torch'.
- Author
-
Amesöder, Christian, Hartig, Florian, and Pichler, Maximilian
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,AFRICAN elephant ,GRAPHICS processing units ,SPECIES distribution ,TORCHES - Abstract
Deep neural networks (DNN) have become a central method in ecology. To build and train DNNs in deep learning (DL) applications, most users rely on one of the major deep learning frameworks, in particular PyTorch or TensorFlow. Using these frameworks, however, requires substantial experience and time. Here, we present 'cito', a user‐friendly R package for DL that allows specifying DNNs in the familiar formula syntax used by many R packages. To fit the models, 'cito' takes advantage of the numerically optimized 'torch' library, including the ability to switch between training models on the CPU or the graphics processing unit (GPU) which allows the efficient training of large DNNs. Moreover, 'cito' includes many user‐friendly functions for model plotting and analysis, including explainable AI (xAI) metrics for effect sizes and variable importance. All xAI metrics as well as predictions can optionally be bootstrapped to generate confidence intervals, including p‐values. To showcase a typical analysis pipeline using 'cito', with its built‐in xAI features, we built a species distribution model of the African elephant. We hope that by providing a user‐friendly R framework to specify, deploy and interpret DNNs, 'cito' will make this interesting class of models more accessible to ecological data analysis. A stable version of 'cito' can be installed from the comprehensive R archive network (CRAN). [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
47. Neural dynamics for improving optimiser in deep learning with noise considered.
- Author
-
Su, Dan, Stanimirović, Predrag S., Han, Ling Bo, and Jin, Long
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,NOISE ,SOURCE code - Abstract
As deep learning evolves, neural network structures become increasingly sophisticated, bringing a series of new optimisation challenges. For example, deep neural networks (DNNs) are vulnerable to a variety of attacks. Training neural networks under privacy constraints is a method to alleviate privacy leakage, and one way to do this is to add noise to the gradient. However, the existing optimisers suffer from weak convergence in the presence of increased noise during training, which leads to a low robustness of the optimiser. To stabilise and improve the convergence of DNNs, the authors propose a neural dynamics (ND) optimiser, which is inspired by the zeroing neural dynamics originated from zeroing neural networks. The authors first analyse the relationship between DNNs and control systems. Then, the authors construct the ND optimiser to update network parameters. Moreover, the proposed ND optimiser alleviates the non‐convergence problem that may be suffered by adding noise to the gradient from different scenarios. Furthermore, experiments are conducted on different neural network structures, including ResNet18, ResNet34, Inception‐v3, MobileNet, and long and short‐term memory network. Comparative results using CIFAR, YouTube Faces, and R8 datasets demonstrate that the ND optimiser improves the accuracy and stability of DNNs under noise‐free and noise‐polluted conditions. The source code is publicly available at https://github.com/LongJin‐lab/ND. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
48. RGB‐guided hyperspectral image super‐resolution with deep progressive learning.
- Author
-
Zhang, Tao, Fu, Ying, Huang, Liwei, Li, Siyuan, You, Shaodi, and Yan, Chenggang
- Subjects
HIGH resolution imaging ,DEEP learning ,ARTIFICIAL neural networks ,SUPERVISED learning ,COMPUTER vision ,IMAGE processing - Abstract
Due to hardware limitations, existing hyperspectral (HS) camera often suffer from low spatial/temporal resolution. Recently, it has been prevalent to super‐resolve a low resolution (LR) HS image into a high resolution (HR) HS image with a HR RGB (or multispectral) image guidance. Previous approaches for this guided super‐resolution task often model the intrinsic characteristic of the desired HR HS image using hand‐crafted priors. Recently, researchers pay more attention to deep learning methods with direct supervised or unsupervised learning, which exploit deep prior only from training dataset or testing data. In this article, an efficient convolutional neural network‐based method is presented to progressively super‐resolve HS image with RGB image guidance. Specifically, a progressive HS image super‐resolution network is proposed, which progressively super‐resolve the LR HS image with pixel shuffled HR RGB image guidance. Then, the super‐resolution network is progressively trained with supervised pre‐training and unsupervised adaption, where supervised pre‐training learns the general prior on training data and unsupervised adaptation generalises the general prior to specific prior for variant testing scenes. The proposed method can effectively exploit prior from training dataset and testing HS and RGB images with spectral‐spatial constraint. It has a good generalisation capability, especially for blind HS image super‐resolution. Comprehensive experimental results show that the proposed deep progressive learning method outperforms the existing state‐of‐the‐art methods for HS image super‐resolution in non‐blind and blind cases. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
49. Position‐aware pushing and grasping synergy with deep reinforcement learning in clutter.
- Author
-
Zhao, Min, Zuo, Guoyu, Yu, Shuangyue, Gong, Daoxiong, Wang, Zihao, and Sie, Ouattara
- Subjects
DEEP reinforcement learning ,PREHENSION (Physiology) ,ARTIFICIAL neural networks ,REINFORCEMENT learning ,IMAGE segmentation - Abstract
The positional information of objects is crucial to enable robots to perform grasping and pushing manipulations in clutter. To effectively perform grasping and pushing manipulations, robots need to perceive the position information of objects, including the coordinates and spatial relationship between objects (e.g., proximity, adjacency). The authors propose an end‐to‐end position‐aware deep Q‐learning framework to achieve efficient collaborative pushing and grasping in clutter. Specifically, a pair of conjugate pushing and grasping attention modules are proposed to capture the position information of objects and generate high‐quality affordance maps of operating positions with features of pushing and grasping operations. In addition, the authors propose an object isolation metric and clutter metric based on instance segmentation to measure the spatial relationships between objects in cluttered environments. To further enhance the perception capacity of position information of the objects, the authors associate the change in the object isolation metric and clutter metric in cluttered environment before and after performing the action with reward function. A series of experiments are carried out in simulation and real‐world which indicate that the method improves sample efficiency, task completion rate, grasping success rate and action efficiency compared to state‐of‐the‐art end‐to‐end methods. Noted that the authors' system can be robustly applied to real‐world use and extended to novel objects. Supplementary material is available at https://youtu.be/NhG\_k5v3NnM}{https://youtu.be/NhG\_k5v3NnM. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
50. Deep learning‐based skin care product recommendation: A focus on cosmetic ingredient analysis and facial skin conditions.
- Author
-
Lee, Jinhee, Yoon, Huisu, Kim, Semin, Lee, Chanhyeok, Lee, Jongha, and Yoo, Sangwook
- Subjects
ARTIFICIAL neural networks ,SKIN care products ,DEEP learning ,RECOMMENDER systems ,SIGNAL convolution ,ARTIFICIAL intelligence - Abstract
Background: Recommendations for cosmetics are gaining popularity, but they are not being made with consideration of the analysis of cosmetic ingredients, which customers consider important when selecting cosmetics. Aims: This article aims to propose a method for estimating the efficacy of cosmetics based on their ingredients and introduces a system that recommends personalized products for consumers, combined with AI skin analysis. Methods: We constructed a deep neural network architecture to analyze sequentially arranged cosmetic ingredients in the product and incorporated skin analysis models to get the precise skin status of users from frontal face images. Our recommendation system makes decisions based on the results optimized for the individual. Results: Our cosmetic recommendation system has shown its effectiveness through reliable evaluation metrics, and numerous examples have demonstrated its ability to make reasonable recommendations for various skin problems. Conclusion: The result shows that deep learning methods can be used to predict the effects of products based on their cosmetic ingredients and are available for use in personalized cosmetic recommendations. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.