28 results on '"Xu, Lu"'
Search Results
2. High-Throughput Classification and Counting of Vegetable Soybean Pods Based on Deep Learning
- Author
-
Chenxi Zhang, Xu Lu, Huimin Ma, Yuhao Hu, Shuainan Zhang, Xiaomei Ning, Jianwei Hu, and Jun Jiao
- Subjects
auxiliary breeding ,crop phenotype ,deep learning ,high throughput ,pod identification ,Agriculture - Abstract
Accurate identification of soybean pods is an important prerequisite for obtaining phenotypic traits such as effective pod number and seed number per plant. However, traditional image-processing methods are sensitive to light intensity, and feature-extraction methods are complex and unstable, which are not suitable for pod multi-classification tasks. In the context of smart agriculture, many experts and scholars use deep learning algorithm methods to obtain the phenotype of soybean pods, but empty pods and aborted seeds are often ignored in pod classification, resulting in certain errors in counting results. Therefore, a new classification method based on the number of effective and abortive seeds in soybean pods is proposed in this paper, and the non-maximum suppression parameters are adjusted. Finally, the method is verified. The results show that our classification counting method can effectively reduce the errors in pod and seed counting. At the same time, this paper designs a pod dataset based on multi-device capture, in which the training dataset after data augmentation has a total of 3216 images, and the distortion image test dataset, the high-density pods image test dataset, and the low-pixel image test dataset include 90 images, respectively. Finally, four object-detection models, Faster R-CNN, YOLOv3, YOLOv4, and YOLOX, are trained on the training dataset, and the recognition performance on the three test datasets is compared to select the best model. Among them, YOLOX has the best comprehensive performance, with a mean average accuracy (mAP) of 98.24%, 91.80%, and 90.27%, respectively. Experimental results show that our algorithm can quickly and accurately achieve the high-throughput counting of pods and seeds, and improve the efficiency of indoor seed testing of soybeans.
- Published
- 2023
- Full Text
- View/download PDF
3. ST-CenterNet: Small Target Detection Algorithm with Adaptive Data Enhancement
- Author
-
Yujie Guo and Xu Lu
- Subjects
small target detection ,deep learning ,selective oversampling ,adaptive data enhancement ,Science ,Astrophysics ,QB460-466 ,Physics ,QC1-999 - Abstract
General target detection with deep learning has made tremendous strides in the past few years. However, small target detection sometimes is associated with insufficient sample size and difficulty in extracting complete feature information. For safety during autonomous driving, remote signs and pedestrians need to be detected from driving scenes photographed by car cameras. In the early period of a medical lesion, because of the small area of the lesion, target detection is of great significance to detect masses and tumors for accurate diagnosis and treatment. To deal with these problems, we propose a novel deep learning model, named CenterNet for small targets (ST-CenterNet). First of all, due to the lack of visual information on small targets in the dataset, we extracted less discriminative features. To overcome this shortcoming, the proposed selective small target replication algorithm (SSTRA) was used to realize increasing numbers of small targets by selectively oversampling them. In addition, the difficulty of extracting shallow semantic information for small targets results in incomplete target feature information. Consequently, we developed a target adaptation feature extraction module (TAFEM), which was used to conduct bottom-up and top-down bidirectional feature extraction by combining ResNet with the adaptive feature pyramid network (AFPN). The improved new network model, AFPN, was added to solve the problem of the original feature extraction module, which can only extract the last layer of the feature information. The experimental results demonstrate that the proposed method can accurately detect the small-scale image of distributed targets and simultaneously, at the pixel level, classify whether a subject is wearing a safety helmet. Compared with the detection effect of the original algorithm on the safety helmet wearing dataset (SHWD), we achieved mean average precision (mAP) of 89.06% and frames per second (FPS) of 28.96, an improvement of 18.08% mAP over the previous method.
- Published
- 2023
- Full Text
- View/download PDF
4. Landslide Hazard Analysis Combining BGA-Net-Based Landslide Susceptibility Perception and Small Baseline Subset Interferometric Synthetic Aperture Radar in the Baige Section in the Upper Reaches of Jinsha River.
- Author
-
Su, Leyi, Zhang, Liang, Gui, Yuannan, Li, Yan, Zhang, Zhi, Xu, Lu, and Ming, Dongping
- Subjects
LANDSLIDE hazard analysis ,SYNTHETIC aperture radar ,CONVOLUTIONAL neural networks ,SYNTHETIC apertures ,LANDSLIDES ,LANDSLIDE prediction - Abstract
The geological and topographic conditions in the upper reaches of the Jinsha River are intricate, with frequent occurrences of landslides. Landslide Susceptibility Prediction (LSP) in this area is a crucial aspect of geological disaster risk management. This study constructs an LSP model using a Convolutional Neural Network (CNN) based on a Bilateral Aggregation Guidance (BAG) strategy, termed BGA-Net. A comprehensive landslide hazard analysis, integrating static landslide susceptibility zonation with dynamic surface deformation monitoring, was therefore conducted. The study area selected was the upper reaches of the Jinsha River, particularly the site of the Baige landslide. The BGA-Net model was first proposed for LSP generation, achieving an accuracy exceeding 85%, while the Small Baseline Subset Interferometric Synthetic Aperture Radar (SBAS-InSAR) technology was jointly applied to comprehensively analyze the dynamic geological hazard risk at a regional scale. The final results were presented in a lookup table format and mapped to delineate and grade each risk level. The results show the method is practical, with high feasibility. Compared with traditional machine learning methods, the BGA-strategy-oriented CNN model more effectively differentiated the extremely low- and extremely high-susceptibility areas, enhancing decision-making processes. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Rain-Induced Landslide Hazard Assessment Using Inception Model and Interpretability Method—A Case Study of Zayu County, Tibet.
- Author
-
Su, Leyi, Gui, Yuannan, Xu, Lu, and Ming, Dongping
- Subjects
LANDSLIDE hazard analysis ,LANDSLIDES ,RAINSTORMS ,CONVOLUTIONAL neural networks ,BACK propagation ,HAZARD mitigation ,EARTHQUAKES ,INFRASTRUCTURE (Economics) - Abstract
Geological landslide disasters significantly threaten the safety of people's lives and property. Landslides are a significant threat in Zayu County, Tibet, resulting in numerous geological disasters, including the 1950 earthquake that caused significant casualties and river blockages. More recent landslides have caused substantial economic losses and infrastructure damage, posing ongoing risks to the local population and their property. Landslide hazard assessment is a critical task in geological disaster prevention and mitigation. This study applied the Inception model to assess landslide hazard in the Zayu area. The Inception model excels at capturing multi-scale features efficiently through its architecture. Fifteen disaster-causing factors were selected as the primary indicators for landslide susceptibility assessment. On this basis, the Inception model was used for landslide susceptibility assessment. Combined with daily precipitation data in the Zayu area, the landslide hazard assessment of the "25 April 2010, heavy rainstorm in Zayu, Tibet" was completed. Back Propagation Neural Network (BPNN), Residual Neural Network (ResNet), Convolutional Neural Network (CNN), and Visual Geometry Group-16 (VGG-16) were introduced for comparison of the fitting effects, and SHapley Additive exPlanations (SHAP) was used for interpretability analysis. The comparative experimental results show that the Inception model performed best in landslide susceptibility assessment and is feasible in practical use. The results also show that the most critical factors in the model were topographic wetness index (TWI), normalized difference water index (NDWI), and road density. This study is significant for assessing landslide hazard in geological landslide disaster prevention and mitigation. It provides a reference for further research and response to similar disasters. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. M-DDC: MRI based demyelinative diseases classification with U-Net segmentation and convolutional network.
- Author
-
Zhou, Deyang, Xu, Lu, Wang, Tianlei, Wei, Shaonong, Gao, Feng, Lai, Xiaoping, and Cao, Jiuwen
- Subjects
- *
NOSOLOGY , *POSTVACCINAL encephalitis , *MAGNETIC resonance imaging , *CHILDREN'S hospitals , *WHITE matter (Nerve tissue) , *PIXELS , *MULTISPECTRAL imaging , *NEUROMYELITIS optica - Abstract
Childhood demyelinative diseases classification (DDC) with brain magnetic resonance imaging (MRI) is crucial to clinical diagnosis. But few attentions have been paid to DDC in the past. How to accurately differentiate pediatric-onset neuromyelitis optica spectrum disorder (NMOSD) from acute disseminated encephalomyelitis (ADEM) based on MRI is challenging in DDC. In this paper, a novel architecture M-DDC based on joint U-Net segmentation network and deep convolutional network is developed. The U-Net segmentation can provide pixel-level structure information, that helps the lesion areas location and size estimation. The classification branch in DDC can detect the regions of interest inside MRIs, including the white matter regions where lesions appear. The performance of the proposed method is evaluated on MRIs of 201 subjects recorded from the Children's Hospital of Zhejiang University School of Medicine. The comparisons show that the proposed DDC achieves the highest accuracy of 99.19% and dice of 71.1% for ADEM and NMOSD classification and segmentation, respectively. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Remain useful life forecasting for roller bearings using sparse auto-encoder.
- Author
-
Tang, Yifeng, Xu, Fan, Xu, Lu, Zhou, Chao, and Deng, Yaling
- Subjects
ROLLER bearings ,STANDARD deviations ,BACK propagation ,DEEP learning ,SUPPORT vector machines - Abstract
A method based on sparse auto-encoder (SAE) in deep learning (DL) for roller bearings remain useful life (RUL) prediction is presented in this paper. Firstly, the roller bearings vibration signals were calculated by different time and frequency domain factors, in which reflect the vibration signals information well. Therefore, the time and frequency domain features were regarded as the input of SAE, then the SAE model in deep learning was used to extract the features through several hidden layers and the sigmoid function was selected as the output function for calculate the prediction value. Finally, compared with other different prediction methods, such as support vector machine (SVM), back propagation (BP) neural network and random forest (RF), the performance of SAE is better than that those models by using mean absolute error (MAE) and root mean square error (RMSE) these two indicators. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
8. Fourier Single-Pixel Imaging Based on Online Modulation Pattern Binarization.
- Author
-
Jiang, Xinding, Tong, Ziyi, Yu, Zhongyang, Jiang, Pengfei, Xu, Lu, Wu, Long, Chen, Mingsheng, Zhang, Yong, Zhang, Jianlong, and Yang, Xu
- Subjects
PIXELS ,DIGITAL technology ,IMAGE reconstruction ,GRAYSCALE model ,MICROMIRROR devices ,SPEED limits - Abstract
Down-sampling Fourier single-pixel imaging is typically achieved by truncating the Fourier spectrum, where exclusively the low-frequency Fourier coefficients are extracted while discarding the high-frequency components. However, the truncation of the Fourier spectrum can lead to an undesired ringing effect in the reconstructed result. Moreover, the original Fourier single-pixel imaging necessitated grayscale Fourier basis patterns for illumination. This requirement limits imaging speed because digital micromirror devices (DMDs) generate grayscale patterns at a lower refresh rate. In order to solve the above problem, a fast and high-quality Fourier single-pixel imaging reconstruction method is proposed in the paper. In the method, the threshold binarization of the Fourier base pattern is performed online to improve the DMD refresh rate, and the reconstruction quality of Fourier single-pixel imaging at a low-sampling rate is improved by generating an adversarial network. This method enables fast reconstruction of target images with higher quality despite low-sampling rates. Compared with conventional Fourier single-pixel imaging, numerical simulation and experimentation demonstrate the effectiveness of the proposed method. Notably, this method is particularly significant for fast Fourier single-pixel imaging applications. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
9. Deep electron cloud‐activity and field‐activity relationships.
- Author
-
Xu, Lu and Yang, Qin
- Subjects
- *
DENSITY functionals , *MOLECULAR structure , *STRUCTURE-activity relationships , *CHEMICAL properties , *SUPPORT vector machines - Abstract
Chemists have been pursuing general mathematical laws to explain and predict molecular properties for a long time. However, most of the traditional quantitative structure‐activity relationship (QSAR) models have limited application domains; for example, they tend to have poor generalization performance when applied to molecules with parent structures different from those of the trained molecules. This paper attempts to develop a new QSAR method that is theoretically possible to predict various properties of molecules with diverse structures. The proposed deep electron cloud‐activity relationships (DECAR) and deep field‐activity relationships (DFAR) methods consist of three essentials: (1) a large number of molecule entities with activity data as training objects and responses; (2) three‐dimensional electron cloud density (ECD) or related field data by the accurate density functional theory methods as input descriptors; and (3) a deep learning model that is sufficiently flexible and powerful to learn the large data described above. DECAR and DFAR are used to distinguish 977 sweet and 1965 non‐sweet molecules (with 6‐fold data augmentation), and the classification performance is demonstrated to be significantly better than the traditional least squares support vector machine (LS‐SVM) models using traditional descriptors. DECAR and DFAR would provide a possible way to establish a widely applicable, cumulative, and shareable artificial intelligence‐driven QSAR system. They are likely to promote the development of an interactive platform to collect and share the accurate ECD and field data of millions of molecules with annotated activities. With enough input data, we envision the appearance of several deep networks trained for various molecular activities. Finally, we could anticipate a single DECAR or DFAR network to learn and infer various properties of interest for chemical molecules, which will become an open and shared learning and inference tool for chemists. Deep electron cloud‐activity relationships (DECAR) and deep field‐activity relationships (DFAR) methods are quantitative structure‐activity relationship (QSAR) models that involve deep learning of thousands (or hundreds of thousands) of molecule entities using augmented and high‐quality 3D electron cloud or related field data as molecular structure descriptors. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
10. Period Estimation of Spread Spectrum Codes Based on ResNet.
- Author
-
Gu, Han-Qing, Liu, Xia-Xia, Xu, Lu, Zhang, Yi-Jia, and Lu, Zhe-Ming
- Subjects
CONVOLUTIONAL neural networks ,TIME delay estimation ,SIGNAL-to-noise ratio - Abstract
In order to more effectively monitor and interfere with enemy signals, it is particularly important to accurately and efficiently identify the intercepted signals and estimate their parameters in the increasingly complex electromagnetic environment. Therefore, in non-cooperative situations, it is of great practical significance to study how to accurately detect direct sequence spread spectrum (DSSS) signals in real time and estimate their parameters. The traditional time-delay correlation algorithm encounters the challenges such as peak energy leakage and false peak interference. As an alternative, this paper introduces a Pseudo-Noise (PN) code period estimation method utilizing a one-dimensional (1D) convolutional neural network based on the residual network (CNN-ResNet). This method transforms the problem of spread spectrum code period estimation into a multi-classification problem of spread spectrum code length estimation. Firstly, the In-phase/Quadrature(I/Q) two-way of the received DSSS signals is directly input into the CNN-ResNet model, which will automatically learn the characteristics of the DSSS signal with different PN code lengths and then estimate the PN code length. Simulation experiments are conducted using a data set with DSSS signals ranging from −20 to 10 dB in terms of signal-to-noise ratios (SNRs). Upon training and verifying the model using BPSK modulation, it is then put to the test with QPSK-modulated signals, and the estimation performance was analyzed through metrics such as loss function, accuracy rate, recall rate, and confusion matrix. The results demonstrate that the 1D CNN-ResNet proposed in this paper is capable of effectively estimating the PN code period of the non-cooperative DSSS signal, exhibiting robust generalization abilities. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
11. DSSS Signal Detection Based on CNN.
- Author
-
Gu, Han-Qing, Liu, Xia-Xia, Xu, Lu, Zhang, Yi-Jia, and Lu, Zhe-Ming
- Subjects
SIGNAL detection ,CONVOLUTIONAL neural networks ,AUTOCORRELATION (Statistics) ,DEEP learning ,MILITARY electronics ,SIGNAL processing ,SIGNAL-to-noise ratio - Abstract
With the wide application of direct sequence spread spectrum (DSSS) signals, the comprehensive performance of DSSS communication systems has been continuously improved, making the electronic reconnaissance link in communication countermeasures more difficult. Electronic reconnaissance technology, as the fundamental means of modern electronic warfare, mainly includes signal detection, recognition, and parameter estimation. At present, research on DSSS detection algorithms is mostly based on the correlation characteristics of DSSS signals, and autocorrelation algorithm is the most mature and widely used method in practical engineering. With the continuous development of deep learning, deep-learning-based methods have gradually been introduced to replace traditional algorithms in the field of signal processing. This paper proposes a spread spectrum signal detection method based on convolutional neural network (CNN). Through experimental analysis, the detection performance of the CNN model proposed in this paper on DSSS signals in various situations has been compared and analyzed with traditional autocorrelation detection methods for different signal-to-noise ratios. The experiments verified the estimation performance of the model in this paper under different signal-to-noise ratios, different spreading code lengths, different spreading code types, and different modulation methods and compared it with the autocorrelation detection algorithm. It was found that the detection performance of the model in this paper was higher than that of the autocorrelation detection method, and the overall performance was improved by 4 dB. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
12. Interpretable Deep Learning Method Combining Temporal Backscattering Coefficients and Interferometric Coherence for Rice Area Mapping.
- Author
-
Ge, Ji, Zhang, Hong, Xu, Lu, Sun, Chun-Ling, and Wang, Chao
- Abstract
Reliable and accurate rice mapping using synthetic aperture radar (SAR) in cloudy and rainy areas is essential for achieving the United Nations Sustainable Development Goal 2 of 2030. An interpretable deep learning SAR rice area mapping method is proposed in this letter to suppress the interference of wetlands and other land covers to multitemporal SAR rice area mapping and improve the accuracy and confidence of the “black box” deep learning model results. Combining the temporal backscattering coefficients and interferometric coherence, three interpretable temporal features are extracted to effectively distinguish rice. Then, the explainable feature-aware network (XFANet), which can provide the learned importance weights of the normalization methods as self-interpretation, is constructed, and the pixel-wise gradient-weighted class activation mapping (PGCAM) post-hoc interpretation method is introduced to interpret the feature variation within the model. The experimental results in the Kampong Chhang and Kampong Chham provinces of Cambodia show that the proposed three interpretable features well suppressed the wetland disturbance to rice. With high interpretability, the overall accuracy (OA) of XFANet reaches 93.43%. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
13. TSDSR: Temporal–Spatial Domain Denoise Super-Resolution Photon-Efficient 3D Reconstruction by Deep Learning.
- Author
-
Tong, Ziyi, Jiang, Xinding, Hu, Jiemin, Xu, Lu, Wu, Long, Yang, Xu, and Zou, Bo
- Subjects
GENERATIVE adversarial networks ,DEEP learning ,AVALANCHE diodes ,HIGH resolution imaging ,SPATIAL resolution ,ROBOTICS - Abstract
The combination of a single-photon avalanche diode detector with a high-sensitivity and photon-efficient reconstruction algorithm can realize the reconstruction of target range image from weak light signal conditions. The limited spatial resolution of the detector and the substantial background noise remain significant challenges in the actual detection process, hindering the accuracy of 3D reconstruction techniques. To address this challenge, this paper proposes a denoising super-resolution reconstruction network based on generative adversarial network (GAN) design. Soft thresholding is incorporated into the deep architecture as a nonlinear transformation layer to effectively filter out noise. Moreover, the Unet-based discriminator is introduced to complete the high-precision detail reconstruction. The experimental results show that the proposed network can achieve high-quality super-resolution range imaging. This approach has the potential to enhance the accuracy and quality of long-range imaging in weak light signal conditions, with broad applications in fields such as robotics, autonomous vehicles, and biomedical imaging. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
14. ESDSCNet: an enhanced shallow feature difference and semantic context network for remote sensing change detection: with building change detection as a case.
- Author
-
Du, Tongyao, Ming, Dongping, Gu, Haiyan, Fang, Kun, Xu, Lu, Dong, Dehui, Zhang, Yu, and Liu, Luheng
- Subjects
STATISTICAL learning ,DEEP learning ,LAND cover - Abstract
Change detection (CD) is a challenging task considering land surface cover changes. Recently, deep learning has been introduced into CD. However, these methods suffer from shortcomings such as insufficient utilization of shallow features and inadequate aggregation of semantic context, which result in pseudo changes and poor integrity of change objects. To address this, an enhanced shallow feature difference and semantic context network (ESDSCNet) is proposed in this study for remote sensing CD. It uses HRNet to extract multi-scale features from bi-temporal images to obtain shallow and deep features. To fully exploit the shallow features, they are input into the difference statistical texture learning (DSTL) module to extract more discriminative features. Subsequently, the features enhanced by DSTL are fed into the change object contextual representation (COCR) module along with the deep difference features extracted by HRNet to characterize the contextual information of the change object. To verify the performance of ESDSCNet in different scenes, this paper takes building change detection as a case and conducts experiments based on three datasets. The experimental results reveal that ESDSCNet outperforms six other advanced methods, regardless of the intersection over union or the harmonic mean of precision rate and recall rate, which further confirms the effectiveness of the proposed network. In addition, another speciality of this paper is that ESDSCNet can identify not only 'Where has changed' but also 'What has changed what', so it has potential application in semantic change detection besides building change detection. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
15. Cropland Data Extraction in Mekong Delta Based on Time Series Sentinel-1 Dual-Polarized Data.
- Author
-
Jiang, Jingling, Zhang, Hong, Ge, Ji, Sun, Chunling, Xu, Lu, and Wang, Chao
- Subjects
FARMS ,SYNTHETIC aperture radar ,DATA extraction ,DECOMPOSITION method ,AGRICULTURE ,REMOTE sensing ,TIME series analysis - Abstract
In recent years, synthetic aperture radar (SAR) has been a widely used data source in the remote sensing field due to its ability to work all day and in all weather conditions. Among SAR satellites, Sentinel-1 is frequently used to monitor large-scale ground objects. The Mekong Delta is a major agricultural region in Southeast Asia, so monitoring its cropland is of great importance. However, it is a challenge to distinguish cropland from other ground objects, such as aquaculture and wetland, in this region. To address this problem, the study proposes a statistical feature combination from the Sentinel-1 dual-polarimetric (dual-pol) data time series based on the m/χ decomposition method. Then the feature combination is put into the proposed Omni-dimensional Dynamic Convolution Residual Segmentation Model (ODCRS Model) of high fitting speed and classification accuracy to realize the cropland extraction of the Mekong Delta region. Experiments show that the ODCRS model achieves an overall accuracy of 93.85%, a MIoU of 88.04%, and a MPA of 93.70%. The extraction results show that our method can effectively distinguish cropland from aquaculture areas and wetlands. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
16. Centroid Optimization of DNN Classification in DOA Estimation for UAV.
- Author
-
Wu, Long, Zhang, Zidan, Yang, Xu, Xu, Lu, Chen, Shuyu, Zhang, Yong, and Zhang, Jianlong
- Subjects
CONVOLUTIONAL neural networks ,ARTIFICIAL neural networks ,CENTROID ,DIRECTION of arrival estimation ,DEEP learning ,DRONE aircraft - Abstract
Classifications based on deep learning have been widely applied in the estimation of the direction of arrival (DOA) of signal. Due to the limited number of classes, the classification of DOA cannot satisfy the required prediction accuracy of signals from random azimuth in real applications. This paper presents a Centroid Optimization of deep neural network classification (CO-DNNC) to improve the estimation accuracy of DOA. CO-DNNC includes signal preprocessing, classification network, and Centroid Optimization. The DNN classification network adopts a convolutional neural network, including convolutional layers and fully connected layers. The Centroid Optimization takes the classified labels as the coordinates and calculates the azimuth of received signal according to the probabilities of the Softmax output. The experimental results show that CO-DNNC is capable of acquiring precise and accurate estimation of DOA, especially in the cases of low SNRs. In addition, CO-DNNC requires lower numbers of classes under the same condition of prediction accuracy and SNR, which reduces the complexity of the DNN network and saves training and processing time. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
17. A Physically Interpretable Rice Field Extraction Model for PolSAR Imagery.
- Author
-
Ge, Ji, Zhang, Hong, Xu, Lu, Sun, Chunling, Duan, Haoxuan, Guo, Zihuan, and Wang, Chao
- Subjects
PADDY fields ,SYNTHETIC aperture radar ,DEEP learning ,WEIGHT (Physics) ,PRODUCTION control ,PHASE coding - Abstract
Reliable and timely rice distribution information is of great value for real-time, quantitative, and localized control of rice production information. Synthetic aperture radar (SAR) has all-weather and all-day observation capability to monitor rice distribution in tropical and subtropical areas. To improve the physical interpretability and spatial interpretability of the deep learning model for SAR rice field extraction, a new SHapley Additive exPlanation (SHAP) value-guided explanation model (SGEM) for polarimetric SAR (PolSAR) data was proposed. First, a rice sample set was produced based on field survey and optical data, and the physical characteristics were extracted using decomposition of polarimetric scattering. Then a SHAP-based Physical Feature Interpretable Module (SPFIM) combing the long short-term memory (LSTM) model and SHAP values was designed to analyze the importance of physical characteristics, a credible physical interpretation associated with rice phenology was provided, and the weight of physical interpretation was combined with the weight of original PolSAR data. Moreover, a SHAP-guided spatial interpretation network (SSEN) was constructed to internalize the spatial interpretation values into the network layer to optimize the spatial refinement of the extraction results. Shanwei City, Guangdong Province, China, was chosen as the study area. The experimental results showed that the physical explanation provided by the proposed method had a high correlation with the rice phenology, and spatial self-interpretation for finer extraction results. The overall accuracy of the rice mapping results was 95.73%, and the kappa coefficient reached 0.9143. The proposed method has a high interpretability and practical value compared with other methods. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
18. Built-Up Area Extraction from GF-3 SAR Data Based on a Dual-Attention Transformer Model.
- Author
-
Li, Tianyang, Wang, Chao, Wu, Fan, Zhang, Hong, Tian, Sirui, Fu, Qiaoyan, and Xu, Lu
- Subjects
DEEP learning ,TRANSFORMER models ,SYNTHETIC aperture radar ,URBAN research - Abstract
Built-up area (BA) extraction using synthetic aperture radar (SAR) data has emerged as a potential method in urban research. Currently, typical deep-learning-based BA extractors show high false-alarm rates in the layover areas and subsurface bedrock, which ignore the surrounding information and cannot be directly applied to large-scale BA mapping. To solve the above problems, a novel transformer-based BA extraction framework for SAR images is proposed. Inspired by SegFormer, we designed a BA extractor with multi-level dual-attention transformer encoders. First, the hybrid dilated convolution (HDC) patch-embedding module keeps the surrounding information of the input patches. Second, the channel self-attention module is designed for dual-attention transformer encoders and global modeling. The multi-level structure is employed to produce the coarse-to-fine semantic feature map of BAs. About 1100 scenes of Gaofen-3 (GF-3) data and 200 scenes of Sentinel-1 data were used in the experiment. Compared to UNet, PSPNet, and SegFormer, our model achieved an 85.35% mean intersection over union (mIoU) and 94.75% mean average precision (mAP) on the test set. The proposed framework achieved the best results in both mountainous and plain terrains. The experiments using Sentinel-1 shows that the proposed method has a good generalization ability with different SAR data sources. Finally, the BA map of China for 2020 was obtained with an overall accuracy of about 86%, which shows high consistency with the global urban footprint. The above experiments proved the effectiveness and robustness of the proposed framework in large-scale BA mapping. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
19. Identifying the kind behind SMILES—anatomical therapeutic chemical classification using structure-only representations.
- Author
-
Cao, Yi, Yang, Zhen-Qun, Zhang, Xu-Lu, Fan, Wenqi, Wang, Yaowei, Shen, Jiajun, Wei, Dong-Qing, Li, Qing, and Wei, Xiao-Yong
- Subjects
SMILING ,DRUG development ,CLASSIFICATION ,SOURCE code ,MOLECULAR structure ,PYRAMIDS - Abstract
Anatomical Therapeutic Chemical (ATC) classification for compounds/drugs plays an important role in drug development and basic research. However, previous methods depend on interactions extracted from STITCH dataset which may make it depend on lab experiments. We present a pilot study to explore the possibility of conducting the ATC prediction solely based on the molecular structures. The motivation is to eliminate the reliance on the costly lab experiments so that the characteristics of a drug can be pre-assessed for better decision-making and effort-saving before the actual development. To this end, we construct a new benchmark consisting of 4545 compounds which is with larger scale than the one used in previous study. A light-weight prediction model is proposed. The model is with better explainability in the sense that it is consists of a straightforward tokenization that extracts and embeds statistically and physicochemically meaningful tokens, and a deep network backed by a set of pyramid kernels to capture multi-resolution chemical structural characteristics. Its efficacy has been validated in the experiments where it outperforms the state-of-the-art methods by 15.53% in accuracy and by 69.66% in terms of efficiency. We make the benchmark dataset, source code and web server open to ease the reproduction of this study. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
20. Deblurring Ghost Imaging Reconstruction Based on Underwater Dataset Generated by Few-Shot Learning.
- Author
-
Yang, Xu, Yu, Zhongyang, Jiang, Pengfei, Xu, Lu, Hu, Jiemin, Wu, Long, Zou, Bo, Zhang, Yong, and Zhang, Jianlong
- Subjects
IMAGE reconstruction ,DEEP learning ,BACKSCATTERING ,PROBLEM solving - Abstract
Underwater ghost imaging based on deep learning can effectively reduce the influence of forward scattering and back scattering of water. With the help of data-driven methods, high-quality results can be reconstructed. However, the training of the underwater ghost imaging requires enormous paired underwater datasets, which are difficult to obtain directly. Although the Cycle-GAN method solves the problem to some extent, the blurring degree of the fuzzy class of the paired underwater datasets generated by Cycle-GAN is relatively unitary. To solve this problem, a few-shot underwater image generative network method is proposed. Utilizing the proposed few-shot learning image generative method, the generated paired underwater datasets are better than those obtained by the Cycle-GAN method, especially under the condition of few real underwater datasets. In addition, to reconstruct high-quality results, an underwater deblurring ghost imaging method is proposed. The reconstruction method consists of two parts: reconstruction and deblurring. The experimental and simulation results show that the proposed reconstruction method has better performance in deblurring at a low sampling rate, compared with existing underwater ghost imaging methods based on deep learning. The proposed reconstruction method can effectively increase the clarity degree of the underwater reconstruction target at a low sampling rate and promotes the further applications of underwater ghost imaging. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
21. Semi-supervised regression with manifold: A Bayesian deep kernel learning approach.
- Author
-
Xu, Lu, Hu, Chen, and Mei, Kuizhi
- Subjects
- *
DEEP learning , *TIKHONOV regularization , *SMOOTHNESS of functions , *MACHINE learning , *DATA distribution , *PROBABILISTIC generative models - Abstract
[Display omitted] • Tackle the under-studied semi-supervised image regression problem to relief the heavy workload of manually annotation for regression tasks. • Applying manifold smoothness to image regression tasks that transfers the SSL problem to kernel learning tasks. • A novel and efficient algorithm, ManiDKL, learns the regression parameters in a Bayesian manner. • Outperforms all existing semi-supervised methods for image regression tasks. Semi-supervised learning (SSL) aims at utilizing the vast unlabeled data to help the supervised training. While existing SSL methods have shown promising results on image classification tasks, most of them rely on the cluster assumption that does not apply to image regression tasks. In this paper, we address the under-studied semi-supervised image regression problem, of which the outputs are continuous values instead of categorical distributions. To tackle this challenging task, we propose an algorithm, called ManiDKL, with the idea that the prediction function should be smooth with respect to the intrinsic manifold of data distribution and behave similarly on both labeled and unlabeled data. In particular, we propose a framework that implements the Tikhonov regularization with generative manifold learning to ensure manifold smoothness of regression function and also reduces the problem to kernel learning. Then a semi-supervised non-parametric Bayesian based deep kernel learning algorithm is proposed, in which unlabeled data are incorporated through posterior regularization. We show the effectiveness of ManiDKL with extensive experiments. It shows that ManiDKL performs comparatively with state-of-the-art SSL image classification methods. Most importantly, we show the superiority of ManiDKL over all existing SSL regression methods on public image datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
22. Rice Mapping in a Subtropical Hilly Region Based on Sentinel-1 Time Series Feature Analysis and the Dual Branch BiLSTM Model.
- Author
-
Sun, Chunling, Zhang, Hong, Ge, Ji, Wang, Chao, Li, Liutong, and Xu, Lu
- Subjects
RICE ,TIME series analysis ,SYNTHETIC aperture radar ,SUSTAINABLE agriculture ,SUSTAINABLE development - Abstract
Timely and accurate information on rice cultivation makes important contributions to the profound reform of the global food and agricultural system, and promotes the development of global sustainable agriculture. With all-day and all-weather observing ability, synthetic aperture radar (SAR) can monitor the distribution of rice in tropical and subtropical areas. To solve the problem of misclassification of rice with no marked signal during the flooding period in subtropical hilly areas, this paper proposes a new feature combination and dual branch bi-directional long short-term memory (DB-BiLSTM) model to achieve high-precision rice mapping using Sentinel-1 time series data. Based on field investigation data, the backscatter time series curves of the rice area were analyzed, and a characteristic index (VV − VH)/(VV + VH) (VV: vertical emission and vertical receipt of polarization, VH: vertical emission and horizontal receipt of polarization) for small areas of hilly land was proposed to effectively distinguish rice and non-rice crops with no marked flooding period. The DB-BiLSTM model was designed, ensuring the independent learning of multiple features and effectively combining the time series information of both (VV − VH)/(VV + VH) and VH features. The city of Shanwei, Guangdong Province, China, was selected as the study area. Experimental results showed that the overall accuracy of the rice mapping results was 97.29%, and the kappa coefficient reached 0.9424. Compared to other methods, the rice mapping results obtained by the proposed method maintained good integrity and had less misclassification, which demonstrated the proposed method's practical value in accurate and effective rice mapping tasks. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
23. Deep Collocative Learning for Immunofixation Electrophoresis Image Analysis.
- Author
-
Wei, Xiao-Yong, Yang, Zhen-Qun, Zhang, Xu-Lu, Liao, Ga, Sheng, Ai-Lin, Zhou, S. Kevin, Wu, Yongkang, and Du, Liang
- Subjects
DEEP learning ,IMAGE analysis ,ELECTROPHORESIS ,MULTIPLE myeloma ,DIAGNOSIS - Abstract
Immunofixation Electrophoresis (IFE) analysis is of great importance to the diagnosis of Multiple Myeloma, which is among the top-9 cancer killers in the United States, but has rarely been studied in the context of deep learning. Two possible reasons are: 1) the recognition of IFE patterns is dependent on the co-location of bands that forms a binary relation, different from the unary relation (visual features to label) that deep learning is good at modeling; 2) deep classification models may perform with high accuracy for IFE recognition but is not able to provide firm evidence (where the co-location patterns are) for its predictions, rendering difficulty for technicians to validate the results. We propose to address these issues with collocative learning, in which a collocative tensor has been constructed to transform the binary relations into unary relations that are compatible with conventional deep networks, and a location-label-free method that utilizes the Grad-CAM saliency map for evidence backtracking has been proposed for accurate localization. In addition, we have proposed Coached Attention Gates that can regulate the inference of the learning to be more consistent with human logic and thus support the evidence backtracking. The experimental results show that the proposed method has obtained a performance gain over its base model ResNet18 by 741.30% in IoU and also outperformed popular deep networks of DenseNet, CBAM, and Inception-v3. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
24. Delineation of cultivated land parcels based on deep convolutional networks and geographical thematic scene division of remotely sensed images.
- Author
-
Xu, Lu, Ming, Dongping, Du, Tongyao, Chen, Yangyang, Dong, Dehui, and Zhou, Chenghu
- Subjects
- *
AGRICULTURAL remote sensing , *DEEP learning , *THEMATIC mapper satellite , *WAVELENGTH division multiplexing , *REMOTE sensing , *SPATIAL resolution - Abstract
• A stratified framework is proposed to extract information from large area of image. • The theoretical concept of geographical thematic scene is firstly presented. • A deep learning network is proposed to fast divide the geographical thematic scene. • A well-performed boundary delineation accuracy assessment method is proposed. Extraction of cultivated land information from high spatial resolution remote sensing images is increasingly becoming an important approach to digitization and informatization in modern agriculture. The continuous development of deep learning technology has made it possible to extract information of cultivated land parcels by an intelligent way. Aiming at fine extraction of cultivated land parcels within large areas, this article builds a framework of geographical thematic scene division according to the rule of territorial differentiation in geography. A deep learning semantic segmentation network, improved U-net with depthwise separable convolution (DSCUnet), is proposed to achieve the division of the whole image. Then, an extended multichannel richer convolutional features (RCF) network is involved to delineate the boundaries of cultivated land parcels from agricultural functional scenes obtained by the former step. In order to testify the feasibility and effectiveness of the proposed methods, this article implemented experiments using Gaofen-2 images with different spatial resolution. The results show an outstanding performance using methods proposed in this article in both dividing agricultural functional scenes and delineating cultivated land parcels compared with other commonly used methods. Meanwhile, the extraction results have the highest accuracy in both the traditional evaluation indices (like Precision, Recall, F 1 , and IoU) and geometric boundary precision of cultivated land parcels. The methods in this article can provide a feasible solution to the problem of finely extracting cultivated land parcels information within large areas and complex landscape conditions in practical applications. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
25. Paddy Rice Mapping in Thailand Using Time-Series Sentinel-1 Data and Deep Learning Model.
- Author
-
Xu, Lu, Zhang, Hong, Wang, Chao, Wei, Sisi, Zhang, Bo, Wu, Fan, and Tang, Yixian
- Subjects
- *
DEEP learning , *PADDY fields , *PROBLEM solving , *FOOD supply , *FEATURE selection , *RANDOM fields - Abstract
The elimination of hunger is the top concern for developing countries and is the key to maintain national stability and security. Paddy rice occupies an essential status in food supply, whose accurate monitoring is of great importance for human sustainable development. As one of the most important paddy rice production countries in the world, Thailand has a favorable hot and humid climate for paddy rice growing, but the growth patterns of paddy rice are too complicated to construct promising growth models for paddy rice discrimination. To solve this problem, this study proposes a large-scale paddy rice mapping scheme, which uses time-series Sentinel-1 data to generate a convincing annual paddy rice map of Thailand. The proposed method extracts temporal statistical features of the time-series SAR images to overcome the intra-class variability due to different management practices and modifies the U-Net model with the fully connected Conditional Random Field (CRF) to maintain the edge of the fields. In this study, 758 Sentinel-1 images that covered the whole country from the end of 2018 to 2019 were acquired to generate the annual paddy rice map. The accuracy, precision, and recall of the resultant paddy rice map reached 91%, 87%, and 95%, respectively. Compared to SVM classifier and the U-Net model based on feature selection strategy (FS-U-Net), the proposed scheme achieved the best overall performance, which demonstrated the capability of overcoming the complex cultivation conditions and accurately identifying the fragmented paddy rice fields in Thailand. This study provides a promising tool for large-scale paddy rice monitoring in tropical production regions and has great potential in the global sustainable development of food and environment management. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
26. Rice Mapping Using a BiLSTM-Attention Model from Multitemporal Sentinel-1 Data.
- Author
-
Sun, Chunling, Zhang, Hong, Xu, Lu, Wang, Chao, and Li, Liutong
- Subjects
SYNTHETIC aperture radar ,RICE ,DEEP learning ,LAND cover ,FOOD production - Abstract
Timely and accurate rice distribution information is needed to ensure the sustainable development of food production and food security. With its unique advantages, synthetic aperture radar (SAR) can monitor the rice distribution in tropical and subtropical areas under any type of weather condition. This study proposes an accurate rice extraction and mapping framework that can solve the issues of low sample production efficiency and fragmented rice plots when prior information on rice distribution is insufficient. The experiment was carried out using multitemporal Sentinel-1A Data in Zhanjiang, China. First, the temporal characteristic map was used for the visualization of rice distribution to improve the efficiency of rice sample production. Second, rice classification was carried out based on the BiLSTM-Attention model, which focuses on learning the key information of rice and non-rice in the backscattering coefficient curve and gives different types of attention to rice and non-rice features. Finally, the rice classification results were optimized based on the high-precision global land cover classification map. The experimental results showed that the classification accuracy of the proposed framework on the test dataset was 0.9351, the kappa coefficient was 0.8703, and the extracted plots maintained good integrity. Compared with the statistical data, the consistency reached 94.6%. Therefore, the framework proposed in this study can be used to extract rice distribution information accurately and efficiently. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
27. Machine vision and novel attention mechanism TCN for enhanced prediction of future deposition height in directed energy deposition.
- Author
-
Yu, Miao, Zhu, Lida, Ning, Jinsheng, Yang, Zhichao, Jiang, Zongze, Xu, Lu, Wang, Yiqi, Meng, Guiru, and Huang, Yiming
- Subjects
- *
COMPUTER vision , *PROCESS capability , *DEEP learning , *PHENOMENOLOGICAL theory (Physics) , *LASER weapons , *OPTICAL scanners - Abstract
[Display omitted] • A novel machine vision method for real-time monitoring of deposition height in noise conditions is proposed. • A novel self-attention temporal convolutional network (SA-TCN) is proposed for predicting future deposition height. • Quantifying complex physical phenomena with specific data has significantly contributed to enhancing the prediction accuracy of a deep learning model. Laser Directed Energy Deposition (L-DED) has garnered significant attention due to its high flexibility and rapid processing capabilities. However, complex physical flow fields and drastic temperature variations are present during L-DED processing, leading to variations in deposition height at different layers and positions under the same processing parameters. Therefore, real-time monitoring of deposition height and timely knowledge of future deposition height are crucial for controlling geometries and arranging processing time effectively. To address this issue, a machine vision method for real-time monitoring of deposition height in noisy environments is proposed, demonstrating a remarkable similarity of 99.22% compared to values measured by a laser scanner. Addressing the complex physical phenomena during processing, specific data quantification was performed. A novel self-attention temporal convolutional network (SA-TCN) was then introduced as a data-driven model to replace physical models for predicting future deposition height, achieving an impressive accuracy of 99.71%. Experiments show that quantifying different physical phenomena with specific data to some extent improves the model prediction accuracy, providing significant support for future deposition height prediction and processing time control of parts in actual production. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. S2O-FSPI: Fourier single pixel imaging via sampling strategy optimization.
- Author
-
Yang, Xu, Jiang, Xinding, Jiang, Pengfei, Xu, Lu, Wu, Long, Hu, Jiemin, Zhang, Yong, Zhang, Jianlong, and Zou, Bo
- Subjects
- *
GEOMETRIC topology , *IMAGE reconstruction algorithms , *IMAGE reconstruction , *PIXELS , *DIAGNOSTIC imaging , *PROBLEM solving - Abstract
• An efficient sampling strategy optimization method with high-quality FSPI reconstruction is implemented. • Numerical Simulations and experiments proved the effectiveness, which has advantages over deterministic sampling strategies. • The geometric topologies of the optimal sampling strategy for different categories of datasets are analyzed. • The geometric topologies of the optimal sampling strategy for different categories of datasets are summarized. • It provides subsequent for sampling strategy choice of Fourier single pixel imaging. It is necessary to artificially set the sampling strategy of Fourier single-pixel imaging (FSPI) to obtain more useful information of the image under the assumption that the image information is mainly concentrated in the low-frequency part of the spatial frequency domain. However, due to the empiricism sampling strategy, it is easy to sample the spatial frequencies with low contribution to image reconstruction, resulting in a waste of sampling resources. To solve this problem, a one-stage FSPI reconstruction network with sampling strategy optimization is designed to obtain the efficient sampling strategy and improve the reconstruction quality. The proposed one-stage FSPI reconstruction network contains the sampling strategy optimization module and reconstruction module, which are jointly trained to obtain high-quality reconstructed results and efficient sampling strategy simultaneously. Experiments and numerical simulations demonstrate the effectiveness of the proposed method. Moreover, the geometric topology of the optimal sampling strategy for FSPI reconstruction trained on various categories of datasets is summarized. It is found that the dataset with similar categories have the similar geometric topology of the optimal sampling strategy, which provides support for sampling strategy selection in subsequent Fourier single-pixel imaging studies. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.