3,622 results
Search Results
2. Advanced Machine Learning and Deep Learning Approaches for Remote Sensing II.
- Author
-
Jeon, Gwanggil
- Subjects
REMOTE sensing ,MACHINE learning ,ARTIFICIAL neural networks ,DEEP learning ,ARTIFICIAL intelligence ,DISTANCE education - Abstract
This document is a summary of a special issue on advanced machine learning and deep learning techniques for remote sensing. The issue includes 16 research papers that cover a range of topics, including hyperspectral image classification, moving point target detection, radar echo extrapolation, and remote sensing object detection. Each paper introduces a novel approach or model and provides extensive testing and evaluation to demonstrate its effectiveness. The insights shared in this special issue are expected to contribute to future advancements in artificial intelligence-based remote sensing research. [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF
3. Remote Sensing of Forests in Bavaria: A Review.
- Author
-
Coleman, Kjirsten, Müller, Jörg, and Kuenzer, Claudia
- Subjects
REMOTE sensing ,BARK beetles ,FOREST monitoring ,FOREST management ,FOREST reserves ,SPACE-based radar ,PLANT phenology ,DROUGHTS - Abstract
In recent decades, climatic pressures have altered the forested landscape of Bavaria. Widespread loss of trees has unevenly impacted the entire state, of which 37% is covered by forests (5% more than the national average). In 2018 and 2019—due in large part to drought and subsequent insect infestations—more tree-covered areas were lost in Bavaria than in any other German state. Moreover, the annual crown condition survey of Bavaria has revealed a decreasing trend in tree vitality since 1998. We conducted a systematic literature review regarding the remote sensing of forests in Bavaria. In total, 146 scientific articles were published between 2008 and 2023. While 88 studies took place in the Bavarian Forest National Park, only five publications covered the whole of Bavaria. Outside of the national park, the remaining 2.5 million hectares of forest in Bavaria are understudied. The most commonly studied topics were related to bark beetle infestations (24 papers); however, few papers focused on the drivers of infestations. The majority of studies utilized airborne data, while publications utilizing spaceborne data focused on multispectral; other data types were under-utilized- particularly thermal, lidar, and hyperspectral. We recommend future studies to both spatially broaden investigations to the state or national scale and to increase temporal data acquisitions together with contemporaneous in situ data. Especially in understudied topics regarding forest response to climate, catastrophic disturbances, regrowth and species composition, phenological timing, and in the sector of forest management. The utilization of remote sensing data in the forestry sector and the uptake of scientific results among stakeholders remains a challenge compared to other heavily forested European countries. An integral part of the Bavarian economy and the tourism sector, forests are also vital for climate regulation via atmospheric carbon reduction and land surface cooling. Therefore, forest monitoring remains centrally important to attaining more resilient and productive forests. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. A Complex Background SAR Ship Target Detection Method Based on Fusion Tensor and Cross-Domain Adversarial Learning.
- Author
-
Chan, Haopeng, Qiu, Xiaolan, Gao, Xin, and Lu, Dongdong
- Subjects
SYNTHETIC aperture radar ,RADAR targets ,LEARNING modules ,GENERALIZATION ,SHIPS - Abstract
Synthetic Aperture Radar (SAR) ship target detection has been extensively researched. However, most methods use the same dataset division for both training and validation. In practical applications, it is often necessary to quickly adapt to new loads, new modes, and new data to detect targets effectively. This presents a cross-domain detection problem that requires further study. This paper proposes a method for detecting SAR ships in complex backgrounds using fusion tensor and cross-domain adversarial learning. The method is designed to address the cross-domain detection problem of SAR ships with large differences between the training and test sets. Specifically, it can be used for the cross-domain detection task from the fully polarised medium-resolution ship dataset (source domain) to the high-resolution single-polarised dataset (target domain). This method proposes a channel fusion module (CFM) based on the YOLOV5s model. The CFM utilises the correlation between polarised channel images during training to enrich the feature information of single-polarised images extracted by the model during inference. This article proposes a module called the cross-domain adversarial learning module (CALM) to reduce overfitting and achieve adaptation between domains. Additionally, this paper introduces the anti-interference head (AIH) which decouples the detection head to reduce the conflict of classification and localisation problems. This improves the anti-interference and generalisation ability in complex backgrounds. This paper conducts cross-domain experiments using the constructed medium-resolution SAR full polarisation dataset (SFPD) as the source domain and the high-resolution single-polarised ship detection dataset (HRSID) as the target domain. Compared to the best-performing YOLOV8s model among typical mainstream models, this model improves precision by 4.9%, recall by 3.3%, AP by 2.4%, and F1 by 3.9%. This verifies the effectiveness of the method and provides a useful reference for improving cross-domain learning and model generalisation capability in the field of target detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Remote Sensing and Landsystems in the Mountain Domain: FAIR Data Accessibility and Landform Identification in the Digital Earth.
- Author
-
Whalley, W. Brian
- Subjects
GLACIAL melting ,LANDFORMS ,REMOTE-sensing images ,ROCK concerts ,REMOTE sensing ,ROCK glaciers - Abstract
Satellite imagery has become a major source for identifying and mapping terrestrial and planetary landforms. However, interpretating landforms and their significance, especially in changing environments, may still be questionable. Consequently, ground truth to check training models, especially in mountainous areas, can be problematic. This paper outlines a decimal format, [dLL], for latitude and longitude geolocation that can be used for model interpretation and validation and in data sets. As data have positions in space and time, [dLL] defined points, as for images, can be associated with metadata as nodes. Together with vertices, metadata nodes help build 'information surfaces' as part of the Digital Earth. This paper examines aspects of the Critical Zone and data integration via the FAIR data principles, data that are; findable, accessible, interoperable and re-usable. Mapping and making inventories of rock glacier landforms are examined in the context of their geomorphic and environmental significance and the need for geolocated ground truth. Terrestrial examination of rock glaciers shows them to be predominantly glacier-derived landforms and not indicators of permafrost. Remote-sensing technologies used to track developing rock glacier surface features show them to be climatically melting glaciers beneath rock debris covers. Distinguishing between glaciers, debris-covered glaciers and rock glaciers over time is a challenge for new remote sensing satellites and technologies and shows the necessity for a common geolocation format to report many Earth surface features. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Efficient Path Planning Algorithm Based on Laser SLAM and an Optimized Visibility Graph for Robots.
- Author
-
Hu, Yunjie, Xie, Fei, Yang, Jiquan, Zhao, Jing, Mao, Qi, Zhao, Fei, and Liu, Xixiang
- Subjects
MOBILE robots ,GRIDS (Cartography) ,POINT cloud ,SEARCH algorithms ,SCHEDULING ,POTENTIAL field method (Robotics) - Abstract
Mobile robots' efficient path planning has long been a challenging task due to the complexity and dynamism of environments. If an occupancy grid map is used in path planning, the number of grids is determined by grid resolution and the size of the actual environment. Excessively high resolution increases the number of traversed grid nodes and thus prolongs path planning time. To address this challenge, this paper proposes an efficient path planning algorithm based on laser SLAM and an optimized visibility graph for mobile robots, which achieves faster computation of the shortest path using the optimized visibility graph. Firstly, the laser SLAM algorithm is used to acquire the undistorted LiDAR point cloud data, which are converted into a visibility graph. Secondly, a bidirectional A* path search algorithm is combined with the Minimal Construct algorithm, enabling the robot to only compute heuristic paths to the target node during path planning in order to reduce search time. Thirdly, a filtering method based on edge length and the number of vertices of obstacles is proposed to reduce redundant vertices and edges in the visibility graph. Additionally, the bidirectional A* search method is implemented for pathfinding in the efficient path planning algorithm proposed in this paper to reduce unnecessary space searches. Finally, simulation and field tests are conducted to validate the algorithm and compare its performance with classic algorithms. The test results indicate that the method proposed in this paper exhibits superior performance in terms of path search time, navigation time, and distance compared to D* Lite, FAR, and FPS algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Sugarcane Yield Estimation Using Satellite Remote Sensing Data in Empirical or Mechanistic Modeling: A Systematic Review.
- Author
-
de França e Silva, Nildson Rodrigues, Chaves, Michel Eustáquio Dantas, Luciano, Ana Cláudia dos Santos, Sanches, Ieda Del'Arco, de Almeida, Cláudia Maria, and Adami, Marcos
- Subjects
REMOTE sensing ,SCIENCE databases ,SUGARCANE ,SUGARCANE growing ,DECISION making ,SUPPLY chains ,TEXT mining - Abstract
The sugarcane crop has great socioeconomic relevance because of its use in the production of sugar, bioelectricity, and ethanol. Mainly cultivated in tropical and subtropical countries, such as Brazil, India, and China, this crop presented a global harvested area of 17.4 million hectares (Mha) in 2021. Thus, decision making in this activity needs reliable information. Obtaining accurate sugarcane yield estimates is challenging, and in this sense, it is important to reduce uncertainties. Currently, it can be estimated by empirical or mechanistic approaches. However, the model's peculiarities vary according to the availability of data and the spatial scale. Here, we present a systematic review to discuss state-of-the-art sugarcane yield estimation approaches using remote sensing and crop simulation models. We consulted 1398 papers, and we focused on 72 of them, published between January 2017 and June 2023 in the main scientific databases (e.g., AGORA-FAO, Google Scholar, Nature, MDPI, among others), using the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) methodology. We observed how the models vary in space and time, presenting the potential, challenges, limitations, and outlooks for enhancing decision making in the sugarcane crop supply chain. We concluded that remote sensing data assimilation both in mechanistic and empirical models is promising and will be enhanced in the coming years, due to the increasing availability of free Earth observation data. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
8. 3D Reconstruction of Ancient Buildings Using UAV Images and Neural Radiation Field with Depth Supervision.
- Author
-
Ge, Yingwei, Guo, Bingxuan, Zha, Peishuai, Jiang, San, Jiang, Ziyu, and Li, Demin
- Subjects
BUILDING repair ,RADIATION ,SIGNAL-to-noise ratio ,POINT cloud ,DATA visualization - Abstract
The 3D reconstruction of ancient buildings through inclined photogrammetry finds a wide range of applications in surveying, visualization and heritage conservation. Unlike indoor objects, reconstructing ancient buildings presents unique challenges, including the slow speed of 3D reconstruction using traditional methods, the complex textures of ancient structures and geometric issues caused by repeated textures. Additionally, there is a hash conflict problem when rendering outdoor scenes using neural radiation fields. To address these challenges, this paper proposes a 3D reconstruction method based on depth-supervised neural radiation fields. To enhance the representation of the geometric neural network, the addition of a truncated signed distance function (TSDF) supplements the existing signed distance function (SDF). Furthermore, the neural network's training is supervised using depth information, leading to improved geometric accuracy in the reconstruction model through depth data obtained from sparse point clouds. This study also introduces a progressive training strategy to mitigate hash conflicts, allowing the hash table to express important details more effectively while reducing feature overlap. The experimental results demonstrate that our method, under the same number of iterations, produces images with clearer structural details, resulting in an average 15% increase in the Peak Signal-to-Noise Ratio (PSNR) value and a 10% increase in the Structural Similarity Index Measure (SSIM) value. Moreover, our reconstruction model produces higher-quality surface models, enabling the fast and highly geometrically accurate 3D reconstruction of ancient buildings. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. Active Wildland Fires in Central Chile and Local Winds (Puelche).
- Author
-
Hayasaka, Hiroshi
- Subjects
METEOROLOGICAL charts ,FIRE weather ,METEOROLOGICAL stations ,JET streams ,WEATHER - Abstract
Central Chile (CC, latitudes 32–40°S) experienced very active fires in 2017 and 2023. These fires burned large areas and killed many people. These unprecedented fires for CC presented a need for more defined fire weather conditions on the synoptic scale. In this paper, fire weather conditions were analyzed using various satellite-derived fire data (hotspots, HSs), wind streamlines, distribution maps of wind flow and temperature, and various synoptic-scale weather maps. Results showed that local winds, known as Puelche, blew on the peak fire days (26 January 2017 and 3 February 2023). The number of HSs on these days was 2676 and 2746, respectively, about 90 times the average (30). The occurrence of Puelche winds was confirmed by streamlines from high-pressure systems offshore of Argentina to the study area in CC. The formation of strong winds and high-temperature areas associated with Puelche winds were identified on the Earth survey satellite maps. Strong winds of about 38 km h
−1 and high temperatures above 32 °C with low relative humidity below 33% were actually observed at the weather station near the fire-prone areas. Lastly, some indications for Puelche winds outbreaks are summarized. This paper's results will be used to prevent future active fire occurrences in the CC. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
10. Multiple-Band Electric Field Response to the Geomagnetic Storm on 4 November 2021.
- Author
-
Zheng, Jie, Huang, Jianping, Li, Zhong, Li, Wenjing, Han, Ying, Lu, Hengxin, and Zhima, Zeren
- Subjects
MAGNETIC storms ,GEOMAGNETISM ,ELECTRIC fields ,SPACE vehicles - Abstract
This paper investigates the impact characteristics of the 4 November 2021 magnetic storm across different frequency bands based on the electric field data (EFD) from the China Seismo-Electromagnetic Satellite (CSES), categorized into four frequency bands: ULF (Ultra-Low-Frequency, DC to 16 Hz), ELF (Extremely Low-Frequency, 6 Hz to 2.2 kHz), VLF (Very Low-Frequency, 1.8 to 20 kHz), and HF (High-Frequency, 18 kHz to 3.5 MHz). The study reveals that in the ULF band, magnetic storm-induced electric field disturbances are primarily in the range of 0 to 5 Hz, with a significant disturbance frequency at 3.9 ± 1.0 Hz. Magnetic storms also enhance Schumann waves in the ULF band, with 8 Hz Schumann waves dominating in the southern hemisphere and 13 Hz Schumann waves dominating in the northern hemisphere. In the ELF band, the more pronounced anomalies occur at 300 Hz–900 Hz and above 1.8 kHz, with the 300 Hz–900 Hz band anomalies around 780 Hz being the most significant. In the VLF band, the electric field anomalies are mainly concentrated in the 3–15 kHz range. The ELF and VLF bands exhibit lower absolute and relative disturbance increments compared to the ULF band, with the relative perturbation growth rate in the ULF band being approximately 10% higher than in the ELF and VLF bands. Magnetic storm-induced electric field disturbances predominantly occur in the ULF, ELF, and VLF bands, with the most significant disturbances in the ULF band. The electric field perturbations in these three frequency bands exhibit hemispheric asymmetry, with strong perturbations in the northern hemisphere occurring earlier than in the southern hemisphere, corresponding to different Dst minima. No electric field disturbances were observed in the HF band (above 18 kHz). The conclusions of this paper are highly significant for future anti-jamming designs in spacecraft and communication equipment, as well as for the further study of magnetic storms. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. Combining KAN with CNN: KonvNeXt's Performance in Remote Sensing and Patent Insights.
- Author
-
Cheon, Minjong and Mun, Changbae
- Subjects
REMOTE sensing ,MACHINE learning ,DEEP learning ,WEATHER forecasting ,DISTANCE education - Abstract
Rapid advancements in satellite technology have led to a significant increase in high-resolution remote sensing (RS) images, necessitating the use of advanced processing methods. Additionally, patent analysis revealed a substantial increase in deep learning and machine learning applications in remote sensing, highlighting the growing importance of these technologies. Therefore, this paper introduces the Kolmogorov-Arnold Network (KAN) model to remote sensing to enhance efficiency and performance in RS applications. We conducted several experiments to validate KAN's applicability, starting with the EuroSAT dataset, where we combined the KAN layer with multiple pre-trained CNN models. Optimal performance was achieved using ConvNeXt, leading to the development of the KonvNeXt model. KonvNeXt was evaluated on the Optimal-31, AID, and Merced datasets for validation and achieved accuracies of 90.59%, 94.1%, and 98.1%, respectively. The model also showed fast processing speed, with the Optimal-31 and Merced datasets completed in 107.63 s each, while the bigger and more complicated AID dataset took 545.91 s. This result is meaningful since it achieved faster speeds and comparable accuracy compared to the existing study, which utilized VIT and proved KonvNeXt's applicability for remote sensing classification tasks. Furthermore, we investigated the model's interpretability by utilizing Occlusion Sensitivity, and by displaying the influential regions, we validated its potential use in a variety of domains, including medical imaging and weather forecasting. This paper is meaningful in that it is the first to use KAN in remote sensing classification, proving its adaptability and efficiency. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. Compensating Acquisition Footprint for Amplitude-Preserving Angle Domain Common Image Gathers Based on 3D Reverse Time Migration.
- Author
-
Liu, Hongwei, Fu, Liyun, Li, Qingqing, and Liu, Lu
- Subjects
LINEAR velocity ,SEISMIC prospecting ,ANGLES ,LIGHTING ,VELOCITY - Abstract
Angle domain common image gathers (ADCIGs) play a crucial role in seismic exploration, offering prestack underground illumination information that aids in validating migration velocity and conducting prestack amplitude versus angle (AVA) analysis for reservoir characterization. This paper introduces an innovative approach for compensating amplitude errors caused by irregular seismic acquisition geometries in ADCIGs. By incorporating an angle domain illumination compensation factor, the proposed method effectively modifies these errors, preserving the amplitude of seismic reflectivity in the prestack angle domain. The effectiveness of the proposed approach is validated through comprehensive tests conducted on synthetic and field data examples. The results demonstrate the capability of the method to enhance the quality of ADCIGs derived from 3D reverse time migration (RTM), yielding accurate and reliable amplitude preservation. While the illumination compensation factor assumes a vertically linear velocity model, the method holds promise for extension to more complex media and diverse migration techniques. This suggests its applicability and adaptability beyond the specific assumptions considered in this study. In conclusion, this paper presents an innovative angle domain illumination compensation factor that significantly improves the quality of ADCIGs by addressing amplitude errors arising from irregular seismic acquisition geometries. The experimental validation using synthetic and field data confirms the effectiveness of the proposed method within the context of 3D RTM. Furthermore, the technique holds potential for broader application in more complex subsurface scenarios and various migration methodologies. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
13. Intelligent Tracking Method for Aerial Maneuvering Target Based on Unscented Kalman Filter.
- Author
-
Dong, Yunlong, Li, Weiqi, Li, Dongxue, Liu, Chao, and Xue, Wei
- Subjects
TRACKING algorithms ,DESIGN techniques ,DYNAMIC models ,REGRESSION analysis ,RECURRENT neural networks - Abstract
This paper constructs a nonlinear iterative filtering framework based on a neural network prediction model. It uses recurrent neural networks (RNNs) to achieve accurate regression of complex maneuvering target dynamic models and integrates them into the nonlinear iterative filtering system via Unscented Transformation (UT). In constructing the neural network prediction model, the Temporal Convolutional Network (TCN) modules that capture long-term dependencies and the Long Short-Term Memory (LSTM) modules that selectively forget non-essential information were utilized to achieve accurate regression of the maneuvering models. When embedding the neural network prediction model, this paper proposes a method for extracting Sigma points using the UT transformation by 'unfolding' multi-sequence vectors and explores design techniques for the time sliding window length of recurrent neural networks. Ultimately, an intelligent tracking algorithm based on unscented filtering, called TCN-LSTM-UKF, was developed, effectively addressing the difficulties of constructing models and transition delays under high-maneuvering conditions and significantly improving the tracking performance of highly maneuvering targets. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
14. Soil Organic Carbon Estimation via Remote Sensing and Machine Learning Techniques: Global Topic Modeling and Research Trend Exploration.
- Author
-
Li, Tong, Cui, Lizhen, Wu, Yu, McLaren, Timothy I., Xia, Anquan, Pandey, Rajiv, Liu, Hongdou, Wang, Weijin, Xu, Zhihong, Song, Xiufang, Dalal, Ram C., and Dang, Yash P.
- Subjects
NATURAL language processing ,CARBON cycle ,SYNTHETIC aperture radar ,AGRICULTURE ,CARBON sequestration ,SYNTHETIC apertures - Abstract
Understanding and monitoring soil organic carbon (SOC) stocks is crucial for ecosystem carbon cycling, services, and addressing global environmental challenges. This study employs the BERTopic model and bibliometric trend analysis exploration to comprehensively analyze global SOC estimates. BERTopic, a topic modeling technique based on BERT (bidirectional encoder representatives from transformers), integrates recent advances in natural language processing. The research analyzed 1761 papers on SOC and remote sensing (RS), in addition to 490 related papers on machine learning (ML) techniques. BERTopic modeling identified nine research themes for SOC estimation using RS, emphasizing spectral prediction models, carbon cycle dynamics, and agricultural impacts on SOC. In contrast, for the literature on RS and ML it identified five thematic clusters: spatial forestry analysis, hyperspectral soil analysis, agricultural deep learning, the multitemporal imaging of farmland SOC, and RS platforms (Sentinel-2 and synthetic aperture radar, SAR). From 1991 to 2023, research on SOC estimation using RS and ML has evolved from basic mapping to topics like carbon sequestration and modeling with Sentinel-2A and big data. In summary, this study traces the historical growth and thematic evolution of SOC research, identifying synergies between RS and ML and focusing on SOC estimation with advanced ML techniques. These findings are critical to global ecosystem SOC assessments and environmental policy formulation. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
15. SPNet: Dual-Branch Network with Spatial Supplementary Information for Building and Water Segmentation of Remote Sensing Images.
- Author
-
Zhao, Wenyu, Xia, Min, Weng, Liguo, Hu, Kai, Lin, Haifeng, Zhang, Youke, and Liu, Ziheng
- Subjects
REMOTE sensing ,FEATURE extraction ,URBAN planning ,LAND use planning ,GOAL (Psychology) - Abstract
Semantic segmentation is primarily employed to generate accurate prediction labels for each pixel of the input image, and then classify the images according to the generated labels. Semantic segmentation of building and water in remote sensing images helps us to conduct reasonable land planning for a city. However, many current mature networks face challenges in simultaneously attending to both contextual and spatial information when performing semantic segmentation on remote sensing imagery. This often leads to misclassifications and omissions. Therefore, this paper proposes a Dual-Branch Network with Spatial Supplementary Information (SPNet) to address the aforementioned issues. We introduce a Context-aware Spatial Feature-Extractor Unit (CSF) to extract contextual and spatial information, followed by the Feature-Interaction Module (FIM) to supplement contextual semantic information with spatial details. Additionally, incorporating the Goal-Oriented Attention Mechanism helps in handling noise. Finally, to obtain more detailed branches, a Multichannel Deep Feature-Extraction Module (MFM) is introduced to extract features from shallow-level network layers. This branch guides the fusion of low-level semantic information with high-level semantic information. Experiments were conducted on building and water datasets, respectively. The results indicate that the segmentation accuracy of the model proposed in this paper surpasses that of other existing mature models. On the building dataset, the mIoU reaches 87.57, while on the water dataset, the mIoU achieves 96.8, which means that the model introduced in this paper demonstrates strong generalization capabilities. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
16. Continuous Wavelet Transform Peak-Seeking Attention Mechanism Conventional Neural Network: A Lightweight Feature Extraction Network with Attention Mechanism Based on the Continuous Wave Transform Peak-Seeking Method for Aero-Engine Hot Jet Fourier Transform Infrared Classification
- Author
-
Du, Shuhan, Han, Wei, Kang, Zhenping, Lu, Xiangning, Liao, Yurong, and Li, Zhaoming
- Subjects
FOURIER transform spectrometers ,WAVELET transforms ,FEATURE extraction ,DEEP learning ,WAVENUMBER ,RUNNING speed - Abstract
Focusing on the problem of identifying and classifying aero-engine models, this paper measures the infrared spectrum data of aero-engine hot jets using a telemetry Fourier transform infrared spectrometer. Simultaneously, infrared spectral data sets with the six different types of aero-engines were created. For the purpose of classifying and identifying infrared spectral data, a CNN architecture based on the continuous wavelet transform peak-seeking attention mechanism (CWT-AM-CNN) is suggested. This method calculates the peak value of middle wave band by continuous wavelet transform, and the peak data are extracted by the statistics of the wave number locations with high frequency. The attention mechanism was used for the peak data, and the attention mechanism was weighted to the feature map of the feature extraction block. The training set, validation set and prediction set were divided in the ratio of 8:1:1 for the infrared spectral data sets. For three different data sets, the CWT-AM-CNN proposed in this paper was compared with the classical classifier algorithm based on CO
2 feature vector and the popular AE, RNN and LSTM spectral processing networks. The prediction accuracy of the proposed algorithm in the three data sets was as high as 97%, and the lightweight network structure design not only guarantees high precision, but also has a fast running speed, which can realize the rapid and high-precision classification of the infrared spectral data of the aero-engine hot jets. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
17. Radiation Anomaly Detection of Sub-Band Optical Remote Sensing Images Based on Multiscale Deep Dynamic Fusion and Adaptive Optimization.
- Author
-
Ci, Jinlong, Tan, Hai, Zhai, Haoran, and Tang, Xinming
- Subjects
TRANSFORMER models ,IMAGE sensors ,REMOTE sensing ,RECORDS management ,DATA transmission systems - Abstract
Radiation anomalies in optical remote sensing images frequently occur due to electronic issues within the image sensor or data transmission errors. These radiation anomalies can be categorized into several types, including CCD, StripeNoise, RandomCode1, RandomCode2, ImageMissing, and Tap. To ensure the retention of image data with minimal radiation issues as much as possible, this paper adopts a self-made radiation dataset and proposes a FlexVisionNet-YOLO network to detect radiation anomalies more accurately. Firstly, RepViT is used as the backbone network with a vision transformer architecture to better capture global and local features. Its multiscale feature fusion mechanism efficiently handles targets of different sizes and shapes, enhancing the detection ability for radiation anomalies. Secondly, a feature depth fusion network is proposed in the Feature Fusion part, which significantly improves the flexibility and accuracy of feature fusion and thus enhances the detection and classification performance of complex remote sensing images. Finally, Inner-CIoU is used in the Head part for edge regression, which significantly improves the localization accuracy by finely adjusting the target edges; Slide-Loss is used for classification loss, which enhances the classification robustness by dynamically adjusting the category probabilities and markedly improves the classification accuracy, especially in the sample imbalance dataset. Experimental results show that, compared to YOLOv8, the proposed FlexVisionNet-YOLO method improves precision, recall, mAP0.5, and mAP0.5:0.9 by 3.5%, 7.1%, 4.4%, and 13.6%, respectively. Its effectiveness in detecting radiation anomalies surpasses that of other models. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
18. Leveraging Visual Language Model and Generative Diffusion Model for Zero-Shot SAR Target Recognition.
- Author
-
Wang, Junyu, Sun, Hao, Tang, Tao, Sun, Yuli, He, Qishan, Lei, Lin, and Ji, Kefeng
- Subjects
LANGUAGE models ,OPTICAL remote sensing ,COMPUTATIONAL electromagnetics ,KNOWLEDGE base ,SYNTHETIC aperture radar ,PRIOR learning ,IMAGE recognition (Computer vision) - Abstract
Simulated data play an important role in SAR target recognition, particularly under zero-shot learning (ZSL) conditions caused by the lack of training samples. The traditional SAR simulation method is based on manually constructing target 3D models for electromagnetic simulation, which is costly and limited by the target's prior knowledge base. Also, the unavoidable discrepancy between simulated SAR and measured SAR makes the traditional simulation method more limited for target recognition. This paper proposes an innovative SAR simulation method based on a visual language model and generative diffusion model by extracting target semantic information from optical remote sensing images and transforming it into a 3D model for SAR simulation to address the challenge of SAR target recognition under ZSL conditions. Additionally, to reduce the domain shift between the simulated domain and the measured domain, we propose a domain adaptation method based on dynamic weight domain loss and classification loss. The effectiveness of semantic information-based 3D models has been validated on the MSTAR dataset and the feasibility of the proposed framework has been validated on the self-built civilian vehicle dataset. The experimental results demonstrate that the first proposed SAR simulation method based on a visual language model and generative diffusion model can effectively improve target recognition performance under ZSL conditions. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. A Multi-Level Cross-Attention Image Registration Method for Visible and Infrared Small Unmanned Aerial Vehicle Targets via Image Style Transfer.
- Author
-
Jiang, Wen, Pan, Hanxin, Wang, Yanping, Li, Yang, Lin, Yun, and Bi, Fukun
- Subjects
IMAGE fusion ,INFRARED imaging ,TRACKING algorithms ,VISIBLE spectra ,DEEP learning ,IMAGE registration ,PIXELS - Abstract
Small UAV target detection and tracking based on cross-modality image fusion have gained widespread attention. Due to the limited feature information available from small UAVs in images, where they occupy a minimal number of pixels, the precision required for detection and tracking algorithms is particularly high in complex backgrounds. Image fusion techniques can enrich the detailed information for small UAVs, showing significant advantages under extreme lighting conditions. Image registration is a fundamental step preceding image fusion. It is essential to achieve accurate image alignment before proceeding with image fusion to prevent severe ghosting and artifacts. This paper specifically focused on the alignment of small UAV targets within infrared and visible light imagery. To address this issue, this paper proposed a cross-modality image registration network based on deep learning, which includes a structure preservation and style transformation network (SPSTN) and a multi-level cross-attention residual registration network (MCARN). Firstly, the SPSTN is employed for modality transformation, transferring the cross-modality task into a single-modality task to reduce the information discrepancy between modalities. Then, the MCARN is utilized for single-modality image registration, capable of deeply extracting and fusing features from pseudo infrared and visible images to achieve efficient registration. To validate the effectiveness of the proposed method, comprehensive experimental evaluations were conducted on the Anti-UAV dataset. The extensive evaluation results validate the superiority and universality of the cross-modality image registration framework proposed in this paper, which plays a crucial role in subsequent image fusion tasks for more effective target detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
20. A Two-Component Polarimetric Target Decomposition Algorithm with Grassland Application.
- Author
-
Huang, Pingping, Chen, Yalan, Li, Xiujuan, Tan, Weixian, Chen, Yuejuan, Yang, Xiangli, Dong, Yifan, Lv, Xiaoqi, and Li, Baoyu
- Subjects
PLANT anatomy ,GRASSLAND plants ,REMOTE sensing ,GRASSLANDS ,ANALYTICAL solutions - Abstract
The study of the polarimetric target decomposition algorithm with physical scattering models has contributed to the development of the field of remote sensing because of its simple and clear physical meaning with a small computational effort. However, most of the volume scattering models in these algorithms are for forests or crops, and there is a lack of volume scattering models for grasslands. In order to improve the accuracy of the polarimetric target decomposition algorithm adapted to grassland data, in this paper, a novel volume scattering model is derived considering the characteristics of real grassland plant structure and combined with the backward scattering coefficients of grass, which is abstracted as a rotatable ellipsoid of variable shape. In the process of rotation, the possibility of rotation is considered in two dimensions, the tilt angle and canting angle; for particle shape, the anisotropy degree A is directly introduced as a parameter to describe and expand the applicability of the model at the same time. After obtaining the analytical solution of the parameters and using the principle of least negative power to determine the optimal solution of the model, the algorithm is validated by applying it to the C-band AirBorne dataset of Hunshandak grassland in Inner Mongolia and the X-band Cosmos-Skymed dataset of Xiwuqi grassland in Inner Mongolia. The performance of the algorithm with five polarimetric target decomposition algorithms is studied comparatively. The experimental results show that the algorithm proposed in this paper outperforms the other algorithms in terms of grassland decomposition accuracy on different bands of data. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
21. Evaluation and Improvement of a CALIPSO-Based Algorithm for Cloud Base Height in China.
- Author
-
Li, Ruolin and Ma, Xiaoyan
- Subjects
CLOUD computing ,LIDAR ,ALGORITHMS ,AEROSOLS ,ALTITUDES ,TROPOSPHERIC aerosols - Abstract
Clouds are crucial in regulating the Earth's energy budget. Global cloud top heights have been easily retrieved from satellite measurements, but there are few methods for determining cloud base height (CBH) from satellite measurements. The Cloud Base Altitude Spatial Extrapolator (CBASE) algorithm was proposed to derive the height of the lower-troposphere liquid cloud base by using the Cloud-Aerosol Lidar with Orthogonal polarization cloud aerosol LiDAR (CALIOP) profiles and weather observations at airports from aviation routine and special weather report (METARs and SPECIs, called METAR) observation data in the United States. A modification to the CBASE algorithm over China (CNMETAR-CBASE) is presented in this paper. In this paper, the ability of the CBASE algorithm to calculate CBH in China is evaluated, and METAR observations over China (CNMETAR) were then used to modify the CBASE algorithm. The results including CNMETAR observation data in China can better retrieve CBH over China compared with the results using the original CBASE algorithm, and the accuracy of the global CBH results has been improved. Overestimations of CBH with the original algorithm range from 500 to 800 m in China, which have been reduced to about 300 m with an improved algorithm. The deviations calculated by the algorithm also have a significant reduction, from 480 m (CBASE) to 420 m (CNMETAR-CBASE). In conclusion, the modified CBASE algorithm not only calculates the CBH more accurately in China but also improves the results of the global CBH retrieved from satellites. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
22. Ionograms Trace Extraction Method Based on Multiscale Transformer Network.
- Author
-
Han, Sijia, Guo, Wei, and Wang, Caiyun
- Subjects
TRANSFORMER models ,ELECTRON density ,NETWORK performance ,SIGNAL processing ,IONOSPHERE ,DEEP learning - Abstract
The echo traces in the ionograms contain key information about the ionosphere. Therefore, the accurate extraction of these traces is crucial for the subsequent work. This paper transforms the original signal processing problem into a semantic segmentation task, combines it with the currently popular deep learning techniques, and proposes a multiscale Transformer network to achieve pixel-level trace extraction. To train the proposed model, we built a dataset by discretizing the original echo data, labeling, and other preprocessing work. A series of advanced semantic segmentation networks are utilized for comparative experiments. The analysis of the results indicates that the proposed network excels in performance, achieving the highest scores on key semantic segmentation evaluation metrics, including mIoU, Kappa, Dice, and AUC-ROC. In addition, this paper also designs a series of ablation experiments to observe the changes in network performance and to evaluate the rationality of the network design. The experimental results demonstrate the effectiveness of the network in the trace extraction task, which plays a positive role in the subsequent electron density reversal work. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
23. Improved Cycle-Consistency Generative Adversarial Network-Based Clutter Suppression Methods for Ground-Penetrating Radar Pipeline Data.
- Author
-
Lin, Yun, Wang, Jiachun, Ma, Deyun, Wang, Yanping, and Ye, Shengbo
- Subjects
GROUND penetrating radar ,GENERATIVE adversarial networks ,DEEP learning - Abstract
Ground-penetrating radar (GPR) is a widely used technology for pipeline detection due to its fast detection speed and high resolution. However, the presence of complex underground media often results in strong ground clutter interference in the collected B-scan echoes, significantly impacting detection performance. To address this issue, this paper proposes an improved clutter suppression network based on a cycle-consistency generative adversarial network (CycleGAN). By employing the concept of style transfer, the network aims to convert clutter images into clutter-free images. This paper introduces multiple residual blocks into the generator and discriminator, respectively, to improve the feature expression ability of the deep learning model. Additionally, the discriminator incorporates the squeeze and excitation (SE) module, a channel attention mechanism, to further enhance the model's ability to extract features from clutter-free images. To evaluate the effectiveness of the proposed network in clutter suppression, both simulation and measurement data are utilized to compare and analyze its performance against traditional clutter suppression methods and deep learning-based methods, respectively. From the result of the measured data, it can be found that the improvement factor ( I m ) of the proposed method has reached 40.68 dB, which is a significant improvement compared to the previous network. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
24. Radargrammetric 3D Imaging through Composite Registration Method Using Multi-Aspect Synthetic Aperture Radar Imagery.
- Author
-
Luo, Yangao, Deng, Yunkai, Xiang, Wei, Zhang, Heng, Yang, Congrui, and Wang, Longxiang
- Subjects
SYNTHETIC aperture radar ,THREE-dimensional imaging ,SYNTHETIC apertures ,SPECKLE interference ,DIGITAL elevation models ,IMAGE registration ,RADIO telescopes - Abstract
Interferometric synthetic aperture radar (InSAR) and tomographic SAR measurement techniques are commonly used for the three-dimensional (3D) reconstruction of complex areas, while the effectiveness of these methods relies on the interferometric coherence among SAR images with minimal angular disparities. Radargrammetry exploits stereo image matching to determine the spatial coordinates of corresponding points in two SAR images and acquire their 3D properties. The performance of the image matching process directly impacts the quality of the resulting digital surface model (DSM). However, the presence of speckle noise, along with dissimilar geometric and radiometric distortions, poses considerable challenges in achieving accurate stereo SAR image matching. To address these aforementioned challenges, this paper proposes a radargrammetric method based on the composite registration of multi-aspect SAR images. The proposed method combines coarse registration using scale invariant feature transform (SIFT) with precise registration using normalized cross-correlation (NCC) to achieve accurate registration between multi-aspect SAR images with large disparities. Furthermore, the multi-aspect 3D point clouds are merged using the proposed radargrammetric 3D imaging method, resulting in the 3D imaging of target scenes based on multi-aspect SAR images. For validation purposes, this paper presents a comprehensive 3D reconstruction of the Five-hundred-meter Aperture Spherical radio Telescope (FAST) using Ka-band airborne SAR images. It does not necessitate prior knowledge of the target and is applicable to the detailed 3D imaging of large-scale areas with complex structures. In comparison to other SAR 3D imaging techniques, it reduces the requirements for orbit control and radar system parameters. To sum up, the proposed 3D imaging method with composite registration guarantees imaging efficiency, while enhancing the imaging accuracy of crucial areas with limited data. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
25. Remote Data for Mapping and Monitoring Coastal Phenomena and Parameters: A Systematic Review.
- Author
-
Cavalli, Rosa Maria
- Subjects
COASTAL mapping ,GEOGRAPHIC names ,DATA mapping ,LAND cover ,URBAN growth ,COASTS - Abstract
Since 1971, remote sensing techniques have been used to map and monitor phenomena and parameters of the coastal zone. However, updated reviews have only considered one phenomenon, parameter, remote data source, platform, or geographic region. No review has offered an updated overview of coastal phenomena and parameters that can be accurately mapped and monitored with remote data. This systematic review was performed to achieve this purpose. A total of 15,141 papers published from January 2021 to June 2023 were identified. The 1475 most cited papers were screened, and 502 eligible papers were included. The Web of Science and Scopus databases were searched using all possible combinations between two groups of keywords: all geographical names in coastal areas and all remote data and platforms. The systematic review demonstrated that, to date, many coastal phenomena (103) and parameters (39) can be mapped and monitored using remote data (e.g., coastline and land use and land cover changes, climate change, and coastal urban sprawl). Moreover, the authors validated 91% of the retrieved parameters, retrieved from remote data 39 parameters that were mapped or monitored 1158 times (88% of the parameters were combined together with other parameters), monitored 75% of the parameters over time, and retrieved 69% of the parameters from several remote data and compared the results with each other and with available products. They obtained 48% of the parameters using different methods, and their results were compared with each other and with available products. They combined 17% of the parameters that were retrieved with GIS and model techniques. In conclusion, the authors addressed the requirements needed to more effectively analyze coastal phenomena and parameters employing integrated approaches: they retrieved the parameters from different remote data, merged different data and parameters, compared different methods, and combined different techniques. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. Precise Motion Compensation of Multi-Rotor UAV-Borne SAR Based on Improved PTA.
- Author
-
Cheng, Yao, Qiu, Xiaolan, and Meng, Dadi
- Subjects
IMAGE stabilization ,SYNTHETIC aperture radar ,GROUND motion ,NUMERICAL calculations ,ELECTRONIC data processing - Abstract
In recent years, with the miniaturization of high-precision position and orientation systems (POS), precise motion errors during SAR data collection can be calculated based on high-precision POS. However, compensating for these errors remains a significant challenge for multi-rotor UAV-borne SAR systems. Compared with large aircrafts, multi-rotor UAVs are lighter, slower, have more complex flight trajectories, and have larger squint angles, which result in significant differences in motion errors between building targets and ground targets. If the motion compensation is based on ground elevation, the motion error of the ground target will be fully compensated, but the building target will still have a large residual error; as a result, although the ground targets can be well-focused, the building targets may be severely defocused. Therefore, it is necessary to further compensate for the residual motion error of building targets based on the actual elevation on the SAR image. However, uncompensated errors will affect the time–frequency relationship; furthermore, the ω-k algorithm will further change these errors, resulting in errors in SAR images becoming even more complex and difficult to compensate for. To solve this problem, this paper proposes a novel improved precise topography and aperture-dependent (PTA) method that can precisely compensate for motion errors in the UAV-borne SAR system. After motion compensation and imaging processing based on ground elevation, a secondary focus is applied to defocused buildings. The improved PTA fully considers the coupling of the residual error with the time–frequency relationship and ω-k algorithm, and the precise errors in the two-dimensional frequency domain are determined through numerical calculations without any approximations. Simulation and actual data processing verify the effectiveness of the method, and the experimental results show that the proposed method in this paper is better than the traditional method. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
27. An Improved Remote Sensing Retrieval Method for Elevated Duct in the South China Sea.
- Author
-
Cheng, Yinhe, Zha, Mengling, Qiao, Wenli, He, Hongjian, Wang, Shuwen, Wang, Shengxiang, Li, Xiaoran, and He, Weiye
- Subjects
MODIS (Spectroradiometer) ,STANDARD deviations ,REMOTE sensing ,ELECTROMAGNETIC waves ,REFRACTIVE index - Abstract
Elevated duct is an atmospheric structure characterized by abnormal refractive index gradients, which can significantly affect the performance of radar, communication, and other systems by capturing a portion of electromagnetic waves. The South China Sea (SCS) is a high-incidence area for elevated duct, so conducting detection and forecasts of the elevated duct in the SCS holds important scientific significance and practical value. This paper attempts to utilize remote sensing techniques for extracting elevated duct information. Based on GPS sounding data, a lapse rate formula (LRF) model and an empirical formula (EF) model for the estimation of the cloud top height of Stratocumulus were obtained, and then remote sensing retrieval methods of elevated duct were established based on the Moderate Resolution Imaging Spectroradiometer (MODIS) remote sensing data. The results of these two models were compared with results from the elevated duct remote sensing retrieval model developed by the United States Naval Postgraduate School. It is shown that the probability of elevated duct events was 79.1% when the presence of Stratocumulus identified using GPS sounding data, and the trapping layer bottom height of elevated duct well with the cloud top height of Stratocumulus, with a correlation coefficient of 0.79, a mean absolute error of 289 m, and a root mean square error of 598 m. Among the different retrieval models applied to MODIS satellite data, the LRF model emerged as the optimal remote sensing retrieval method for elevated duct in the SCS, showing a correlation coefficient of 0.51, a mean absolute error of 447 m, and a root mean square error of 658 m between the trapping layer bottom height and the cloud top height. Consequently, the encouraging validation results demonstrate that the LRF model proposed in this paper offers a novel method for diagnosing and calculating elevated ducts information over large-scale marine areas from remote sensing data. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. Evolution of the Floe Size Distribution in Arctic Summer Based on High-Resolution Satellite Imagery.
- Author
-
Li, Zongxing, Lu, Peng, Zhou, Jiaru, Zhang, Hang, Huo, Puzhen, Yu, Miao, Wang, Qingkai, and Li, Zhijun
- Subjects
DISTRIBUTION (Probability theory) ,WEIBULL distribution ,IMAGE processing ,REMOTE-sensing images ,FRACTAL dimensions - Abstract
In this paper, based on high-resolution satellite images near an ice bridge in the Canadian Basin, we extracted floe size parameters and analyzed the temporal and spatial variations in the parameters through image processing techniques. The floe area shows a decreasing trend over time, while the perimeter and mean clamped diameter (MCD) exhibit no obvious pattern of change. In addition, the roundness of floes, reflected by shape parameters, generally decreases initially and then increases, and the average roundness of small floes is smaller than that of large floes. To correct the deviations from power law behaviour when assessing the floe size distribution (FSD) with the traditional power law function, the upper-truncated power law distribution function and the Weibull function are selected. The four parameters of the two functions are important parameters for describing the floe size distribution, and L r and L 0 are roughly equal to the maximum calliper diameter and the average calliper diameter of the floes in the region. D in the upper-truncated power law distribution function represents the fractal dimension of the floes, and r in the Weibull function represents the shape parameter of the floes, both of which increase and then decrease with time. In this paper, we investigate the response of the rate of change in the FSD parameter to the differences in the monthly average temperature and find that D , r and air temperature are positively correlated, which verifies the influence of air temperature on the floe size distribution. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
29. Enhanced Strapdown Inertial Navigation System (SINS)/LiDAR Tightly Integrated Simultaneous Localization and Mapping (SLAM) for Urban Structural Feature Weaken Occasions in Vehicular Platform.
- Author
-
Xu, Xu, Guan, Lianwu, Gao, Yanbin, Chen, Yufei, and Liu, Zhejun
- Subjects
INERTIAL navigation systems ,KALMAN filtering ,POINT cloud ,LIDAR ,TRACKING algorithms - Abstract
LiDAR-based simultaneous localization and mapping (SLAM) offer robustness against illumination changes, but the inherent sparsity of LiDAR point clouds poses challenges for continuous tracking and navigation, especially in feature-deprived scenarios. This paper proposes a novel LiDAR/SINS tightly integrated SLAM algorithm designed to address the localization challenges in urban environments characterized in sparse structural features. Firstly, the method extracts edge points from the LiDAR point cloud using a traditional segmentation method and clusters them to form distinctive edge lines. Then, a rotation-invariant feature—line distance—is calculated based on the edge line properties that were inspired by the traditional tightly integrated navigation system. This line distance is utilized as the observation in a Kalman filter that is integrated into a tightly coupled LiDAR/SINS system. This system tracks the same edge lines across multiple frames for filtering and correction instead of tracking points or LiDAR odometry results. Meanwhile, for loop closure, the method modifies the common SCANCONTEXT algorithm by designating all bins that do not reach the maximum height as special loop keys, which reduce false matches. Finally, the experimental validation conducted in urban environments with sparse structural features demonstrated a 17% improvement in positioning accuracy when compared to the conventional point-based methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
30. Underwater Side-Scan Sonar Target Detection: YOLOv7 Model Combined with Attention Mechanism and Scaling Factor.
- Author
-
Wen, Xin, Wang, Jian, Cheng, Chensheng, Zhang, Feihu, and Pan, Guang
- Subjects
SONAR ,ARTIFICIAL neural networks ,SONAR imaging ,OBJECT recognition (Computer vision) ,UNDERWATER exploration - Abstract
Side-scan sonar plays a crucial role in underwater exploration, and the autonomous detection of side-scan sonar images is vital for detecting unknown underwater environments. However, due to the complexity of the underwater environment, the presence of a few highlighted areas on the targets, blurred feature details, and difficulty in collecting data from side-scan sonar, achieving high-precision autonomous target recognition in side-scan sonar images is challenging. This article addresses this problem by improving the You Only Look Once v7 (YOLOv7) model to achieve high-precision object detection in side-scan sonar images. Firstly, given that side-scan sonar images contain large areas of irrelevant information, this paper introduces the Swin-Transformer for dynamic attention and global modeling, which enhances the model's focus on the target regions. Secondly, the Convolutional Block Attention Module (CBAM) is utilized to further improve feature representation and enhance the neural network model's accuracy. Lastly, to address the uncertainty of geometric features in side-scan sonar target features, this paper innovatively incorporates a feature scaling factor into the YOLOv7 model. The experiment initially verified the necessity of attention mechanisms in the public dataset. Subsequently, experiments on our side-scan sonar (SSS) image dataset show that the improved YOLOv7 model has 87.9% and 49.23% in its average accuracy ( m A P 0.5 ) and ( m A P 0.5:0.95), respectively. These results are 9.28% and 8.41% higher than the YOLOv7 model. The improved YOLOv7 algorithm proposed in this paper has great potential for object detection and the recognition of side-scan sonar images. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
31. A Throughput Performance Analysis Method for Multimode Underwater Acoustic Communication Network Based on Markov Decision Process.
- Author
-
Wang, Chao, Du, Pengyu, Wang, Zhongkang, and Li, Dong
- Subjects
UNDERWATER acoustic communication ,MARKOV processes ,TELECOMMUNICATION systems ,ARCHITECTURAL acoustics ,SUBMERGED structures - Abstract
The multimode underwater acoustic communication network is a novel form of underwater acoustic communication that adjusts its communication mode to enhance overall performance. Current performance analysis methods are primarily applied to single-mode networks and assume uniform communication capability across all nodes, making them unsuitable for multimode networks. This paper investigates the multimode communication of the physical layer, considering factors such as the marine environment, the node transmitting sound source level, and the transmitting distance. A decoding conflict model is proposed to support multimode concurrent transmission scenarios. The communication mode is designed to be compatible with the channel and node characteristics. Additionally, using a Markov decision process, this paper establishes a performance evaluation and analysis model for multimode underwater acoustic networks to determine throughput performance limits in real underwater environments. Simulations across various scenarios validate that the throughput performance limits obtained by this method are more accurate under multimode networks, with an improvement in accuracy of over 67.5% compared to existing methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
32. Attention Guide Axial Sharing Mixed Attention (AGASMA) Network for Cloud Segmentation and Cloud Shadow Segmentation.
- Author
-
Gu, Guowei, Wang, Zhongchen, Weng, Liguo, Lin, Haifeng, Zhao, Zikai, and Zhao, Liling
- Subjects
IMAGE fusion ,PARALLEL processing ,REMOTE sensing ,IMAGE processing - Abstract
Segmenting clouds and their shadows is a critical challenge in remote sensing image processing. The shape, texture, lighting conditions, and background of clouds and their shadows impact the effectiveness of cloud detection. Currently, architectures that maintain high resolution throughout the entire information-extraction process are rapidly emerging. This parallel architecture, combining high and low resolutions, produces detailed high-resolution representations, enhancing segmentation prediction accuracy. This paper continues the parallel architecture of high and low resolution. When handling high- and low-resolution images, this paper employs a hybrid approach combining the Transformer and CNN models. This method facilitates interaction between the two models, enabling the extraction of both semantic and spatial details from the images. To address the challenge of inadequate fusion and significant information loss between high- and low-resolution images, this paper introduces a method based on ASMA (Axial Sharing Mixed Attention). This approach establishes pixel-level dependencies between high-resolution and low-resolution images, aiming to enhance the efficiency of image fusion. In addition, to enhance the effective focus on critical information in remote sensing images, the AGM (Attention Guide Module) is introduced, to integrate attention elements from original features into ASMA, to alleviate the problem of insufficient channel modeling of the self-attention mechanism. Our experimental results on the Cloud and Cloud Shadow dataset, the SPARCS dataset, and the CSWV dataset demonstrate the effectiveness of our method, surpassing the state-of-the-art techniques for cloud and cloud shadow segmentation. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
33. LOD2-Level+ Low-Rise Building Model Extraction Method for Oblique Photography Data Using U-NET and a Multi-Decision RANSAC Segmentation Algorithm.
- Author
-
He, Yufeng, Wu, Xiaobian, Pan, Weibin, Chen, Hui, Zhou, Songshan, Lei, Shaohua, Gong, Xiaoran, Xu, Hanzeyu, and Sheng, Yehua
- Subjects
ARCHITECTURAL details ,DIGITAL elevation models ,POINT cloud ,PHOTOGRAPHY ,ALGORITHMS - Abstract
Oblique photography is a regional digital surface model generation technique that can be widely used for building 3D model construction. However, due to the lack of geometric and semantic information about the building, these models make it difficult to differentiate more detailed components in the building, such as roofs and balconies. This paper proposes a deep learning-based method (U-NET) for constructing 3D models of low-rise buildings that address the issues. The method ensures complete geometric and semantic information and conforms to the LOD2 level. First, digital orthophotos are used to perform building extraction based on U-NET, and then a contour optimization method based on the main direction of the building and the center of gravity of the contour is used to obtain the regular building contour. Second, the pure building point cloud model representing a single building is extracted from the whole point cloud scene based on the acquired building contour. Finally, the multi-decision RANSAC algorithm is used to segment the building detail point cloud and construct a triangular mesh of building components, followed by a triangular mesh fusion and splicing method to achieve monolithic building components. The paper presents experimental evidence that the building contour extraction algorithm can achieve a 90.3% success rate and that the resulting single building 3D model contains LOD2 building components, which contain detailed geometric and semantic information. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
34. Design of Scanning Units for the Underwater Circumferential-Scanning LiDAR Based on Pyramidal-Shaped Reflectors and a Rapid Detection Method for Target Orientation.
- Author
-
Zha, Bingting, Xu, Guangbo, Chen, Zhuo, Tan, Yayun, Qin, Jianxin, and Zhang, He
- Subjects
DOPPLER lidar ,LIDAR ,MAGNETIC control ,DESIGN - Abstract
Challenges have been observed in the traditional circumferential-scanning LiDAR underwater to balance between the detection range and the sealing performance. To tackle these challenges, a new scanning unit is presented in this paper, employing a pyramidal-shaped reflector for enhanced performance. Furthermore, an innovative magneto–electric detection module comprising Hall switches and magnetic rings is introduced. It can facilitate the accurate identification of the reflector's edge, thereby enhancing the precision of the target-orientation detection. A rapid target orientation coding method based on split-frequency clocks is proposed on FPGAs. It can output the target's initial and termination orientation codes immediately after capturing it, exhibiting a significantly low output delay of 20 ns and a high detection resolution of 15°. Finally, a prototype is fabricated to validate the design in this paper. The experimental results demonstrate that the scanning unit enables reliable scanning and orientation recognition of the target. In addition, it is trustworthy in receiving echo signals when the laser passes through glass and then an aqueous medium. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
35. A GPU-Based Integration Method from Raster Data to a Hexagonal Discrete Global Grid.
- Author
-
Zheng, Senyuan, Zhou, Liangchen, Lu, Chengshuai, and Lv, Guonian
- Subjects
MOBILE geographic information systems ,DATABASE design ,RESEARCH personnel ,DATA transmission systems ,DATA conversion - Abstract
This paper proposes an algorithm for the conversion of raster data to hexagonal DGGSs in the GPU by redevising the encoding and decoding mechanisms. The researchers first designed a data structure based on rhombic tiles to convert the hexagonal DGGS to a texture format acceptable for GPUs, thus avoiding the irregularity of the hexagonal DGGS. Then, the encoding and decoding methods of the tile data based on space-filling curves were designed, respectively, so as to reduce the amount of data transmission from the CPU to the GPU. Finally, the researchers improved the algorithmic efficiency through thread design. To validate the above design, raster integration experiments were conducted based on the global Aster 30 m digital elevation dataDEM, and the experimental results showed that the raster integration accuracy of this algorithms was around 1 m, while its efficiency could be improved to more than 600 times that of the algorithm for integrating the raster data to the hexagonal DGGS data, executed in the CPU. Therefore, the researchers believe that this study will provide a feasible method for the efficient and stable integration of massive raster data based on a hexagonal grid, which may well support the organization of massive raster data in the field of GIS. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
36. A Method for Underwater Acoustic Target Recognition Based on the Delay-Doppler Joint Feature.
- Author
-
Du, Libin, Wang, Zhengkai, Lv, Zhichao, Han, Dongyue, Wang, Lei, Yu, Fei, and Lan, Qing
- Subjects
CONVOLUTIONAL neural networks ,ARCHITECTURAL acoustics ,OBJECT recognition (Computer vision) ,FOURIER transforms - Abstract
With the aim of solving the problem of identifying complex underwater acoustic targets using a single signal feature in the Time–Frequency (TF) feature, this paper designs a method that recognizes the underwater targets based on the Delay-Doppler joint feature. First, this method uses symplectic finite Fourier transform (SFFT) to extract the Delay-Doppler features of underwater acoustic signals, analyzes the Time–Frequency features at the same time, and combines the Delay-Doppler (DD) feature and Time–Frequency feature to form a joint feature (TF-DD). This paper uses three types of convolutional neural networks to verify that TF-DD can effectively improve the accuracy of target recognition. Secondly, this paper designs an object recognition model (TF-DD-CNN) based on joint features as input, which simplifies the neural network's overall structure and improves the model's training efficiency. This research employs ship-radiated noise to validate the efficacy of TF-DD-CNN for target identification. The results demonstrate that the combined characteristic and the TF-DD-CNN model introduced in this study can proficiently detect ships, and the model notably enhances the precision of detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
37. Oxygen and Air Density Retrieval Method for Single-Band Stellar Occultation Measurement.
- Author
-
Li, Zheng, Wu, Xiaocheng, Tu, Cui, Yang, Junfeng, Hu, Xiong, and Yan, Zhaoai
- Subjects
OCCULTATIONS (Astronomy) ,HYDROSTATIC equilibrium ,ATMOSPHERIC density ,STELLAR spectra ,ATMOSPHERIC layers ,IDEAL gases - Abstract
The stellar occultation technique is capable of atmospheric trace gas detection using the molecule absorption characteristics of the stellar spectra. In this paper, the non-iterative and iterative retrieval methods for oxygen and air density detection by stellar occultation are investigated. For the single-band average transmission data in the oxygen 761 nm A-band, an onion-peeling algorithm is used to calculate the effective optical depth of each atmospheric layer, and then the optical depth is used to retrieve the oxygen number density. The iteration method introduces atmospheric hydrostatic equilibrium and the ideal gas equation of state, and it achieves a more accurate retrieval of the air density under the condition of a priori temperature deviation. Finally, this paper analyzes the double solution problem in the iteration process and the ideas to improve the problem. This paper provides a theoretical basis for the development of a new type of atmospheric density detection method. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
38. Cooperative Jamming Resource Allocation with Joint Multi-Domain Information Using Evolutionary Reinforcement Learning.
- Author
-
Xin, Qi, Xin, Zengxian, and Chen, Tao
- Subjects
RADAR interference ,REINFORCEMENT learning ,PARTICLE swarm optimization ,RESOURCE allocation ,REINFORCEMENT (Psychology) ,MACHINE learning ,DUNG beetles - Abstract
Addressing the formidable challenges posed by multiple jammers jamming multiple radars, which arise from spatial discretization, many degrees of freedom, numerous model input parameters, and the complexity of constraints, along with a multi-peaked objective function, this paper proposes a cooperative jamming resource allocation method, based on evolutionary reinforcement learning, that uses joint multi-domain information. Firstly, an adversarial scenario model is established, characterizing the interaction between multiple jammers and radars based on a multi-beam jammer model and a radar detection model. Subsequently, considering real-world scenarios, this paper analyzes the constraints and objective function involved in cooperative jamming resource allocation by multiple jammers. Finally, accounting for the impact of spatial, frequency, and energy domain information on jamming resource allocation, matrices representing spatial condition constraints, jamming beam allocation, and jamming power allocation are formulated to characterize the cooperative jamming resource allocation problem. Based on this foundation, the joint allocation of the jamming beam and jamming power is optimized under the constraints of jamming resources. Through simulation experiments, it was determined that, compared to the dung beetle optimizer (DBO) algorithm and the particle swarm optimization (PSO) algorithm, the proposed evolutionary reinforcement learning algorithm based on DBO and Q-Learning (DBO-QL) offers 3.03% and 6.25% improvements in terms of jamming benefit and 26.33% and 50.26% improvements in terms of optimization success rate, respectively. In terms of algorithm response time, the proposed hybrid DBO-QL algorithm has a response time of 0.11 s, which is 97.35% and 96.57% lower than the response times of the DBO and PSO algorithms, respectively. The results show that the method proposed in this paper has good convergence, stability, and timeliness. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
39. Analysis of Road Surface Texture for Asphalt Pavement Adhesion Assessment Using 3D Laser Technology.
- Author
-
Liang, Haimei, Pagano, Rosa Giovanna, Oddone, Stefano, Cong, Lin, and De Blasiis, Maria Rosaria
- Subjects
ASPHALT pavements ,SURFACE texture ,PAVEMENTS ,SURFACE analysis ,LASERS ,OPTICAL scanners - Abstract
Pavement adhesion plays a crucial role in driving safety, while traditional test methods exhibit some limitations. To improve the efficiency and accuracy of asphalt pavement texture characterization and adhesion assessments, this paper uses three-dimensional (3D) laser technology to detect the continuous point cloud data of road surface and reconstruct the 3D topography of pavement texture. On this basis, a volume parameter Volume of peak materials (Vmp) is innovatively proposed to comprehensively characterize the 3D spatial characteristics of road surface texture. The correlation analysis between the proposed Vmp and the traditional adhesion evaluation index Transversal Adhesion Coefficient (CAT) is conducted, and then refined graded adhesion prediction models based on the proposed Vmp are proposed. Results show that the proposed volume parameter Vmp can reliably and accurately characterize the asphalt pavement texture by considering more structural properties of the road surface texture. According to the research findings of this paper, it is feasible to achieve rapid and correct assessment of asphalt pavement adhesion using 3D laser detection technology by comprehensively considering the 3D characteristics of the road surface texture. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
40. A Building Point Cloud Extraction Algorithm in Complex Scenes.
- Author
-
Su, Zhonghua, Peng, Jing, Feng, Dajian, Li, Shihua, Yuan, Yi, and Zhou, Guiyun
- Subjects
POINT cloud ,ALGORITHMS ,URBAN renewal ,CITIES & towns ,THREE-dimensional modeling - Abstract
Buildings are significant components of digital cities, and their precise extraction is essential for the three-dimensional modeling of cities. However, it is difficult to accurately extract building features effectively in complex scenes, especially where trees and buildings are tightly adhered. This paper proposes a highly accurate building point cloud extraction method based solely on the geometric information of points in two stages. The coarsely extracted building point cloud in the first stage is iteratively refined with the help of mask polygons and the region growing algorithm in the second stage. To enhance accuracy, this paper combines the Alpha Shape algorithm with the neighborhood expansion method to generate mask polygons, which help fill in missing boundary points caused by the region growing algorithm. In addition, this paper performs mask extraction on the original points rather than non-ground points to solve the problem of incorrect identification of facade points near the ground using the cloth simulation filtering algorithm. The proposed method has shown excellent extraction accuracy on the Urban-LiDAR and Vaihingen datasets. Specifically, the proposed method outperforms the PointNet network by 20.73% in precision for roof extraction of the Vaihingen dataset and achieves comparable performance with the state-of-the-art HDL-JME-GGO network. Additionally, the proposed method demonstrated high accuracy in extracting building points, even in scenes where buildings were closely adjacent to trees. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
41. Conditional Diffusion Model for Urban Morphology Prediction.
- Author
-
Shi, Tiandong, Zhao, Ling, Liu, Fanfan, Zhang, Ming, Li, Mengyao, Peng, Chengli, and Li, Haifeng
- Subjects
URBAN morphology ,GENERATIVE adversarial networks ,DISTRIBUTION (Probability theory) ,URBAN research - Abstract
Predicting urban morphology based on local attributes is an important issue in urban science research. The deep generative models represented by generative adversarial network (GAN) models have achieved impressive results in this area. However, in such methods, the urban morphology is assumed to follow a specific probability distribution and be able to directly approximate the distribution via GAN models, which is not a realistic strategy. As demonstrated by the score-based model, a better strategy is to learn the gradient of the probability distribution and implicitly approximate the distribution. Therefore, in this paper, an urban morphology prediction method based on the conditional diffusion model is proposed. Implementing this approach results in the decomposition of the attribute-based urban morphology prediction task into two subproblems: estimating the gradient of the conditional distribution, and gradient-based sampling. During the training stage, the gradient of the conditional distribution is approximated by using a conditional diffusion model to predict the noise added to the original urban morphology. In the generation stage, the corresponding conditional distribution is parameterized based on the noise predicted by the conditional diffusion model, and the final prediction result is generated through iterative sampling. The experimental results showed that compared with GAN-based methods, our method demonstrated improvements of 5.5%, 5.9%, and 13.2% in the metrics of low-level pixel features, shallow structural features, and deep structural features, respectively. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
42. A Semantic Spatial Structure-Based Loop Detection Algorithm for Visual Environmental Sensing.
- Author
-
Cheng, Xina, Zhang, Yichi, Kang, Mengte, Wang, Jialiang, Jiao, Jianbin, Dong, Le, and Jiao, Licheng
- Subjects
ALGORITHMS ,SEMANTIC computing - Abstract
Loop closure detection is an important component of the Simultaneous Localization and Mapping (SLAM) algorithm, which is utilized in environmental sensing. It helps to reduce drift errors during long-term operation, improving the accuracy and robustness of localization. Such improvements are sorely needed, as conventional visual-based loop detection algorithms are greatly affected by significant changes in viewpoint and lighting conditions. In this paper, we present a semantic spatial structure-based loop detection algorithm. In place of feature points, robust semantic features are used to cope with the variation in the viewpoint. In consideration of the semantic features, which are region-based, we provide a corresponding matching algorithm. Constraints on semantic information and spatial structure are used to determine the existence of loop-back. A multi-stage pipeline framework is proposed to systematically leverage semantic information at different levels, enabling efficient filtering of potential loop closure candidates. To validate the effectiveness of our algorithm, we conducted experiments using the uHumans2 dataset. Our results demonstrate that, even when there are significant changes in viewpoint, the algorithm exhibits superior robustness compared to that of traditional loop detection methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
43. Soil Salinity Inversion in Yellow River Delta by Regularized Extreme Learning Machine Based on ICOA.
- Author
-
Wang, Jiajie, Wang, Xiaopeng, Zhang, Jiahua, Shang, Xiaodi, Chen, Yuyi, Feng, Yiping, and Tian, Bingbing
- Subjects
SOIL salinity ,MACHINE learning ,SOIL salinization ,OPTIMIZATION algorithms ,BACK propagation - Abstract
Soil salinization has seriously affected agricultural production and ecological balance in the Yellow River Delta region. Rapid and accurate monitoring of soil salinity has become an urgent need. Traditional machine learning models tend to fall into local optimal values during the learning process, which reduces their accuracy. This paper introduces Circle map to enhance the crayfish optimization algorithm (COA), which is then integrated with the regularized extreme learning machine (RELM) model, aiming to improve the accuracy of soil salinity content (SSC) inversion in the Yellow River Delta region. We employed Landsat5 TM remote sensing images and measured salinity data to develop spectral indices, such as the band index, salinity index, vegetation index, and comprehensive index, selecting the optimal modeling variable group through Pearson correlation analysis and variable projection importance analysis. The back propagation neural network (BPNN), RELM, and improved crayfish optimization algorithm–regularized extreme learning machine (ICOA-RELM) models were constructed using measured data and selected variable groups for SSC inversion. The results indicate that the ICOA-RELM model enhances the R 2 value by an average of about 0.1 compared to other models, particularly those using groups of variables filtered by variable projection importance analysis as input variables, which showed the best inversion effect (test set R 2 value of 0.75, MAE of 0.198, RMSE of 0.249). The SSC inversion results indicate a higher salinization degree in the coastal regions of the Yellow River Delta and a lower degree in the inland areas, with moderate saline soil and severe saline soil comprising 48.69% of the total area. These results are consistent with the actual sampling results, which verify the practicability of the model. This paper's methods and findings introduce an innovative and practical tool for monitoring and managing salinized soils in the Yellow River Delta, offering significant theoretical and practical benefits. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
44. A Long-Duration Glacier Change Analysis for the Urumqi River Valley, a Representative Region of Central Asia.
- Author
-
Wang, Lin, Yang, Shujing, Chen, Kangning, Liu, Shuangshuang, Jin, Xiang, and Xie, Yida
- Subjects
GLACIERS ,ALPINE glaciers ,GLOBAL warming ,AGRICULTURAL productivity ,TIME series analysis ,HIGH temperatures ,CLIMATE change - Abstract
The increasing global warming trend has resulted in the mass loss of most glaciers. The Urumqi Vally, located in the dry and cold zone of China, and its widely dispersed glaciers are significant to the regional ecological environment, oasis economic development, and industrial and agricultural production. This is representative of glaciers in Middle Asia and represents one of the world's longest observed time series of glaciers, beginning in 1959. The Urumqi Headwater Glacier No. 1 (UHG-1) has a dominant presence in the World Glacier Monitoring Service (WGMS). This paper supplies a comprehensive analysis of past studies and future modeling of glacier changes in the Urumqi Valley. It has received insufficient attention in the past, and the mass balance of UHG-1 was used to verify that the geodetic results and the OGGM model simulation results are convincing. The main conclusions are: The area of 48.68 ± 4.59 km
2 delineated by 150 glaciers in 1958 decreased to 21.61 ± 0.27 km2 delineated by 108 glaciers in 2022, with a reduction of 0.47 ± 0.04 km2 ·a−1 (0.96% a−1 in 1958–2022). The glacier mass balance by geodesy is −0.69 ± 0.11 m w.e.a−1 in 2000–2022, which is just deviating from the measured result (−0.66 m w.e.a−1 ), but the geodetic result in this paper can be enough to reflect the glacier changes (−0.65 ± 0.11 m w.e.a−1 ) of the URB in 2000–2022. The future loss rate of area and volume will undergo a rapid and then decelerating process, with the fastest and slowest inflection points occurring around 2035 and 2070, respectively. High temperatures and large precipitation in summer accelerate glacier loss, and the corresponding lag period of glacier change to climate is about 2–3 years. [ABSTRACT FROM AUTHOR]- Published
- 2024
- Full Text
- View/download PDF
45. Remote Sensing Image Retrieval Algorithm for Dense Data.
- Author
-
Li, Xin, Liu, Shibin, and Liu, Wei
- Subjects
IMAGE retrieval ,GREEDY algorithms ,INFORMATION retrieval ,ALGORITHMS ,DATA quality - Abstract
With the rapid development of remote sensing technology, remote sensing products have found increasingly widespread applications across various fields. Nevertheless, as the volume of remote sensing image data continues to grow, traditional data retrieval techniques have encountered several challenges such as substantial query results, data overlap, and variations in data quality. Users need to manually browse and filter a large number of remote sensing datasets, which is a cumbersome and inefficient process. In order to cope with these problems of traditional remote sensing image retrieval methods, this paper proposes a remote sensing image retrieval algorithm for dense data (DD-RSIRA). The algorithm establishes evaluation metrics based on factors like imaging time, cloud coverage, and image coverage. The algorithm utilizes the global grids to create an ensemble coverage relation between images and grids. A locally optimal initial solution is obtained by a greedy algorithm, and then a local search is performed to search for the optimal solution by combining the strategies of weighted gain-loss scheme and novel mechanism. Ultimately, it achieves an optimal coverage of remote sensing images within the region of interest. In this paper, it is shown that the method obtains a smaller number of datasets with lower redundancy and higher data utilization and ensures the data quality to a certain extent in order to accurately meet the requirements of the regional coverage of remote sensing images. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
46. A Novel Method for Simplifying the Distribution Envelope of Green Tide for Fast Drift Prediction in the Yellow Sea, China.
- Author
-
Ding, Yi, Gao, Song, Huang, Guoman, Wu, Lingjuan, Wang, Zhiyong, Yuan, Chao, and Yu, Zhigang
- Subjects
REMOTE sensing ,AQUATIC sports ,MARINE ecology ,TOURISM impact ,AZIMUTH - Abstract
Since 2008, annual outbreaks of green tides in the Yellow Sea have had severe impacts on tourism, fisheries, water sports, and marine ecology, necessitating effective interception and removal measures. Satellite remote sensing has emerged as a promising tool for monitoring large-scale green tides due to its wide coverage and instantaneous imaging capabilities. Additionally, drift prediction techniques can forecast the location of future green tides based on remote sensing monitoring information. This monitoring and prediction information is crucial for developing an effective plan to intercept and remove green tides. One key aspect of this monitoring information is the green tide distribution envelope, which can be generated automatically and quickly using buffer analysis methods. However, this method produces a large number of envelope vertices, resulting in significant computational burden during prediction calculations. To address this issue, this paper proposes a simplification method based on azimuth difference and side length (SM-ADSL). Compared to the isometric and Douglas–Peucker methods with the same simplification rate, SM-ADSL exhibits better performance in preserving shape and area. The simplified distribution envelope can shorten prediction times and enhance the efficiency of emergency decision-making for green tide disasters. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
47. Lunar Exploration Based on Ground-Based Radar: Current Research Progress and Future Prospects.
- Author
-
Xu, Jiangwan, Ding, Chunyu, Su, Yan, Ding, Zonghua, Yang, Song, Li, Jiawei, Dong, Zehua, Sharma, Ravi, Qiu, Xiaohang, Lei, Zhonghan, Chen, Haoyu, Jiang, Changzhi, Chen, Wentao, Cheng, Qi, and Liang, Zihang
- Subjects
LUNAR exploration ,GROUND penetrating radar ,LUNAR soil ,SPACE flight to the moon ,LUNAR surface - Abstract
Lunar exploration is of significant importance in the development and utilization of in situ lunar resources, water ice exploration, and astronomical science. In recent years, ground-based radar (GBR) has gained increasing attention in the field of lunar exploration due to its flexibility, low cost, and penetrating capabilities. This paper reviews the scientific research on lunar exploration using GBR, outlining the basic principles of GBR and the progress made in lunar exploration studies. Our paper introduces the fundamental principles of lunar imaging using GBR and systematically reviews studies on lunar surface/subsurface detection, the dielectric properties inversion of the lunar regolith, and polar water ice detection using GBR. In particular, the paper summarizes the current development status of the Chinese GBR and forecasts future development trends in China. This review will enhance the understanding of lunar exploration results using GBR radar, systematically demonstrate the main applications and scientific achievements of GBR in lunar exploration, and provide a reference for GBR radar in future lunar exploration missions. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
48. AFMSFFNet: An Anchor-Free-Based Feature Fusion Model for Ship Detection.
- Author
-
Zhang, Yuxin, Dong, Chunlei, Guo, Lixin, Meng, Xiao, Liu, Yue, and Wei, Qihao
- Subjects
SYNTHETIC aperture radar ,SHIP models ,NETWORK performance ,PYRAMIDS ,SHIPS - Abstract
This paper aims to improve a small-scale object detection model to achieve detection accuracy matching or even surpassing that of complex models. Efforts are made in the module design phase to minimize parameter count as much as possible, thereby providing the potential for rapid detection of maritime targets. Here, this paper introduces an innovative Anchor-Free-based Multi-Scale Feature Fusion Network (AFMSFFNet), which improves the problems of missed detection and false positives, particularly in inshore or small target scenarios. Leveraging the YOLOX tiny as the foundational architecture, our proposed AFMSFFNet incorporates a novel Adaptive Bidirectional Fusion Pyramid Network (AB-FPN) for efficient multi-scale feature fusion, enhancing the saliency representation of targets and reducing interference from complex backgrounds. Simultaneously, the designed Multi-Scale Global Attention Detection Head (MGAHead) utilizes a larger receptive field to learn object features, generating high-quality reconstructed features for enhanced semantic information integration. Extensive experiments conducted on publicly available Synthetic Aperture Radar (SAR) image ship datasets demonstrate that AFMSFFNet outperforms the traditional baseline models in detection performance. The results indicate an improvement of 2.32% in detection accuracy compared to the YOLOX tiny model. Additionally, AFMSFFNet achieves a Frames Per Second (FPS) of 78.26 in SSDD, showcasing superior efficiency compared to the well-established performance networks, such as faster R-CNN and CenterNet, with efficiency improvement ranging from 4.7 to 6.7 times. This research provides a valuable solution for efficient ship detection in complex backgrounds, demonstrating the efficacy of AFMSFFNet through quantitative improvements in accuracy and efficiency compared to existing models. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
49. Sea–Land Segmentation of Remote-Sensing Images with Prompt Mask-Attention.
- Author
-
Ji, Yingjie, Wu, Weiguo, Nie, Shiqiang, Wang, Jinyu, and Liu, Song
- Subjects
REMOTE-sensing images ,IMAGE segmentation ,REMOTE sensing ,DEEP learning ,QUANTITATIVE research - Abstract
Remote-sensing technology has gradually become one of the most important ways to extract sea–land boundaries due to its large scale, high efficiency, and low cost. However, sea–land segmentation (SLS) is still a challenging problem because of data diversity and inconsistency, "different objects with the same spectrum" or "the same object with different spectra", and noise and interference problems, etc. In this paper, a new sea–land segmentation method (PMFormer) for remote-sensing images is proposed. The contributions are mainly two points. First, based on Mask2Former architecture, we introduce the prompt mask by normalized difference water index (NDWI) of the target image and prompt encoder architecture. The prompt mask provides more reasonable constraints for attention so that the segmentation errors are alleviated in small region boundaries and small branches, which are caused by insufficiency of prior information by large data diversity or inconsistency. Second, for the large intra-class difference problem in the foreground–background segmentation in sea–land scenes, we use deep clustering to simplify the query vectors and make them more suitable for binary segmentation. Then, traditional NDWI and eight other deep-learning methods are thoroughly compared with the proposed PMFormer on three open sea–land datasets. The efficiency of the proposed method is confirmed, after the quantitative analysis, qualitative analysis, time consumption, error distribution, etc. are presented by detailed contrast experiments. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
50. SAR-NTV-YOLOv8: A Neural Network Aircraft Detection Method in SAR Images Based on Despeckling Preprocessing.
- Author
-
Guo, Xiaomeng and Xu, Baoyi
- Subjects
SPECKLE interference ,SYNTHETIC aperture radar ,SPECKLE interferometry ,FEATURE extraction ,SPINE - Abstract
Monitoring aircraft using synthetic aperture radar (SAR) images is a very important task. Given its coherent imaging characteristics, there is a large amount of speckle interference in the image. This phenomenon leads to the scattering information of aircraft targets being masked in SAR images, which is easily confused with background scattering points. Therefore, automatic detection of aircraft targets in SAR images remains a challenging task. For this task, this paper proposes a framework for speckle reduction preprocessing of SAR images, followed by the use of an improved deep learning method to detect aircraft in SAR images. Firstly, to improve the problem of introducing artifacts or excessive smoothing in speckle reduction using total variation (TV) methods, this paper proposes a new nonconvex total variation (NTV) method. This method aims to ensure the effectiveness of speckle reduction while preserving the original scattering information as much as possible. Next, we present a framework for aircraft detection based on You Only Look Once v8 (YOLOv8) for SAR images. Therefore, the complete framework is called SAR-NTV-YOLOv8. Meanwhile, a high-resolution small target feature head is proposed to mitigate the impact of scale changes and loss of depth feature details on detection accuracy. Then, an efficient multi-scale attention module was proposed, aimed at effectively establishing short-term and long-term dependencies between feature grouping and multi-scale structures. In addition, the progressive feature pyramid network was chosen to avoid information loss or degradation in multi-level transmission during the bottom-up feature extraction process in Backbone. Sufficient comparative experiments, speckle reduction experiments, and ablation experiments are conducted on the SAR-Aircraft-1.0 and SADD datasets. The results have demonstrated the effectiveness of SAR-NTV-YOLOv8, which has the most advanced performance compared to other mainstream algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.