Descriptor: "Geometric transformation" / Publisher: mdpi ag - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Geometric transformation"' showing total 30 results

Start Over Descriptor "Geometric transformation" Publisher mdpi ag

30 results on '"Geometric transformation"'

1. Investigating Effective Geometric Transformation for Image Augmentation to Improve Static Hand Gestures with a Pre-Trained Convolutional Neural Network

Author: Baiti-Ahmad Awaluddin, Chun-Tang Chao, and Juing-Shian Chiou
Subjects: hand gesture recognition, image augmentation, geometric transformation, ResNet, MobileNet, inception, Mathematics, QA1-939
Abstract: Hand gesture recognition (HGR) is a challenging and fascinating research topic in computer vision with numerous daily life applications. In HGR, computers aim to identify and classify hand gestures. The limited diversity of the dataset used in HGR is due to the limited number of hand gesture demonstrators, acquisition environments, and hand pose variations despite previous efforts. Geometric image augmentations are commonly used to address these limitations. These augmentations include scaling, translation, rotation, flipping, and image shearing. However, research has yet to focus on identifying the best geometric transformations for augmenting the HGR dataset. This study employed three commonly utilized pre-trained models for image classification tasks, namely ResNet50, MobileNetV2, and InceptionV3. The system’s performance was evaluated on five static HGR datasets: DLSI, HG14, ArabicASL, MU HandImages ASL, and Sebastian Marcell. The experimental results demonstrate that many geometric transformations are unnecessary for HGR image augmentation. Image shearing and horizontal flipping are the most influential transformations for augmenting the HGR dataset and achieving better classification performance. Moreover, ResNet50 outperforms MobileNetV2 and InceptionV3 for static HGR.
Published: 2023
Full Text: View/download PDF

2. Fittings Detection Method Based on Multi-Scale Geometric Transformation and Attention-Masking Mechanism

Author: Ning Wang, Ke Zhang, Jinwei Zhu, Liuqi Zhao, Zhenlin Huang, Xing Wen, Yuheng Zhang, and Wenshuo Lou
Subjects: geometric transformation, fittings, object detection, transformer, Chemical technology, TP1-1185
Abstract: Overhead transmission lines are important lifelines in power systems, and the research and application of their intelligent patrol technology is one of the key technologies for building smart grids. The main reason for the low detection performance of fittings is the wide range of some fittings’ scale and large geometric changes. In this paper, we propose a fittings detection method based on multi-scale geometric transformation and attention-masking mechanism. Firstly, we design a multi-view geometric transformation enhancement strategy, which models geometric transformation as a combination of multiple homomorphic images to obtain image features from multiple views. Then, we introduce an efficient multiscale feature fusion method to improve the detection performance of the model for targets with different scales. Finally, we introduce an attention-masking mechanism to reduce the computational burden of model-learning multiscale features, thereby further improving model performance. In this paper, experiments have been conducted on different datasets, and the experimental results show that the proposed method greatly improves the detection accuracy of transmission line fittings.
Published: 2023
Full Text: View/download PDF

3. Image Processing and QR Code Application Method for Construction Safety Management

Author: Joon-Soo Kim, Chang-Yong Yi, and Young-Jun Park
Subjects: safety management, AI based, QR code, image processing, geometric transformation, yolo, Technology, Engineering (General). Civil engineering (General), TA1-2040, Biology (General), QH301-705.5, Physics, QC1-999, Chemistry, QD1-999
Abstract: Construction safety accidents occur due to a combination of factors. Even a minor accident that could have been treated as a simple injury can lead to a serious accident or death, depending on when and where it occurred. Currently, methods for tracking worker behavior to manage such construction safety accidents are being studied. However, applying the methods to the construction site, various additional elements (e.g., sensors, transmitters, wearing equipment, and control systems) that must be additionally installed and managed are required. The cost of installation and management of these factors increases in proportion to the size of the site and the number of targets to be managed. In addition, the application of new equipment and new rules lowers the work efficiency of workers. In this paper, the following contents are described: (1) system overview, (2) image processing-QR code-based safety management target recognition methodology, and (3) object location discrimination technique applying the geometric transformation. Finally, the proposed methodology was tested to confirm the operation in the field, and the experimental results and conclusions were described in the paper.
Published: 2021
Full Text: View/download PDF

4. Data Augmentation Method by Applying Color Perturbation of Inverse PSNR and Geometric Transformations for Object Recognition Based on Deep Learning

Author: Eun Kyeong Kim, Hansoo Lee, Jin Yong Kim, and Sungshin Kim
Subjects: image data augmentation, data deficiency, deep learning, small dataset, color perturbation, geometric transformation, Technology, Engineering (General). Civil engineering (General), TA1-2040, Biology (General), QH301-705.5, Physics, QC1-999, Chemistry, QD1-999
Abstract: Deep learning is applied in various manufacturing domains. To train a deep learning network, we must collect a sufficient amount of training data. However, it is difficult to collect image datasets required to train the networks to perform object recognition, especially because target items that are to be classified are generally excluded from existing databases, and the manual collection of images poses certain limitations. Therefore, to overcome the data deficiency that is present in many domains including manufacturing, we propose a method of generating new training images via image pre-processing steps, background elimination, target extraction while maintaining the ratio of the object size in the original image, color perturbation considering the predefined similarity between the original and generated images, geometric transformations, and transfer learning. Specifically, to demonstrate color perturbation and geometric transformations, we compare and analyze the experiments of each color space and each geometric transformation. The experimental results show that the proposed method can effectively augment the original data, correctly classify similar items, and improve the image classification accuracy. In addition, it also demonstrates that the effective data augmentation method is crucial when the amount of training data is small.
Published: 2020
Full Text: View/download PDF

5. An SRTM-Aided Epipolar Resampling Method for Multi-Source High-Resolution Satellite Stereo Observation

Author: Jingwen Hu, Gui-Song Xia, and Hong Sun
Subjects: epipolar resampling, high-resolution satellite image, stereo observation, SRTM, dense image-matching, RFM, geometric transformation, Science
Abstract: Binocular stereo observation with multi-source satellite images used to be challenging and impractical, but is now a valuable research issue with the introduction of powerful deep-learning-based stereo matching approaches. However, epipolar resampling, which is critical for binocular stereo observation, has rarely been studied with multi-source satellite images. The main problem is that, under the multi-source stereo mode, the epipolar-line-direction (ELD) at an image location may vary when computed with different elevations. Thus, a novel SRTM (Shuttle Radar Topography Mission)-aided approach is proposed, where a point is transformed from the original image-space to the epipolar image-space through a global rotation, followed by a block-wise homography transformation. The global rotation transfers the ELDs at the center of the overlapping area to the x-axis, and then block-wise transformation shifts the ELDs of all grid-points to the x-axis and eliminates the y-disparities between the virtual corresponding points. Experiments with both single-source and multi-source stereo images showed that the proposed method is obviously more accurate than the previous methods that do not use SRTM. Moreover, with some of the multi-source image pairs, only the proposed method ensured the y-disparities remained within ±1 pixel.
Published: 2019
Full Text: View/download PDF

6. Faster R-CNN and Geometric Transformation-Based Detection of Driver’s Eyes Using Multiple Near-Infrared Camera Sensors

Author: Sung Ho Park, Hyo Sik Yoon, and Kang Ryoung Park
Subjects: gaze tracking, driver’s eye detection, shallow CNN, faster R-CNN, geometric transformation, Chemical technology, TP1-1185
Abstract: Studies are being actively conducted on camera-based driver gaze tracking in a vehicle environment for vehicle interfaces and analyzing forward attention for judging driver inattention. In existing studies on the single-camera-based method, there are frequent situations in which the eye information necessary for gaze tracking cannot be observed well in the camera input image owing to the turning of the driver’s head during driving. To solve this problem, existing studies have used multiple-camera-based methods to obtain images to track the driver’s gaze. However, this method has the drawback of an excessive computation process and processing time, as it involves detecting the eyes and extracting the features of all images obtained from multiple cameras. This makes it difficult to implement it in an actual vehicle environment. To solve these limitations of existing studies, this study proposes a method that uses a shallow convolutional neural network (CNN) for the images of the driver’s face acquired from two cameras to adaptively select camera images more suitable for detecting eye position; faster R-CNN is applied to the selected driver images, and after the driver’s eyes are detected, the eye positions of the camera image of the other side are mapped through a geometric transformation matrix. Experiments were conducted using the self-built Dongguk Dual Camera-based Driver Database (DDCD-DB1) including the images of 26 participants acquired from inside a vehicle and the Columbia Gaze Data Set (CAVE-DB) open database. The results confirmed that the performance of the proposed method is superior to those of the existing methods.
Published: 2019
Full Text: View/download PDF

7. Ground-Based MIMO-SAR Fast Imaging Algorithm Based on Geometric Transformation

Author: Qihong Dan, Chunrui Yu, Shisheng Huang, Tao Lai, Haifeng Huang, Wu Chen, and Duojie Weng
Subjects: ground-based synthetic aperture radar, multiple-input multiple-output, phase center approximation error, subaperture imaging, geometric transformation, Computer Networks and Communications, Hardware and Architecture, Control and Systems Engineering, Signal Processing, Electrical and Electronic Engineering
Abstract: Ground-based multiple-input multiple-output synthetic aperture radar (MIMO-SAR) is a new type of deformation monitoring sensor that has the advantages of no mechanical motion and fast echo acquisition. Ground-based MIMO-SAR can significantly improve the data rate of deformation monitoring. In this paper, a fast imaging algorithm tailored for ground-based MIMO-SAR data is proposed, which can be applied in both far-field and near-field scenarios. First, the phase center approximation error of the non-collinear array in the near field is analyzed. Then, a fast imaging algorithm based on geometric transformation for the coherent synthesis of subimages is put forward. The algorithm uses the geometric transformation to convert the subaperture imaging results into the full aperture coordinate system, which avoids the point-by-point interpolation calculation and further reduces the computational cost of the subimage coherent synthesis algorithm. Simulations and experiments show that the algorithm can achieve high-precision focusing imaging, and its operation efficiency is significantly improved compared with the algorithm based on interpolation.
Published: 2023
Full Text: View/download PDF

8. Reconstruction of Cylindrical Surfaces Using Digital Image Correlation

Author: Adilson Berveglieri and Antonio M. G. Tommaselli
Subjects: cylinder, geometric transformation, LSM, optical measurement, reconstruction, Chemical technology, TP1-1185
Abstract: A technique for the reconstruction of cylindrical surfaces using optical images with an extension of least squares matching is presented. This technique is based on stereo-image acquisition of a cylindrical object, and it involves displacing the camera following the object length. The basic concept behind this technique is that variations in the camera viewpoint over a cylindrical object produce perspective effects similar to a conic section in an image sequence. Such parallax changes are continuous and can be modelled by a second-order function, which is combined with an adaptive least squares matching (ALSM) for the 3D object reconstruction. Using this concept, a photogrammetric intersection with only two image patches can be used to model a cylindrical object with high accuracy. Experiments were conducted with a cylinder on a panel with coded targets to assess the 3D reconstruction accuracy. The accuracy assessment was based on a comparison between the estimated diameter and the diameter directly measured over the cylinder. The difference between the diameters indicated an accuracy of 1/10 mm, and the cylindrical surface was entirely reconstructed.
Published: 2018
Full Text: View/download PDF

9. Image Processing and QR Code Application Method for Construction Safety Management

Author: Chang-Yong Yi, Young-Jun Park, and Joon-Soo Kim
Subjects: Technology, Computer science, QH301-705.5, QC1-999, 0211 other engineering and technologies, 020101 civil engineering, Image processing, 02 engineering and technology, Field (computer science), 0201 civil engineering, 021105 building & construction, Code (cryptography), General Materials Science, Biology (General), Instrumentation, QD1-999, Application methods, Fluid Flow and Transfer Processes, Process Chemistry and Technology, Physics, Geometric transformation, General Engineering, yolo, QR code, Object (computer science), Engineering (General). Civil engineering (General), Computer Science Applications, Construction site safety, Reliability engineering, image processing, geometric transformation, Chemistry, Control system, AI based, safety management, TA1-2040
Abstract: Construction safety accidents occur due to a combination of factors. Even a minor accident that could have been treated as a simple injury can lead to a serious accident or death, depending on when and where it occurred. Currently, methods for tracking worker behavior to manage such construction safety accidents are being studied. However, applying the methods to the construction site, various additional elements (e.g., sensors, transmitters, wearing equipment, and control systems) that must be additionally installed and managed are required. The cost of installation and management of these factors increases in proportion to the size of the site and the number of targets to be managed. In addition, the application of new equipment and new rules lowers the work efficiency of workers. In this paper, the following contents are described: (1) system overview, (2) image processing-QR code-based safety management target recognition methodology, and (3) object location discrimination technique applying the geometric transformation. Finally, the proposed methodology was tested to confirm the operation in the field, and the experimental results and conclusions were described in the paper.
Published: 2021

10. Toward Autonomous UAV Localization via Aerial Image Registration

Author: Christopher Gilliam, Wenchao Li, Samantha Le May, Xuezhi Wang, Allison Kealy, Beth Jelfs, and Bill Moran
Subjects: 010504 meteorology & atmospheric sciences, Computer Networks and Communications, Computer science, landmark detection, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Image registration, lcsh:TK7800-8360, 02 engineering and technology, 01 natural sciences, UAV localization, Robustness (computer science), 0202 electrical engineering, electronic engineering, information engineering, Computer vision, Electrical and Electronic Engineering, Aerial image, 0105 earth and related environmental sciences, business.industry, Geometric transformation, SURF, lcsh:Electronics, image registration, Hardware and Architecture, Control and Systems Engineering, GNSS applications, Signal Processing, 020201 artificial intelligence & image processing, Artificial intelligence, UAV control, business
Abstract: Absolute localization of a flying UAV on its own in a global-navigation-satellite-system (GNSS)-denied environment is always a challenge. In this paper, we present a landmark-based approach where a UAV is automatically locked into the landmark scene shown in a georeferenced image via a feedback control loop, which is driven by the output of an aerial image registration. To pursue a real-time application, we design and implement a speeded-up-robust-features (SURF)-based image registration algorithm that focuses efficiency and robustness under a 2D geometric transformation. A linear UAV controller with signals of four degrees of freedom is derived from the estimated transformation matrix. The approach is validated in a virtual simulation environment, with experimental results demonstrating the effectiveness and robustness of the proposed UAV self-localization system.
Published: 2021

11. Vein Pattern Verification and Identification Based on Local Geometric Invariants Constructed from Minutia Points and Augmented with Barcoded Local Feature

Author: Yutthana Pititheeraphab, Hisayuki Aoyama, Chuchart Pintavirooj, and Nuntachai Thongpance
Subjects: Biometrics, Computer science, dorsal hand vein recognition, Feature extraction, 0211 other engineering and technologies, Word error rate, 02 engineering and technology, lcsh:Technology, hybrid feature, Image (mathematics), Set (abstract data type), lcsh:Chemistry, biometric, 0202 electrical engineering, electronic engineering, information engineering, General Materials Science, affine invariant, Instrumentation, lcsh:QH301-705.5, contactless, Fluid Flow and Transfer Processes, Minutiae, 021110 strategic, defence & security studies, business.industry, lcsh:T, Process Chemistry and Technology, Geometric transformation, General Engineering, Pattern recognition, lcsh:QC1-999, Computer Science Applications, non-contact, lcsh:Biology (General), lcsh:QD1-999, Feature (computer vision), lcsh:TA1-2040, identification, 020201 artificial intelligence & image processing, vein pattern, Artificial intelligence, business, verification, lcsh:Engineering (General). Civil engineering (General), lcsh:Physics
Abstract: This paper presents the development of a hybrid feature&mdash, dorsal hand vein and dorsal geometry&mdash, modality for human recognition. Our proposed hybrid feature extraction method exploits two types of features: dorsal hand geometric-related and local vein pattern. Using geometric affine invariants, the peg-free system extracts minutia points and vein termination and bifurcation and constructs a set of geometric invariants, which are then used to establish the correspondence between two sets of minutiae&mdash, one for the query vein image and the other for the reference vein image. When the correspondence is established, geometric transformation parameters are computed to align the query with the reference image. Once aligned, hybrid features are extracted for identification. In this study, the algorithm was tested on a database of 140 subjects, in which ten different dorsal hand geometric-related images were taken for each individual, and yielded the promising results. In this regard, we have achieved an equal error rate (EER) of 0.243%, indicating that our method is feasible and effective for dorsal vein recognition with high accuracy. This hierarchical scheme significantly improves the performance of personal verification and/or identification.
Published: 2020

12. Improving Positioning Accuracy via Map Matching Algorithm for Visual–Inertial Odometer

Author: Pu Wang, Jitong Zhang, Yuman Mou, Juan Meng, and Mingrong Ren
Subjects: Inertial frame of reference, Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, Map matching, conditional random field, lcsh:Chemical technology, 01 natural sciences, Biochemistry, Odometer, Article, Analytical Chemistry, indoor positioning system, Indoor positioning system, 0202 electrical engineering, electronic engineering, information engineering, lcsh:TP1-1185, Electrical and Electronic Engineering, Instrumentation, 010401 analytical chemistry, Geometric transformation, 020206 networking & telecommunications, Atomic and Molecular Physics, and Optics, 0104 chemical sciences, visual–inertial odometer, Feature (computer vision), map matching, Noise (video), Algorithm
Abstract: A visual&ndash, inertial odometer is used to fuse the image information obtained by a vision sensor with the data measured by an inertial sensor and recover the motion track online in a global frame. However, in an indoor environment, geometric transformation, sparse features, illumination changes, blurring, and noise will occur, which will either cause a reduction in or failure of the positioning accuracy. To solve this problem, a map matching algorithm based on an indoor plane structure map is proposed to improve the positioning accuracy of the system, this algorithm was implemented using a conditional random field model. The output of the attitude information from the visual&ndash, inertial odometer was used as the input of the conditional random field model. The feature function between the attitude information and the expected value was established, and the maximum probabilistic value of the attitude was estimated. Finally, the closed-loop feedback correction of the visual&ndash, inertial system was carried out with the probabilistic attitude value. A number of experiments were designed to verify the feasibility and reliability of the positioning method proposed in this paper.
Published: 2020

13. Point Cloud Registration Based on Multiparameter Functional

Author: Artyom Makovetskii, Aleksei Voronin, Sergei Voronin, and Vitaly Kober
Subjects: orthogonal transformations, neural network, Computer science, General Mathematics, MathematicsofComputing_GENERAL, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Point cloud, closed-form solution, iterative closest points (ICP), variational functionals, Computer Science::Robotics, DCP (deep closest points), surface reconstruction, Physics::Plasma Physics, computer geometry, QA1-939, Computer Science (miscellaneous), Point (geometry), Engineering (miscellaneous), Global optimization, normal ICP (NICP), Artificial neural network, Covariance matrix, global optimization, Geometric transformation, affine transformations, Maxima and minima, Computer Science::Programming Languages, Affine transformation, Algorithm, Mathematics
Abstract: The registration of point clouds in a three-dimensional space is an important task in many areas of computer vision, including robotics and autonomous driving. The purpose of registration is to find a rigid geometric transformation to align two point clouds. The registration problem can be affected by noise and partiality (two point clouds only have a partial overlap). The Iterative Closed Point (ICP) algorithm is a common method for solving the registration problem. Recently, artificial neural networks have begun to be used in the registration of point clouds. The drawback of ICP and other registration algorithms is the possible convergence to a local minimum. Thus, an important characteristic of a registration algorithm is the ability to avoid local minima. In this paper, we propose an ICP-type registration algorithm (λ-ICP) that uses a multiparameter functional (λ-functional). The proposed λ-ICP algorithm generalizes the NICP algorithm (normal ICP). The application of the λ-functional requires a consistent choice of the eigenvectors of the covariance matrix of two point clouds. The paper also proposes an algorithm for choosing the directions of eigenvectors. The performance of the proposed λ-ICP algorithm is compared with that of a standard point-to-point ICP and neural network Deep Closest Points (DCP).
Published: 2021
Full Text: View/download PDF

14. RAHC_GAN: A Data Augmentation Method for Tomato Leaf Disease Recognition

Author: Zhangwei Chang, Xiaofeng Yang, Haifang Li, Hongxia Deng, and Dongsheng Luo
Subjects: Physics and Astronomy (miscellaneous), Artificial neural network, insufficient training data, Computer science, business.industry, General Mathematics, generative adversarial network, Geometric transformation, Pattern recognition, Residual, Data set, tomato disease recognition, Chemistry (miscellaneous), Test set, Classifier (linguistics), QA1-939, Computer Science (miscellaneous), Artificial intelligence, business, data augmentation, Mathematics, Block (data storage), Generator (mathematics)
Abstract: Accurate recognition of tomato diseases is of great significance for agricultural production. Sufficient and insufficient training data of supervised recognition neural network training are symmetry problems. A high precision neural network needs a large number of labeled data, and the difficulty of data sample acquisition is the main challenge to improving the performance of disease recognition. The tomato leaf data augmented by the traditional data augmentation methods based on geometric transformation usually contain less information, and the generalization is not strong. Therefore, a new data augmentation method, RAHC_GAN, based on generative adversarial networks is proposed in this paper, which is used to expand tomato leaf data and identify diseases. In this method, continuous hidden variables are added at the input of the generator, and the purpose is to continuously control the size of the generated disease area and to supplement the intra class information of the same disease. Additionally, the residual attention block is added to the generator to make it pay more attention to the disease region in the leaf image; a multi-scale discriminator is also used to enrich the detailed texture of the generated image and finally generate leaves with obvious disease features. Then, we use the images generated by RAHC_GAN and the original training images to build an expanded data set, which is used to train four kinds of recognition networks, AlexNet, VGGNet, GoogLeNet, and ResNet, and the performance is evaluated through the test set. Experimental results show that RAHC_GAN can generate leaves with obvious disease features, and the generated expanded data set can significantly improve the recognition performance of the classifier. Furthermore, the results of the apple, grape, and corn data set show that RAHC_GAN can also be used as a method to solve the problem of insufficient data in other plant research tasks.
Published: 2021
Full Text: View/download PDF

15. Evaluation of Data Augmentation Techniques for Facial Expression Recognition Systems

Author: Alessandro Floris, Luigi Atzori, and Simone Porcu
Subjects: Reflection (computer programming), Computer Networks and Communications, Computer science, convolutional neural network, lcsh:TK7800-8360, 02 engineering and technology, Translation (geometry), Convolutional neural network, 0202 electrical engineering, electronic engineering, information engineering, facial expression recognition, Electrical and Electronic Engineering, Transformation geometry, business.industry, generative adversarial network, lcsh:Electronics, Geometric transformation, 020206 networking & telecommunications, Pattern recognition, synthetic image database, machine learning, Facial expression recognition, Hardware and Architecture, Control and Systems Engineering, Signal Processing, 020201 artificial intelligence & image processing, Artificial intelligence, business, data augmentation
Abstract: Most Facial Expression Recognition (FER) systems rely on machine learning approaches that require large databases for an effective training. As these are not easily available, a good solution is to augment the databases with appropriate data augmentation (DA) techniques, which are typically based on either geometric transformation or oversampling augmentations (e.g., generative adversarial networks (GANs)). However, it is not always easy to understand which DA technique may be more convenient for FER systems because most state-of-the-art experiments use different settings which makes the impact of DA techniques not comparable. To advance in this respect, in this paper, we evaluate and compare the impact of using well-established DA techniques on the emotion recognition accuracy of a FER system based on the well-known VGG16 convolutional neural network (CNN). In particular, we consider both geometric transformations and GAN to increase the amount of training images. We performed cross-database evaluations: training with the "augmented" KDEF database and testing with two different databases (CK+ and ExpW). The best results were obtained combining horizontal reflection, translation and GAN, bringing an accuracy increase of approximately 30%. This outperforms alternative approaches, except for the one technique that could however rely on a quite bigger database.
Published: 2020
Full Text: View/download PDF

16. A Full-Spectrum Registration Method for Zhuhai-1 Satellite Hyperspectral Imagery

Author: Zhang Qiang, Meng Jinjun, Jiaqi Wu, Suyun Feng, Linlin Lu, Jun Yan, and Qingting Li
Subjects: hyperspectral image, 010504 meteorology & atmospheric sciences, Computer science, 0211 other engineering and technologies, Scale-invariant feature transform, Delaunay triangulation, 02 engineering and technology, RANSAC, lcsh:Chemical technology, 01 natural sciences, Biochemistry, Least squares, Article, Analytical Chemistry, lcsh:TP1-1185, Electrical and Electronic Engineering, scale-invariant feature transform, Instrumentation, 021101 geological & geomatics engineering, 0105 earth and related environmental sciences, business.industry, Geometric transformation, Hyperspectral imaging, Pattern recognition, Spectral bands, Atomic and Molecular Physics, and Optics, full-spectrums registration, Feature (computer vision), Artificial intelligence, Affine transformation, Zhuhai-1 satellite, business
Abstract: Accurate registration is an essential prerequisite for analysis and applications involving remote sensing imagery. It is usually difficult to extract enough matching points for inter-band registration in hyperspectral imagery due to the different spectral responses for land features in different image bands. This is especially true for non-adjacent bands. The inconsistency in geometric distortion caused by topographic relief also makes it inappropriate to use a single affine transformation relationship for the geometric transformation of the entire image. Currently, accurate registration between spectral bands of Zhuhai-1 satellite hyperspectral imagery remains challenging. In this paper, a full-spectrum registration method was proposed to address this problem. The method combines the transfer strategy based on the affine transformation relationship between adjacent spectrums with the differential correction from dense Delaunay triangulation. Firstly, the scale-invariant feature transform (SIFT) extraction method was used to extract and match feature points of adjacent bands. The RANdom SAmple Consensus (RANSAC) algorithm and the least square method is then used to eliminate mismatching point pairs to obtain fine matching point pairs. Secondly, a dense Delaunay triangulation was constructed based on fine matching point pairs. The affine transformation relation for non-adjacent bands was established for each triangle using the affine transformation relation transfer strategy. Finally, the affine transformation relation was used to perform differential correction for each triangle. Three Zhuhai-1 satellite hyperspectral images covering different terrains were used as experiment data. The evaluation results showed that the adjacent band registration accuracy ranged from 0.2 to 0.6 pixels. The structural similarity measure and cosine similarity measure between non-adjacent bands were both greater than 0.80. Moreover, the full-spectrum registration accuracy was less than 1 pixel. These registration results can meet the needs of Zhuhai-1 hyperspectral imagery applications in various fields.
Published: 2020
Full Text: View/download PDF

17. Robust Hand Shape Features for Dynamic Hand Gesture Recognition Using Multi-Level Feature LSTM

Author: Guee-Sang Lee, Soo-Hyung Kim, Hyung-Jeong Yang, and Nhu-Tai Do
Subjects: Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, lcsh:Technology, lcsh:Chemistry, human-computer interaction, 020204 information systems, Component (UML), 0202 electrical engineering, electronic engineering, information engineering, Feature (machine learning), General Materials Science, Segmentation, Pyramid (image processing), Representation (mathematics), lcsh:QH301-705.5, Instrumentation, ComputingMethodologies_COMPUTERGRAPHICS, Block (data storage), Fluid Flow and Transfer Processes, lcsh:T, business.industry, Dynamic Hand Gesture Recognition, Process Chemistry and Technology, Geometric transformation, General Engineering, Pattern recognition, lcsh:QC1-999, Computer Science Applications, lcsh:Biology (General), lcsh:QD1-999, lcsh:TA1-2040, Gesture recognition, hand shape features, 020201 artificial intelligence & image processing, Artificial intelligence, lcsh:Engineering (General). Civil engineering (General), business, lcsh:Physics
Abstract: This study builds robust hand shape features from the two modalities of depth and skeletal data for the dynamic hand gesture recognition problem. For the hand skeleton shape approach, we use the movement, the rotations of the hand joints with respect to their neighbors, and the skeletal point-cloud to learn the 3D geometric transformation. For the hand depth shape approach, we use the feature representation from the hand component segmentation model. Finally, we propose a multi-level feature LSTM with Conv1D, the Conv2D pyramid, and the LSTM block to deal with the diversity of hand features. Therefore, we propose a novel method by exploiting robust skeletal point-cloud features from skeletal data, as well as depth shape features from the hand component segmentation model in order for the multi-level feature LSTM model to benefit from both. Our proposed method achieves the best result on the Dynamic Hand Gesture Recognition (DHG) dataset with 14 and 28 classes for both depth and skeletal data with accuracies of 96.07% and 94.40%, respectively.
Published: 2020
Full Text: View/download PDF

18. Simultaneous Smoothing and Untangling of 2D Meshes Based on Explicit Element Geometric Transformation and Element Stitching

Author: Mingguang Geng, Zhihong Gou, and Shuli Sun
Subjects: Computer science, 010103 numerical & computational mathematics, explicit element geometric transformation, lcsh:Technology, 01 natural sciences, mesh smoothing, lcsh:Chemistry, Image stitching, parallel algorithm, mesh untangling, General Materials Science, Polygon mesh, 0101 mathematics, lcsh:QH301-705.5, Instrumentation, ComputingMethodologies_COMPUTERGRAPHICS, Fluid Flow and Transfer Processes, lcsh:T, Process Chemistry and Technology, Geometric transformation, General Engineering, lcsh:QC1-999, Computer Science Applications, 010101 applied mathematics, Transformation (function), lcsh:Biology (General), lcsh:QD1-999, lcsh:TA1-2040, Polygon, Element (category theory), lcsh:Engineering (General). Civil engineering (General), Unit (ring theory), Algorithm, lcsh:Physics, Smoothing
Abstract: Mesh quality can affect both the accuracy and efficiency of numerical solutions. This paper first proposes a geometry-based smoothing and untangling method for 2D meshes based on explicit element geometric transformation and element stitching. A new explicit element geometric transformation (EEGT) operation for polygonal elements is firstly presented. The transformation, if applied iteratively to an arbitrary polygon (even inverted), will improve its regularity and quality. Then a well-designed element stitching scheme is introduced, which is achieved by carefully choosing appropriate element weights to average the temporary nodes obtained by the above individual element transformation. Based on the explicit element geometric transformation and element stitching, a new mesh smoothing and untangling approach for 2D meshes is proposed. The proper choice of averaging weights for element stitching ensures that the elements can be transitioned smoothly and uniformly throughout the calculation domain. Numerical results show that the proposed method is able to produce high-quality meshes with no inverted elements for highly tangled meshes. Besides, the inherent regularity and fine-grained parallelism make it suitable for implementation on Graphic Processor Unit (GPU).
Published: 2020
Full Text: View/download PDF

19. Morphological Band Registration of Multispectral Cameras for Water Quality Analysis with Unmanned Aerial Vehicle

Author: Wonkook Kim, Stephen C. Mangum, Sunghun Jung, and Yongseon Moon
Subjects: 0106 biological sciences, Micasense Rededge-M, 010504 meteorology & atmospheric sciences, Image quality, Science, Multispectral image, water quality, 01 natural sciences, Displacement (vector), law.invention, band registration, law, Image resolution, 0105 earth and related environmental sciences, Remote sensing, Pixel, 010604 marine biology & hydrobiology, Geometric transformation, multispectral camera, morphological registration, Lens (optics), General Earth and Planetary Sciences, Common spatial pattern, Geology
Abstract: Multispectral imagery contains abundant spectral information on terrestrial and oceanic targets, and retrieval of the geophysical variables of the targets is possible when the radiometric integrity of the data is secured. Multispectral cameras typically require the registration of individual band images because their lens locations for individual bands are often displaced from each other, thereby generating images of different viewing angles. Although this type of displacement can be corrected through a geometric transformation of the image coordinates, a mismatch or misregistration between the bands still remains, owing to the image acquisition timing that differs by bands. Even a short time difference is critical for the image quality of fast-moving targets, such as water surfaces, and this type of deformation cannot be compensated for with a geometric transformation between the bands. This study proposes a novel morphological band registration technique, based on the quantile matching method, for which the correspondence between the pixels of different bands is not sought by their geometric relationship, but by the radiometric distribution constructed in the vicinity of the pixel. In this study, a Micasense Rededge-M camera was operated on an unmanned aerial vehicle and multispectral images of coastal areas were acquired at various altitudes to examine the performance of the proposed method for different spatial scales. To assess the impact of the correction on a geophysical variable, the performance of the proposed method was evaluated for the chlorophyll-a concentration estimation. The results showed that the proposed method successfully removed the noisy spatial pattern caused by misregistration while maintaining the original spatial resolution for both homogeneous scenes and an episodic scene with a red tide outbreak.
Published: 2020
Full Text: View/download PDF

20. LiDAR and Camera Fusion Approach for Object Distance Estimation in Self-Driving Vehicles

Author: Soon Kwon, Jongrak Hwang, G Ajay Kumar, Sung Hoon Youn, Jaehyeong Park, and Jin-Hee Lee
Subjects: Physics and Astronomy (miscellaneous), Computer science, General Mathematics, projection, 01 natural sciences, 010309 optics, computational geometry transformation, 0103 physical sciences, Computer Science (miscellaneous), Computer vision, Projection (set theory), sensor fusion, business.industry, lcsh:Mathematics, autonomous vehicle, 010401 analytical chemistry, Geometric transformation, Robotics, Ranging, lcsh:QA1-939, Sensor fusion, Automation, 0104 chemical sciences, depth sensing, Transformation (function), Lidar, Chemistry (miscellaneous), self-driving vehicle, sensor calibration, point cloud to image mapping, Artificial intelligence, business
Abstract: The fusion of light detection and ranging (LiDAR) and camera data in real-time is known to be a crucial process in many applications, such as in autonomous driving, industrial automation, and robotics. Especially in the case of autonomous vehicles, the efficient fusion of data from these two types of sensors is important to enabling the depth of objects as well as the detection of objects at short and long distances. As both the sensors are capable of capturing the different attributes of the environment simultaneously, the integration of those attributes with an efficient fusion approach greatly benefits the reliable and consistent perception of the environment. This paper presents a method to estimate the distance (depth) between a self-driving car and other vehicles, objects, and signboards on its path using the accurate fusion approach. Based on the geometrical transformation and projection, low-level sensor fusion was performed between a camera and LiDAR using a 3D marker. Further, the fusion information is utilized to estimate the distance of objects detected by the RefineDet detector. Finally, the accuracy and performance of the sensor fusion and distance estimation approach were evaluated in terms of quantitative and qualitative analysis by considering real road and simulation environment scenarios. Thus the proposed low-level sensor fusion, based on the computational geometric transformation and projection for object distance estimation proves to be a promising solution for enabling reliable and consistent environment perception ability for autonomous vehicles.
Published: 2020
Full Text: View/download PDF

21. Optimal Tracking of QR Inspired LEA Using Particle Filter for Secured Visual MIMO Communication Based Vehicular Network

Author: Manikandan Chinnusamy, Rakesh Kumar Sidharthan, Sai Siva Satwik Kommi, Venkatesh Sivanandam, Chandrasekar Mallari Rao, and Neelamegam Periasamy
Subjects: lcsh:Applied optics. Photonics, adaptive threshold technique, Computer science, Real-time computing, MIMO, 02 engineering and technology, 01 natural sciences, 010309 optics, 020210 optoelectronics & photonics, Robustness (computer science), 0103 physical sciences, 0202 electrical engineering, electronic engineering, information engineering, Radiology, Nuclear Medicine and imaging, Randomness tests, Instrumentation, particle filter, Key generation, QR-Code, Vehicular ad hoc network, visual MIMO, LFSR based random key generation, Geometric transformation, lcsh:TA1501-1820, Atomic and Molecular Physics, and Optics, Bit error rate, Particle filter
Abstract: Line of sight (LoS) and security are the fundamental requirement for any secure visual MIMO communication based vehicular networks. Uneven speed, irregular terrains, and uncertain appearance of obstacles disgrace LoS between the visual multiple-input multiple-output (MIMO) transmitter and receiver. This paper proposes optimal tracking techniques to maintain LoS using a novel quick response (QR) inspired light emitting array (LEA) pattern. Automatic geometric transformation of the QR code in conjunction with the particle filter is used to track the dynamically varying LEA position. In addition to that, an adaptive threshold technique (ATT) has been proposed to minimize error caused by ambient light variations and interferences. A novel key generation algorithm is also proposed to improve the security of the visual MIMO communication system. It utilizes a customized linear feedback shift register (LFSR) and synthetic color image to generate a stronger key. The randomness test demonstrates the robustness of the proposed key generation technique with a minimal probability value of 0.1223 (&gt, 0.01). Simulation and real-time studies have been carried out in the presence of geometric distortions, and the bit error rate (BER) performance is evaluated. Results illustrate that the proposed techniques make the visual MIMO system more secure and reliable for vehicular networks.
Published: 2019
Full Text: View/download PDF

22. The Extension of Phase Correlation to Image Perspective Distortions Based on Particle Swarm Optimization

Author: Chenhui Wang, Xue Wan, and Shengyang Li
Subjects: Similarity (geometry), Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 0211 other engineering and technologies, perspective, Image registration, Image processing, 02 engineering and technology, lcsh:Chemical technology, Translation (geometry), Biochemistry, Article, Analytical Chemistry, phase correlation, 0202 electrical engineering, electronic engineering, information engineering, lcsh:TP1-1185, Computer vision, Electrical and Electronic Engineering, Instrumentation, 021101 geological & geomatics engineering, business.industry, Geometric transformation, Motion blur, Particle swarm optimization, Atomic and Molecular Physics, and Optics, Computer Science::Computer Vision and Pattern Recognition, Phase correlation, 020201 artificial intelligence & image processing, Artificial intelligence, business, optimization
Abstract: Phase correlation is one of the widely used image registration method in medical image processing and remote sensing. One of the main limitations of the phase correlation-based registration method is that it can only cope with Euclidean transformations, such as translation, rotation and scale, which constrain its application in wider fields, such as multi-view image matching, image-based navigation, etc. In this paper, we extended the phase correlation to perspective transformation by the combination of particle swarm optimization. Inspired by optic lens alignment based on interference, we propose to use the quality of PC fringes as the similarity, and then the aim of registration is to search for the optimized geometric transformation operator, which obtain the maximize value of PC-based similarity function through particle swarm optimization approach. The proposed method is validated by image registration experiments using simulated terrain shading, texture and natural landscape images containing different challenges, including illumination variation, lack of texture, motion blur, occlusion and geometric distortions. Further, image-based navigation experiments are carried out to demonstrate that the proposed method is able to correctly recover the trajectory of camera using multimodal target and reference image. Even under great radiometric and geometric distortions, the proposed method is able to achieve 0.1 sub-pixel matching accuracy on average while other methods fail to find the correspondence.
Published: 2019
Full Text: View/download PDF

23. Automatic Annotation of Airborne Images by Label Propagation Based on a Bayesian-CRF Model

Author: Franz Kurz, Peter Reinartz, Friedrich Fraundorfer, and Xiangyu Zhuo
Subjects: Conditional random field, 010504 meteorology & atmospheric sciences, Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, 01 natural sciences, Convolutional neural network, automatic image annotation, 0202 electrical engineering, electronic engineering, information engineering, lcsh:Science, 0105 earth and related environmental sciences, label propagation, Photogrammetrie und Bildanalyse, Pixel, Contextual image classification, business.industry, Deep learning, Geometric transformation, Cognitive neuroscience of visual object recognition, Pattern recognition, Convolutional Neural Network (CNN), Automatic image annotation, General Earth and Planetary Sciences, lcsh:Q, 020201 artificial intelligence & image processing, Artificial intelligence, business, Conditional Random Field (CRF)
Abstract: The tremendous advances in deep neural networks have demonstrated the superiority of deep learning techniques for applications such as object recognition or image classification. Nevertheless, deep learning-based methods usually require a large amount of training data, which mainly comes from manual annotation and is quite labor-intensive. In order to reduce the amount of manual work required for generating enough training data, we hereby propose to leverage existing labeled data to generate image annotations automatically. Specifically, the pixel labels are firstly transferred from one image modality to another image modality via geometric transformation to create initial image annotations, and then additional information (e.g., height measurements) is incorporated for Bayesian inference to update the labeling beliefs. Finally, the updated label assignments are optimized with a fully connected conditional random field (CRF), yielding refined labeling for all pixels in the image. The proposed approach is tested on two different scenarios, i.e., (1) label propagation from annotated aerial imagery to unmanned aerial vehicle (UAV) imagery and (2) label propagation from map database to aerial imagery. In each scenario, the refined image labels are used as pseudo-ground truth data for training a convolutional neural network (CNN). Results demonstrate that our model is able to produce accurate label assignments even around complex object boundaries, besides, the generated image labels can be effectively leveraged for training CNNs and achieve comparable classification accuracy as manual image annotations, more specifically, the per-class classification accuracy of the networks trained by the manual image annotations and the generated image labels have a difference within ± 5 % .
Published: 2019
Full Text: View/download PDF

24. An Improved InSAR Image Co-Registration Method for Pairs with Relatively Big Distortions or Large Incoherent Areas

Author: Guo Zhang, Lei Zhang, and Zhenwei Chen
Subjects: Polynomial, 010504 meteorology & atmospheric sciences, Computer science, 0211 other engineering and technologies, Scale (descriptive set theory), 02 engineering and technology, lcsh:Chemical technology, 01 natural sciences, Biochemistry, non-translation relation, Article, Analytical Chemistry, InSAR, co-registration, Distortion, Interferometric synthetic aperture radar, Computer vision, lcsh:TP1-1185, Electrical and Electronic Engineering, registration point, Instrumentation, Transformation geometry, 021101 geological & geomatics engineering, 0105 earth and related environmental sciences, business.industry, Geometric transformation, Atomic and Molecular Physics, and Optics, Computer Science::Computer Vision and Pattern Recognition, Artificial intelligence, business, Rotation (mathematics)
Abstract: Co-registration is one of the most important steps in interferometric synthetic aperture radar (InSAR) data processing. The standard offset-measurement method based on cross-correlating uniformly distributed patches takes no account of specific geometric transformation between images or characteristics of ground scatterers. Hence, it is inefficient and difficult to obtain satisfying co-registration results for image pairs with relatively big distortion or large incoherent areas. Given this, an improved co-registration strategy is proposed in this paper which takes both the geometric features and image content into consideration. Firstly, some geometric transformations including scale, flip, rotation, and shear between images were eliminated based on the geometrical information, and the initial co-registration polynomial was obtained. Then the registration points were automatically detected by integrating the signal-to-clutter-ratio (SCR) thresholds and the amplitude information, and a further co-registration process was performed to refine the polynomial. Several comparison experiments were carried out using 2 TerraSAR-X data from the Hong Kong airport and 21 PALSAR data from the Donghai Bridge. Experiment results demonstrate that the proposed method brings accuracy and efficiency improvements for co-registration and processing abilities in the cases of big distortion between images or large incoherent areas in the images. For most co-registrations, the proposed method can enhance the reliability and applicability of co-registration and thus promote the automation to a higher level.
Published: 2016

25. Automatic and Accurate Conflation of Different Road-Network Vector Data towards Multi-Modal Navigation

Author: Liqiu Meng, Wei Yao, and Meng Zhang
Subjects: Matching (statistics), Geospatial analysis, 010504 meteorology & atmospheric sciences, Geography, Planning and Development, 0211 other engineering and technologies, lcsh:G1-922, 02 engineering and technology, Pedestrian, Machine learning, computer.software_genre, 01 natural sciences, Earth and Planetary Sciences (miscellaneous), Computers in Earth Sciences, 021101 geological & geomatics engineering, 0105 earth and related environmental sciences, Atlas (topology), business.industry, Geometric transformation, data conflation, pedestrian ways, multi-modal navigation, Conflation, ddc, Identification (information), Modal, Geography, Artificial intelligence, Data mining, business, computer, lcsh:Geography (General)
Abstract: With the rapid improvement of geospatial data acquisition and processing techniques, a variety of geospatial databases from public or private organizations have become available. Quite often, one dataset may be superior to other datasets in one, but not all aspects. In Germany, for instance, there were three major road network vector data, viz. Tele Atlas (which is now “TOMTOM”), NAVTEQ (which is now “here”), and ATKIS. However, none of them was qualified for the purpose of multi-modal navigation (e.g., driving + walking): Tele Atlas and NAVTEQ consist of comprehensive routing-relevant information, but many pedestrian ways are missing; ATKIS covers more pedestrian areas but the road objects are not fully attributed. To satisfy the requirements of multi-modal navigation, an automatic approach has been proposed to conflate different road networks together, which involves five routines: (a) road-network matching between datasets; (b) identification of the pedestrian ways; (c) geometric transformation to eliminate geometric inconsistency; (d) topologic remodeling of the conflated road network; and (e) error checking and correction. The proposed approach demonstrates high performance in a number of large test areas and therefore has been successfully utilized for the real-world data production in the whole region of Germany. As a result, the conflated road network allows the multi-modal navigation of “driving + walking”.
Published: 2016

26. An Alignment Method for the Integration of Underwater 3D Data Captured by a Stereovision System and an Acoustic Camera

Author: Maurizio Muzzupappa, Fabio Bruno, Antonio Lagudi, and Gianfranco Bianco
Subjects: Engineering, Point cloud, 02 engineering and technology, optical and acoustic integration, Remotely operated underwater vehicle, lcsh:Chemical technology, 01 natural sciences, Biochemistry, Article, Analytical Chemistry, opto-acoustic vision, 0202 electrical engineering, electronic engineering, information engineering, Computer vision, underwater 3D imaging, lcsh:TP1-1185, Electrical and Electronic Engineering, Visibility, Instrumentation, Rigid transformation, Optimal estimation, business.industry, 010401 analytical chemistry, Geometric transformation, Atomic and Molecular Physics, and Optics, 0104 chemical sciences, ROV navigation, Noise, Acoustic camera, 020201 artificial intelligence & image processing, Artificial intelligence, business
Abstract: The integration of underwater 3D data captured by acoustic and optical systems is a promising technique in various applications such as mapping or vehicle navigation. It allows for compensating the drawbacks of the low resolution of acoustic sensors and the limitations of optical sensors in bad visibility conditions. Aligning these data is a challenging problem, as it is hard to make a point-to-point correspondence. This paper presents a multi-sensor registration for the automatic integration of 3D data acquired from a stereovision system and a 3D acoustic camera in close-range acquisition. An appropriate rig has been used in the laboratory tests to determine the relative position between the two sensor frames. The experimental results show that our alignment approach, based on the acquisition of a rig in several poses, can be adopted to estimate the rigid transformation between the two heterogeneous sensors. A first estimation of the unknown geometric transformation is obtained by a registration of the two 3D point clouds, but it ends up to be strongly affected by noise and data dispersion. A robust and optimal estimation is obtained by a statistical processing of the transformations computed for each pose. The effectiveness of the method has been demonstrated in this first experimentation of the proposed 3D opto-acoustic camera.
Published: 2016

27. A Rapid Coordinate Transformation Method Applied in Industrial Robot Calibration Based on Characteristic Line Coincidence

Author: Xinghua Qu, Xiaojia Shi, Fumin Zhang, and Bailing Liu
Subjects: 0209 industrial biotechnology, Robot calibration, Computer science, photogrammetric system, Coordinate system, Point cloud, Arm solution, 02 engineering and technology, Kinematics, robot calibration, lcsh:Chemical technology, Biochemistry, Article, Analytical Chemistry, law.invention, multi-sensor measurement system, Matrix (mathematics), Industrial robot, 020901 industrial engineering & automation, Transformation matrix, Control theory, Position (vector), law, 0202 electrical engineering, electronic engineering, information engineering, lcsh:TP1-1185, Electrical and Electronic Engineering, Instrumentation, 020208 electrical & electronic engineering, Geometric transformation, coordinate transformation, Atomic and Molecular Physics, and Optics, Line (geometry)
Abstract: Coordinate transformation plays an indispensable role in industrial measurements, including photogrammetry, geodesy, laser 3-D measurement and robotics. The widely applied methods of coordinate transformation are generally based on solving the equations of point clouds. Despite the high accuracy, this might result in no solution due to the use of ill conditioned matrices. In this paper, a novel coordinate transformation method is proposed, not based on the equation solution but based on the geometric transformation. We construct characteristic lines to represent the coordinate systems. According to the space geometry relation, the characteristic line scan is made to coincide by a series of rotations and translations. The transformation matrix can be obtained using matrix transformation theory. Experiments are designed to compare the proposed method with other methods. The results show that the proposed method has the same high accuracy, but the operation is more convenient and flexible. A multi-sensor combined measurement system is also presented to improve the position accuracy of a robot with the calibration of the robot kinematic parameters. Experimental verification shows that the position accuracy of robot manipulator is improved by 45.8% with the proposed method and robot calibration.
Published: 2016

28. Measurement of Surface Displacement and Deformation of Mass Movements Using Least Squares Matching of Repeat High Resolution Satellite and Aerial Images

Author: Misganu Debella-Gilo and Andreas Kääb
Subjects: displacement, velocity, strain rate, Mass movement, mass movement, Science, Geometric transformation, deformation, image matching, Deformation (meteorology), Strain rate, Geodesy, Least squares, Displacement (vector), Shear stress, General Earth and Planetary Sciences, least-squares, Rotation (mathematics), Geology
Abstract: Displacement and deformation are fundamental measures of Earth surface mass movements such as glacier flow, rockglacier creep and rockslides. Ground-based methods of monitoring such mass movements can be costly, time consuming and limited in spatial and temporal coverage. Remote sensing techniques, here matching of repeat optical images, are increasingly used to obtain displacement and deformation fields. Strain rates are usually computed in a post-processing step based on the gradients of the measured velocity field. This study explores the potential of automatically and directly computing velocity, rotation and strain rates on Earth surface mass movements simultaneously from the matching positions and the parameters of the geometric transformation models using the least squares matching (LSM) approach. The procedures are exemplified using bi-temporal high resolution satellite and aerial images of glacier flow, rockglacier creep and land sliding. The results show that LSM matches the images and computes longitudinal strain rates, transverse strain rates and shear strain rates reliably with mean absolute deviations in the order of 10−4 (one level of significance below the measured values) as evaluated on stable grounds. The LSM also improves the accuracy of displacement estimation of the pixel-precision normalized cross-correlation by over 90% under ideal (simulated) circumstances and by about 25% for real multi-temporal images of mass movements.
Published: 2012

29. Reconstruction of Cylindrical Surfaces Using Digital Image Correlation

Author: Antonio Maria Garcia Tommaselli, Adilson Berveglieri, and Universidade Estadual Paulista (Unesp)
Subjects: cylinder, Digital image correlation, reconstruction, Computer science, Cylinder, 02 engineering and technology, lcsh:Chemical technology, Biochemistry, Article, Analytical Chemistry, 0203 mechanical engineering, Intersection, 0202 electrical engineering, electronic engineering, information engineering, lcsh:TP1-1185, LSM, Computer vision, optical measurement, Electrical and Electronic Engineering, Instrumentation, business.industry, 3D reconstruction, Perspective (graphical), Geometric transformation, Optical measurement, Atomic and Molecular Physics, and Optics, geometric transformation, 020303 mechanical engineering & transports, Photogrammetry, Conic section, Least squares matching, 020201 artificial intelligence & image processing, Artificial intelligence, Reconstruction, Parallax, business
Abstract: Made available in DSpace on 2019-10-06T16:09:02Z (GMT). No. of bitstreams: 0 Previous issue date: 2018-12-01 Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) A technique for the reconstruction of cylindrical surfaces using optical images with an extension of least squares matching is presented. This technique is based on stereo-image acquisition of a cylindrical object, and it involves displacing the camera following the object length. The basic concept behind this technique is that variations in the camera viewpoint over a cylindrical object produce perspective effects similar to a conic section in an image sequence. Such parallax changes are continuous and can be modelled by a second-order function, which is combined with an adaptive least squares matching (ALSM) for the 3D object reconstruction. Using this concept, a photogrammetric intersection with only two image patches can be used to model a cylindrical object with high accuracy. Experiments were conducted with a cylinder on a panel with coded targets to assess the 3D reconstruction accuracy. The accuracy assessment was based on a comparison between the estimated diameter and the diameter directly measured over the cylinder. The difference between the diameters indicated an accuracy of 1/10 mm, and the cylindrical surface was entirely reconstructed. Department of Statistics São Paulo State University UNESP, 305 Department of Cartography São Paulo State University UNESP, 305 Department of Statistics São Paulo State University UNESP, 305 Department of Cartography São Paulo State University UNESP, 305 FAPESP: 2013/50426-4 FAPESP: 2014/05033-7
Published: 2018
Full Text: View/download PDF

30. Two-Dimensional (2D) Slices Encryption-Based Security Solution for Three-Dimensional (3D) Printing Industry

Author: Oh-Heum Kwon, Suk-Hwan Lee, Ki-Ryong Kwon, and Giao N. Pham
Subjects: Computer Networks and Communications, Computer science, lcsh:TK7800-8360, 3D printing, 02 engineering and technology, Encryption, 3D printing data, 3D printing security, discrete cosine transform, geometric transformation, 0202 electrical engineering, electronic engineering, information engineering, Discrete cosine transform, Electrical and Electronic Engineering, business.industry, lcsh:Electronics, Process (computing), 021001 nanoscience & nanotechnology, Transmission (telecommunications), Hardware and Architecture, Control and Systems Engineering, Signal Processing, Key (cryptography), 020201 artificial intelligence & image processing, 0210 nano-technology, business, Computer hardware
Abstract: Nowadays, three-dimensional (3D) printing technology is applied to many areas of life and changes the world based on the creation of complex structures and shapes that were not feasible in the past. But, the data of 3D printing is often attacked in the storage and transmission processes. Therefore, 3D printing must be ensured security in the manufacturing process, especially the data of 3D printing to prevent attacks from hackers. This paper presents a security solution for 3D printing based on two-dimensional (2D) slices encryption. The 2D slices of 3D printing data is encrypted in the frequency domain or in the spatial domain by the secret key to generate the encrypted data of 3D printing. We implemented the proposed solution in both the frequency domain based on the Discrete Cosine Transform and the spatial domain based on geometric transform. The entire 2D slices of 3D printing data is altered and secured after the encryption process. The proposed solution is responsive to the security requirements for the secured storage and transmission. Experimental results also verified that the proposed solution is effective to 3D printing data and is independent on the format of 3D printing models. When compared to the conventional works, the security and performance of the proposed solution is also better.
Published: 2018
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

30 results on '"Geometric transformation"'

1. Investigating Effective Geometric Transformation for Image Augmentation to Improve Static Hand Gestures with a Pre-Trained Convolutional Neural Network

2. Fittings Detection Method Based on Multi-Scale Geometric Transformation and Attention-Masking Mechanism

3. Image Processing and QR Code Application Method for Construction Safety Management

4. Data Augmentation Method by Applying Color Perturbation of Inverse PSNR and Geometric Transformations for Object Recognition Based on Deep Learning

5. An SRTM-Aided Epipolar Resampling Method for Multi-Source High-Resolution Satellite Stereo Observation

6. Faster R-CNN and Geometric Transformation-Based Detection of Driver’s Eyes Using Multiple Near-Infrared Camera Sensors

7. Ground-Based MIMO-SAR Fast Imaging Algorithm Based on Geometric Transformation

8. Reconstruction of Cylindrical Surfaces Using Digital Image Correlation

9. Image Processing and QR Code Application Method for Construction Safety Management

10. Toward Autonomous UAV Localization via Aerial Image Registration

11. Vein Pattern Verification and Identification Based on Local Geometric Invariants Constructed from Minutia Points and Augmented with Barcoded Local Feature

12. Improving Positioning Accuracy via Map Matching Algorithm for Visual–Inertial Odometer

13. Point Cloud Registration Based on Multiparameter Functional

14. RAHC_GAN: A Data Augmentation Method for Tomato Leaf Disease Recognition

15. Evaluation of Data Augmentation Techniques for Facial Expression Recognition Systems

16. A Full-Spectrum Registration Method for Zhuhai-1 Satellite Hyperspectral Imagery

17. Robust Hand Shape Features for Dynamic Hand Gesture Recognition Using Multi-Level Feature LSTM

18. Simultaneous Smoothing and Untangling of 2D Meshes Based on Explicit Element Geometric Transformation and Element Stitching

19. Morphological Band Registration of Multispectral Cameras for Water Quality Analysis with Unmanned Aerial Vehicle

20. LiDAR and Camera Fusion Approach for Object Distance Estimation in Self-Driving Vehicles

21. Optimal Tracking of QR Inspired LEA Using Particle Filter for Secured Visual MIMO Communication Based Vehicular Network

22. The Extension of Phase Correlation to Image Perspective Distortions Based on Particle Swarm Optimization

23. Automatic Annotation of Airborne Images by Label Propagation Based on a Bayesian-CRF Model

24. An Improved InSAR Image Co-Registration Method for Pairs with Relatively Big Distortions or Large Incoherent Areas

25. Automatic and Accurate Conflation of Different Road-Network Vector Data towards Multi-Modal Navigation

26. An Alignment Method for the Integration of Underwater 3D Data Captured by a Stereovision System and an Acoustic Camera

27. A Rapid Coordinate Transformation Method Applied in Industrial Robot Calibration Based on Characteristic Line Coincidence

28. Measurement of Surface Displacement and Deformation of Mass Movements Using Least Squares Matching of Repeat High Resolution Satellite and Aerial Images

29. Reconstruction of Cylindrical Surfaces Using Digital Image Correlation

30. Two-Dimensional (2D) Slices Encryption-Based Security Solution for Three-Dimensional (3D) Printing Industry

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

30 results on '"Geometric transformation"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources