Descriptor: "I.4.1" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"I.4.1"' showing total 260 results

Start Over Descriptor "I.4.1"

260 results on '"I.4.1"'

51. Rethinking Generalization in American Sign Language Prediction for Edge Devices with Extremely Low Memory Footprint

Author: Paul, Aditya Jyoti, Mohan, Puranjay, and Sehgal, Stuti
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Human-Computer Interaction, 68T45, 68T10, 68T07, 68U10, I.2.10, I.4.8, I.5.1, J.3, I.4.1, K.4.2
Abstract: Due to the boom in technical compute in the last few years, the world has seen massive advances in artificially intelligent systems solving diverse real-world problems. But a major roadblock in the ubiquitous acceptance of these models is their enormous computational complexity and memory footprint. Hence efficient architectures and training techniques are required for deployment on extremely low resource inference endpoints. This paper proposes an architecture for detection of alphabets in American Sign Language on an ARM Cortex-M7 microcontroller having just 496 KB of framebuffer RAM. Leveraging parameter quantization is a common technique that might cause varying drops in test accuracy. This paper proposes using interpolation as augmentation amongst other techniques as an efficient method of reducing this drop, which also helps the model generalize well to previously unseen noisy data. The proposed model is about 185 KB post-quantization and inference speed is 20 frames per second., Comment: 6 pages, Published in IEEE RAICS 2020, see https://raics.in
Published: 2020
Full Text: View/download PDF

52. Efficient Robust Watermarking Based on Quaternion Singular Value Decomposition and Coefficient Pair Selection

Author: Chen, Yong, Jia, Zhi-Gang, Peng, Ya-Xin, and Peng, Yan
Subjects: Computer Science - Computer Vision and Pattern Recognition, Mathematics - Numerical Analysis, 65F55, I.4.1
Abstract: Quaternion singular value decomposition (QSVD) is a robust technique of digital watermarking which can extract high quality watermarks from watermarked images with low distortion. In this paper, QSVD technique is further investigated and an efficient robust watermarking scheme is proposed. The improved algebraic structure-preserving method is proposed to handle the problem of "explosion of complexity" occurred in the conventional QSVD design. Secret information is transmitted blindly by incorporating in QSVD two new strategies, namely, coefficient pair selection and adaptive embedding. Unlike conventional QSVD which embeds watermarks in a single imaginary unit, we propose to adaptively embed the watermark into the optimal hiding position using the Normalized Cross-Correlation (NC) method. This avoids the selection of coefficient pair with less correlation, and thus, it reduces embedding impact by decreasing the maximum modification of coefficient values. In this way, compared with conventional QSVD, the proposed watermarking strategy avoids more modifications to a single color image layer and a better visual quality of the watermarked image is observed. Meanwhile, adaptive QSVD resists some common geometric attacks, and it improves the robustness of conventional QSVD. With these improvements, our method outperforms conventional QSVD. Its superiority over other state-of-the-art methods is also demonstrated experimentally., Comment: 11 figures, 3 tables
Published: 2020

53. Search and Rescue with Airborne Optical Sectioning

Author: Schedl, David C., Kurmi, Indrajit, and Bimber, Oliver
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning, 68T07, 68T45, I.2.10, I.4.1
Abstract: We show that automated person detection under occlusion conditions can be significantly improved by combining multi-perspective images before classification. Here, we employed image integration by Airborne Optical Sectioning (AOS)---a synthetic aperture imaging technique that uses camera drones to capture unstructured thermal light fields---to achieve this with a precision/recall of 96/93%. Finding lost or injured people in dense forests is not generally feasible with thermal recordings, but becomes practical with use of AOS integral images. Our findings lay the foundation for effective future search and rescue technologies that can be applied in combination with autonomous or manned aircraft. They can also be beneficial for other fields that currently suffer from inaccurate classification of partially occluded people, animals, or objects., Comment: 11 pages, 5 figures, 3 tables, Nature Machine Intelligence (under review)
Published: 2020
Full Text: View/download PDF

54. Single-shot Hyperspectral-Depth Imaging with Learned Diffractive Optics

Author: Baek, Seung-Hwan, Ikoma, Hayato, Jeon, Daniel S., Li, Yuqi, Heidrich, Wolfgang, Wetzstein, Gordon, and Kim, Min H.
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, I.2.10, I.4.1, I.5
Abstract: Imaging depth and spectrum have been extensively studied in isolation from each other for decades. Recently, hyperspectral-depth (HS-D) imaging emerges to capture both information simultaneously by combining two different imaging systems; one for depth, the other for spectrum. While being accurate, this combinational approach induces increased form factor, cost, capture time, and alignment/registration problems. In this work, departing from the combinational principle, we propose a compact single-shot monocular HS-D imaging method. Our method uses a diffractive optical element (DOE), the point spread function of which changes with respect to both depth and spectrum. This enables us to reconstruct spectrum and depth from a single captured image. To this end, we develop a differentiable simulator and a neural-network-based reconstruction that are jointly optimized via automatic differentiation. To facilitate learning the DOE, we present a first HS-D dataset by building a benchtop HS-D imager that acquires high-quality ground truth. We evaluate our method with synthetic and real experiments by building an experimental prototype and achieve state-of-the-art HS-D imaging results.
Published: 2020

55. Blind Inverse Gamma Correction with Maximized Differential Entropy

Author: Lee, Yong, Zhang, Shaohua, Li, Miao, and He, Xiaoyu
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, I.4.1, I.4.3, I.4.9
Abstract: Unwanted nonlinear gamma distortion frequently occurs in a great diversity of images during the procedures of image acquisition, processing, and/or display. And the gamma distortion often varies with capture setup change and luminance variation. Blind inverse gamma correction, which automatically determines a proper restoration gamma value from a given image, is of paramount importance to attenuate the distortion. For blind inverse gamma correction, an adaptive gamma transformation method (AGT-ME) is proposed directly from a maximized differential entropy model. And the corresponding optimization has a mathematical concise closed-form solution, resulting in efficient implementation and accurate gamma restoration of AGT-ME. Considering the human eye has a non-linear perception sensitivity, a modified version AGT-ME-VISUAL is also proposed to achieve better visual performance. Tested on variable datasets, AGT-ME could obtain an accurate estimation of a large range of gamma distortion (0.1 to 3.0), outperforming the state-of-the-art methods. Besides, the proposed AGT-ME and AGT-ME-VISUAL were applied to three typical applications, including automatic gamma adjustment, natural/medical image contrast enhancement, and fringe projection profilometry image restoration. Furthermore, the AGT-ME/ AGT-ME-VISUAL is general and can be seamlessly extended to the masked image, multi-channel (color or spectrum) image, or multi-frame video, and free of the arbitrary tuning parameter. Besides, the corresponding Python code (https://github.com/yongleex/AGT-ME) is also provided for interested users., Comment: 12 pages, 8 figures
Published: 2020
Full Text: View/download PDF

56. A Bayesian Multilevel Random-Effects Model for Estimating Noise in Image Sensors

Author: Riutort-Mayol, Gabriel, Gómez-Rubio, Virgilio, Marqués-Mateu, Ángel, Lerma, José Luis, and López-Quílez, Antonio
Subjects: Statistics - Applications, Electrical Engineering and Systems Science - Image and Video Processing, 62P30, 62P35, 62F15, 62J05, C.4, G.3, I.4.1
Abstract: Sensor noise sources cause differences in the signal recorded across pixels in a single image and across multiple images. This paper presents a Bayesian approach to decomposing and characterizing the sensor noise sources involved in imaging with digital cameras. A Bayesian probabilistic model based on the (theoretical) model for noise sources in image sensing is fitted to a set of a time-series of images with different reflectance and wavelengths under controlled lighting conditions. The image sensing model is a complex model, with several interacting components dependent on reflectance and wavelength. The properties of the Bayesian approach of defining conditional dependencies among parameters in a fully probabilistic model, propagating all sources of uncertainty in inference, makes the Bayesian modeling framework more attractive and powerful than classical methods for approaching the image sensing model. A feasible correspondence of noise parameters to their expected theoretical behaviors and well calibrated posterior predictive distributions with a small root mean square error for model predictions have been achieved in this study, thus showing that the proposed model accurately approximates the image sensing model. The Bayesian approach could be extended to formulate further components aimed at identifying even more specific parameters of the imaging process.
Published: 2020
Full Text: View/download PDF

57. AvatarMe: Realistically Renderable 3D Facial Reconstruction 'in-the-wild'

Author: Lattas, Alexandros, Moschoglou, Stylianos, Gecer, Baris, Ploumpis, Stylianos, Triantafyllou, Vasileios, Ghosh, Abhijeet, and Zafeiriou, Stefanos
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, I.2.10, I.3.7, I.4.1
Abstract: Over the last years, with the advent of Generative Adversarial Networks (GANs), many face analysis tasks have accomplished astounding performance, with applications including, but not limited to, face generation and 3D face reconstruction from a single "in-the-wild" image. Nevertheless, to the best of our knowledge, there is no method which can produce high-resolution photorealistic 3D faces from "in-the-wild" images and this can be attributed to the: (a) scarcity of available data for training, and (b) lack of robust methodologies that can successfully be applied on very high-resolution data. In this paper, we introduce AvatarMe, the first method that is able to reconstruct photorealistic 3D faces from a single "in-the-wild" image with an increasing level of detail. To achieve this, we capture a large dataset of facial shape and reflectance and build on a state-of-the-art 3D texture and shape reconstruction method and successively refine its results, while generating the per-pixel diffuse and specular components that are required for realistic rendering. As we demonstrate in a series of qualitative and quantitative experiments, AvatarMe outperforms the existing arts by a significant margin and reconstructs authentic, 4K by 6K-resolution 3D faces from a single low-resolution image that, for the first time, bridges the uncanny valley., Comment: Accepted to CVPR2020. Project page: github.com/lattas/AvatarMe with high resolution results, data and more. 10 pages, 9 figures
Published: 2020

58. Fisheye Distortion Rectification from Deep Straight Lines

Author: Xue, Zhu-Cun, Xue, Nan, and Xia, Gui-Song
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.4.1
Abstract: This paper presents a novel line-aware rectification network (LaRecNet) to address the problem of fisheye distortion rectification based on the classical observation that straight lines in 3D space should be still straight in image planes. Specifically, the proposed LaRecNet contains three sequential modules to (1) learn the distorted straight lines from fisheye images; (2) estimate the distortion parameters from the learned heatmaps and the image appearance; and (3) rectify the input images via a proposed differentiable rectification layer. To better train and evaluate the proposed model, we create a synthetic line-rich fisheye (SLF) dataset that contains the distortion parameters and well-annotated distorted straight lines of fisheye images. The proposed method enables us to simultaneously calibrate the geometric distortion parameters and rectify fisheye images. Extensive experiments demonstrate that our model achieves state-of-the-art performance in terms of both geometric accuracy and image quality on several evaluation metrics. In particular, the images rectified by LaRecNet achieve an average reprojection error of 0.33 pixels on the SLF dataset and produce the highest peak signal-to-noise ratio (PSNR) and structure similarity index (SSIM) compared with the groundtruth.
Published: 2020

59. JPEG Steganography and Synchronization of DCT Coefficients for a Given Development Pipeline

Author: Taburet, Théo, Bas, Patrick, Sawaya, Wadih, and Cogranne, Remi
Subjects: Computer Science - Multimedia, Computer Science - Cryptography and Security, 68U10, I.4.1
Abstract: This short paper proposes to use the statistical analysis of the correlation between DCT coefficients to design a new synchronization strategy that can be used for cost-based steganographic schemes in the JPEG domain. First, an analysis is performed on the covariance matrix of DCT coefficients of neighboring blocks after a development similar to the one used to generate BossBase. This analysis exhibits groups of uncorrelated coefficients: 4 groups per block and 2 groups of uncorrelated diagonal neighbors together with groups of mutually correlated coefficients groups of 6 coefficients per blocs and 8 coefficients between 2 adjacent blocks. Using the uncorrelated groups, an embedding scheme can be designed using only 8 disjoint lattices. The cost map for each lattice is updated firstly by using an implicit underlying Gaussian distribution with a variance directly computed from the embedding costs, and secondly by deriving conditional distributions from multivariate distributions. The covariance matrix of these distributions takes into account both the correlations exhibited by the analysis of the covariance matrix and the variance derived from the cost. This synchronization scheme enables to obtain a gain of PE of 5% at QF 95 for an embedding rate close to 0.3 bnzac coefficient using DCTR feature sets.
Published: 2020

60. Towards Evaluating Gaussian Blurring in Perceptual Hashing as a Facial Image Filter

Author: Alparslan, Yigit, Alparslan, Ken, Kshettry, Mannika, and Kratz, Louis
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.4.1, I.4.9
Abstract: With the growth in social media, there is a huge amount of images of faces available on the internet. Often, people use other people's pictures on their own profile. Perceptual hashing is often used to detect whether two images are identical. Therefore, it can be used to detect whether people are misusing others' pictures. In perceptual hashing, a hash is calculated for a given image, and a new test image is mapped to one of the existing hashes if duplicate features are present. Therefore, it can be used as an image filter to flag banned image content or adversarial attacks --which are modifications that are made on purpose to deceive the filter-- even though the content might be changed to deceive the filters. For this reason, it is critical for perceptual hashing to be robust enough to take transformations such as resizing, cropping, and slight pixel modifications into account. In this paper, we would like to propose to experiment with effect of gaussian blurring in perceptual hashing for detecting misuse of personal images specifically for face images. We hypothesize that use of gaussian blurring on the image before calculating its hash will increase the accuracy of our filter that detects adversarial attacks which consist of image cropping, adding text annotation, and image rotation., Comment: 5 pages, fixed typos, added references in Introduction section, added co-author due to post-publication contributions
Published: 2020

61. Registration made easy -- standalone orthopedic navigation with HoloLens

Author: Liebmann, Florentin, Roner, Simon, von Atzigen, Marco, Wanivenhaus, Florian, Neuhaus, Caroline, Spirig, José, Scaramuzza, Davide, Sutter, Reto, Snedeker, Jess, Farshad, Mazda, and Fürnstahl, Philipp
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.4.1
Abstract: In surgical navigation, finding correspondence between preoperative plan and intraoperative anatomy, the so-called registration task, is imperative. One promising approach is to intraoperatively digitize anatomy and register it with the preoperative plan. State-of-the-art commercial navigation systems implement such approaches for pedicle screw placement in spinal fusion surgery. Although these systems improve surgical accuracy, they are not gold standard in clinical practice. Besides economical reasons, this may be due to their difficult integration into clinical workflows and unintuitive navigation feedback. Augmented Reality has the potential to overcome these limitations. Consequently, we propose a surgical navigation approach comprising intraoperative surface digitization for registration and intuitive holographic navigation for pedicle screw placement that runs entirely on the Microsoft HoloLens. Preliminary results from phantom experiments suggest that the method may meet clinical accuracy requirements., Comment: 6 pages, 5 figures, accepted at CVPR 2019 workshop on Computer Vision Applications for Mixed Reality Headsets (https://docs.microsoft.com/en-us/windows/mixed-reality/cvpr-2019)
Published: 2020

62. Natural Steganography in JPEG Domain with a Linear Development Pipeline

Author: Théo, Taburet, Patrick, Bas, Wadih, Sawaya, and Fridrich, Jessica
Subjects: Computer Science - Multimedia, Computer Science - Cryptography and Security, I.4.1
Abstract: In order to achieve high practical security, Natural Steganography (NS) uses cover images captured at ISO sensitivity $ISO_{1}$ and generates stego images mimicking ISO sensitivity $ISO_{2}>ISO_{1}$. This is achieved by adding a stego signal to the cover that mimics the sensor photonic noise. This paper proposes an embedding mechanism to perform NS in the JPEG domain after linear developments by explicitly computing the correlations between DCT coefficients before quantization. In order to compute the covariance matrix of the photonic noise in the DCT domain, we first develop the matrix representation of demosaicking, luminance averaging, pixel section, and 2D-DCT. A detailed analysis of the resulting covariance matrix is done in order to explain the origins of the correlations between the coefficients of $3\times3$ DCT blocks. An embedding scheme is then presented that takes in order to take into account all the correlations. It employs 4 sub-lattices and 64 lattices per sub-lattices. The modification probabilities of each DCT coefficient are then derived by computing conditional probabilities from the multivariate Gaussian distribution using the Cholesky decomposition of the covariance matrix. This derivation is also used to compute the embedding capacity of each image. Using a specific database called E1 Base, we show that in the JPEG domain NS (J-Cov-NS) enables to achieve high capacity (more than 2 bits per non-zero AC DCT) and with high practical security ($P_{\mathrm{E}}\simeq40\%$ using DCTR from QF 75 to QF 100)., Comment: 13 pages, 14 figures
Published: 2020

63. ViPR: Visual-Odometry-aided Pose Regression for 6DoF Camera Localization

Author: Ott, Felix, Feigl, Tobias, Löffler, Christoffer, and Mutschler, Christopher
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Computer Science - Robotics, Electrical Engineering and Systems Science - Image and Video Processing, I.2.9, I.2.10, I.4.1, I.4.10, I.5.4
Abstract: Visual Odometry (VO) accumulates a positional drift in long-term robot navigation tasks. Although Convolutional Neural Networks (CNNs) improve VO in various aspects, VO still suffers from moving obstacles, discontinuous observation of features, and poor textures or visual information. While recent approaches estimate a 6DoF pose either directly from (a series of) images or by merging depth maps with optical flow (OF), research that combines absolute pose regression with OF is limited. We propose ViPR, a novel modular architecture for long-term 6DoF VO that leverages temporal information and synergies between absolute pose estimates (from PoseNet-like modules) and relative pose estimates (from FlowNet-based modules) by combining both through recurrent layers. Experiments on known datasets and on our own Industry dataset show that our modular design outperforms state of the art in long-term navigation tasks., Comment: Conf. on Computer Vision and Pattern Recognition (CVPR): Joint Workshop on Long-Term Visual Localization, Visual Odometry and Geometric and Learning-based SLAM 2020
Published: 2019
Full Text: View/download PDF

64. A Statistical View on Synthetic Aperture Imaging for Occlusion Removal

Author: Kurmi, Indrajit, Schedl, David C., and Bimber, Oliver
Subjects: Computer Science - Graphics, Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing, I.4.1, I.4.3
Abstract: Synthetic apertures find applications in many fields, such as radar, radio telescopes, microscopy, sonar, ultrasound, LiDAR, and optical imaging. They approximate the signal of a single hypothetical wide aperture sensor with either an array of static small aperture sensors or a single moving small aperture sensor. Common sense in synthetic aperture sampling is that a dense sampling pattern within a wide aperture is required to reconstruct a clear signal. In this article we show that there exists practical limits to both, synthetic aperture size and number of samples for the application of occlusion removal. This leads to an understanding on how to design synthetic aperture sampling patterns and sensors in a most optimal and practically efficient way. We apply our findings to airborne optical sectioning which uses camera drones and synthetic aperture imaging to computationally remove occluding vegetation or trees for inspecting ground surfaces., Comment: 10 pages, 11 figures, IEEE Sensors Jounral (accepted)
Published: 2019
Full Text: View/download PDF

65. High Flux Passive Imaging with Single-Photon Sensors

Author: Ingle, Atul, Velten, Andreas, and Gupta, Mohit
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Physics - Instrumentation and Detectors, I.3.3, I.4.1
Abstract: Single-photon avalanche diodes (SPADs) are an emerging technology with a unique capability of capturing individual photons with high timing precision. SPADs are being used in several active imaging systems (e.g., fluorescence lifetime microscopy and LiDAR), albeit mostly limited to low photon flux settings. We propose passive free-running SPAD (PF-SPAD) imaging, an imaging modality that uses SPADs for capturing 2D intensity images with unprecedented dynamic range under ambient lighting, without any active light source. Our key observation is that the precise inter-photon timing measured by a SPAD can be used for estimating scene brightness under ambient lighting conditions, even for very bright scenes. We develop a theoretical model for PF-SPAD imaging, and derive a scene brightness estimator based on the average time of darkness between successive photons detected by a PF-SPAD pixel. Our key insight is that due to the stochastic nature of photon arrivals, this estimator does not suffer from a hard saturation limit. Coupled with high sensitivity at low flux, this enables a PF-SPAD pixel to measure a wide range of scene brightness, from very low to very high, thereby achieving extreme dynamic range. We demonstrate an improvement of over 2 orders of magnitude over conventional sensors by imaging scenes spanning a dynamic range of 1,000,000:1., Comment: 28 pages, 15 figures, addressed reviewers's comments, fixed some errors and typos, official peer reviewed version to appear in IEEE CVPR 2019
Published: 2019

66. Optimal conditions for connectedness of discretized sets

Author: Brimkov, Boris and Brimkov, Valentin E.
Subjects: Computer Science - Discrete Mathematics, Computer Science - Computer Vision and Pattern Recognition, 52c99, 68c99, G.2.m, I.4.1
Abstract: Constructing a discretization of a given set is a major problem in various theoretical and applied disciplines. An offset discretization of a set $X$ is obtained by taking the integer points inside a closed neighborhood of $X$ of a certain radius. In this note we determine a minimum threshold for the offset radius, beyond which the discretization of a disconnected set is always connected. The results hold for a broad class of disconnected and unbounded subsets of $R^n$, and generalize several previous results. Algorithmic aspects and possible applications are briefly discussed., Comment: 9 pages, 1 figure with 2 subfigures
Published: 2018

67. Single-shot thermal ghost imaging using wavelength-division multiplexing

Author: Deng, Chao, Wang, Yuwang, Suo, Jinli, Zhang, Zhili, and Dai, Qionghai
Subjects: Physics - Optics, Experimental work, I.4.1
Abstract: Ghost imaging (GI) is a potential imaging technique that reconstructs the target scene from its correlated measurements with a sequential of patterns. Restricted by the multi-shot principle, GI usually requires long acquisition time and is limited in observation of dynamic scenes. To handle this problem, this paper proposes a single-shot thermal ghost imaging scheme via wavelength-division multiplexing technique. Specifically, we generate thousands of patterns simultaneously by modulating a broadband light source with a wavelength dependent diffuser. These patterns carry the scene's spatial information and then the correlated measurements are coupled into a spectrometer for the final reconstruction. This technique accelerates the ghost imaging speed significantly and promotes the applications in dynamic ghost imaging., Comment: 10 pages, 4 figures
Published: 2017
Full Text: View/download PDF

68. Automatic Extrinsic Calibration for Lidar-Stereo Vehicle Sensor Setups

Author: Guindel, Carlos, Beltrán, Jorge, Martín, David, and García, Fernando
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Robotics, 68T45, I.4.8, I.2.9, I.4.1
Abstract: Sensor setups consisting of a combination of 3D range scanner lasers and stereo vision systems are becoming a popular choice for on-board perception systems in vehicles; however, the combined use of both sources of information implies a tedious calibration process. We present a method for extrinsic calibration of lidar-stereo camera pairs without user intervention. Our calibration approach is aimed to cope with the constraints commonly found in automotive setups, such as low-resolution and specific sensor poses. To demonstrate the performance of our method, we also introduce a novel approach for the quantitative assessment of the calibration results, based on a simulation environment. Tests using real devices have been conducted as well, proving the usability of the system and the improvement over the existing approaches. Code is available at http://wiki.ros.org/velo2cam_calibration, Comment: Accepted to IEEE International Conference on Intelligent Transportation Systems 2017 (ITSC)
Published: 2017

69. Discrete Spectrum Reconstruction using Integral Approximation Algorithm

Author: Sizikov, Valery and Sidorov, Denis
Subjects: Mathematics - Numerical Analysis, 45B05, 45Q05, G.1.9, I.4.1
Abstract: An inverse problem in spectroscopy is considered. The objective is to restore the discrete spectrum from observed spectrum data, taking into account the spectrometer's line spread function. The problem is reduced to solution of a system of linear-nonlinear equations (SLNE) with respect to intensities and frequencies of the discrete spectral lines. The SLNE is linear with respect to lines' intensities and nonlinear with respect to the lines' frequencies. The integral approximation algorithm is proposed for the solution of this SLNE. The algorithm combines solution of linear integral equations with solution of a system of linear algebraic equations and avoids nonlinear equations. Numerical examples of the application of the technique, both to synthetic and experimental spectra, demonstrate the efficacy of the proposed approach in enabling an effective enhancement of the spectrometer's resolution., Comment: submitted to Applied Spectroscopy Journal
Published: 2017

70. High Dimensional Consistent Digital Segments

Author: Chiu, Man-Kwun and Korman, Matias
Subjects: Computer Science - Computational Geometry, I.3.5, I.4.1
Abstract: We consider the problem of digitalizing Euclidean line segments from $\mathbb{R}^d$ to $\mathbb{Z}^d$. Christ {\em et al.} (DCG, 2012) showed how to construct a set of {\em consistent digital segment} (CDS) for $d=2$: a collection of segments connecting any two points in $\mathbb{Z}^2$ that satisfies the natural extension of the Euclidean axioms to $\mathbb{Z}^d$. In this paper we study the construction of CDSs in higher dimensions. We show that any total order can be used to create a set of {\em consistent digital rays} CDR in $\mathbb{Z}^d$ (a set of rays emanating from a fixed point $p$ that satisfies the extension of the Euclidean axioms). We fully characterize for which total orders the construction holds and study their Hausdorff distance, which in particular positively answers the question posed by Christ {\em et al.}.
Published: 2016

71. Learning-Based View Synthesis for Light Field Cameras

Author: Kalantari, Nima Khademi, Wang, Ting-Chun, and Ramamoorthi, Ravi
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, I.4.1
Abstract: With the introduction of consumer light field cameras, light field imaging has recently become widespread. However, there is an inherent trade-off between the angular and spatial resolution, and thus, these cameras often sparsely sample in either spatial or angular domain. In this paper, we use machine learning to mitigate this trade-off. Specifically, we propose a novel learning-based approach to synthesize new views from a sparse set of input views. We build upon existing view synthesis techniques and break down the process into disparity and color estimation components. We use two sequential convolutional neural networks to model these two components and train both networks simultaneously by minimizing the error between the synthesized and ground truth images. We show the performance of our approach using only four corner sub-aperture views from the light fields captured by the Lytro Illum camera. Experimental results show that our approach synthesizes high-quality images that are superior to the state-of-the-art techniques on a variety of challenging real-world scenes. We believe our method could potentially decrease the required angular resolution of consumer light field cameras, which allows their spatial resolution to increase., Comment: in ACM Transactions on Graphics 2016
Published: 2016
Full Text: View/download PDF

72. Reflections on Shannon Information: In search of a natural information-entropy for images

Author: Larkin, Kieran G.
Subjects: Computer Science - Information Theory, Computer Science - Computer Vision and Pattern Recognition, I.4.1, I.4.2, I.4.5, I.4.10
Abstract: It is not obvious how to extend Shannon's original information entropy to higher dimensions, and many different approaches have been tried. We replace the English text symbol sequence originally used to illustrate the theory by a discrete, bandlimited signal. Using Shannon's later theory of sampling we derive a new and symmetric version of the second order entropy in 1D. The new theory then naturally extends to 2D and higher dimensions, where by naturally we mean simple, symmetric, isotropic and parsimonious. Simplicity arises from the direct application of Shannon's joint entropy equalities and inequalities to the gradient (del) vector field image embodying the second order relations of the scalar image. Parsimony is guaranteed by halving of the vector data rate using Papoulis' generalized sampling expansion. The new 2D entropy measure, which we dub delentropy, is underpinned by a computable probability density function we call deldensity. The deldensity captures the underlying spatial image structure and pixel co-occurrence. It achieves this because each scalar image pixel value is nonlocally related to the entire gradient vector field. Both deldensity and delentropy are highly tractable and yield many interesting connections and useful inequalities. The new measure explicitly defines a realizable encoding algorithm and a corresponding reconstruction. Initial tests show that delentropy compares favourably with the conventional intensity-based histogram entropy and the compressed data rates of lossless image encoders (GIF, PNG, WEBP, JP2K-LS and JPG-LS) for a selection of images. The symmetric approach may have applications to higher dimensions and problems concerning image complexity measures., Comment: 47 pages,9 figures, preprint for submission to an image science and optics journal
Published: 2016

73. Absolute Pose Estimation from Line Correspondences using Direct Linear Transformation

Author: Přibyl, Bronislav, Zemčík, Pavel, and Čadík, Martin
Subjects: Computer Science - Computer Vision and Pattern Recognition, 68T45, I.4.8, I.4.1
Abstract: This work is concerned with camera pose estimation from correspondences of 3D/2D lines, i. e. with the Perspective-n-Line (PnL) problem. We focus on large line sets, which can be efficiently solved by methods using linear formulation of PnL. We propose a novel method "DLT-Combined-Lines" based on the Direct Linear Transformation (DLT) algorithm, which benefits from a new combination of two existing DLT methods for pose estimation. The method represents 2D structure by lines, and 3D structure by both points and lines. The redundant 3D information reduces the minimum required line correspondences to 5. A cornerstone of the method is a combined projection matri xestimated by the DLT algorithm. It contains multiple estimates of camera rotation and translation, which can be recovered after enforcing constraints of the matrix. Multiplicity of the estimates is exploited to improve the accuracy of the proposed method. For large line sets (10 and more), the method is comparable to the state-of-theart in accuracy of orientation estimation. It achieves state-of-the-art accuracy in estimation of camera position and it yields the smallest reprojection error under strong image noise. The method achieves top-3 results on real world data. The proposed method is also highly computationally effective, estimating the pose of 1000 lines in 12 ms on a desktop computer., Comment: 37 pages, 6 figures, 4 tables. Accepted for publication in Computer Vision and Image Understanding
Published: 2016
Full Text: View/download PDF

74. Camera Pose Estimation from Lines using Pl\'ucker Coordinates

Author: Přibyl, Bronislav, Zemčík, Pavel, and Čadík, Martin
Subjects: Computer Science - Computer Vision and Pattern Recognition, 68T45, I.4.8, I.4.1
Abstract: Correspondences between 3D lines and their 2D images captured by a camera are often used to determine position and orientation of the camera in space. In this work, we propose a novel algebraic algorithm to estimate the camera pose. We parameterize 3D lines using Pl\"ucker coordinates that allow linear projection of the lines into the image. A line projection matrix is estimated using Linear Least Squares and the camera pose is then extracted from the matrix. An algebraic approach to handle mismatched line correspondences is also included. The proposed algorithm is an order of magnitude faster yet comparably accurate and robust to the state-of-the-art, it does not require initialization, and it yields only one solution. The described method requires at least 9 lines and is particularly suitable for scenarios with 25 and more lines, as also shown in the results., Comment: 12 pages, 5 figures, In Proceedings of the British Machine Vision Conference (BMVC 2015), pages 45.1-45.12. BMVA Press, September 2015
Published: 2016
Full Text: View/download PDF

75. Hierarchical Modeling of Multidimensional Data in Regularly Decomposed Spaces: Applications in Image Analysis

Author: Guye, Olivier
Subjects: Computer Science - Computer Vision and Pattern Recognition, H.2.8, H.3.1, H.3.3, I.2.10, I.4.1, I.4.6, I.4.7, I.4.8, I.4.9, I.4.10, I.5.2, I.5.3, I.5.4
Abstract: This last document is showing the gradual introduction of hierarchical modeling techniques in image analysis. The first chapter is dealing with the first works carried out in the field of industrial applications of pattern recognition. The second chapter is focusing on the usage of these techniques in satellite imagery and on the development of a satellite data archiving system in the aim of using it in digital geography. The third chapter is about face recognition based on planar image analysis and about the recognition of partially hidden patterns. The present publication is ending with the description of a future system of self-descriptive coding of still or moving pictures in relation with the current video coding standards. As in the previous documents, it will be found in annex algorithms targeted on image analysis according two complementary approaches: - boundary-based approach for the industrial applications of artificial vision; - region-based approach for satellite image analysis., Comment: 172 pages, 52 figures, research report
Published: 2016

76. Manifolds of Projective Shapes

Author: Hotz, Thomas, Kelma, Florian, and Kent, John T.
Subjects: Mathematics - Statistics Theory, Computer Science - Computer Vision and Pattern Recognition, Mathematics - Geometric Topology, 51N15 (primary), 62H11, 62H35 (secondary), I.4.1, I.4.7
Abstract: The projective shape of a configuration of k points or "landmarks" in RP(d) consists of the information that is invariant under projective transformations and hence is reconstructable from uncalibrated camera views. Mathematically, the space of projective shapes for these k landmarks can be described as the quotient space of k copies of RP(d) modulo the action of the projective linear group PGL(d). Using homogeneous coordinates, such configurations can be described as real k-times-(d+1)-dimensional matrices given up to left-multiplication of non-singular diagonal matrices, while the group PGL(d) acts as GL(d+1) from the right. The main purpose of this paper is to give a detailed examination of the topology of projective shape space, and, using matrix notation, it is shown how to derive subsets that are in a certain sense maximal, differentiable Hausdorff manifolds which can be provided with a Riemannian metric. A special subclass of the projective shapes consists of the Tyler regular shapes, for which geometrically motivated pre-shapes can be defined, thus allowing for the construction of a natural Riemannian metric.
Published: 2016

77. Robust Large-Scale Localization in 3D Point Clouds Revisited

Author: Tschopp, Fabian and Zorzi, Marco
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.2.10, I.3.5, I.4.1, I.4.8
Abstract: We tackle the problem of getting a full 6-DOF pose estimation of a query image inside a given point cloud. This technical report re-evaluates the algorithms proposed by Y. Li et al. "Worldwide Pose Estimation using 3D Point Cloud". Our code computes poses from 3 or 4 points, with both known and unknown focal length. The results can easily be displayed and analyzed with Meshlab. We found both advantages and shortcomings of the methods proposed. Furthermore, additional priors and parameters for point selection, RANSAC and pose quality estimate (inlier test) are proposed and applied., Comment: 6 pages; technical report
Published: 2015

78. BREN: Body Reflection Essence-Neuter Model for Separation of Reflection Components

Author: Je, Changsoo and Park, Hyung-Min
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, Physics - Optics, I.2.10, I.3.7, I.4.1, I.4.8
Abstract: We propose a novel reflection color model consisting of body essence and (mixed) neuter, and present an effective method for separating dichromatic reflection components using a single image. Body essence is an entity invariant to interface reflection, and has two degrees of freedom unlike hue and maximum chromaticity. As a result, the proposed method is insensitive to noise and proper for colors around CMY (cyan, magenta, and yellow) as well as RGB (red, green, and blue), contrary to the maximum chromaticity-based methods. Interface reflection is separated by using a Gaussian function, which removes a critical thresholding problem. Furthermore, the method does not require any region segmentation. Experimental results show the efficacy of the proposed model and method., Comment: 4 pages, 4 figures
Published: 2015
Full Text: View/download PDF

79. Separable and non-separable data representation for pattern discrimination

Author: Miszczak, Jarosław Adam
Subjects: Quantum Physics, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Learning, I.5.2, I.4.1
Abstract: We provide a complete work-flow, based on the language of quantum information theory, suitable for processing data for the purpose of pattern recognition. The main advantage of the introduced scheme is that it can be easily implemented and applied to process real-world data using modest computation resources. At the same time it can be used to investigate the difference in the pattern recognition resulting from the utilization of the tensor product structure of the space of quantum states. We illustrate this difference by providing a simple example based on the classification of 2D data., Comment: 11 pages, 2 figures
Published: 2015

80. A survey of modern optical character recognition techniques

Author: Borovikov, Eugene
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.7.5, I.4.1, I.5.4, I.4.3, I.4.6, I.4.7
Abstract: This report explores the latest advances in the field of digital document recognition. With the focus on printed document imagery, we discuss the major developments in optical character recognition (OCR) and document image enhancement/restoration in application to Latin and non-Latin scripts. In addition, we review and discuss the available technologies for hand-written document recognition. In this report, we also provide some company-accumulated benchmark results on available OCR engines., Comment: Technical report surveying OCR/ICR and document understanding methods as of 2004.It contains 38 pages, numerous figures, 93 references, and provides a table of contents
Published: 2014

81. Detection of Non-Stationary Photometric Perturbations on Projection Screens

Author: Castañeda-Garay, Miguel, Belmonte-Fernández, Oscar, Pérez-Rosés, Hebert, and Diaz-Tula, Antonio
Subjects: Computer Science - Computer Vision and Pattern Recognition, H.5.2, I.4.1
Abstract: Interfaces based on projection screens have become increasingly more popular in recent years, mainly due to the large screen size and resolution that they provide, as well as their stereo-vision capabilities. This work shows a local method for real-time detection of non-stationary photometric perturbations in projected images by means of computer vision techniques. The method is based on the computation of differences between the images in the projector's frame buffer and the corresponding images on the projection screen observed by the camera. It is robust under spatial variations in the intensity of light emitted by the projector on the projection surface and also robust under stationary photometric perturbations caused by external factors. Moreover, we describe the experiments carried out to show the reliability of the method., Comment: 20 pages, Journal of Research and Practice in Information Technology, vol. 44, num. 4, 2012
Published: 2014

82. Hyperspectral Imaging and Analysis for Sparse Reconstruction and Recognition

Author: Khan, Zohaib
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.4.1, I.4.5, I.4.7, I.4.10, I.5.4, I.7.5
Abstract: This thesis proposes spatio-spectral techniques for hyperspectral image analysis. Adaptive spatio-spectral support and variable exposure hyperspectral imaging is demonstrated to improve spectral reflectance recovery from hyperspectral images. Novel spectral dimensionality reduction techniques have been proposed from the perspective of spectral only and spatio-spectral information preservation. It was found that the joint sparse and joint group sparse hyperspectral image models achieve lower reconstruction error and higher recognition accuracy using only a small subset of bands. Hyperspectral image databases have been developed and made publicly available for further research in compressed hyperspectral imaging, forensic document analysis and spectral reflectance recovery., Comment: PhD Thesis, School of Computer Science and Software Engineering, The University of Western Australia
Published: 2014

83. Dense Scattering Layer Removal

Author: Yan, Qiong, Xu, Li, and Jia, Jiaya
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.4.1
Abstract: We propose a new model, together with advanced optimization, to separate a thick scattering media layer from a single natural image. It is able to handle challenging underwater scenes and images taken in fog and sandstorm, both of which are with significantly reduced visibility. Our method addresses the critical issue -- this is, originally unnoticeable impurities will be greatly magnified after removing the scattering media layer -- with transmission-aware optimization. We introduce non-local structure-aware regularization to properly constrain transmission estimation without introducing the halo artifacts. A selective-neighbor criterion is presented to convert the unconventional constrained optimization problem to an unconstrained one where the latter can be efficiently solved., Comment: 10 pages, 10 figures, Siggraph Asia 2013 Technial Briefs
Published: 2013

84. Patch-based Probabilistic Image Quality Assessment for Face Selection and Improved Video-based Face Recognition

Author: Wong, Yongkang, Chen, Shaokang, Mau, Sandra, Sanderson, Conrad, and Lovell, Brian C.
Subjects: Computer Science - Computer Vision and Pattern Recognition, Statistics - Applications, I.4.1, I.4.6, I.4.7, I.4.9, I.4.10, I.5.1, I.5.4, G.3
Abstract: In video based face recognition, face images are typically captured over multiple frames in uncontrolled conditions, where head pose, illumination, shadowing, motion blur and focus change over the sequence. Additionally, inaccuracies in face localisation can also introduce scale and alignment variations. Using all face images, including images of poor quality, can actually degrade face recognition performance. While one solution it to use only the "best" subset of images, current face selection techniques are incapable of simultaneously handling all of the abovementioned issues. We propose an efficient patch-based face image quality assessment algorithm which quantifies the similarity of a face image to a probabilistic face model, representing an "ideal" face. Image characteristics that affect recognition are taken into account, including variations in geometric alignment (shift, rotation and scale), sharpness, head pose and cast shadows. Experiments on FERET and PIE datasets show that the proposed algorithm is able to identify images which are simultaneously the most frontal, aligned, sharp and well illuminated. Further experiments on a new video surveillance dataset (termed ChokePoint) show that the proposed method provides better face subsets than existing face selection techniques, leading to significant improvements in recognition accuracy.
Published: 2013
Full Text: View/download PDF

85. Performance Evaluation of Edge-Directed Interpolation Methods for Images

Author: Yu, Shaode, Zhu, Qingsong, Wu, Shibin, and Xie, Yaoqin
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.4.1
Abstract: Many interpolation methods have been developed for high visual quality, but fail for inability to preserve image structures. Edges carry heavy structural information for detection, determination and classification. Edge-adaptive interpolation approaches become a center of focus. In this paper, performance of four edge-directed interpolation methods comparing with two traditional methods is evaluated on two groups of images. These methods include new edge-directed interpolation (NEDI), edge-guided image interpolation (EGII), iterative curvature-based interpolation (ICBI), directional cubic convolution interpolation (DCCI) and two traditional approaches, bi-linear and bi-cubic. Meanwhile, no parameters are mentioned to measure edge-preserving ability of edge-adaptive interpolation approaches and we proposed two. One evaluates accuracy and the other measures robustness of edge-preservation ability. Performance evaluation is based on six parameters. Objective assessment and visual analysis are illustrated and conclusions are drawn from theoretical backgrounds and practical results., Comment: 9 pages, 5 figures, 2 tables
Published: 2013

86. Morphological Reconstruction for Word Level Script Identification

Author: Dhandra, B. V. and Hangarge, Mallikarjun
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.4.1
Abstract: A line of a bilingual document page may contain text words in regional language and numerals in English. For Optical Character Recognition (OCR) of such a document page, it is necessary to identify different script forms before running an individual OCR system. In this paper, we have identified a tool of morphological opening by reconstruction of an image in different directions and regional descriptors for script identification at word level, based on the observation that every text has a distinct visual appearance. The proposed system is developed for three Indian major bilingual documents, Kannada, Telugu and Devnagari containing English numerals. The nearest neighbour and k-nearest neighbour algorithms are applied to classify new word images. The proposed algorithm is tested on 2625 words with various font styles and sizes. The results obtained are quite encouraging, Comment: 11 Pages, 8 Figures,5 Tables; Revised: 15-06-2007,Published: 30-06-2007
Published: 2011

87. Improving the Performance of K-Means for Color Quantization

Author: Celebi, M. Emre
Subjects: Computer Science - Graphics, I.4.1
Abstract: Color quantization is an important operation with many applications in graphics and image processing. Most quantization methods are essentially based on data clustering algorithms. However, despite its popularity as a general purpose clustering algorithm, k-means has not received much respect in the color quantization literature because of its high computational requirements and sensitivity to initialization. In this paper, we investigate the performance of k-means as a color quantizer. We implement fast and exact variants of k-means with several initialization schemes and then compare the resulting quantizers to some of the most popular quantizers in the literature. Experiments on a diverse set of images demonstrate that an efficient implementation of k-means with an appropriate initialization strategy can in fact serve as a very effective color quantizer., Comment: 26 pages, 4 figures, 13 tables
Published: 2011
Full Text: View/download PDF

88. Fast Color Quantization Using Weighted Sort-Means Clustering

Author: Celebi, M. Emre
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, I.4.1
Abstract: Color quantization is an important operation with numerous applications in graphics and image processing. Most quantization methods are essentially based on data clustering algorithms. However, despite its popularity as a general purpose clustering algorithm, k-means has not received much respect in the color quantization literature because of its high computational requirements and sensitivity to initialization. In this paper, a fast color quantization method based on k-means is presented. The method involves several modifications to the conventional (batch) k-means algorithm including data reduction, sample weighting, and the use of triangle inequality to speed up the nearest neighbor search. Experiments on a diverse set of images demonstrate that, with the proposed modifications, k-means becomes very competitive with state-of-the-art color quantization methods in terms of both effectiveness and efficiency., Comment: 30 pages, 2 figures, 4 tables
Published: 2010
Full Text: View/download PDF

89. Surface Curvature Effects on Reflectance from Translucent Materials

Author: Kolchin, Konstantin
Subjects: Computer Science - Graphics, Physics - Optics, I.4.1, I.4.8
Abstract: Most of the physically based techniques for rendering translucent objects use the diffusion theory of light scattering in turbid media. The widely used dipole diffusion model (Jensen et al. 2001) applies the diffusion-theory formula derived for a planar interface to objects of arbitrary shapes. This paper presents first results of our investigation of how surface curvature affects the diffuse reflectance from translucent materials., Comment: 10 pages, 2 figures. The first version of this paper was published in the Communication Papers Proceedings of 18th International Conference on Computer Graphics, Visualization and Computer Vision 2010 - WSCG2010
Published: 2010

90. Generalized Semimagic Squares for Digital Halftoning

Author: Kawamura, Akitoshi
Subjects: Computer Science - Computational Geometry, Mathematics - Combinatorics, 68U10, 65D18, 97A20, I.4.1, F.2.2, G.2.1
Abstract: Completing Aronov et al.'s study on zero-discrepancy matrices for digital halftoning, we determine all (m, n, k, l) for which it is possible to put mn consecutive integers on an m-by-n board (with wrap-around) so that each k-by-l region holds the same sum. For one of the cases where this is impossible, we give a heuristic method to find a matrix with small discrepancy., Comment: 6 pages, 6 figures
Published: 2010
Full Text: View/download PDF

91. Parallelization of the LBG Vector Quantization Algorithm for Shared Memory Systems

Author: Annaji, Rajashekar and Rao, Shrisha
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Distributed, Parallel, and Cluster Computing, I.4.1, I.4.2, D.1.3
Abstract: This paper proposes a parallel approach for the Vector Quantization (VQ) problem in image processing. VQ deals with codebook generation from the input training data set and replacement of any arbitrary data with the nearest codevector. Most of the efforts in VQ have been directed towards designing parallel search algorithms for the codebook, and little has hitherto been done in evolving a parallelized procedure to obtain an optimum codebook. This parallel algorithm addresses the problem of designing an optimum codebook using the traditional LBG type of vector quantization algorithm for shared memory systems and for the efficient usage of parallel processors. Using the codebook formed from a training set, any arbitrary input data is replaced with the nearest codevector from the codebook. The effectiveness of the proposed algorithm is indicated., Comment: 14 pages
Published: 2009

92. A Family of Simplified Geometric Distortion Models for Camera Calibration

Author: Ma, Lili, Chen, YangQuan, and Moore, Kevin L.
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.4.1
Abstract: The commonly used radial distortion model for camera calibration is in fact an assumption or a restriction. In practice, camera distortion could happen in a general geometrical manner that is not limited to the radial sense. This paper proposes a simplified geometrical distortion modeling method by using two different radial distortion functions in the two image axes. A family of simplified geometric distortion models is proposed, which are either simple polynomials or the rational functions of polynomials. Analytical geometric undistortion is possible using two of the distortion functions discussed in this paper and their performance can be improved by applying a piecewise fitting idea. Our experimental results show that the geometrical distortion models always perform better than their radial distortion counterparts. Furthermore, the proposed geometric modeling method is more appropriate for cameras whose distortion is not perfectly radially symmetric around the center of distortion., Comment: 14 pages, 11 eps figures
Published: 2003

93. Camera Calibration: a USU Implementation

Author: Ma, Lili, Chen, YangQuan, and Moore, Kevin L.
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.4.1
Abstract: The task of camera calibration is to estimate the intrinsic and extrinsic parameters of a camera model. Though there are some restricted techniques to infer the 3-D information about the scene from uncalibrated cameras, effective camera calibration procedures will open up the possibility of using a wide range of existing algorithms for 3-D reconstruction and recognition. The applications of camera calibration include vision-based metrology, robust visual platooning and visual docking of mobile robots where the depth information is important., Comment: 39 pages, 19 eps figures, source codes are in the codes.m and corners.dat
Published: 2003

94. An Analytical Piecewise Radial Distortion Model for Precision Camera Calibration

Author: Ma, Lili, Chen, YangQuan, and Moore, Kevin L.
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.4.1
Abstract: The common approach to radial distortion is by the means of polynomial approximation, which introduces distortion-specific parameters into the camera model and requires estimation of these distortion parameters. The task of estimating radial distortion is to find a radial distortion model that allows easy undistortion as well as satisfactory accuracy. This paper presents a new piecewise radial distortion model with easy analytical undistortion formula. The motivation for seeking a piecewise radial distortion model is that, when a camera is resulted in a low quality during manufacturing, the nonlinear radial distortion can be complex. Using low order polynomials to approximate the radial distortion might not be precise enough. On the other hand, higher order polynomials suffer from the inverse problem. With the new piecewise radial distortion function, more flexibility is obtained and the radial undistortion can be performed analytically. Experimental results are presented to show that with this new piecewise radial distortion model, better performance is achieved than that using the single function. Furthermore, a comparable performance with the conventional polynomial model using 2 coefficients can also be accomplished., Comment: 6 pages, 6 PostScript figures
Published: 2003

95. A New Analytical Radial Distortion Model for Camera Calibration

Author: Ma, Lili, Chen, YangQuan, and Moore, Kevin L.
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.4.1
Abstract: Common approach to radial distortion is by the means of polynomial approximation, which introduces distortion-specific parameters into the camera model and requires estimation of these distortion parameters. The task of estimating radial distortion is to find a radial distortion model that allows easy undistortion as well as satisfactory accuracy. This paper presents a new radial distortion model with an easy analytical undistortion formula, which also belongs to the polynomial approximation category. Experimental results are presented to show that with this radial distortion model, satisfactory accuracy is achieved., Comment: 2 Postscript figures
Published: 2003

96. Rational Radial Distortion Models with Analytical Undistortion Formulae

Author: Ma, Lili, Chen, YangQuan, and Moore, Kevin L.
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.4.1
Abstract: The common approach to radial distortion is by the means of polynomial approximation, which introduces distortion-specific parameters into the camera model and requires estimation of these distortion parameters. The task of estimating radial distortion is to find a radial distortion model that allows easy undistortion as well as satisfactory accuracy. This paper presents a new class of rational radial distortion models with easy analytical undistortion formulae. Experimental results are presented to show that with this class of rational radial distortion models, satisfactory and comparable accuracy is achieved., Comment: 6 pages
Published: 2003

97. Flexible Camera Calibration Using a New Analytical Radial Undistortion Formula with Application to Mobile Robot Localization

Author: Ma, Lili, Chen, YangQuan, and Moore, Kevin L.
Subjects: Computer Science - Computer Vision and Pattern Recognition, I.4.1
Abstract: Most algorithms in 3D computer vision rely on the pinhole camera model because of its simplicity, whereas virtually all imaging devices introduce certain amount of nonlinear distortion, where the radial distortion is the most severe part. Common approach to radial distortion is by the means of polynomial approximation, which introduces distortion-specific parameters into the camera model and requires estimation of these distortion parameters. The task of estimating radial distortion is to find a radial distortion model that allows easy undistortion as well as satisfactory accuracy. This paper presents a new radial distortion model with an easy analytical undistortion formula, which also belongs to the polynomial approximation category. Experimental results are presented to show that with this radial distortion model, satisfactory accuracy is achieved. An application of the new radial distortion model is non-iterative yellow line alignment with a calibrated camera on ODIS, a robot built in our CSOIS., Comment: 6 pages, 2 Postscript figures
Published: 2003

98. 2D Electrophoresis Gel Image and Diagnosis of a Disease

Author: Kim, Gene and Kim, MyungHo
Subjects: Computer Science - Computational Complexity, Computer Science - Computer Vision and Pattern Recognition, Quantitative Biology - Quantitative Methods, I.5, J.3, I.4.1, I.4.3
Abstract: The process of diagnosing a disease from the 2D gel electrophoresis image is a challenging problem. This is due to technical difficulties of generating reproducible images with a normalized form and the effect of negative stain. In this paper, we will discuss a new concept of interpreting the 2D images and overcoming the aforementioned technical difficulties using mathematical transformation. The method makes use of 2D gel images of proteins in serums and we explain a way of representing the images into vectors in order to apply machine-learning methods, such as the support vector machine., Comment: 10 pages, DIMACS workshop of Rutgers University, on complexity in Biosystems
Published: 2003

99. AvatarMe++: Facial Shape and BRDF Inference With Photorealistic Rendering-Aware GANs

Author: Alexandros Lattas, Stylianos Moschoglou, Stefanos Zafeiriou, Stylianos Ploumpis, Baris Gecer, and Abhijeet Ghosh
Subjects: FOS: Computer and information sciences, Computer science, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Inference, Rendering (computer graphics), Computer Science - Graphics, I.4.1, I.3.7, I.2.10, Artificial Intelligence, Margin (machine learning), Computer vision, Specular reflection, business.industry, Applied Mathematics, Uncanny valley, Subsurface scattering, Graphics (cs.GR), Computational Theory and Mathematics, Face (geometry), Computer Vision and Pattern Recognition, Artificial intelligence, Bidirectional reflectance distribution function, business, Software
Abstract: Over the last years, many face analysis tasks have accomplished astounding performance, with applications including face generation and 3D face reconstruction from a single "in-the-wild" image. Nevertheless, to the best of our knowledge, there is no method which can produce render-ready high-resolution 3D faces from "in-the-wild" images and this can be attributed to the: (a) scarcity of available data for training, and (b) lack of robust methodologies that can successfully be applied on very high-resolution data. In this work, we introduce the first method that is able to reconstruct photorealistic render-ready 3D facial geometry and BRDF from a single "in-the-wild" image. We capture a large dataset of facial shape and reflectance, which we have made public. We define a fast facial photorealistic differentiable rendering methodology with accurate facial skin diffuse and specular reflection, self-occlusion and subsurface scattering approximation. With this, we train a network that disentangles the facial diffuse and specular BRDF components from a shape and texture with baked illumination, reconstructed with a state-of-the-art 3DMM fitting method. Our method outperforms the existing arts by a significant margin and reconstructs high-resolution 3D faces from a single low-resolution image, that can be rendered in various applications, and bridge the uncanny valley., Project and Dataset page: ( https://github.com/lattas/AvatarMe ). 20 pages, including supplemental materials. Accepted for publishing at IEEE Transactions on Pattern Analysis and Machine Intelligence on 13 November 2021. Copyright 2021 IEEE. Personal use of this material is permitted
Published: 2022

100. A Bayesian Reflection on Surfaces

Author: Wolf, David R.
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Data Structures and Algorithms, Computer Science - Learning, Mathematics - Probability, Nonlinear Sciences - Adaptation and Self-Organizing Systems, Physics - Data Analysis, Statistics and Probability, G.3, I.2.4, I.2.6, I.2.10, I.4.1, I.4.4, I.4.5, I.4.10
Abstract: The topic of this paper is a novel Bayesian continuous-basis field representation and inference framework. Within this paper several problems are solved: The maximally informative inference of continuous-basis fields, that is where the basis for the field is itself a continuous object and not representable in a finite manner; the tradeoff between accuracy of representation in terms of information learned, and memory or storage capacity in bits; the approximation of probability distributions so that a maximal amount of information about the object being inferred is preserved; an information theoretic justification for multigrid methodology. The maximally informative field inference framework is described in full generality and denoted the Generalized Kalman Filter. The Generalized Kalman Filter allows the update of field knowledge from previous knowledge at any scale, and new data, to new knowledge at any other scale. An application example instance, the inference of continuous surfaces from measurements (for example, camera image data), is presented., Comment: 34 pages, 1 figure, abbreviated versions presented: Bayesian Statistics, Valencia, Spain, 1998; Maximum Entropy and Bayesian Methods, Garching, Germany, 1998
Published: 2000
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

260 results on '"I.4.1"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources