105 results on '"Jesús Malo"'
Search Results
2. Alignment of color discrimination in humans and image segmentation networks
- Author
-
Pablo Hernández-Cámara, Paula Daudén-Oliver, Valero Laparra, and Jesús Malo
- Subjects
vision models ,color discrimination ,image segmentation ,artificial neural networks ,U-Nets ,image statistics ,Psychology ,BF1-990 - Abstract
The experiments allowed by current machine learning models imply a revival of the debate on the causes of specific trends of human visual psychophysics. Machine learning facilitates the exploration of the effect of specific visual goals (such as image segmentation) by different neural architectures in different statistical environments in an unprecedented manner. In this way, (1) the principles behind psychophysical facts such as the non-Euclidean nature of human color discrimination and (2) the emergence of human-like behaviour in artificial systems can be explored under a new light. In this work, we show for the first time that the tolerance or invariance of image segmentation networks for natural images under changes of illuminant in the color space (a sort of insensitivity region around the white) is an ellipsoid oriented similarly to a (human) MacAdam ellipse. This striking similarity between an artificial system and human vision motivates a set of experiments checking the relevance of the statistical environment on the emergence of such insensitivity regions. Results suggest, that in this case, the statistics of the environment may be more relevant than the architecture selected to perform the image segmentation.
- Published
- 2024
- Full Text
- View/download PDF
3. Artificial psychophysics questions classical hue cancellation experiments
- Author
-
Jorge Vila-Tomás, Pablo Hernández-Cámara, and Jesús Malo
- Subjects
artificial psychophysics ,visual neuroscience ,hue cancellation experiments ,opponent color coding ,spectral sensitivity of artificial networks ,Neurosciences. Biological psychiatry. Neuropsychiatry ,RC321-571 - Abstract
We show that classical hue cancellation experiments lead to human-like opponent curves even if the task is done by trivial (identity) artificial networks. Specifically, human-like opponent spectral sensitivities always emerge in artificial networks as long as (i) the retina converts the input radiation into any tristimulus-like representation, and (ii) the post-retinal network solves the standard hue cancellation task, e.g. the network looks for the weights of the cancelling lights so that every monochromatic stimulus plus the weighted cancelling lights match a grey reference in the (arbitrary) color representation used by the network. In fact, the specific cancellation lights (and not the network architecture) are key to obtain human-like curves: results show that the classical choice of the lights is the one that leads to the best (more human-like) result, and any other choices lead to progressively different spectral sensitivities. We show this in two ways: through artificial psychophysics using a range of networks with different architectures and a range of cancellation lights, and through a change-of-basis theoretical analogy of the experiments. This suggests that the opponent curves of the classical experiment are just a by-product of the front-end photoreceptors and of a very specific experimental choice but they do not inform about the downstream color representation. In fact, the architecture of the post-retinal network (signal recombination or internal color space) seems irrelevant for the emergence of the curves in the classical experiment. This result in artificial networks questions the conventional interpretation of the classical result in humans by Jameson and Hurvich.
- Published
- 2023
- Full Text
- View/download PDF
4. Information Flow in Biological Networks for Color Vision
- Author
-
Jesús Malo
- Subjects
chromatic information ,color appearance networks ,efficient coding hypothesis ,total correlation ,mutual information ,Gaussianization ,Science ,Astrophysics ,QB460-466 ,Physics ,QC1-999 - Abstract
Biological neural networks for color vision (also known as color appearance models) consist of a cascade of linear + nonlinear layers that modify the linear measurements at the retinal photo-receptors leading to an internal (nonlinear) representation of color that correlates with psychophysical experience. The basic layers of these networks include: (1) chromatic adaptation (normalization of the mean and covariance of the color manifold); (2) change to opponent color channels (PCA-like rotation in the color space); and (3) saturating nonlinearities to obtain perceptually Euclidean color representations (similar to dimension-wise equalization). The Efficient Coding Hypothesis argues that these transforms should emerge from information-theoretic goals. In case this hypothesis holds in color vision, the question is what is the coding gain due to the different layers of the color appearance networks? In this work, a representative family of color appearance models is analyzed in terms of how the redundancy among the chromatic components is modified along the network and how much information is transferred from the input data to the noisy response. The proposed analysis is performed using data and methods that were not available before: (1) new colorimetrically calibrated scenes in different CIE illuminations for the proper evaluation of chromatic adaptation; and (2) new statistical tools to estimate (multivariate) information-theoretic quantities between multidimensional sets based on Gaussianization. The results confirm that the efficient coding hypothesis holds for current color vision models, and identify the psychophysical mechanisms critically responsible for gains in information transference: opponent channels and their nonlinear nature are more important than chromatic adaptation at the retina.
- Published
- 2022
- Full Text
- View/download PDF
5. In Praise of Artifice Reloaded: Caution With Natural Image Databases in Modeling Vision
- Author
-
Marina Martinez-Garcia, Marcelo Bertalmío, and Jesús Malo
- Subjects
natural stimuli ,artificial stimuli ,subjective image quality databases ,wavelet + divisive normalization ,contrast masking ,Neurosciences. Biological psychiatry. Neuropsychiatry ,RC321-571 - Abstract
Subjective image quality databases are a major source of raw data on how the visual system works in naturalistic environments. These databases describe the sensitivity of many observers to a wide range of distortions of different nature and intensity seen on top of a variety of natural images. Data of this kind seems to open a number of possibilities for the vision scientist to check the models in realistic scenarios. However, while these natural databases are great benchmarks for models developed in some other way (e.g., by using the well-controlled artificial stimuli of traditional psychophysics), they should be carefully used when trying to fit vision models. Given the high dimensionality of the image space, it is very likely that some basic phenomena are under-represented in the database. Therefore, a model fitted on these large-scale natural databases will not reproduce these under-represented basic phenomena that could otherwise be easily illustrated with well selected artificial stimuli. In this work we study a specific example of the above statement. A standard cortical model using wavelets and divisive normalization tuned to reproduce subjective opinion on a large image quality dataset fails to reproduce basic cross-masking. Here we outline a solution for this problem by using artificial stimuli and by proposing a modification that makes the model easier to tune. Then, we show that the modified model is still competitive in the large-scale database. Our simulations with these artificial stimuli show that when using steerable wavelets, the conventional unit norm Gaussian kernels in divisive normalization should be multiplied by high-pass filters to reproduce basic trends in masking. Basic visual phenomena may be misrepresented in large natural image datasets but this can be solved with model-interpretable stimuli. This is an additional argument in praise of artifice in line with Rust and Movshon (2005).
- Published
- 2019
- Full Text
- View/download PDF
6. Topographic Independent Component Analysis reveals random scrambling of orientation in visual space.
- Author
-
Marina Martinez-Garcia, Luis M Martinez, and Jesús Malo
- Subjects
Medicine ,Science - Abstract
Neurons at primary visual cortex (V1) in humans and other species are edge filters organized in orientation maps. In these maps, neurons with similar orientation preference are clustered together in iso-orientation domains. These maps have two fundamental properties: (1) retinotopy, i.e. correspondence between displacements at the image space and displacements at the cortical surface, and (2) a trade-off between good coverage of the visual field with all orientations and continuity of iso-orientation domains in the cortical space. There is an active debate on the origin of these locally continuous maps. While most of the existing descriptions take purely geometric/mechanistic approaches which disregard the network function, a clear exception to this trend in the literature is the original approach of Hyvärinen and Hoyer based on infomax and Topographic Independent Component Analysis (TICA). Although TICA successfully addresses a number of other properties of V1 simple and complex cells, in this work we question the validity of the orientation maps obtained from TICA. We argue that the maps predicted by TICA can be analyzed in the retinal space, and when doing so, it is apparent that they lack the required continuity and retinotopy. Here we show that in the orientation maps reported in the TICA literature it is easy to find examples of violation of the continuity between similarly tuned mechanisms in the retinal space, which suggest a random scrambling incompatible with the maps in primates. The new experiments in the retinal space presented here confirm this guess: TICA basis vectors actually follow a random salt-and-pepper organization back in the image space. Therefore, the interesting clusters found in the TICA topology cannot be interpreted as the actual cortical orientation maps found in cats, primates or humans. In conclusion, Topographic ICA does not reproduce cortical orientation maps.
- Published
- 2017
- Full Text
- View/download PDF
7. Spatio-chromatic adaptation via higher-order canonical correlation analysis of natural images.
- Author
-
Michael U Gutmann, Valero Laparra, Aapo Hyvärinen, and Jesús Malo
- Subjects
Medicine ,Science - Abstract
Independent component and canonical correlation analysis are two general-purpose statistical methods with wide applicability. In neuroscience, independent component analysis of chromatic natural images explains the spatio-chromatic structure of primary cortical receptive fields in terms of properties of the visual environment. Canonical correlation analysis explains similarly chromatic adaptation to different illuminations. But, as we show in this paper, neither of the two methods generalizes well to explain both spatio-chromatic processing and adaptation at the same time. We propose a statistical method which combines the desirable properties of independent component and canonical correlation analysis: It finds independent components in each data set which, across the two data sets, are related to each other via linear or higher-order correlations. The new method is as widely applicable as canonical correlation analysis, and also to more than two data sets. We call it higher-order canonical correlation analysis. When applied to chromatic natural images, we found that it provides a single (unified) statistical framework which accounts for both spatio-chromatic processing and adaptation. Filters with spatio-chromatic tuning properties as in the primary visual cortex emerged and corresponding-colors psychophysics was reproduced reasonably well. We used the new method to make a theory-driven testable prediction on how the neural response to colored patterns should change when the illumination changes. We predict shifts in the responses which are comparable to the shifts reported for chromatic contrast habituation.
- Published
- 2014
- Full Text
- View/download PDF
8. Orthonormal Convolutions for the Rotation Based Iterative Gaussianization.
- Author
-
Valero Laparra, Alexander Hepburn, Juan Emmanuel Johnson, and Jesús Malo
- Published
- 2022
- Full Text
- View/download PDF
9. Perceptnet: A Human Visual System Inspired Neural Network For Estimating Perceptual Distance.
- Author
-
Alexander Hepburn, Valero Laparra, Jesús Malo, Ryan McConville, and Raúl Santos-Rodríguez
- Published
- 2020
- Full Text
- View/download PDF
10. Lossless coding of hyperspectral images with principal polynomial analysis.
- Author
-
Naoufal Amrani, Valero Laparra, Gustavo Camps-Valls, Joan Serra-Sagristà, and Jesús Malo
- Published
- 2014
- Full Text
- View/download PDF
11. Complex-Valued Independent Component Analysis of Natural Images.
- Author
-
Valero Laparra, Michael Gutmann, Jesús Malo, and Aapo Hyvärinen
- Published
- 2011
- Full Text
- View/download PDF
12. Estimating biophysical variable dependences with kernels.
- Author
-
Gustavo Camps-Valls, Devis Tuia, Valero Laparra, and Jesús Malo
- Published
- 2010
- Full Text
- View/download PDF
13. PCA Gaussianization for image processing.
- Author
-
Valero Laparra, Gustavo Camps-Valls, and Jesús Malo
- Published
- 2009
- Full Text
- View/download PDF
14. Spatio-chromatic information available from different neural layers via Gaussianization
- Author
-
Jesús Malo
- Subjects
Multivariate statistics ,Computer science ,Neuroscience (miscellaneous) ,Retina–cortex pathway ,Total correlation ,law.invention ,lcsh:RC321-571 ,03 medical and health sciences ,symbols.namesake ,0302 clinical medicine ,law ,Chromatic saturation ,Chromatic scale ,lcsh:Neurosciences. Biological psychiatry. Neuropsychiatry ,030304 developmental biology ,Opponent channels ,0303 health sciences ,business.industry ,Research ,lcsh:Mathematics ,Estimator ,Chromatic adaptation ,Texture sensors ,Pattern recognition ,Mutual information ,lcsh:QA1-939 ,Chromatic and achromatic Contrast sensitivity functions (CSFs) ,Transmitted information ,Achromatic lens ,Quantitative Biology - Neurons and Cognition ,FOS: Biological sciences ,Divisive normalization ,Cones ,Jacobian matrix and determinant ,symbols ,Neurons and Cognition (q-bio.NC) ,Gaussianization ,Artificial intelligence ,business ,030217 neurology & neurosurgery - Abstract
How much visual information about the retinal images can be extracted from the different layers of the visual pathway? This question depends on the complexity of the visual input, the set of transforms applied to this multivariate input, and the noise of the sensors in the considered layer. Separate subsystems (e.g. opponent channels, spatial filters, nonlinearities of the texture sensors) have been suggested to be organized for optimal information transmission. However, the efficiency of these different layers has not been measured when they operate together on colorimetrically calibrated natural images and using multivariate information-theoretic units over the joint spatio-chromatic array of responses. In this work, we present a statistical tool to address this question in an appropriate (multivariate) way. Specifically, we propose an empirical estimate of the information transmitted by the system based on a recent Gaussianization technique. The total correlation measured using the proposed estimator is consistent with predictions based on the analytical Jacobian of a standard spatio-chromatic model of the retina–cortex pathway. If the noise at certain representation is proportional to the dynamic range of the response, and one assumes sensors of equivalent noise level, then transmitted information shows the following trends: (1) progressively deeper representations are better in terms of the amount of captured information, (2) the transmitted information up to the cortical representation follows the probability of natural scenes over the chromatic and achromatic dimensions of the stimulus space, (3) the contribution of spatial transforms to capture visual information is substantially greater than the contribution of chromatic transforms, and (4) nonlinearities of the responses contribute substantially to the transmitted information but less than the linear transforms.
- Published
- 2020
15. Contrast sensitivity functions in autoencoders
- Author
-
Qiang, Li, Alex, Gomez-Villa, Marcelo, Bertalmío, Jesús, Malo, Ministerio de Ciencia e Innovación (España), and Generalitat Valenciana
- Subjects
Chromatic adaptation ,Architectures ,Natural images ,Deblurring and denoising ,Modulation transfer function ,Retina ,Sensory Systems ,Contrast Sensitivity ,Ophthalmology ,Quantitative Biology - Neurons and Cognition ,FOS: Biological sciences ,Humans ,Neurons and Cognition (q-bio.NC) ,Neural Networks, Computer ,Spatiotemporal and chromatic contrast sensitivity ,Noisy cones ,Vision, Ocular ,Convolutional autoencoders ,Statistical goals - Abstract
45 pags., 24 figs., 4 tabs., 3 apps., The human contrast sensitivity function (CSF) characterizes the psychophysical response to visual gratings of different frequency (Campbell & Robson, 1968). Filter characterizations in the Fourier domain are complete only for linear, shift-invariant systems. Human vision certainly is more complicated than that, however, this simple measure of the bandwidth of the system is still of paramount significance in biological vision: the CSF filter is an image-computable model that roughly describes the kind of visual information that is available for humans (Watson & Ahumada, 2016). Moreover, although it is defined for threshold conditions, there are many examples that illustrate the relevance of the CSF in more general situations (Watson et al., 1986; Watson & Malo, 2002; Watson & Ahumada, 2005), so it has shaped image engineering over decades (Mannos & Sakrison, 1974; Hunt, 1975; Wallace, 1992; Taubman & Marcellin, 2001). This theoretical and practical relevance motivated the measurement of CSFs, not only for spatial gratings (Campbell & Robson, 1968), but also for moving gratings (Kelly, 1979), chromatic gratings (Mullen, 1985), spatiotemporal chromatic gratings (Díez-Ajenjo et al., 2011), at different luminance levels (Wuerger et al., 2020), and for alternative basis of the image space (Malo et al., 1997), Partially funded by these grants from GVA/AEI/FEDER/EU: MICINN DPI2017-89867-C2-2-R, MICINN PID2020-118071GB-I00, and GVA Grisolía-P/2019/035 (for JM and QL), and MICINN PGC2018-099651-B-I00 (for A.G.V. and M.B.)
- Published
- 2022
16. Remote Sensing Image Processing
- Author
-
Gustavo Camps-Valls, Devis Tuia, Luis Gómez-Chova, Sandra Jiménez, and Jesús Malo
- Published
- 2011
- Full Text
- View/download PDF
17. Geometrical and statistical properties of vision models obtained via maximum differentiation.
- Author
-
Jesús Malo and Eero P. Simoncelli
- Published
- 2015
- Full Text
- View/download PDF
18. Variable-size block matching algorithm for motion estimation using a perceptual-based splitting criterion.
- Author
-
Francesc J. Ferri, Jesús Malo, Jesús V. Albert, and J. Soret
- Published
- 1998
- Full Text
- View/download PDF
19. Computing variations of entropy and redundancy under nonlinear mappings not preserving the signal dimension: quantifying the efficiency of V1 cortex
- Author
-
José J. Esteve-Taboada, Jesús Malo, Valero Laparra, Qiang Li, and Emmanuel Johnson
- Subjects
symbols.namesake ,Wavelet ,Redundancy (information theory) ,Dimension (vector space) ,Computer science ,Jacobian matrix and determinant ,symbols ,Entropy (information theory) ,Total correlation ,Efficient coding hypothesis ,Algorithm ,Curse of dimensionality - Abstract
In computational neuroscience, the Efficient Coding Hypothesis argues that the neural organization comes from the optimization of information-theoretic goals [Barlow Proc.Nat.Phys.Lab.59]. A way to confirm this requires the analysis of the statistical performance of biological systems that have not been statistically optimized [Renart et al. Science10, Malo&Laparra Neur.Comp.10, Foster JOSA18, Gomez-Villa&Malo J.Neurophysiol.19]. However, when analyzing the information-theoretic performance, cortical magnification in the retina-cortex pathway poses a theoretical problem. Cortical magnification stands for the increase the signal dimensionality [Cowey&Rolls Exp. Brain Res.74]. Conventional models based on redundant wavelets increase the dimension of the signal by 1 order of magnitude [Watson CVGIP87, Schwartz&Simoncelli Nat.Neurosci.01]. Such increase implies a problem to quantify the efficiency of the transforms. In fact, previous accounts of the information flow along physiological networks had to do some sort of approximation to deal with magnification, e.g. (1) using orthonormal wavelets or preserving dimension [Bethge JOSA06, Malo&Laparra Neur.Comp.10] , or (2) using a reference for the relations introduced by the redundant transform [Laparra&Malo JMLR10, Gomez-Villa&Malo J.Neurophysiol.19]. In this work, we address the information theoretic analysis of such nonlinear systems that do not preserve dimension using no approximation. On the one hand we derive the theory to compute variations of entropy and total correlation under such transforms, which involves the knowledge of the Jacobian of the system wrt the input. To that end, we use the analytical results in [Martinez&Malo PLOS18]. On the other hand, we compare such predictions with a recently proposed non-parametric estimator of information-theory measures: the Rotation-Based Iterative Gaussianization [Laparra&Malo IEEE Trans.Neur.Nets11, Johnson, Laparra&Malo ICML19]. Consistency between the results validate the theory and provide new insights into the visual neural function.
- Published
- 2021
20. Information flow in Color Appearance Neural Networks
- Author
-
Jesús Malo
- Subjects
Artificial neural network ,Computer science ,Color vision ,business.industry ,Chromatic adaptation ,Normalization (image processing) ,Pattern recognition ,Chromatic scale ,Artificial intelligence ,Efficient coding hypothesis ,Color space ,Covariance ,business - Abstract
Color Appearance Models are biological networks that consist of a cascade of linear+nonlinear layers that modify the linear measurements at the retinal photo-receptors leading to an internal (nonlinear) representation of color that correlates with psychophysical experience. The basic layers of these networks include: (1) chromatic adaptation (normalization of the mean and covariance of the color manifold), (2) change to opponent color channels (PCA-like rotation in the color space), and (3) saturating nonlinearities to get perceptually Euclidean color representations (similar to dimensionwise equalization). The Efficient Coding Hypothesis argues that these transforms should emerge from information-theoretic goals. In case this hypothesis holds in color vision, the question is, what is the coding gain due to the different layers of the color appearance networks? In this work, a representative family of Color Appearance Models is analyzed in terms of how the redundancy among the chromatic components is modified along the network and how much information is transferred from the input data to the noisy response. The proposed analysis is done using data and methods that were not available before: (1) new colorimetrically calibrated scenes in different CIE illuminations for proper evaluation of chromatic adaptation, and (2) new statistical tools to estimate (multivariate) information-theoretic quantities between multidimensional sets based on Gaussianization. Results confirm that the Efficient Coding Hypothesis holds for current color vision models, and identify the psychophysical mechanisms critically responsible for gains in information transference: opponent channels and their nonlinear nature are more important than chromatic adaptation at the retina.
- Published
- 2021
21. Evidence for the intrinsically nonlinear nature of receptive fields in vision
- Author
-
Alex Gomez-Villa, Jesús Malo, David Kane, Javier Vazquez-Corral, Marcelo Bertalmío, and Adrián Martín
- Subjects
0301 basic medicine ,Visual perception ,Property (programming) ,Computer science ,media_common.quotation_subject ,lcsh:Medicine ,Basis function ,Stimulus (physiology) ,Models, Biological ,03 medical and health sciences ,0302 clinical medicine ,Artificial Intelligence ,Perception ,Animals ,Humans ,lcsh:Science ,Set (psychology) ,Vision, Ocular ,media_common ,Multidisciplinary ,Artificial neural network ,business.industry ,lcsh:R ,Vision science ,Nonlinear system ,030104 developmental biology ,Nonlinear Dynamics ,Receptive field ,Visual Perception ,lcsh:Q ,Neural Networks, Computer ,Artificial intelligence ,business ,030217 neurology & neurosurgery ,Retinal Neurons - Abstract
The responses of visual neurons, as well as visual perception phenomena in general, are highly nonlinear functions of the visual input, while most vision models are grounded on the notion of a linear receptive field (RF). The linear RF has a number of inherent problems: it changes with the input, it presupposes a set of basis functions for the visual system, and it conflicts with recent studies on dendritic computations. Here we propose to model the RF in a nonlinear manner, introducing the intrinsically nonlinear receptive field (INRF). Apart from being more physiologically plausible and embodying the efficient representation principle, the INRF has a key property of wide-ranging implications: for several vision science phenomena where a linear RF must vary with the input in order to predict responses, the INRF can remain constant under different stimuli. We also prove that Artificial Neural Networks with INRF modules instead of linear filters have a remarkably improved performance and better emulate basic human perception. Our results suggest a change of paradigm for vision science as well as for artificial intelligence.
- Published
- 2020
22. Channel Capacity in Psychovisual Deep-Nets: Gaussianization Versus Kozachenko-Leonenko
- Author
-
Jesús Malo
- Subjects
Visual Psychophysics ,Artificial neural network ,business.industry ,Estimator ,Pattern recognition ,law.invention ,Channel capacity ,Achromatic lens ,law ,Chromatic scale ,Artificial intelligence ,Representation (mathematics) ,business ,Adaptation (computer science) - Abstract
In this work, we quantify how neural networks designed from biology using no statistical training have a remarkable performance in information theoretic terms. Specifically, we address the question of the amount of information that can be extracted about the images from the different layers of psychophysically tuned deep networks. We show that analytical approaches are not possible, and we propose the use of two empirical estimators of capacity: the classical Kozachenko-Lonenko estimator and a recent estimator based on Gaussianization. Results show that networks purely based on visual psychophysics are extremely efficient in two aspects: (1) the internal representation of these networks duplicates the amount of information that can be extracted about the images with regard to the amount of information that could be obtained from the input representation assuming sensors of the same quality, and (2) the capacity of internal representation follows the PDF of natural scenes over the chromatic and achromatic dimensions of the stimulus space. This remarkable adaptation to the natural environment is an example of how imitation of biological vision may inspire architectures and save training effort in artificial vision.
- Published
- 2020
23. CREATIVE DISSEMINATION OF OPTOMETRY FOR PRIMARY EDUCATION STUDENTS
- Author
-
José J. Esteve-Taboada, Pascual Capilla, Álvaro Pons, María Del Carmen García-Domene, Dolores de Fez, Amparo Díez-Ajenjo, Paula García-Balaguer, María José Luque-Cobija, Vicent Sanchis, Rosa Vila-Andrés, Lydia Torres-Villanueva, and Jesús Malo
- Subjects
Medical education ,Primary education ,Sociology - Published
- 2020
24. OPTOMETRIC GYMKHANA FOR THE ACQUISITION OF CLINICAL PSYCHOPHYSICS CONCEPTS
- Author
-
Vicent Sanchis, Pascual Capilla, Álvaro Pons, María Del Carmen García-Domene, José J. Esteve-Taboada, Amparo Díez-Ajenjo, Dolores de Fez, María José Luque-Cobija, and Jesús Malo
- Subjects
Cognitive science ,Psychophysics ,Gymkhana ,Psychology - Published
- 2020
25. Visual information flow in Wilson-Cowan networks
- Author
-
Alexander Gomez-Villa, Marcelo Bertalmío, and Jesús Malo
- Subjects
Normalization (statistics) ,Physiology ,Computer science ,Computation ,Population ,Models, Biological ,050105 experimental psychology ,Retina ,03 medical and health sciences ,Wilson–Cowan equations ,0302 clinical medicine ,Multi-informationtotal correlation ,Humans ,0501 psychology and cognitive sciences ,Visual Pathways ,Efficient coding hypothesis ,Efficient representation principle ,education ,Visual Cortex ,education.field_of_study ,Normalization model ,General Neuroscience ,05 social sciences ,Univariate ,FOS: Biological sciences ,Quantitative Biology - Neurons and Cognition ,Divisive normalization ,Visual Perception ,Neurons and Cognition (q-bio.NC) ,Total correlation ,Neural Networks, Computer ,Nerve Net ,Algorithm ,030217 neurology & neurosurgery ,Image compression - Abstract
In this paper, we study the communication efficiency of a psychophysically tuned cascade of Wilson-Cowan and divisive normalization layers that simulate the retina-V1 pathway. This is the first analysis of Wilson-Cowan networks in terms of multivariate total correlation. The parameters of the cortical model have been derived through the relation between the steady state of the Wilson-Cowan model and the divisive normalization model. The communication efficiency has been analyzed in two ways: First, we provide an analytical expression for the reduction of the total correlation among the responses of a V1-like population after the application of the Wilson-Cowan interaction. Second, we empirically study the efficiency with visual stimuli and statistical tools that were not available before 1) we use a recent, radiometrically calibrated, set of natural scenes, and 2) we use a recent technique to estimate the multivariate total correlation in bits from sets of visual responses, which only involves univariate operations, thus giving better estimates of the redundancy. The theoretical and the empirical results show that, although this cascade of layers was not optimized for statistical independence in any way, the redundancy between the responses gets substantially reduced along the neural pathway. Specifically, we show that 1) the efficiency of a Wilson-Cowan network is similar to its equivalent divisive normalization model; 2) while initial layers (Von Kries adaptation and Weber-like brightness) contribute to univariate equalization, and the bigger contributions to the reduction in total correlation come from the computation of nonlinear local contrast and the application of local oriented filters; and 3) psychophysically tuned models are more efficient (reduce more total correlation) in the more populated regions of the luminance-contrast plane. These results are an alternative confirmation of the efficient coding hypothesis for the Wilson-Cowan systems, and, from an applied perspective, they suggest that neural field models could be an option in image coding to perform image compression.NEW & NOTEWORTHY The Wilson-Cowan interaction is analyzed in total correlation terms for the first time. Theoretical and empirical results show that this psychophysically tuned interaction achieves the biggest efficiency in the most frequent region of the image space. This is an original confirmation of the efficient coding hypothesis and suggests that neural field models can be an alternative to divisive normalization in image compression.
- Published
- 2020
26. Canonical Retina-to-Cortex Vision Model Ready for Automatic Differentiation
- Author
-
Jesús Malo and Qiang Li
- Subjects
Theoretical computer science ,Computer science ,Automatic differentiation ,business.industry ,Computation ,Deep learning ,Python (programming language) ,Task (project management) ,Nonlinear system ,Distortion ,Key (cryptography) ,Artificial intelligence ,business ,computer ,computer.programming_language - Abstract
Canonical vision models of the retina-to-V1 cortex pathway consist of cascades of several Linear+Nonlinear layers. In this setting, parameter tuning is the key to obtain a sensible behavior when putting all these multiple layers to work together. Conventional tuning of these neural models very much depends on the explicit computation of the derivatives of the response with regard to the parameters. And, in general, this is not an easy task. Automatic differentiation is a tool developed by the deep learning community to solve similar problems without the need of explicit computation of the analytic derivatives. Therefore, implementations of canonical visual neuroscience models that are ready to be used in an automatic differentiation environment are extremely needed nowadays. In this work we introduce a Python implementation of a standard multi-layer model for the retina-to-V1 pathway. Results show that the proposed default parameters reproduce image distortion psychophysics. More interestingly, given the python implementation, the parameters of this visual model are ready to be optimized with automatic differentiation tools for alternative goals.
- Published
- 2020
27. Information Flow in Biological Networks for Color Vision
- Author
-
Jesús Malo
- Subjects
FOS: Biological sciences ,Quantitative Biology - Neurons and Cognition ,General Physics and Astronomy ,Neurons and Cognition (q-bio.NC) ,chromatic information ,color appearance networks ,efficient coding hypothesis ,total correlation ,mutual information ,Gaussianization - Abstract
Biological neural networks for color vision (also known as color appearance models) consist of a cascade of linear + nonlinear layers that modify the linear measurements at the retinal photo-receptors leading to an internal (nonlinear) representation of color that correlates with psychophysical experience. The basic layers of these networks include: (1) chromatic adaptation (normalization of the mean and covariance of the color manifold); (2) change to opponent color channels (PCA-like rotation in the color space); and (3) saturating nonlinearities to obtain perceptually Euclidean color representations (similar to dimension-wise equalization). The Efficient Coding Hypothesis argues that these transforms should emerge from information-theoretic goals. In case this hypothesis holds in color vision, the question is what is the coding gain due to the different layers of the color appearance networks? In this work, a representative family of color appearance models is analyzed in terms of how the redundancy among the chromatic components is modified along the network and how much information is transferred from the input data to the noisy response. The proposed analysis is performed using data and methods that were not available before: (1) new colorimetrically calibrated scenes in different CIE illuminations for the proper evaluation of chromatic adaptation; and (2) new statistical tools to estimate (multivariate) information-theoretic quantities between multidimensional sets based on Gaussianization. The results confirm that the efficient coding hypothesis holds for current color vision models, and identify the psychophysical mechanisms critically responsible for gains in information transference: opponent channels and their nonlinear nature are more important than chromatic adaptation at the retina.
- Published
- 2019
28. Color illusions also deceive CNNs for low-level vision tasks: Analysis and implications
- Author
-
Alexander Gomez-Villa, Marcelo Bertalmío, Javier Vazquez-Corral, Adrián Martín, and Jesús Malo
- Subjects
Computer science ,media_common.quotation_subject ,Illusion ,Color space ,Convolutional neural network ,050105 experimental psychology ,03 medical and health sciences ,0302 clinical medicine ,Perception ,Humans ,0501 psychology and cognitive sciences ,Vision, Ocular ,media_common ,Artificial neural network ,business.industry ,Optical illusion ,05 social sciences ,Illusions ,Sensory Systems ,Ophthalmology ,Vision science ,Human visual system model ,Artificial intelligence ,Neural Networks, Computer ,business ,030217 neurology & neurosurgery - Abstract
The study of visual illusions has proven to be a very useful approach in vision science. In this work we start by showing that, while convolutional neural networks (CNNs) trained for low-level visual tasks in natural images may be deceived by brightness and color illusions, some network illusions can be inconsistent with the perception of humans. Next, we analyze where these similarities and differences may come from. On one hand, the proposed linear eigenanalysis explains the overall similarities: in simple CNNs trained for tasks like denoising or deblurring, the linear version of the network has center-surround receptive fields, and global transfer functions are very similar to the human achromatic and chromatic contrast sensitivity functions in human-like opponent color spaces. These similarities are consistent with the long-standing hypothesis that considers low-level visual illusions as a by-product of the optimization to natural environments. Specifically, here human-like features emerge from error minimization. On the other hand, the observed differences must be due to the behavior of the human visual system not explained by the linear approximation. However, our study also shows that more ‘flexible’ network architectures, with more layers and a higher degree of nonlinearity, may actually have a worse capability of reproducing visual illusions. This implies, in line with other works in the vision science literature, a word of caution on using CNNs to study human vision: on top of the intrinsic limitations of the L+NL formulation of artificial networks to model vision, the nonlinear behavior of flexible architectures may easily be markedly different from that of the visual system. This work has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement number 761544 (project HDR4EU) and under grant agreement number 780470 (project SAUCE), and by the Spanish government and FEDER Fund, grant Ref. PGC2018-099651-B-I00 (MCIU/AEI/FEDER, UE). The work of AM was supported by the Spanish government under Grant FJCI-2017–31758. JM has been supported by the Spanish government under the MINECO grant Ref. DPI2017-89867 and by the Generalitat Velanciana grant Ref. GrisoliaP-2019-035. We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan Xp GPU used for this research.
- Published
- 2019
29. Visual discrimination and adaptation using non-linear unsupervised learning.
- Author
-
Sandra Jiménez, Valero Laparra, and Jesús Malo
- Published
- 2013
- Full Text
- View/download PDF
30. Chromatic induction and contrast masking: similar models, different goals?
- Author
-
Sandra Jiménez, Xavier Otazu, Valero Laparra, and Jesús Malo
- Published
- 2013
- Full Text
- View/download PDF
31. Remote Sensing Image Processing
- Author
-
Gustavo Camps-Valls, Devis Tuia, Luis Gómez-Chova, Sandra Jiménez, Jesus Malo, Gustavo Camps-Valls, Devis Tuia, Luis Gómez-Chova, Sandra Jiménez, and Jesus Malo
- Subjects
- Electrical engineering, Signal processing
- Abstract
Earth observation is the field of science concerned with the problem of monitoring and modeling the processes on the Earth surface and their interaction with the atmosphere. The Earth is continuously monitored with advanced optical and radar sensors. The images are analyzed and processed to deliver useful products to individual users, agencies and public administrations. To deal with these problems, remote sensing image processing is nowadays a mature research area, and the techniques developed in the field allow many real-life applications with great societal value. For instance, urban monitoring, fire detection or flood prediction can have a great impact on economical and environmental issues. To attain such objectives, the remote sensing community has turned into a multidisciplinary field of science that embraces physics, signal theory, computer science, electronics and communications. From a machine learning and signal/image processing point of view, all the applications are tackled under specific formalisms, such as classification and clustering, regression and function approximation, data coding, restoration and enhancement, source unmixing, data fusion or feature selection and extraction. This book covers some of the fields in a comprehensive way. Table of Contents: Remote Sensing from Earth Observation Satellites / The Statistics of Remote Sensing Images / Remote Sensing Feature Selection and Extraction / Classification / Spectral Mixture Analysis / Estimation of Physical Parameters
- Published
- 2022
32. Visual Information Fidelity with better Vision Models and better Mutual Information Estimates
- Author
-
Qiang Li, Benyamin Kheravdar, and Jesús Malo
- Subjects
Ophthalmology ,Computer science ,business.industry ,media_common.quotation_subject ,Fidelity ,Mutual information ,Artificial intelligence ,Machine learning ,computer.software_genre ,business ,computer ,Sensory Systems ,media_common - Published
- 2021
33. PerceptNet: A Human Visual System Inspired Neural Network for Estimating Perceptual Distance
- Author
-
Raul Santos-Rodriguez, Alexander Hepburn, Jesús Malo, Valero Laparra, and Ryan McConville
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Visual perception ,Computer science ,Image quality ,media_common.quotation_subject ,Feature extraction ,Machine Learning (stat.ML) ,02 engineering and technology ,01 natural sciences ,Convolutional neural network ,human visual system ,Machine Learning (cs.LG) ,010309 optics ,Statistics - Machine Learning ,Perception ,0103 physical sciences ,0202 electrical engineering, electronic engineering, information engineering ,FOS: Electrical engineering, electronic engineering, information engineering ,perceptual distance ,media_common ,Artificial neural network ,business.industry ,Deep learning ,Image and Video Processing (eess.IV) ,Pattern recognition ,Electrical Engineering and Systems Science - Image and Video Processing ,neural networks ,Human visual system model ,020201 artificial intelligence & image processing ,Artificial intelligence ,business - Abstract
Traditionally, the vision community has devised algorithms to estimate the distance between an original image and images that have been subject to perturbations. Inspiration was usually taken from the human visual perceptual system and how the system processes different perturbations in order to replicate to what extent it determines our ability to judge image quality. While recent works have presented deep neural networks trained to predict human perceptual quality, very few borrow any intuitions from the human visual system. To address this, we present PerceptNet, a convolutional neural network where the architecture has been chosen to reflect the structure and various stages in the human visual system. We evaluate PerceptNet on various traditional perception datasets and note strong performance on a number of them as compared with traditional image quality metrics. We also show that including a nonlinearity inspired by the human visual system in classical deep neural networks architectures can increase their ability to judge perceptual similarity. Compared to similar deep learning methods, the performance is similar, although our network has a number of parameters that is several orders of magnitude less.
- Published
- 2019
- Full Text
- View/download PDF
34. Regression Wavelet Analysis for Lossless Coding of Remote-Sensing Data
- Author
-
Michael W. Marcellin, Joan Serra-Sagrista, Jesús Malo, Naoufal Amrani, and Valero Laparra
- Subjects
Discrete wavelet transform ,Computational complexity theory ,business.industry ,0211 other engineering and technologies ,Hyperspectral imaging ,Pattern recognition ,Regression analysis ,02 engineering and technology ,Wavelet packet decomposition ,Wavelet ,Principal component analysis ,0202 electrical engineering, electronic engineering, information engineering ,General Earth and Planetary Sciences ,020201 artificial intelligence & image processing ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,021101 geological & geomatics engineering ,Remote sensing ,Mathematics ,Coding (social sciences) - Abstract
A novel wavelet-based scheme to increase coefficient independence in hyperspectral images is introduced for lossless coding. The proposed regression wavelet analysis (RWA) uses multivariate regression to exploit the relationships among wavelet-transformed components. It builds on our previous nonlinear schemes that estimate each coefficient from neighbor coefficients. Specifically, RWA performs a pyramidal estimation in the wavelet domain, thus reducing the statistical relations in the residuals and the energy of the representation compared to existing wavelet-based schemes. We propose three regression models to address the issues concerning estimation accuracy, component scalability, and computational complexity. Other suitable regression models could be devised for other goals. RWA is invertible, it allows a reversible integer implementation, and it does not expand the dynamic range. Experimental results over a wide range of sensors, such as AVIRIS, Hyperion, and Infrared Atmospheric Sounding Interferometer, suggest that RWA outperforms not only principal component analysis and wavelets but also the best and most recent coding standard in remote sensing, CCSDS-123.
- Published
- 2016
35. Derivatives and Inverse of Cascaded Linear+Nonlinear Neural Models
- Author
-
Praveen Cyriac, Marcelo Bertalmío, Marina Martinez-Garcia, Jesús Malo, Thomas Batard, Ministerio de Economía y Competitividad (España), Comisión Interministerial de Ciencia y Tecnología, CICYT (España), and European Research Council
- Subjects
0301 basic medicine ,Light ,Computer science ,Vision ,Sensory systems ,lcsh:Medicine ,Inverse ,Social Sciences ,Sensory perception ,law.invention ,Machine Learning ,Matrix (mathematics) ,0302 clinical medicine ,Wavelet ,law ,Signal Decoders ,Psychology ,lcsh:Science ,Bioassays and physiological analysis ,Visual Cortex ,Multidisciplinary ,Physics ,Electromagnetic Radiation ,Linear model ,Sensory Systems ,Invertible matrix ,Bioassays and Physiological Analysis ,Jacobian matrix and determinant ,Physical Sciences ,symbols ,Engineering and Technology ,Neurons and Cognition (q-bio.NC) ,Sensory Perception ,Algorithm ,Algorithms ,Neural decoding ,Research Article ,Normalization (statistics) ,Visible Light ,Models, Neurological ,Research and Analysis Methods ,03 medical and health sciences ,symbols.namesake ,Signal decoders ,Psychophysics ,Humans ,Vision, Ocular ,lcsh:R ,Neurosciences ,Biology and Life Sciences ,Nonlinear system ,030104 developmental biology ,Algebra ,Luminance ,Linear Algebra ,Nonlinear Dynamics ,FOS: Biological sciences ,Quantitative Biology - Neurons and Cognition ,Linear Models ,lcsh:Q ,Electronics ,Eigenvectors ,030217 neurology & neurosurgery ,Mathematics ,Neuroscience - Abstract
In vision science, cascades of Linear+Nonlinear transforms are very successful in modeling a number of perceptual experiences. However, the conventional literature is usually too focused on only describing the forward input-output transform. Instead, in this work we present the mathematics of such cascades beyond the forward transform, namely the Jacobian matrices and the inverse. The fundamental reason for this analytical treatment is that it offers useful analytical insight into the psychophysics, the physiology, and the function of the visual system. For instance, we show how the trends of the sensitivity (volume of the discrimination regions) and the adaptation of the receptive fields can be identified in the expression of the Jacobian w.r.t. the stimulus. This matrix also tells us which regions of the stimulus space are encoded more efficiently in multi-information terms. The Jacobian w.r.t. the parameters shows which aspects of the model have bigger impact in the response, and hence their relative relevance. The analytic inverse implies conditions for the response and model parameters to ensure appropriate decoding. From the experimental and applied perspective, (a) the Jacobian w.r.t. the stimulus is necessary in new experimental methods based on the synthesis of visual stimuli with interesting geometrical properties, (b) the Jacobian matrices w.r.t. the parameters are convenient to learn the model from classical experiments or alternative goal optimization, and (c) the inverse is a promising model-based alternative to blind machine-learning methods for neural decoding that do not include meaningful biological information. The theory is checked by building and testing a vision model that actually follows a modular Linear+Nonlinear program. Our illustrative derivable and invertible model consists of a cascade of modules that account for brightness, contrast, energy masking, and wavelet masking. To stress the generality of this modular setting we show examples where some of the canonical Divisive Normalization modules are substituted by equivalent modules such as the Wilson-Cowan interaction model (at the V1 cortex) or a tone-mapping model (at the retina)., This work was partially funded by the Spanish Ministerio de Economia y Competitividad projects CICYT TEC2013-50520-EXP and CICYT BFU2014-59776-R, by the European Research Council, Starting Grant ref. 306337, by the Spanish government and FEDER Fund, grant ref. TIN2015-71537-P(MINECO/FEDER,UE), 1021, and by the ICREA Academia Award.
- Published
- 2017
36. Topographic Independent Component Analysis reveals random scrambling of orientation in visual space
- Author
-
Jesús Malo, Marina Martinez-Garcia, and Luis M. Martinez
- Subjects
0301 basic medicine ,Computer science ,Vision ,Visual space ,Statistics as Topic ,lcsh:Medicine ,Social Sciences ,Space (mathematics) ,Scrambling ,chemistry.chemical_compound ,0302 clinical medicine ,Cognition ,Learning and Memory ,Animal Cells ,Medicine and Health Sciences ,Psychology ,lcsh:Science ,media_common ,Visual Cortex ,Neurons ,Mammals ,Object Recognition ,Coding Mechanisms ,Brain Mapping ,Multidisciplinary ,Geography ,Orientation (computer vision) ,Visual field ,medicine.anatomical_structure ,Vertebrates ,Sensory Perception ,Cellular Types ,Anatomy ,Neuronal Tuning ,Research Article ,Cartography ,Primates ,media_common.quotation_subject ,Ocular Anatomy ,Retina ,03 medical and health sciences ,Topographic Maps ,Ocular System ,Memory ,Perception ,Orientation ,Neuronal tuning ,medicine ,Animals ,Humans ,Cortical surface ,Computational Neuroscience ,business.industry ,lcsh:R ,Organisms ,Cognitive Psychology ,Biology and Life Sciences ,Computational Biology ,Retinal ,Pattern recognition ,Cell Biology ,030104 developmental biology ,Visual cortex ,chemistry ,Retinotopy ,Cellular Neuroscience ,Amniotes ,Earth Sciences ,Cognitive Science ,lcsh:Q ,Artificial intelligence ,business ,030217 neurology & neurosurgery ,Neuroscience - Abstract
Neurons at primary visual cortex (V1) in humans and other species are edge filters organized in orientation maps. In these maps, neurons with similar orientation preference are clustered together in iso-orientation domains. These maps have two fundamental properties: (1) retinotopy, i.e. correspondence between displacements at the image space and displacements at the cortical surface, and (2) a trade-off between good coverage of the visual field with all orientations and continuity of iso-orientation domains in the cortical space. There is an active debate on the origin of these locally continuous maps. While most of the existing descriptions take purely geometric/mechanistic approaches which disregard the network function, a clear exception to this trend in the literature is the original approach of Hyvärinen and Hoyer based on infomax and Topographic Independent Component Analysis (TICA). Although TICA successfully addresses a number of other properties of V1 simple and complex cells, in this work we question the validity of the orientation maps obtained from TICA. We argue that the maps predicted by TICA can be analyzed in the retinal space, and when doing so, it is apparent that they lack the required continuity and retinotopy. Here we show that in the orientation maps reported in the TICA literature it is easy to find examples of violation of the continuity between similarly tuned mechanisms in the retinal space, which suggest a random scrambling incompatible with the maps in primates. The new experiments in the retinal space presented here confirm this guess: TICA basis vectors actually follow a random salt-and-pepper organization back in the image space. Therefore, the interesting clusters found in the TICA topology cannot be interpreted as the actual cortical orientation maps found in cats, primates or humans. In conclusion, Topographic ICA does not reproduce cortical orientation maps.
- Published
- 2017
37. The Role of Spatial Information in Disentangling the Irradiance–Reflectance–Transmittance Ambiguity
- Author
-
Jesús Malo and Sandra Jiménez
- Subjects
Mathematical optimization ,MODTRAN ,Radiance ,General Earth and Planetary Sciences ,Hyperspectral imaging ,Electrical and Electronic Engineering ,Inverse problem ,Curvature ,Heuristics ,Spatial analysis ,Algorithm ,Mathematics ,Free parameter - Abstract
In the satellite hyperspectral measures, the contributions of light, surface, and atmosphere are mixed. Applications need separate access to the sources. Conventional inversion techniques usually take a pixelwise spectral-only approach. However, recent improvements in retrieving surface and atmosphere characteristics use heuristic spatial smoothness constraints. In this paper, we theoretically justify such heuristics by analyzing the impact of spatial information on the uncertainty of the solution. The proposed analysis allows to assess in advance the uniqueness (or robustness) of the solution depending on the curvature of a likelihood surface. In situations where pixel-based approaches become unreliable, it turns out that the consideration of spatial information always makes the problem to be better conditioned. With the proposed analysis, this is easily understood since the curvature is consistent with the complexity of the sources measured in terms of the number of significant eigenvalues (or free parameters in the problem). In agreement with recent results in hyperspectral image coding, spatial correlations in the sources imply that the intrinsic complexity of the spatio-spectral representation of the signal is always lower than its spectral-only counterpart. According to this, the number of free parameters in the spatio-spectral inverse problem is smaller, so the spatio-spectral approaches are always better than spectral-only approaches. Experiments using ensembles of actual reflectance values and realistic MODTRAN irradiance and atmosphere radiance and transmittance values show that the proposed analysis successfully predicts the practical difficulty of the problem and the improved quality of spatio-spectral retrieval.
- Published
- 2014
38. The Brain’s Camera. Optimal Algorithms for Wiring the Eye to the Brain Shape How We See
- Author
-
Friedrich T. Sommer, Luis M. Martinez, Jesús Malo, A. J. Valiño-Perez, Marina Martinez-Garcia, S. Sala, Manuel Molano-Mazón, and Judith A. Hirsch
- Subjects
Retina ,Computer science ,business.industry ,Function (mathematics) ,Lateral geniculate nucleus ,Retinal ganglion ,medicine.anatomical_structure ,Retinal ganglion cell ,Receptive field ,Cortex (anatomy) ,Digital image processing ,medicine ,Computer vision ,Artificial intelligence ,business ,Algorithm - Abstract
The problem of sending information at long distances, without significant attenuation and at a low cost, is common to both artificial and natural environments. In the brain, a widespread strategy to solve the cost-efficiency trade off in long distance communication is the presence of convergent pathways, or bottlenecks. In the visual system, for example, to preserve resolution, information is acquired by a first layer with a large number of neurons (the photoreceptors in the retina) and then compressed into a much smaller number of units in the output layer (the retinal ganglion cells), to send that information to the brain at the lowest possible metabolic cost. Recently, we found experimental evidence for an optimal compression-decompression algorithm in the early visual pathway that reproduces the strategies used in digital image processing. Our results bear strong consequences for our current understanding of the development and function of the visual thalamus and cortex.
- Published
- 2016
39. Nonlinearities and Adaptation of Color Vision from Sequential Principal Curves Analysis
- Author
-
Gustavo Camps-Valls, Valero Laparra, Sandra Jiménez, and Jesús Malo
- Subjects
FOS: Computer and information sciences ,Color vision ,Computer science ,Cognitive Neuroscience ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Standard illuminant ,Machine Learning (stat.ML) ,Models, Biological ,Arts and Humanities (miscellaneous) ,Statistics - Machine Learning ,Psychophysics ,Humans ,Learning ,Computer Simulation ,Chromatic scale ,Parametric statistics ,Principal Component Analysis ,Color Vision ,Nonlinear dimensionality reduction ,Adaptation, Physiological ,Nonlinear system ,Nonlinear Dynamics ,FOS: Biological sciences ,Quantitative Biology - Neurons and Cognition ,Metric (mathematics) ,A priori and a posteriori ,Neurons and Cognition (q-bio.NC) ,Algorithm ,Color Perception ,Photic Stimulation - Abstract
Mechanisms of human color vision are characterized by two phenomenological aspects: the system is nonlinear and adaptive to changing environments. Conventional attempts to derive these features from statistics use separate arguments for each aspect. The few statistical explanations that do consider both phenomena simultaneously follow parametric formulations based on empirical models. Therefore, it may be argued that the behavior does not come directly from the color statistics but from the convenient functional form adopted. In addition, many times the whole statistical analysis is based on simplified databases that disregard relevant physical effects in the input signal, as, for instance, by assuming flat Lambertian surfaces. In this work, we address the simultaneous statistical explanation of the nonlinear behavior of achromatic and chromatic mechanisms in a fixed adaptation state and the change of such behavior (i.e., adaptation) under the change of observation conditions. Both phenomena emerge directly from the samples through a single data-driven method: the sequential principal curves analysis (SPCA) with local metric. SPCA is a new manifold learning technique to derive a set of sensors adapted to the manifold using different optimality criteria. Here sequential refers to the fact that sensors (curvilinear dimensions) are designed one after the other, and not to the particular (eventually iterative) method to draw a single principal curve. Moreover, in order to reproduce the empirical adaptation reported under D65 and A illuminations, a new database of colorimetrically calibrated images of natural objects under these illuminants was gathered, thus overcoming the limitations of available databases. The results obtained by applying SPCA show that the psychophysical behavior on color discrimination thresholds, discount of the illuminant, and corresponding pairs in asymmetric color matching emerge directly from realistic data regularities, assuming no a priori functional form. These results provide stronger evidence for the hypothesis of a statistically driven organization of color sensors. Moreover, the obtained results suggest that the nonuniform resolution of color sensors at this low abstraction level may be guided by an error-minimization strategy rather than by an information-maximization goal.
- Published
- 2016
40. Dimensionality Reduction via Regression in Hyperspectral Imagery
- Author
-
Valero Laparra, Jesús Malo, and Gustau Camps-Valls
- Subjects
FOS: Computer and information sciences ,business.industry ,Dimensionality reduction ,Computer Vision and Pattern Recognition (cs.CV) ,Feature extraction ,Nonlinear dimensionality reduction ,Diffusion map ,Computer Science - Computer Vision and Pattern Recognition ,Pattern recognition ,Machine Learning (stat.ML) ,Collinearity ,Reduction (complexity) ,Statistics - Machine Learning ,Signal Processing ,Principal component analysis ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,Mathematics ,Curse of dimensionality - Abstract
This paper introduces a new unsupervised method for dimensionality reduction via regression (DRR). The algorithm belongs to the family of invertible transforms that generalize Principal Component Analysis (PCA) by using curvilinear instead of linear features. DRR identifies the nonlinear features through multivariate regression to ensure the reduction in redundancy between he PCA coefficients, the reduction of the variance of the scores, and the reduction in the reconstruction error. More importantly, unlike other nonlinear dimensionality reduction methods, the invertibility, volume-preservation, and straightforward out-of-sample extension, makes DRR interpretable and easy to apply. The properties of DRR enable learning a more broader class of data manifolds than the recently proposed Non-linear Principal Components Analysis (NLPCA) and Principal Polynomial Analysis (PPA). We illustrate the performance of the representation in reducing the dimensionality of remote sensing data. In particular, we tackle two common problems: processing very high dimensional spectral information such as in hyperspectral image sounding data, and dealing with spatial-spectral image patches of multispectral images. Both settings pose collinearity and ill-determination problems. Evaluation of the expressive power of the features is assessed in terms of truncation error, estimating atmospheric variables, and surface land cover classification error. Results show that DRR outperforms linear PCA and recently proposed invertible extensions based on neural networks (NLPCA) and univariate regressions (PPA)., Comment: 12 pages, 6 figures, 62 references
- Published
- 2016
- Full Text
- View/download PDF
41. A Color Contrast Definition for Perceptually Based Color Image Coding
- Author
-
M. J. Luque, Gustau Camps-Valls, Juan Manuel Gutiérrez, and Jesús Malo
- Subjects
Color histogram ,Color image ,Color normalization ,Computer science ,business.industry ,Color balance ,General Medicine ,Color space ,Color model ,Color depth ,Computer vision ,Artificial intelligence ,business ,Histogram equalization - Published
- 2012
42. Psychophysically Tuned Divisive Normalization Approximately Factorizes the PDF of Natural Images
- Author
-
Valero Laparra and Jesús Malo
- Subjects
Neurons ,Computational neuroscience ,business.industry ,Cognitive Neuroscience ,media_common.quotation_subject ,Models, Neurological ,Normalization (image processing) ,Pattern recognition ,Mutual information ,Information theory ,Machine learning ,computer.software_genre ,Visual processing ,Models of neural computation ,Arts and Humanities (miscellaneous) ,Perception ,Visual Perception ,Artificial intelligence ,Efficient coding hypothesis ,business ,computer ,Visual Cortex ,media_common ,Mathematics - Abstract
The conventional approach in computational neuroscience in favor of the efficient coding hypothesis goes from image statistics to perception. It has been argued that the behavior of the early stages of biological visual processing (e.g., spatial frequency analyzers and their nonlinearities) may be obtained from image samples and the efficient coding hypothesis using no psychophysical or physiological information. In this work we address the same issue in the opposite direction: from perception to image statistics. We show that psychophysically fitted image representation in V1 has appealing statistical properties, for example, approximate PDF factorization and substantial mutual information reduction, even though no statistical information is used to fit the V1 model. These results are complementary evidence in favor of the efficient coding hypothesis.
- Published
- 2010
43. Visual aftereffects and sensory nonlinearities from a single statistical framework
- Author
-
Valero Laparra and Jesús Malo
- Subjects
Normalization (statistics) ,texture aftereffect ,Computer science ,adaptation ,unsupervised learning ,scene statistics ,lcsh:RC321-571 ,Behavioral Neuroscience ,lcsh:Neurosciences. Biological psychiatry. Neuropsychiatry ,Decorrelation ,Biological Psychiatry ,color aftereffect ,Parametric statistics ,Original Research ,Curves analysis ,business.industry ,Optical illusion ,Nonparametric statistics ,Scene statistics ,Maximization ,sequential principal curves analysis ,Psychiatry and Mental health ,Neuropsychology and Physiological Psychology ,Neurology ,A priori and a posteriori ,Artificial intelligence ,business ,Algorithm ,Neuroscience ,motion aftereffect - Abstract
When adapted to a particular scenery our senses may fool us: colors are misinterpreted, certain spatial patterns seem to fade out, and static objects appear to move in reverse. A mere empirical description of the mechanisms tuned to color, texture, and motion may tell us where these visual illusions come from. However, such empirical models of gain control do not explain why these mechanisms work in this apparently dysfunctional manner. Current normative explanations of aftereffects based on scene statistics derive gain changes by (1) invoking decorrelation and linear manifold matching/equalization, or (2) using nonlinear divisive normalization obtained from parametric scene models. These principled approaches have different drawbacks: the first is not compatible with the known saturation nonlinearities in the sensors and it cannot fully accomplish information maximization due to its linear nature. In the second, gain change is almost determined a priori by the assumed parametric image model linked to divisive normalization. In this study we show that both the response changes that lead to aftereffects and the nonlinear behavior can be simultaneously derived from a single statistical framework: the Sequential Principal Curves Analysis (SPCA). As opposed to mechanistic models, SPCA is not intended to describe how physiological sensors work, but it is focused on explaining why they behave as they do. Nonparametric SPCA has two key advantages as a normative model of adaptation: (i) it is better than linear techniques as it is a flexible equalization that can be tuned for more sensible criteria other than plain decorrelation (either full information maximization or error minimization); and (ii) it makes no a priori functional assumption regarding the nonlinearity, so the saturations emerge directly from the scene data and the goal (and not from the assumed function). It turns out that the optimal responses derived from these more sensible criteria and SPCA are consistent with dysfunctional behaviors such as aftereffects.
- Published
- 2015
44. V1 non-linear properties emerge from local-to-global non-linear ICA
- Author
-
Juan Manuel Gutiérrez and Jesús Malo
- Subjects
Masking (art) ,business.industry ,Models, Neurological ,Neuroscience (miscellaneous) ,Independent component analysis ,Nonlinear system ,Wavelet ,Nonlinear Dynamics ,Receptive field ,Encoding (memory) ,Visual Perception ,Humans ,Automatic gain control ,Neural Networks, Computer ,Artificial intelligence ,Visual Fields ,business ,Algorithm ,Photic Stimulation ,Energy (signal processing) ,Visual Cortex ,Mathematics - Abstract
It has been argued that the aim of non-linearities in different visual and auditory mechanisms may be to remove the relations between the coefficients of the signal after global linear ICA-like stages. Specifically, in Schwartz and Simoncelli (2001), it was shown that masking effects are reproduced by fitting the parameters of a particular non-linearity in order to remove the dependencies between the energy of wavelet coefficients. In this work, we present a different result that supports the same efficient encoding hypothesis. However, this result is more general because, instead of assuming any specific functional form for the non-linearity, we show that by using an unconstrained approach, masking-like behavior emerges directly from natural images. This result is an additional indication that Barlow's efficient encoding hypothesis may explain not only the shape of receptive fields of V1 sensors but also their non-linear behavior.
- Published
- 2006
45. Linear transform for simultaneous diagonalization of covariance and perceptual metric matrix in image coding
- Author
-
Irene Epifanio, Jesús Malo, and Jaime Gutierrez
- Subjects
Covariance matrix ,business.industry ,Pattern recognition ,Covariance ,Weighting ,Matrix (mathematics) ,Redundancy (information theory) ,Artificial Intelligence ,Signal Processing ,Discrete cosine transform ,Computer Vision and Pattern Recognition ,Artificial intelligence ,business ,Software ,Transform coding ,Mathematics ,Image compression - Abstract
Two types ofredundancies are contained in images: statistical redundancy and psychovisual redundancy. Image representation techniques for image coding should remove both redundancies in order to obtain good results. In order to establish an appropriate representation, the standard approach to transform coding only considers the statistical redundancy, whereas the psychovisual factors are introduced after the selection ofthe representation as a simple scalar weighting in the transform domain. In this work, we take into account the psychovisual factors in the de8nition of the representation together with the statistical factors, by means of the perceptual metric and the covariance matrix, respectively. In general the ellipsoids described by these matrices are not aligned. Therefore, the optimal basis for image representation should simultaneously diagonalize both matrices. This approach to the basis selection problem has several advantages in the particular application ofimage coding. As the transform domain is Euclidean (by de8nition), the quantizer design is highly simpli8ed and at the same time, the use ofscalar quantizers is truly justi8ed. The proposed representation is compared to covariance-based representations such as the DCT and the KLT or PCA using standard JPEG-like and Max-Lloyd quantizers. ? 2003 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
- Published
- 2003
46. The Wilson-Cowan model describes Contrast Response and Subjective Distortion
- Author
-
Praveen Cyriac, Marcelo Bertalmío, Jesús Malo, Thomas Batard, and Marina Martinez-Garcia
- Subjects
Computer science ,media_common.quotation_subject ,05 social sciences ,050105 experimental psychology ,Sensory Systems ,Wilson–Cowan model ,03 medical and health sciences ,Ophthalmology ,0302 clinical medicine ,Quantum mechanics ,Distortion ,Contrast (vision) ,0501 psychology and cognitive sciences ,030217 neurology & neurosurgery ,media_common - Published
- 2017
47. Lossless coding of hyperspectral images with principal polynomial analysis
- Author
-
Valero Laparra, Naoufal Amrani, Gustau Camps-Valls, Jesús Malo, and Joan Serra-Sagrista
- Subjects
Lossless compression ,Data compaction ,business.industry ,Rounding ,Gaussian ,Dimensionality reduction ,Hyperspectral imaging ,Pattern recognition ,symbols.namesake ,Principal component analysis ,symbols ,Entropy (information theory) ,Artificial intelligence ,business ,Mathematics - Abstract
The transform in image coding aims to remove redundancy among data coefficients so that they can be independently coded, and to capture most of the image information in few coefficients. While the second goal ensures that discarding coefficients will not lead to large errors, the first goal ensures that simple (point-wise) coding schemes can be applied to the retained coefficients with optimal results. Principal Component Analysis (PCA) provides the best independence and data compaction for Gaussian sources. Yet, non-linear generalizations of PCA may provide better performance for more realistic non-Gaussian sources. Principal Polynomial Analysis (PPA) generalizes PCA by removing the non-linear relations among components using regression, and was analytically proved to perform better than PCA in dimensionality reduction. We explore here the suitability of reversible PPA for lossless compression of hyperspectral images. We found that reversible PPA performs worse than PCA due to the high impact of the rounding operation errors and to the amount of side information. We then propose two generalizations: Backwards PPA, where polynomial estimations are performed in reverse order, and Double-Sided PPA, where more than a single dimension is used in the predictions. Both yield better coding performance than canonical PPA and are comparable to PCA.
- Published
- 2014
48. Dimensionality reduction via regression on hyperspectral infrared sounding data
- Author
-
Valero Laparra, Gustavo Camps-Valls, and Jesús Malo
- Subjects
Clustering high-dimensional data ,Redundancy (information theory) ,business.industry ,Dimensionality reduction ,Principal component analysis ,Feature extraction ,Nonlinear dimensionality reduction ,Hyperspectral imaging ,Pattern recognition ,Artificial intelligence ,business ,Mathematics ,Curse of dimensionality - Abstract
This paper introduces a new method for dimensionality reduction via regression (DRR). The method generalizes Principal Component Analysis (PCA) in such a way that reduces the variance of the PCA scores. In order to do so, DRR relies on a deflationary process in which a non-linear regression reduces the redundancy between the PC scores. Unlike other nonlinear dimensionality reduction methods, DRR is easy to apply, it has out-of-sample extension, it is invertible, and the learned transformation is volume-preserving. These properties make the method useful for a wide range of applications, especially in very high dimensional data in general, and for hyperspectral image processing in particular. We illustrate the performance of the algorithm in reducing the dimensionality of IASI hyperspectral image sounding data. We compare DRR with related and invertible methods such as linear PCA and Principal Polynomial Analysis (PPA) in terms of reconstruction error, and expressive power of the extracted features to estimate atmospheric variables.
- Published
- 2014
49. The role of perceptual contrast non-linearities in image transform quantization
- Author
-
J. Soret, Jesús Malo, Francesc J. Ferri, José M. Artigas, and Jesús V. Albert
- Subjects
Image coding ,Training set ,business.industry ,Quantization (signal processing) ,media_common.quotation_subject ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Perception ,Signal Processing ,Computer vision ,Computer Vision and Pattern Recognition ,Artificial intelligence ,Perceptual Distortion ,Minification ,business ,Algorithm ,Mathematics ,media_common - Abstract
The conventional quantizer design based on average error minimization over a training set does not guarantee a good subjective behavior on individual images even if perceptual metrics are used. In this work a novel criterion for transform coder design is analyzed in depth. Its aim is to bound the perceptual distortion in each individual quantization according to a non-linear model of early human vision. A common comparison framework is presented to describe the qualitative behavior of the optimal quantizers under the proposed criterion and the conventional rate-distortion based criterion. Several underlying metrics, with and without perceptual non-linearities, are used with both criteria. Analytical results show that the proposed design criterion gives rise to a JPEG-like quantization if a simple linear metric is used. Experimental results show that significant improvements over the perceptually weighted rate-distortion approach are obtained if a more meaningful non-linear metric is used.
- Published
- 2000
50. Image quality metric based on multidimensional contrast perception models
- Author
-
Jesús Malo, J.M. Artigas, Pascual Capilla, and Álvaro Pons
- Subjects
Computer science ,Image quality ,business.industry ,media_common.quotation_subject ,Pattern recognition ,Human-Computer Interaction ,Hardware and Architecture ,Feature (computer vision) ,Distortion ,Human visual system model ,Metric (mathematics) ,Contrast (vision) ,Computer vision ,Artificial intelligence ,Electrical and Electronic Engineering ,Set (psychology) ,business ,Representation (mathematics) ,media_common - Abstract
The procedure to compute the subjective difference between two input images should be equivalent to a straightforward difference between their perceived versions, hence reliable subjective difference metrics must be founded on a proper perception model. For image distortion evaluation purposes, perception can be considered as a set of signal transforms that successively map the original image in the spatial domain into a feature and a response space. The properties of the spatial pattern analyzers involved in these transforms determine the geometry of these different signal representation domains. In this work the general relations between the sensitivity of the human visual system and the perceptual geometry of the different representation spaces are presented. This general formalism is particularized through a novel physiological model of response summation of cortical cells that reproduce the psychophysical data of contrast incremental thresholds. In this way, a procedure to compute subjective distances between images in any representation domain is obtained. The reliability of the proposed scheme is tested in two different contexts. On the one hand, it reproduces the results of suprathreshold contrast matching experiences and subjective contrast scales (Georgeson and Shackleton, Vision Res. 34 (1994) 1061–1075; Swanson et al., Vision Res. 24 (1985) 63–75; Cannon, Vision Res. 19 (1979) 1045–1052; Biondini and Mattiello, Vision Res. 25 (1985) 1–9), and on the other hand, it provides a theoretical background that generalizes our previous perceptual difference model (Malo et al., Im. Vis. Comp. 15 (1997) 535–548) whose outputs are linearly related to experimental subjective assessment of distortion.
- Published
- 1999
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.