Author: "Assif, Liav" / Publication Type: Electronic Resources - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Assif, Liav"' showing total 5 results

Start Over Author "Assif, Liav" Publication Type Electronic Resources

5 results on '"Assif, Liav"'

1. A model for full local image interpretation

Author: Ben-Yosef, Guy, Assif, Liav, Harari, Daniel, Ullman, Shimon, Ben-Yosef, Guy, Assif, Liav, Harari, Daniel, and Ullman, Shimon
Abstract: We describe a computational model of humans' ability to provide a detailed interpretation of components in a scene. Humans can identify in an image meaningful components almost everywhere, and identifying these components is an essential part of the visual process, and of understanding the surrounding scene and its potential meaning to the viewer. Detailed interpretation is beyond the scope of current models of visual recognition. Our model suggests that this is a fundamental limitation, related to the fact that existing models rely on feed-forward but limited top-down processing. In our model, a first recognition stage leads to the initial activation of class candidates, which is incomplete and with limited accuracy. This stage then triggers the application of class-specific interpretation and validation processes, which recover richer and more accurate interpretation of the visible scene. We discuss implications of the model for visual interpretation by humans and by computer vision models., Comment: Published in the Proceedings of the 37th Annual Meeting of the Cognitive Science Society (CogSci), 2015
Published: 2021

2. Image interpretation by iterative bottom-up top-down processing

Author: Ullman, Shimon, Assif, Liav, Strugatski, Alona, Vatashsky, Ben-Zion, Levy, Hila, Netanyahu, Aviv, Yaari, Adam, Ullman, Shimon, Assif, Liav, Strugatski, Alona, Vatashsky, Ben-Zion, Levy, Hila, Netanyahu, Aviv, and Yaari, Adam
Abstract: Scene understanding requires the extraction and representation of scene components together with their properties and inter-relations. We describe a model in which meaningful scene structures are extracted from the image by an iterative process, combining bottom-up (BU) and top-down (TD) networks, interacting through a symmetric bi-directional communication between them (counter-streams structure). The model constructs a scene representation by the iterative use of three components. The first model component is a BU stream that extracts selected scene elements, properties and relations. The second component (cognitive augmentation) augments the extracted visual representation based on relevant non-visual stored representations. It also provides input to the third component, the TD stream, in the form of a TD instruction, instructing the model what task to perform next. The TD stream then guides the BU visual stream to perform the selected task in the next cycle. During this process, the visual representations extracted from the image can be combined with relevant non-visual representations, so that the final scene representation is based on both visual information extracted from the scene and relevant stored knowledge of the world. We describe how a sequence of TD-instructions is used to extract from the scene structures of interest, including an algorithm to automatically select the next TD-instruction in the sequence. The extraction process is shown to have favorable properties in terms of combinatorial generalization, generalizing well to novel scene structures and new combinations of objects, properties and relations not seen during training. Finally, we compare the model with relevant aspects of the human vision, and suggest directions for using the BU-TD scheme for integrating visual and cognitive components in the process of scene understanding.
Published: 2021

3. Structured learning and detailed interpretation of minimal object images

Author: Ben-Yosef, Guy, Assif, Liav, Ullman, Shimon, Ben-Yosef, Guy, Assif, Liav, and Ullman, Shimon
Abstract: We model the process of human full interpretation of object images, namely the ability to identify and localize all semantic features and parts that are recognized by human observers. The task is approached by dividing the interpretation of the complete object to the interpretation of multiple reduced but interpretable local regions. We model interpretation by a structured learning framework, in which there are primitive components and relations that play a useful role in local interpretation by humans. To identify useful components and relations used in the interpretation process, we consider the interpretation of minimal configurations, namely reduced local regions that are minimal in the sense that further reduction will turn them unrecognizable and uninterpretable. We show experimental results of our model, and results of predicting and testing relations that were useful to the model via transformed minimal images., Comment: Accepted to Workshop on Mutual Benefits of Cognitive and Computer Vision, at the International Conference on Computer Vision. Venice, Italy, 2017
Published: 2017

4. Atoms of recognition in human and computer vision

Author: Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences, McGovern Institute for Brain Research at MIT, Ullman, Shimon, Harari, Daniel, Assif, Liav, Fetaya, Ethan, Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences, McGovern Institute for Brain Research at MIT, Ullman, Shimon, Harari, Daniel, Assif, Liav, and Fetaya, Ethan
Abstract: Discovering the visual features and representations used by the brain to recognize objects is a central problem in the study of vision. Recently, neural network models of visual object recognition, including biological and deep network models, have shown remarkable progress and have begun to rival human performance in some challenging tasks. These models are trained on image examples and learn to extract features and representations and to use them for categorization. It remains unclear, however, whether the representations and learning processes discovered by current models are similar to those used by the human visual system. Here we show, by introducing and using minimal recognizable images, that the human visual system uses features and processes that are not used by current models and that are critical for recognition. We found by psychophysical studies that at the level of minimal recognizable images a minute change in the image can have a drastic effect on recognition, thus identifying features that are critical for the task. Simulations then showed that current models cannot explain this sensitivity to precise feature configurations and, more generally, do not learn to recognize minimal images at a human level. The role of the features shown here is revealed uniquely at the minimal level, where the contribution of each feature is essential. A full understanding of the learning and use of such features will extend our understanding of visual recognition and its cortical mechanisms and will enhance the capacity of computational models to learn from visual experience and to deal with recognition and detailed image interpretation., European Research Council (Advanced Grant “Digital Baby”), National Science Foundation (U.S.) (STC Center for Brains, Minds and Machines Award CCF-1231216)
Published: 2017

5. A model for full local image interpretation

Author: Ben-Yosef, Guy, Ben-Yosef, Guy, Assif, Liav, Harari, Daniel, Ullman, Shimon, Ben-Yosef, Guy, Ben-Yosef, Guy, Assif, Liav, Harari, Daniel, and Ullman, Shimon
Abstract: We describe a computational model of humans' ability to provide a detailed interpretation of a scene‚Äôs components. Humans can identify in an image meaningful components almost everywhere, and identifying these components is an essential part of the visual process, and of understanding the surrounding scene and its potential meaning to the viewer. Detailed interpretation is beyond the scope of current models of visual recognition. Our model suggests that this is a fundamental limitation, related to the fact that existing models rely on feed-forward but limited top-down processing. In our model, a first recognition stage leads to the initial activation of class candidates, which is incomplete and with limited accuracy. This stage then triggers the application of class-specific interpretation and validation processes, which recover richer and more accurate interpretation of the visible scene. We discuss implications of the model for visual interpretation by humans and by computer vision models
Published: 2015

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

5 results on '"Assif, Liav"'

1. A model for full local image interpretation

2. Image interpretation by iterative bottom-up top-down processing

3. Structured learning and detailed interpretation of minimal object images

4. Atoms of recognition in human and computer vision

5. A model for full local image interpretation

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Publication Year Range

Publication Type

Database

Publisher

5 results on '"Assif, Liav"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources