10 results on '"attribute recognition"'
Search Results
2. Pedestrian attribute recognition using trainable Gabor wavelets
- Author
-
Junejo, Imran N., Ahmed, Naveed, and Lataifeh, Mohammad
- Published
- 2021
- Full Text
- View/download PDF
3. Few-shot object detection and attribute recognition from construction site images for improved field compliance.
- Author
-
Wang, Xiyu and El-Gohary, Nora
- Subjects
- *
OBJECT recognition (Computer vision) , *BUILDING sites , *COMPUTER vision , *HAZARDS , *DEEP learning - Abstract
Computer vision techniques can be used to detect site objects for identifying noncompliances that could lead to safety incidents. However, existing methods are limited in covering diverse hazard scenarios, detecting site objects with imbalanced distributions, and recognizing their intricate attributes to describe their conditions and functionality. To address these gaps, this paper proposes a deep learning-based method for identifying multiple fall-related objects and their associated attributes. The proposed method consists of three submethods: (1) a method for developing relevant datasets by retrieving images from open resources; (2) a method for few-shot object detection, which deals with imbalanced distributions; and (3) a method for attribute recognition to add semantic descriptions to the detected objects. The proposed method achieved an average precision and recall of 88.2% and 79.5% for few-shot object detection and 94.8% and 95.7% for attribute recognition, respectively, which indicates good performance. • Deep learning-based method to detect information for field compliance checking. • Conducts few-shot object detection for objects with imbalanced distribution. • Conducts attribute recognition to add descriptive condition information to objects. • Achieved good performance in extracting site visual information. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. Dual Intercommunication Network: Enabling Interhemispheric Communications in Hemisphere-Inspired ANNs
- Author
-
Yanze Wu
- Subjects
Attribute recognition ,computer vision ,deep learning ,multi-task learning ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
The human brain has been a main source of inspiration for designing deep learning models. Recently, inspired by the specialized functions of two cerebral hemispheres in processing low and high spatial frequency information, some dual-path neural networks with global and local branches have been proposed to deal with both coarse- and fine-grained visual tasks simultaneously. However, in existing works, the interhemispheric communication mechanism, which is responded by the corpus callosum, the largest white matter structure in the human brain that connecting the left and right cerebral hemispheres, is still not fully explored and exploited. This paper aims to explore how the corpus callosum can inspire us to enable transfer and integration of information between global and local branches in hemisphere-inspired artificial neural networks, such that one branch can leverage the other's learned knowledge and benefit each other. To this end, we propose a gated intercommunication unit to selectively transfer useful knowledge between the two branches via attention mechanisms to alleviate the negative transfer. Experiments on sb-MNIST and two pedestrian attribute datasets show that the proposed method outperforms the compared ones in most cases.
- Published
- 2020
- Full Text
- View/download PDF
5. Person retrieval in surveillance using textual query: a review.
- Author
-
Galiyawala, Hiren and Raval, Mehul S.
- Subjects
VIDEO surveillance ,NATURAL language processing ,COMPUTER vision - Abstract
Recent advancement of research in biometrics, computer vision, and natural language processing has discovered opportunities for person retrieval from surveillance videos using textual query. The prime objective of a surveillance system is to locate a person using a description, e.g., a short woman with a pink t-shirt and white skirt carrying a black purse. She has brown hair. Such a description contains attributes like gender, height, type of clothing, colour of clothing, hair colour, and accessories. Such attributes are formally known as soft biometrics. They help bridge the semantic gap between a human description and a machine as a textual query contains the person's soft biometric attributes. It is also not feasible to manually search through huge volumes of surveillance footage to retrieve a specific person. Hence, automatic person retrieval using vision and language-based algorithms is becoming popular. In comparison to other state-of-the-art reviews, the contribution of the paper is as follows: 1. Recommends most discriminative soft biometrics for specific challenging conditions. 2. Integrates benchmark datasets and retrieval methods for objective performance evaluation. 3. A complete snapshot of techniques based on features, classifiers, number of soft biometric attributes, type of the deep neural networks, and performance measures. 4. The comprehensive coverage of person retrieval from handcrafted features based methods to end-to-end approaches based on natural language description. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
6. Exploiting interaction of fine and coarse features and attribute co-occurrence for person attribute recognition.
- Author
-
Sun, Zhiyong, Ye, Junyong, Wang, Tongqing, Jiang, Li, and Li, Yang
- Subjects
CONVOLUTIONAL neural networks ,COMPUTER vision ,SIGNAL convolution ,IMAGE representation ,VISUAL fields ,PREDICTION theory - Abstract
Person attribute recognition, i.e., the prediction of a fixed set of semantic attributes given an image of a person, becomes an important topic in the field of computer vision. Recently, methods based on convolutional neural networks have shown outstanding performance in this area. They usually employ a CNN network to mine the shared feature representation followed by several layers for attribute classification. To improve the representation ability of the model, many methods element-add or concatenate coarse and fine feature maps to fuse information at different feature levels. However, these methods didn't fully exploit the interaction of multi-level convolutional feature maps for person attribute analysis and not consider the correlation of attributes for the same person. In this paper, we introduce a kind of correlation feature, which exploits the high order interaction of coarse and fine feature maps to capture the robust feature representation from multi-level convolution layers as the image representation for person attribute recognition. Moreover, we propose an intraperson attribute loss to explicitly model the correlation of attributes for the same person. We experiment our proposed model on CIFAR-10 dataset, Berkeley Human Attributes dataset, PA-100 K dataset, and experimental results show the better performance of the feature representation and the effectiveness of intra-person attribute loss. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
7. Low-level features for visual attribute recognition: An evaluation.
- Author
-
Danaci, Emine Gul and Ikizler-Cinbis, Nazli
- Subjects
- *
PATTERN recognition systems , *COMPUTER vision , *OBJECT recognition (Computer vision) , *LEARNING , *HISTOGRAMS - Abstract
In recent years, visual attributes, which are mid-level representations that describe human-understandable aspects of objects and scenes, have become a popular topic of computer vision research. Visual attributes are being used in various tasks, including object recognition, people search, scene recognition, and many more. A critical step in attribute recognition is the extraction of low-level features, which encodes the local visual characteristics in images, and provides the representation used in the attribute prediction step. In this work, we explore the effects of utilizing different low-level features on learning visual attributes. In particular, we analyze the performance of various shape, color, texture and deep neural network features. Experiments have been carried out on four different datasets, which have been collected for different visual recognition tasks and extensive evaluations have been reported. Our results show that, while the supervised deep features are effective, using them in combination with low-level features can lead to significant improvements in attribute recognition performance. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
8. Attribute Grammar for Joint Parsing of Human Attribute, Part and Pose
- Author
-
Park, Seyoung
- Subjects
Computer science ,Attribute recognition ,Computer vision ,Pose estimation - Abstract
Robust pose estimation and attribute classification are of particular interest and important tasks to the computer vision community, and they are frequently used as the intermediate representations for other high-level tasks, such as human identification, visual search and human tracking. In this dissertation, we present a unified framework for joint inferring human body pose and human attribute in a parse graph with attributes augmented to nodes in the hierarchical representation. Particularly, unlike previous existing approaches mostly train models for the two tasks separately and combine the inference sequentially, we build a unified framework by integrating the three traditional grammar formulations in an And-Or graph representation: (i) Phrase structure grammar representing the hierarchical decomposition of the human body; (ii) Dependency grammar modeling the geometric articulation; and (iii) Attribute grammar accounting for the compatibility relations between different parts in the hierarchy. Furthermore, we propose extension of our model to integrate the deep learned features efficiently to provide the better performance with richer and deeper appearance representation. We also propose a technique to handle large variation of appearance and geometry. Particularly, unlike previous approaches define the parts by drawing square bounding box around keypoints or annotating precise bounding box for parts, our approach defines parts through the separate part proposal process. Finally, we demonstrate the effectiveness of both of joint modeling and integrating deep models by showing state-of-the-art performance on several recent public benchmark datasets. We also collect our own dataset and compare our approach with existing methods.
- Published
- 2016
9. Can adversarial networks hallucinate occluded people with a plausible aspect?
- Author
-
Federico Fulgeri, Stefano Alletto, Rita Cucchiara, Matteo Fabbri, and Simone Calderara
- Subjects
FOS: Computer and information sciences ,Attribute recognition ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,02 engineering and technology ,Image (mathematics) ,Computer graphics ,Occlusions ,Discriminative model ,GAN ,0202 electrical engineering, electronic engineering, information engineering ,Computer vision ,Video game ,Pixel ,Artificial neural network ,business.industry ,Deep learning ,020207 software engineering ,Hallucinating ,Signal Processing ,020201 artificial intelligence & image processing ,Computer Vision and Pattern Recognition ,Artificial intelligence ,business ,Software - Abstract
When you see a person in a crowd, occluded by other persons, you miss visual information that can be used to recognize, re-identify or simply classify him or her. You can imagine its appearance given your experience, nothing more. Similarly, AI solutions can try to hallucinate missing information with specific deep learning architectures, suitably trained with people with and without occlusions. The goal of this work is to generate a complete image of a person, given an occluded version in input, that should be a) without occlusion b) similar at pixel level to a completely visible people shape c) capable to conserve similar visual attributes (e.g. male/female) of the original one. For the purpose, we propose a new approach by integrating the state-of-the-art of neural network architectures, namely U-nets and GANs, as well as discriminative attribute classification nets, with an architecture specifically designed to de-occlude people shapes. The network is trained to optimize a Loss function which could take into account the aforementioned objectives. As well we propose two datasets for testing our solution: the first one, occluded RAP, created automatically by occluding real shapes of the RAP dataset (which collects also attributes of the people aspect); the second is a large synthetic dataset, AiC, generated in computer graphics with data extracted from the GTA video game, that contains 3D data of occluded objects by construction. Results are impressive and outperform any other previous proposal. This result could be an initial step to many further researches to recognize people and their behavior in an open crowded world., Under review at CVIU
- Published
- 2019
10. A Case Study on Attribute Recognition of Heated Metal Mark Image Using Deep Convolutional Neural Networks.
- Author
-
Mao , Keming, Lu , Duo, E , Dazhi, and Tan , Zhenhua
- Subjects
- *
HEATING of metals , *ARTIFICIAL neural networks , *QUALITATIVE research , *COMPUTER vision , *MACHINE learning - Abstract
Heated metal mark is an important trace to identify the cause of fire. However, traditional methods mainly focus on the knowledge of physics and chemistry for qualitative analysis and make it still a challenging problem. This paper presents a case study on attribute recognition of the heated metal mark image using computer vision and machine learning technologies. The proposed work is composed of three parts. Material is first generated. According to national standards, actual needs and feasibility, seven attributes are selected for research. Data generation and organization are conducted, and a small size benchmark dataset is constructed. A recognition model is then implemented. Feature representation and classifier construction methods are introduced based on deep convolutional neural networks. Finally, the experimental evaluation is carried out. Multi-aspect testings are performed with various model structures, data augments, training modes, optimization methods and batch sizes. The influence of parameters, recognitio efficiency and execution time are also analyzed. The results show that with a fine-tuned model, the recognition rate of attributes metal type, heating mode, heating temperature, heating duration, cooling mode, placing duration and relative humidity are 0.925, 0.908, 0.835, 0.917, 0.928, 0.805 and 0.92, respectively. The proposed method recognizes the attribute of heated metal mark with preferable effect, and it can be used in practical application. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.