99 results on '"attribute recognition"'
Search Results
2. Few-shot object detection and attribute recognition from construction site images for improved field compliance
- Author
-
Wang, Xiyu and El-Gohary, Nora
- Published
- 2024
- Full Text
- View/download PDF
3. Pedestrian Attribute Recognition Using Hierarchical Transformers
- Author
-
Lohani, Lalit, Thakare, Kamalakar Vijay, Nayak, Kamakshya Prasad, Dogra, Debi Prosad, Choi, Heeseung, Jung, Hyungjoo, Kim, Ig-Jae, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Antonacopoulos, Apostolos, editor, Chaudhuri, Subhasis, editor, Chellappa, Rama, editor, Liu, Cheng-Lin, editor, Bhattacharya, Saumik, editor, and Pal, Umapada, editor
- Published
- 2025
- Full Text
- View/download PDF
4. Evaluating Attribute Comprehension in Large Vision-Language Models
- Author
-
Zhang, Haiwen, Yang, Zixi, Liu, Yuanzhi, Wang, Xinran, He, Zheqi, Liang, Kongming, Ma, Zhanyu, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Lin, Zhouchen, editor, Cheng, Ming-Ming, editor, He, Ran, editor, Ubul, Kurban, editor, Silamu, Wushouer, editor, Zha, Hongbin, editor, Zhou, Jie, editor, and Liu, Cheng-Lin, editor
- Published
- 2025
- Full Text
- View/download PDF
5. Durability and Bearing Capacity Assessment of Existing Bridge Pile Foundations for Sustainable Highway Reconstruction.
- Author
-
Zhang, Jian, Huang, Linxuan, Gao, Qiang, Ding, Hejie, Ren, Zhe, Liu, Chuanxiao, Cheng, Guangtan, and Wu, Duohua
- Abstract
Conducting a reasonable assessment of whether the durability and bearing capacity of concrete used in existing bridge pile foundations can meet all requirements is an important prerequisite for ensuring the reuse of existing bridge pile foundations. With this in mind, this study evaluates the project of the Beijing–Taibei Highway in Shandong Province that must be restored and enlarged. Using static load and concrete durability testing methods, the bearing capacity characteristics and durability of seven existing bridge pile foundations were investigated. Using attribute identification theory, a systematic technique for determining the reuse value of existing piles is proposed. The test examination findings demonstrate that the change curve corresponding to the pile foundation's ultimate bearing capacity matches the concrete compressive strength curve. The pile foundations had lower ultimate bearing capacity than average. The concrete compressive strengths of the three test piles were 18%, 16.3%, and 17.5% lower than the average compressive strengths of the seven test piles. A comprehensive durability evaluation method for assessing existing piles can effectively evaluate the reuse value of existing piles and promote sustainable highway reconstruction and expansion projects. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Region-Aware Fashion Contrastive Learning for Unified Attribute Recognition and Composed Retrieval.
- Author
-
WANG Kangping and ZHAO Mingbo
- Subjects
CLOTHING industry ,LANGUAGE & languages ,IMAGE retrieval ,IMAGE segmentation ,ACCURACY - Abstract
Clothing attribute recognition has become an essential technology, which enables users to automatically identify the characteristics of clothes and search for clothing images with similar attributes. However, existing methods cannot recognize newly added attributes and may fail to capture region-level visual features. To address the aforementioned issues, a region-aware fashion contrastive language-image pre-training (RaF-CLIP) model was proposed. This model aligned cropped and segmented images with category and multiple fine-grained attribute texts, achieving the matching of fashion region and corresponding texts through contrastive learning. Clothing retrieval found suitable clothing based on the user-specified clothing categories and attributes, and to further improve the accuracy of retrieval, an attribute-guided composed network (AGCN) as an additional component on RaF-CLIP was introduced, specifically designed for composed image retrieval. This task aimed to modify the reference image based on textual expressions to retrieve the expected target. By adopting a transformer-based bidirectional attention and gating mechanism, it realized the fusion and selection of image features and attribute text features. Experimental results show that the proposed model achieves a mean precision of 0.663 3 for attribute recognition tasks and a recall@10 (recall@ k is defined as the percentage of correct samples appearing in the top k retrieval results) of 39.18 for composed image retrieval task, satisfying user needs for freely searching for clothing through images and texts. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Investigating Person Attribute Recognition in Challenging Environments
- Author
-
Oukid, Ilyes, Boulahia, Said Yacine, Amamra, Abdenour, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Djamaa, Badis, editor, Boudane, Abdelhamid, editor, Mazari Abdessameud, Oussama, editor, and Hosni, Adil Imad Eddine, editor
- Published
- 2024
- Full Text
- View/download PDF
8. Scene Recognition With Objectness, Attribute, and Category Learning
- Author
-
Li-Hui Zhao, Jean-Paul Ainam, Ji Zhang, and Wenai Song
- Subjects
Scene classification ,object detection ,attribute recognition ,attribute annotation ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Scene classification has established itself as a challenging research problem. Compared to images of individual objects, scene images could be much more semantically complex and abstract. Their difference mainly lies in the level of granularity of recognition. Yet, image recognition serves as a key pillar for the good performance of scene recognition as the knowledge attained from object images can be used for accurate recognition of scenes. The existing scene recognition methods only take the category label of the scene into consideration. However, we find that the contextual information that contains detailed local descriptions are also beneficial in allowing the scene recognition model to be more discriminative. In this paper, we aim to improve scene recognition using attribute and category label information encoded in objects. Based on the complementarity of attribute and category labels, we propose a Multi-task Attribute-Scene Recognition (MASR) network which learns a category embedding and at the same time predicts scene attributes. Attribute acquisition and object annotation are tedious and time consuming tasks. We tackle the problem by proposing a partially supervised annotation strategy in which human intervention is significantly reduced. The strategy provides a much more cost-effective solution to real world scenarios, and requires considerably less annotation efforts. Moreover, we re-weight the attribute predictions considering the level of importance indicated by the object detected scores. Using the proposed method, we efficiently annotate attribute labels for four large-scale datasets, and systematically investigate how scene and attribute recognition benefit from each other. The experimental results demonstrate that MASR learns a more discriminative representation and achieves competitive recognition performance compared to the state-of-the-art methods.
- Published
- 2024
- Full Text
- View/download PDF
9. Using Smartphone Sensing for Recognition of Game Player Attributes During Gameplay
- Author
-
Khaquan, Muhammad Saad, Ehatisham-ul-Haq, Muhammad, Murtaza, Fiza, Raheel, Aasim, Arsalan, Aamir, Azam, Muhammad Awais, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, and Arai, Kohei, editor
- Published
- 2023
- Full Text
- View/download PDF
10. IFF: An Intelligent Fashion Forecasting System
- Author
-
Muttaraju, Chakita, Prabhu, Ramya Narasimha, Sheetal, S., Uma, D., Shylaja, S. S., Xhafa, Fatos, Series Editor, Buyya, Rajkumar, editor, Hernandez, Susanna Munoz, editor, Kovvur, Ram Mohan Rao, editor, and Sarma, T. Hitendra, editor
- Published
- 2023
- Full Text
- View/download PDF
11. Efficient Deep Learning Approach to Recognize Person Attributes by Using Hybrid Transformers for Surveillance Scenarios
- Author
-
S. Raghavendra, Ramyashree, S. K. Abhilash, Venu Madhav Nookala, and S. Kaliraj
- Subjects
Attribute recognition ,CNN ,deep neural network ,image classification ,transformers ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Numerous deep perception technologies and methods are built on the foundation of pedestrian feature identification. It covers various fields, including autonomous driving, spying, and object tracking. A recent study area is the identification of personality traits that has attracted much interest in video surveillance. Identifying a person’s distinct areas is complex and plays an incredibly significant role. This paper presents a current method applied to networks of primary convolutional neurons to locate the area connected to the Person attribute. Using Individual Feature Identification, the features of a person, such as gender, age, fashion sense, and equipment, have received much attention in video surveillance analytics. This Article adopted a Conv-Attentional image transformer that broke down the most discriminating Attribute and region into multiple grades. The feed-forward system and conv-attention are the components of serial blocks, and parallel blocks have two attention-focused tactics: direct cross-layer attention and feature interpolation. It also provides a flexible Attribute Localization Module (ALM) to learn the regional aspects of each Attribute are considered at several levels, and the most discriminating areas are selected adaptively. We draw the conclusion that hybrid transformers outperform pure transformers in this instance. The extensive experimental results indicate that the proposed hybrid technique achieves higher results than the current strategies on four unique private characteristic datasets, i.e., RapV2, RapV1, PETA, and PA100K.
- Published
- 2023
- Full Text
- View/download PDF
12. A robust attribute-aware and real-time multi-target multi-camera tracking system using multi-scale enriched features and hierarchical clustering.
- Author
-
Moghaddam, Mahnaz, Charmi, Mostafa, and Hassanpoor, Hossein
- Abstract
Multi-Camera Multi-Target Tracking (MTMCT) has challenges such as viewpoint and pose variations, scale and illumination changes, and occlusion. Available MTMCT approaches have high computational complexity and are not sufficiently robust in the mentioned challenges. In this work, an Attribute Recognition-based MTMCT(AR-MTMCT) framework is presented for real-time application. This framework performs object detection, re-Identification (Re-Id) feature extraction, and attribute recognition in an end-to-end manner. Applying attributes highly improves MTMC online tracking performance in the mentioned challenges. The pipeline of AR-MTMCT consists of three modules. The first module is a novel one-shot Single-Camera Tracking (SCT) architecture named Attribute Recognition-Multi Object Tracking (AR-MOT) which performs object detection, Re-Id feature extraction, and attributes recognition using one backbone through multi-task learning. Hierarchical clustering is performed in the second module to deal with the detection of several instances of one identity in the overlapping areas of cameras. In the last module, a new data association algorithm is performed using spatial information to reduce matching candidates. We also have proposed an efficient strategy in the data association algorithm to remove lost tracks by making a trade-off between the number of lost tracks and the maximum lost time. Evaluation and training of AR-MTMCT have been done on the large-scale MTA dataset. The proposed system has been improved by 20% and 11%, respectively, compared to the WDA method in IDF1 and IDs metrics. Also, the AR-MTMCT outperforms the state-of-the-art methods by a large margin on decreasing computational complexities. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
13. Attribute-guided attention and dependency learning for improving person re-identification based on data analysis technology.
- Author
-
Chang, Heyu, Qu, Dan, Wang, Kun, Zhang, Hongqi, Si, Nianwen, Yan, Gengxiao, and Li, Huazhong
- Subjects
DATA analysis ,LEARNING modules ,MACHINE learning ,LEARNING ,IDENTIFICATION - Abstract
Person re-identification (Re-ID) can determine whether a pedestrian target can be matched across diverse regions or cameras, thereby alleviating the problem between massive surveillance data and inefficient manual retrieval. Inspired by attribute-person recognition (APR) network, this paper proposes an improved Re-ID method based on attribute learning, which uses an attribute-guided attention mechanism module and an attribute dependency learning module to learn fine-grained attribute features and rich dependencies among them. After that, a joint model with the integration of attribute recognition and person identity recognition is built for end-to-end training. Experimental results show that the proposed method can effectively improve Re-ID accuracy and achieve a competitive recognition performance. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
14. Relation-Aware Attribute Network for Fine-Grained Clothing Recognition
- Author
-
Yang, Mingjian, Li, Yixin, Su, Zhuo, Zhou, Fan, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Mantoro, Teddy, editor, Lee, Minho, editor, Ayu, Media Anugerah, editor, Wong, Kok Wai, editor, and Hidayanto, Achmad Nizar, editor
- Published
- 2021
- Full Text
- View/download PDF
15. An Object Detection and Tracking Algorithm Combined with Semantic Information
- Author
-
Ji, Qingbo, Liu, Hang, Hou, Changbo, Zhang, Qiang, Mo, Hongwei, Akan, Ozgur, Editorial Board Member, Bellavista, Paolo, Editorial Board Member, Cao, Jiannong, Editorial Board Member, Coulson, Geoffrey, Editorial Board Member, Dressler, Falko, Editorial Board Member, Ferrari, Domenico, Editorial Board Member, Gerla, Mario, Editorial Board Member, Kobayashi, Hisashi, Editorial Board Member, Palazzo, Sergio, Editorial Board Member, Sahni, Sartaj, Editorial Board Member, Shen, Xuemin (Sherman), Editorial Board Member, Stan, Mircea, Editorial Board Member, Jia, Xiaohua, Editorial Board Member, Zomaya, Albert Y., Editorial Board Member, Xiong, Jinbo, editor, Wu, Shaoen, editor, Peng, Changgen, editor, and Tian, Youliang, editor
- Published
- 2021
- Full Text
- View/download PDF
16. Merging Super Resolution and Attribute Learning for Low-Resolution Person Attribute Recognition
- Author
-
Ramin Abbaszadi and Nazli Ikizler-Cinbis
- Subjects
Attribute recognition ,linear programming ,linear combination ,neural networks ,parametric inference ,super resolution ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
In video surveillance, visual person attributes such as gender, backpack, type of clothing are crucial for person search or re-identification. For detecting and retrieving these attributes with high accuracy, the availability of high-quality videos is a necessity in general. However, this cannot be guaranteed in general surveillance videos or images; beside improving hardware technology, improving inference algorithms on low-resolution data is valuable. This paper attempts to propose two solutions in this direction: designing a combined neural network architecture from existing architectures, and a novel combination approach toward re-identification on low-resolution videos. The proposed architecture introduces a combined Neural Network architecture, called SRMAR, that jointly trains Super Resolution and Multi Attribute Recognition models for more effective recognition. Experiments on two benchmark datasets demonstrate the effectiveness and applicability of the proposed neural network architecture for low-resolution multi-attribute recognition. Furthermore, a higher-level linear combination scheme that optimally combines the proposed SRMAR architecture and multi-attribute recognition network is presented, yielding superior results in low-resolution person attribute recognition.
- Published
- 2022
- Full Text
- View/download PDF
17. Automatic Recognition of Giant Panda Attributes from Their Vocalizations Based on Squeeze-and-Excitation Network.
- Author
-
Zhao, Qijun, Zhang, Yanqiu, Hou, Rong, He, Mengnan, Liu, Peng, Xu, Ping, Zhang, Zhihe, and Chen, Peng
- Subjects
- *
GIANT panda , *PANDAS , *ANIMAL communication , *KEYSTONE species , *AGE groups , *WILDLIFE conservation - Abstract
The giant panda (Ailuropoda melanoleuca) has long attracted the attention of conservationists as a flagship and umbrella species. Collecting attribute information on the age structure and sex ratio of the wild giant panda populations can support our understanding of their status and the design of more effective conservation schemes. In view of the shortcomings of traditional methods, which cannot automatically recognize the age and sex of giant pandas, we designed a SENet (Squeeze-and-Excitation Network)-based model to automatically recognize the attributes of giant pandas from their vocalizations. We focused on the recognition of age groups (juvenile and adult) and sex of giant pandas. The reason for using vocalizations is that among the modes of animal communication, sound has the advantages of long transmission distances, strong penetrating power, and rich information. We collected a dataset of calls from 28 captive giant panda individuals, with a total duration of 1298.02 s of recordings. We used MFCC (Mel-frequency Cepstral Coefficients), which is an acoustic feature, as inputs for the SENet. Considering that small datasets are not conducive to convergence in the training process, we increased the size of the training data via SpecAugment. In addition, we used focal loss to reduce the impact of data imbalance. Our results showed that the F1 scores of our method for recognizing age group and sex reached 96.46% ± 5.71% and 85.85% ± 7.99%, respectively, demonstrating that the automatic recognition of giant panda attributes based on their vocalizations is feasible and effective. This more convenient, quick, timesaving, and laborsaving attribute recognition method can be used in the investigation of wild giant pandas in the future. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
18. Seismic vulnerability assessment of mountainous gas pipelines based on improved Borda method and attribute recognition
- Author
-
Wenyue WANG, Ying WU, Xiao YOU, and Lang CHEN
- Subjects
mountainous gas pipelines ,seismic vulnerability assessment ,improved borda method ,attribute recognition ,Oils, fats, and waxes ,TP670-699 ,Gas industry ,TP751-762 - Abstract
The areas along the mountainous gas pipelines are complicated in geographic and geomorphic conditions, and thus greatly affected by earthquakes. Therefore, the safety and reliability assessment methods of mountainous gas pipelines under the influence of earthquakes were analyzed, which is of important strategic significance to ensure the safety of natural gas supply. From the prospective of the seismic risk and pipeline vulnerability factors, the method of calculating the weight of the index system under group decision-making with the improved Borda method was put forward. Meanwhile, the calculation methods of multi-index comprehensive measurement mentioned in the attribute recognition theory were divided into the types with "or" and "and" logics. Then, the calculation method of multi-index comprehensive measurement with "and" logics was proposed, and the assessment system for the seismic vulnerability of mountainous gas pipelines was established. In addition, the assessment system was applied to a mountainous gas pipeline in western Sichuan, and by introducing the attribute recognition theory, the attribute recognition was performed with the basic data combined with the expert opinions. In this way, the vulnerability level of the pipeline section was obtained. Generally, the results could provide reference for the prediction of the vulnerable pipeline sections in earthquakes and the development of the appropriate measures.
- Published
- 2021
- Full Text
- View/download PDF
19. Unsupervised Domain Attention Adaptation Network for Caricature Attribute Recognition
- Author
-
Ji, Wen, He, Kelei, Huo, Jing, Gu, Zheng, Gao, Yang, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Vedaldi, Andrea, editor, Bischof, Horst, editor, Brox, Thomas, editor, and Frahm, Jan-Michael, editor
- Published
- 2020
- Full Text
- View/download PDF
20. Attribute-Guided Global and Part-Level Identity Network for Person Re-Identification.
- Author
-
Pan, Shaoming, Feng, Wenqiang, and Chong, Yanwen
- Subjects
- *
DEEP learning , *PEDESTRIANS , *HUMAN body , *GLOBAL method of teaching - Abstract
Most of the person re-identification (re-ID) algorithms based on deep learning mainly learn the global feature representation of pedestrians, while ignoring the important role of fine-grained pedestrian attribute features on re-ID tasks. Pedestrian attributes are middle-level semantic features, which have invariance in different poses, camera views, and illumination conditions. Considering the robustness and promotion of pedestrian attributes for person re-ID task, we propose an Attribute-guided Global and Part-level identity Network (AGPNet), which consists of a global identity task, a part-level identity task, and a pedestrian attributes learning task. AGPNet takes advantage of perceived semantic information of pedestrian attributes and deploys them as guidance to attend to human body regions and learn robust feature representation in the feature representation construction stage. Extensive experiments on two large-scale person re-ID datasets (Market-1501 and DukeMTMC-reID) show the effectiveness of our method, which is competitive with the state-of-the-art algorithms. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
21. Enhance Feature Representation of Dual Networks for Attribute Prediction
- Author
-
Fang, Yuchun, Cao, Yilu, Zhang, Wei, Yuan, Qiulong, Barbosa, Simone Diniz Junqueira, Editorial Board Member, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Kotenko, Igor, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Gedeon, Tom, editor, Wong, Kok Wai, editor, and Lee, Minho, editor
- Published
- 2019
- Full Text
- View/download PDF
22. Fine-grained attribute-aware analysis for person re-identification.
- Author
-
Bai, Kunlong, Fu, Saiji, Yang, Linrui, and Liu, Dalian
- Abstract
How and where to learn the discriminative feature has always been a critical issue of person re-identification (re-ID). Most of the previous methods focus on extracting global representation from overall identity image and lack of high-level semantic attribute information. Based on the considerations above, we focus on looking for a way to make people's attribute play a role in person re-identification task. In this paper, a novel multitask-like network, namely, Attribute-Identity Complementary Network (AICNet), is designed. It contains two branches to learn the identity and attribute feature separately, and a reciprocal interaction process is added to enrich the discrimination of the resulting feature. In addition, in order to better extract high-level semantic attribute information, multi-scale features are combined in the process of attribute extraction. With the help of this attribute-identity complementary strategy, the generated feature can be guided to learn most distinctive attribute feature that is most relevant to identity feature. The present paper carries out extensive experiments on two large-scale datasets, including Market-1501, and DukeMTMC-reID, after which it is found that our attribute-identity complementary framework significantly outperforms the baseline model and achieves competitive performance compared with the state-of-the-art person re-ID methods. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
23. MEL⁃YOLO:多任务人眼属性识别及关键点定位网络.
- Author
-
吴东亮, 沈文忠, and 刘林嵩
- Abstract
Copyright of Journal of Data Acquisition & Processing / Shu Ju Cai Ji Yu Chu Li is the property of Editorial Department of Journal of Nanjing University of Aeronautics & Astronautics and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2022
- Full Text
- View/download PDF
24. AttributeNet: Attribute enhanced vehicle re-identification.
- Author
-
Quispe, Rodolfo, Lan, Cuiling, Zeng, Wenjun, and Pedrini, Helio
- Subjects
- *
CONVOLUTIONAL neural networks - Abstract
Vehicle Re-Identification (V-ReID) is a critical task that associates the same vehicle across images from different camera viewpoints. Many works explore attribute clues to enhance V-ReID; however, there is usually a lack of effective interaction between the attribute-related modules and final V-ReID objective. In this work, we propose a new method to efficiently explore discriminative information from vehicle attributes (for instance, color and type). We introduce AttributeNet (ANet) that jointly extracts identity-relevant features and attribute features. We enable the interaction by distilling the ReID-helpful attribute feature and adding it into the general ReID feature to increase the discrimination power. Moreover, we propose a constraint, named Amelioration Constraint (AC), which encourages the feature after adding attribute features onto the general ReID feature to be more discriminative than the original general ReID feature. We validate the effectiveness of our framework on three challenging datasets. Experimental results show that our method achieves the state-of-the-art performance. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
25. Dual Intercommunication Network: Enabling Interhemispheric Communications in Hemisphere-Inspired ANNs
- Author
-
Yanze Wu
- Subjects
Attribute recognition ,computer vision ,deep learning ,multi-task learning ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
The human brain has been a main source of inspiration for designing deep learning models. Recently, inspired by the specialized functions of two cerebral hemispheres in processing low and high spatial frequency information, some dual-path neural networks with global and local branches have been proposed to deal with both coarse- and fine-grained visual tasks simultaneously. However, in existing works, the interhemispheric communication mechanism, which is responded by the corpus callosum, the largest white matter structure in the human brain that connecting the left and right cerebral hemispheres, is still not fully explored and exploited. This paper aims to explore how the corpus callosum can inspire us to enable transfer and integration of information between global and local branches in hemisphere-inspired artificial neural networks, such that one branch can leverage the other's learned knowledge and benefit each other. To this end, we propose a gated intercommunication unit to selectively transfer useful knowledge between the two branches via attention mechanisms to alleviate the negative transfer. Experiments on sb-MNIST and two pedestrian attribute datasets show that the proposed method outperforms the compared ones in most cases.
- Published
- 2020
- Full Text
- View/download PDF
26. Pedestrian attribute recognition using trainable Gabor wavelets
- Author
-
Imran N. Junejo, Naveed Ahmed, and Mohammad Lataifeh
- Subjects
Deep learning ,Attribute recognition ,Computer vision ,Science (General) ,Q1-390 ,Social sciences (General) ,H1-99 - Abstract
Surveillance cameras are everywhere keeping an eye on pedestrians or people as they navigate through the scene. Within this context, our paper addresses the problem of pedestrian attribute recognition (PAR). This problem entails the extraction of different attributes such as age-group, clothing style, accessories, footwear style etc. This is a multi-label problem with a host of challenges even for human observers. As such, the topic has rightly attracted attention recently. In this work, we integrate trainable Gabor wavelet (TGW) layers inside a convolution neural network (CNN). Whereas other researchers have used fixed Gabor filters with the CNN, the proposed layers are learnable and adapt to the dataset for a better recognition. We test our method on publicly available challenging datasets and demonstrate considerable improvements over state of the art approaches.
- Published
- 2021
- Full Text
- View/download PDF
27. Person retrieval in surveillance using textual query: a review.
- Author
-
Galiyawala, Hiren and Raval, Mehul S.
- Subjects
VIDEO surveillance ,NATURAL language processing ,COMPUTER vision - Abstract
Recent advancement of research in biometrics, computer vision, and natural language processing has discovered opportunities for person retrieval from surveillance videos using textual query. The prime objective of a surveillance system is to locate a person using a description, e.g., a short woman with a pink t-shirt and white skirt carrying a black purse. She has brown hair. Such a description contains attributes like gender, height, type of clothing, colour of clothing, hair colour, and accessories. Such attributes are formally known as soft biometrics. They help bridge the semantic gap between a human description and a machine as a textual query contains the person's soft biometric attributes. It is also not feasible to manually search through huge volumes of surveillance footage to retrieve a specific person. Hence, automatic person retrieval using vision and language-based algorithms is becoming popular. In comparison to other state-of-the-art reviews, the contribution of the paper is as follows: 1. Recommends most discriminative soft biometrics for specific challenging conditions. 2. Integrates benchmark datasets and retrieval methods for objective performance evaluation. 3. A complete snapshot of techniques based on features, classifiers, number of soft biometric attributes, type of the deep neural networks, and performance measures. 4. The comprehensive coverage of person retrieval from handcrafted features based methods to end-to-end approaches based on natural language description. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
28. 目标多种多值属性的端端快速识别网络.
- Author
-
周伦钢, 孙怡峰, 王坤, 吴疆, 黄维贵, and 李炳龙
- Abstract
Copyright of Journal of Computer Engineering & Applications is the property of Beijing Journal of Computer Engineering & Applications Journal Co Ltd. and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2021
- Full Text
- View/download PDF
29. Exploiting interaction of fine and coarse features and attribute co-occurrence for person attribute recognition.
- Author
-
Sun, Zhiyong, Ye, Junyong, Wang, Tongqing, Jiang, Li, and Li, Yang
- Subjects
CONVOLUTIONAL neural networks ,COMPUTER vision ,SIGNAL convolution ,IMAGE representation ,VISUAL fields ,PREDICTION theory - Abstract
Person attribute recognition, i.e., the prediction of a fixed set of semantic attributes given an image of a person, becomes an important topic in the field of computer vision. Recently, methods based on convolutional neural networks have shown outstanding performance in this area. They usually employ a CNN network to mine the shared feature representation followed by several layers for attribute classification. To improve the representation ability of the model, many methods element-add or concatenate coarse and fine feature maps to fuse information at different feature levels. However, these methods didn't fully exploit the interaction of multi-level convolutional feature maps for person attribute analysis and not consider the correlation of attributes for the same person. In this paper, we introduce a kind of correlation feature, which exploits the high order interaction of coarse and fine feature maps to capture the robust feature representation from multi-level convolution layers as the image representation for person attribute recognition. Moreover, we propose an intraperson attribute loss to explicitly model the correlation of attributes for the same person. We experiment our proposed model on CIFAR-10 dataset, Berkeley Human Attributes dataset, PA-100 K dataset, and experimental results show the better performance of the feature representation and the effectiveness of intra-person attribute loss. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
30. Collaborative Learning Network for Face Attribute Prediction
- Author
-
Wang, Shiyao, Deng, Zhidong, Wang, Zhenyang, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Lai, Shang-Hong, editor, Lepetit, Vincent, editor, Nishino, Ko, editor, and Sato, Yoichi, editor
- Published
- 2017
- Full Text
- View/download PDF
31. Person Attribute Recognition by Sequence Contextual Relation Learning.
- Author
-
Wu, Jingjing, Liu, Hao, Jiang, Jianguo, Qi, Meibin, Ren, Bo, Li, Xiaohong, and Wang, Yashen
- Subjects
- *
CONTEXTUAL learning , *IMAGE registration , *ALGORITHMS , *OBJECT recognition (Computer vision) , *IMAGE recognition (Computer vision) - Abstract
Person attribute recognition aims to identify the attribute labels from the pedestrian images. Extracting contextual relation from the images and attributes, including the spatial-semantic relations, the spatial context and the semantic correlation, is beneficial to enhance the discrimination of the features for recognizing the attributes. Thus, this work proposes a sequence contextual relation learning (SCRL) method to capture these relations. It first embeds the images and attributes into sequences in two branches. Then SCRL flexibly learns the contextual relation from the sequences with the parallel attention model structure, which integrates the inter-attention and intra-attention models. The inter-attention module is utilized to extract the spatial-semantic relations, while the intra-attention is designed to gain the spatial context and the semantic correlation. Both attention modules are comprised of several parallel attention units and each unit can obtain the pairwise relations in one subspace. Therefore, they obtain the relations in multiple subspaces, which can improve the comprehensiveness of the relation learning. Additionally, for the sake of better extraction of spatial-semantic relations, this paper employs connectionist temporal classification (CTC) loss which is capable of driving the network to enforce monotonic alignment between the image and attribute. It can also accelerate the convergence of the network by the algorithm in it. Extensive experiments on five public datasets, i.e., Market-1501 attribute, Duke attribute, PETA, RAP and PA-100K datasets, demonstrate the effectiveness of the proposed method. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
32. Recurrent Prediction With Spatio-Temporal Attention for Crowd Attribute Recognition.
- Author
-
Li, Qiaozhe, Zhao, Xin, He, Ran, and Huang, Kaiqi
- Subjects
- *
FORECASTING , *CROWDS , *OBJECT recognition (Computer vision) , *MACHINE learning , *PREDICTION models , *VIDEO compression - Abstract
Crowd attribute recognition is a challenging task for crowd video understanding because a crowd video often contains multiple attributes from various types. Traditional deep learning-based methods directly treat this recognition problem as a multiple binary classification problem and represent the video by vectorizing and fusing the separately learned spatial and temporal features in the fully connected layers. Therefore, the correlations between these attributes may not be well captured. In this paper, a bidirectional recurrent prediction model with a semantic-aware attention mechanism is proposed to explore the spatio-temporal and semantic relations between the attributes for more accurate recognition. The ConvLSTM is introduced for feature representation to capture the spatio-temporal structure of the crowd videos and facilitate the visual attention. The bidirectional recurrent attention module is proposed for sequential attribute prediction by associating each subcategory attributes to corresponding semantic-related regions iteratively. The experiments and evaluations on the challenging WWW crowd video dataset not only show that our approach significantly outperforms the state-of-the-art methods but also verify that our approach can effectively capture the spatio-temporal and semantic relations of the crowd attributes. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
33. 隧道施工期岩爆危险性评价的属性识别模型及工程应用.
- Author
-
何怡帆, 李天斌, and 曹海洋
- Abstract
Copyright of Hydrogeology & Engineering Geology / Shuiwendizhi Gongchengdizhi is the property of Hydrogeology & Engineering Geology Editorial Office and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2020
- Full Text
- View/download PDF
34. Attribute Based Approach for Clothing Recognition
- Author
-
Fan, Wang, Qiyang, Zhao, Qingjie, Liu, Baolin, Yin, Diniz Junqueira Barbosa, Simone, Series editor, Chen, Phoebe, Series editor, Du, Xiaoyong, Series editor, Filipe, Joaquim, Series editor, Kara, Orhun, Series editor, Kotenko, Igor, Series editor, Liu, Ting, Series editor, Sivalingam, Krishna M., Series editor, Washio, Takashi, Series editor, Tan, Tieniu, editor, Li, Xuelong, editor, Chen, Xilin, editor, Zhou, Jie, editor, Yang, Jian, editor, and Cheng, Hong, editor
- Published
- 2016
- Full Text
- View/download PDF
35. Deep Multi-Task Network for Learning Person Identity and Attributes
- Author
-
Philip Chikontwe and Hyo Jong Lee
- Subjects
Attribute recognition ,convolutional neural networks (CNNs) ,deep learning ,person re-identification ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Person re-identification (re-ID) has been gaining in popularity in the research community owing to its numerous applications and growing importance in the surveillance industry. Recent methods often employ partial features for person re-ID and offer fine-grained information beneficial for person retrieval. In this paper, we focus on learning improved partial discriminative features using a deep convolutional neural architecture, which includes a pyramid spatial pooling module for efficient person feature representation. Furthermore, we propose a multi-task convolutional network that learns both personal attributes and identities in an end-to-end framework. Our approach incorporates partial features and global features for identity and attribute prediction, respectively. Experiments on several large-scale person re-ID benchmark data sets demonstrate the accuracy of our approach. For example, we report rank-1 accuracies of 85.37% (+3.47 %) and 92.81% (+0.51 %) on the DukeMTMC re-ID and Market-1501 data sets, respectively. The proposed method shows encouraging improvements compared with the state-of-the-art methods.
- Published
- 2018
- Full Text
- View/download PDF
36. Multi-task Attribute Joint Feature Learning
- Author
-
Chang, Lu, Fang, Yuchun, Jiang, Xiaoda, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Yang, Jinfeng, editor, Yang, Jucheng, editor, Sun, Zhenan, editor, Shan, Shiguang, editor, Zheng, Weishi, editor, and Feng, Jianjiang, editor
- Published
- 2015
- Full Text
- View/download PDF
37. 基于动态多任务平衡方法的行人属性识别深度学习网络.
- Author
-
孙志勇, 叶俊勇, 汪同庆, 雷 莉, 连 捷, and 李 阳
- Abstract
Copyright of Journal of Computer-Aided Design & Computer Graphics / Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao is the property of Gai Kan Bian Wei Hui and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2019
- Full Text
- View/download PDF
38. Improving person re-identification by attribute and identity learning.
- Author
-
Lin, Yutian, Zheng, Liang, Zheng, Zhedong, Wu, Yu, Hu, Zhilan, Yan, Chenggang, and Yang, Yi
- Subjects
- *
LABELS , *INFORMATION commons , *PEDESTRIANS - Abstract
• We annotate attribute labels on two large-scale person re-identification datasets. • We propose APR to improve re-ID by exploiting global and detailed information. • We introduce a module to leverage the correlation between attributes. • We speed-up the retrieval of re-ID by ten times with only a 2.92% accuracy drop. • We achieve competitive re-ID performance with the state-of-the-art methods. Person re-identification (re-ID) and attribute recognition share a common target at learning pedestrian descriptions. Their difference consists in the granularity. Most existing re-ID methods only take identity labels of pedestrians into consideration. However, we find the attributes, containing detailed local descriptions, are beneficial in allowing the re-ID model to learn more discriminative feature representations. In this paper, based on the complementarity of attribute labels and ID labels, we propose an attribute-person recognition (APR) network, a multi-task network which learns a re-ID embedding and at the same time predicts pedestrian attributes. We manually annotate attribute labels for two large-scale re-ID datasets, and systematically investigate how person re-ID and attribute recognition benefit from each other. In addition, we re-weight the attribute predictions considering the dependencies and correlations among the attributes. The experimental results on two large-scale re-ID benchmarks demonstrate that by learning a more discriminative representation, APR achieves competitive re-ID performance compared with the state-of-the-art methods. We use APR to speed up the retrieval process by ten times with a minor accuracy drop of 2.92% on Market-1501. Besides, we also apply APR on the attribute recognition task and demonstrate improvement over the baselines. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
39. Image Understanding for Visual Dialog.
- Author
-
Yeongsu Cho and Incheol Kim
- Abstract
This study proposes a deep neural network model based on an encoder-decoder structure for visual dialogs. Ongoing linguistic understanding of the dialog history and context is important to generate correct answers to questions in visual dialogs followed by questions and answers regarding images. Nevertheless, in many cases, a visual understanding that can identify scenes or object attributes contained in images is beneficial. Hence, in the proposed model, by employing a separate person detector and an attribute recognizer in addition to visual features extracted from the entire input image at the encoding stage using a convolutional neural network, we emphasize attributes, such as gender, age, and dress concept of the people in the corresponding image and use them to generate answers. The results of the experiments conducted using VisDial v0.9, a large benchmark dataset, confirmed that the proposed model performed well. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
40. Massive Ship Fault Data Retrieval Algorithm Supporting Complex Query in Cloud Computing.
- Author
-
Lou, Hong
- Subjects
- *
INFORMATION retrieval , *CLOUD computing , *ALGORITHMS , *LAGRANGE equations , *SEARCH algorithms , *SHIPS - Abstract
Lou, H., 2019. Massive ship fault data retrieval algorithm supporting complex query in cloud computing. In: Guido-Aldana, P.A. and Mulahasan, S. (eds.), Advances in Water Resources and Exploration. Journal of Coastal Research, Special Issue No. 93, pp. 1013–1018. Coconut Creek (Florida), ISSN 0749-0208. Aiming at the problems of the current retrieval algorithm for massive ship fault data retrieval in cloud computing, which does not support complex query, large retrieval cost and low precision, a massive ship fault data retrieval algorithm supporting complex query in cloud computing based on query performance prediction is proposed. Introducing the Lagrangian algorithm to preprocess the ship fault data information; combining the attribute values of the ship fault data and the context to calculate the similarity between different attributes, and identifying the same attribute according to the similarity; The attribute recognition result is queried by using multiple retrieval models, and the model complex query request is converted into a one-dimensional query key value, and the query key values obtained by the plurality of models are predicted, and the search result with the optimal prediction performance of the complex query mode is used as the final result. The experimental results show that the proposed algorithm has good retrieval accuracy in complex query mode, and it has less overhead than the current search algorithm. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
41. Heated Metal Mark Attribute Recognition Based on Compressed CNNs Model.
- Author
-
Yin, He, Mao, Keming, Zhao, Jianzhe, Chang, Huidong, E, Dazhi, and Tan, Zhenhua
- Subjects
CONVOLUTIONAL neural networks ,HUMIDITY ,METALS ,MODEL railroads ,AUTOMATIC timers ,GRAPHICAL user interfaces - Abstract
This study considered heated metal mark attribute recognition based on compressed convolutional neural networks (CNNs) models. Based on our previous works, the heated metal mark image benchmark dataset was further expanded. State-of-the-art lightweight CNNs models were selected. Technologies of pruning, compressing, weight quantization were introduced and analyzed. Then, a multi-label model training method was devised. Moreover, the proposed models were deployed on Android devices. Finally, comprehensive experiments were evaluated. The results show that, with the fine-tuned compressed CNNs model, the recognition rate of attributes meta type, heating mode, heating temperature, heating duration, cooling mode, placing duration and relative humidity were 0.803, 0.837, 0.825, 0.812, 0.883, 0.817 and 0.894, respectively. The best model obtained an overall performance of 0.823. Comparing with traditional CNNs, the adopted compressed multi-label model greatly improved the training efficiency and reduced the space occupation, with a relatively small decrease in recognition accuracy. The running time on Android devices was acceptable. It is shown that the proposed model is applicable for real time application and is convenient to implement on mobile or embedded devices scenarios. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
42. Can adversarial networks hallucinate occluded people with a plausible aspect?
- Author
-
Fulgeri, Federico, Fabbri, Matteo, Alletto, Stefano, Calderara, Simone, and Cucchiara, Rita
- Subjects
PEDESTRIANS ,DEEP learning ,POKEMON Go ,COMPUTER graphics ,COST functions ,VIDEO games ,ARTIFICIAL neural networks - Abstract
Abstract When you see a person in a crowd, occluded by other persons, you miss visual information that can be used to recognize, re-identify or simply classify him or her. You can imagine its appearance given your experience, nothing more. Similarly, AI solutions can try to hallucinate missing information with specific deep learning architectures, suitably trained with people with and without occlusions. The goal of this work is to generate a complete image of a person, given an occluded version in input, that should be a) without occlusion b) similar at pixel level to a completely visible people shape c) capable to conserve similar visual attributes (e.g. male/female) of the original one. For the purpose, we propose a new approach by integrating the state-of-the-art of neural network architectures, namely U-nets and GANs, as well as discriminative attribute classification nets, with an architecture specifically designed to de-occlude people shapes. The network is trained to optimize a Loss function which could take into account the aforementioned objectives. As well we propose two datasets for testing our solution: the first one, occluded RAP, created automatically by occluding real shapes of the RAP dataset created by Li et al. (2016) (which collects also attributes of the people aspect); the second is a large synthetic dataset, AiC, generated in computer graphics with data extracted from the GTA video game, that contains 3D data of occluded objects by construction. Results are impressive and outperform any other previous proposal. This result could be an initial step to many further researches to recognize people and their behavior in an open crowded world. Highlights • We propose a novel Adversarial Network that solves occlusions in pedestrian images. • Our reconstruction keeps both the appearance and the background coherent. • We condition the occluded body part restoration on pedestrian attributes. • We devise a new way for synthetically generating occlusion and visible pairs. • We provide a huge CG dataset for pedestrian attribute recognition in crowded areas. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
43. Massive Ship Fault Data Retrieval Algorithm Supporting Complex Query in Cloud Computing.
- Author
-
Hong Lou
- Subjects
INFORMATION retrieval ,CLOUD computing ,ALGORITHMS ,SEARCH algorithms ,SHIPS ,LAGRANGE equations - Abstract
Aiming at the problems of the current retrieval algorithm for massive ship fault data retrieval in cloud computing, which does not support complex query, large retrieval cost and low precision, a massive ship fault data retrieval algorithm supporting complex query in cloud computing based on query performance prediction is proposed. Introducing the Lagrangian algorithm to preprocess the ship fault data information; combining the attribute values of the ship fault data and the context to calculate the similarity between different attributes, and identifying the same attribute according to the similarity; The attribute recognition result is queried by using multiple retrieval models, and the model complex query request is converted into a one-dimensional query key value, and the query key values obtained by the plurality of models are predicted, and the search result with the optimal prediction performance of the complex query mode is used as the final result. The experimental results show that the proposed algorithm has good retrieval accuracy in complex query mode, and it has less overhead than the current search algorithm. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
44. Assessment of Urban Ecosystem Health Based on Attribute Recognition Theory
- Author
-
Li, Mingfu, Yang, Yuhang, editor, and Ma, Maode, editor
- Published
- 2012
- Full Text
- View/download PDF
45. Object and attribute recognition for product image with self-supervised learning.
- Author
-
Dai, Yong, Li, Yi, and Sun, Bin
- Subjects
- *
PRODUCT image , *IMAGE recognition (Computer vision) , *PRODUCT attributes , *LATENT semantic analysis , *OBJECT recognition (Computer vision) , *PRODUCT design - Abstract
Accurate class and attribute recognition is the critical technique to convert the unstructured product image data into structured knowledge base, which provides strong support for product design in the future. However, objects of different classes sharing similar attribute may vary a lot in visual appearances, making it challenging to accurately recognize the objects and their attributes. Different from the traditional multi-label image recognition, the attribute of the object, as high-level semantic information, is not corresponding to certain regions of the object and requires to learn more fine-grained features to represent the latent high level semantic information for the attribute across different object categories. Therefore, a self-supervised method called Deconstruction and Reconstruction Learning Network (DRLN) is proposed to solve the above problems in this paper. The DRLN tries to learn more fine-grained and local feature of the input product image by a self-supervised task, which deconstructs the input product images by randomly shuffling their local regions and further reconstructs the features of corresponding deconstructed images. Besides, the proposed model is optimized in an end-to-end manner by learning from multiple tasks, i.e., the multi-label classification task, the adversarial discrimination task, and the location alignment task. Experimental results demonstrate that the proposed method outperforms the state-of-the-arts for multi-label learning problems on both our product image dataset and another public available attribute recognition dataset. To facilitate future research in this field, all the datasets and codes are directly available online. 1 1 https://github.com/Yong-DAI/DRLN. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
46. Heated Metal Mark Attribute Recognition Based on Compressed CNNs Model
- Author
-
He Yin, Keming Mao, Jianzhe Zhao, Huidong Chang, Dazhi E, and Zhenhua Tan
- Subjects
attribute recognition ,heated metal mark image ,compressed CNNs ,multi-label training ,Technology ,Engineering (General). Civil engineering (General) ,TA1-2040 ,Biology (General) ,QH301-705.5 ,Physics ,QC1-999 ,Chemistry ,QD1-999 - Abstract
This study considered heated metal mark attribute recognition based on compressed convolutional neural networks (CNNs) models. Based on our previous works, the heated metal mark image benchmark dataset was further expanded. State-of-the-art lightweight CNNs models were selected. Technologies of pruning, compressing, weight quantization were introduced and analyzed. Then, a multi-label model training method was devised. Moreover, the proposed models were deployed on Android devices. Finally, comprehensive experiments were evaluated. The results show that, with the fine-tuned compressed CNNs model, the recognition rate of attributes meta type, heating mode, heating temperature, heating duration, cooling mode, placing duration and relative humidity were 0.803, 0.837, 0.825, 0.812, 0.883, 0.817 and 0.894, respectively. The best model obtained an overall performance of 0.823. Comparing with traditional CNNs, the adopted compressed multi-label model greatly improved the training efficiency and reduced the space occupation, with a relatively small decrease in recognition accuracy. The running time on Android devices was acceptable. It is shown that the proposed model is applicable for real time application and is convenient to implement on mobile or embedded devices scenarios.
- Published
- 2019
- Full Text
- View/download PDF
47. Scale coding bag of deep features for human attribute and action recognition.
- Author
-
Khan, Fahad Shahbaz, van de Weijer, Joost, Anwer, Rao Muhammad, Bagdanov, Andrew D., Felsberg, Michael, and Laaksonen, Jorma
- Subjects
- *
IMAGE recognition (Computer vision) , *BAG-of-words model (Computer science) , *ARTIFICIAL neural networks , *IMAGE compression , *PIXELS - Abstract
Most approaches to human attribute and action recognition in still images are based on image representation in which multi-scale local features are pooled across scale into a single, scale-invariant encoding. Both in bag-of-words and the recently popular representations based on convolutional neural networks, local features are computed at multiple scales. However, these multi-scale convolutional features are pooled into a single scale-invariant representation. We argue that entirely scale-invariant image representations are sub-optimal and investigate approaches to scale coding within a bag of deep features framework. Our approach encodes multi-scale information explicitly during the image encoding stage. We propose two strategies to encode multi-scale information explicitly in the final image representation. We validate our two scale coding techniques on five datasets: Willow, PASCAL VOC 2010, PASCAL VOC 2012, Stanford-40 and Human Attributes (HAT-27). On all datasets, the proposed scale coding approaches outperform both the scale-invariant method and the standard deep features of the same network. Further, combining our scale coding approaches with standard deep features leads to consistent improvement over the state of the art. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
48. RSCM: Region Selection and Concurrency Model for Multi-Class Weather Recognition.
- Author
-
Lin, Di, Lu, Cewu, Huang, Hui, and Jia, Jiaya
- Subjects
- *
BENCHMARKING (Management) , *IMAGE processing , *IMAGE recognition (Computer vision) , *MACHINE learning , *DATA analysis - Abstract
Toward weather condition recognition, we emphasize the importance of regional cues in this paper and address a few important problems regarding appropriate representation, its differentiation among regions, and weather-condition feature construction. Our major contribution is, first, to construct a multi-class benchmark data set containing 65 000 images from six common categories for sunny, cloudy, rainy, snowy, haze, and thunder weather. This data set also benefits weather classification and attribute recognition. Second, we propose a deep learning framework named region selection and concurrency model (RSCM) to help discover regional properties and concurrency. We evaluate RSCM on our multi-class benchmark data and another public data set for weather recognition. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
49. Attribute recognition model and its application of risk assessment for slope stability at tunnel portal.
- Author
-
Qian Zhang, Jing-chun Wang, and Hai-xia Zhang
- Subjects
- *
SLOPE stability , *RISK assessment , *MATRICES (Mathematics) , *SLOPES (Physical geography) , *MATHEMATICS - Abstract
Due to large span and complex construction technology, the portal section of tunnel is vulnerable to the threats of latent slope failures such as landslide, spalling and collapse. In the present paper, to ensure the construction safety of tunnel, the attribute recognition model of risk assessment for the slope stability at tunnel portal was established by using attribute mathematics theory. The integrated degree of rock mass, lithological feature, geological structural feature, weathering degree, seismic intensity, gradient, rainfall and construction factors were selected as the indices of the risk assessment for the stability slope at tunnel portal with grading criteria of each index. Moreover, the weights of the assessment indices were determined by the judgment matrix constructed from analytic hierarchy process. The attribute measurement functions were used to compute attribute measurement of single index and synthetic attribute measurement. The identification and classification of risk assessment for the slope stability at tunnel portal were assessed using the confidence criterion. For practical purposes, the stability of slope located in portal of an actual Tunnel, i.e., Jiefangcun Tunnel in Cheng-Lan Railway, was evaluated based on the previously established attribute recognition model. The corresponding risk treatment measures were then proposed based on the assessment results and monitor results to ensure the construction safety of tunnel portal. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
50. A Case Study on Attribute Recognition of Heated Metal Mark Image Using Deep Convolutional Neural Networks
- Author
-
Keming Mao, Duo Lu, Dazhi E, and Zhenhua Tan
- Subjects
attribute recognition ,heated metal mark ,convolutional neural networks ,Chemical technology ,TP1-1185 - Abstract
Heated metal mark is an important trace to identify the cause of fire. However, traditional methods mainly focus on the knowledge of physics and chemistry for qualitative analysis and make it still a challenging problem. This paper presents a case study on attribute recognition of the heated metal mark image using computer vision and machine learning technologies. The proposed work is composed of three parts. Material is first generated. According to national standards, actual needs and feasibility, seven attributes are selected for research. Data generation and organization are conducted, and a small size benchmark dataset is constructed. A recognition model is then implemented. Feature representation and classifier construction methods are introduced based on deep convolutional neural networks. Finally, the experimental evaluation is carried out. Multi-aspect testings are performed with various model structures, data augments, training modes, optimization methods and batch sizes. The influence of parameters, recognitio efficiency and execution time are also analyzed. The results show that with a fine-tuned model, the recognition rate of attributes metal type, heating mode, heating temperature, heating duration, cooling mode, placing duration and relative humidity are 0.925, 0.908, 0.835, 0.917, 0.928, 0.805 and 0.92, respectively. The proposed method recognizes the attribute of heated metal mark with preferable effect, and it can be used in practical application.
- Published
- 2018
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.