13 results on '"Xilin, Chen"'
Search Results
2. Location Sensitive Network for Human Instance Segmentation
- Author
-
Xiangzhou Zhang, Bingpeng Ma, Hong Chang, Xilin Chen, and Shiguang Shan
- Subjects
Pixel ,Computer science ,business.industry ,Feature extraction ,Sampling (statistics) ,Pattern recognition ,Image segmentation ,Semantics ,Computer Graphics and Computer-Aided Design ,Encoding (memory) ,Image Processing, Computer-Assisted ,Humans ,Attention ,Segmentation ,Artificial intelligence ,Representation (mathematics) ,business ,Algorithms ,Software - Abstract
Location is an important distinguishing information for instance segmentation. In this paper, we propose a novel model, called Location Sensitive Network (LSNet), for human instance segmentation. LSNet integrates instance-specific location information into one-stage segmentation framework. Specifically, in the segmentation branch, Pose Attention Module (PAM) encodes the location information into the attention regions through coordinates encoding. Based on the location information provided by PAM, the segmentation branch is able to effectively distinguish instances in feature-level. Moreover, we propose a combination operation named Keypoints Sensitive Combination (KSCom) to utilize the location information from multiple sampling points. These sampling points construct the points representation for instances via human keypoints and random points. Human keypoints provide the spatial locations and semantic information of the instances, and random points expand the receptive fields. Based on the points representation for each instance, KSCom effectively reduces the mis-classified pixels. Our method is validated by the experiments on public datasets. LSNet-5 achieves 56.2 mAP at 18.5 FPS on COCOPersons. Besides, the proposed method is significantly superior to its peers in the case of severe occlusion.
- Published
- 2021
- Full Text
- View/download PDF
3. AttGAN: Facial Attribute Editing by Only Changing What You Want
- Author
-
Wangmeng Zuo, Meina Kan, Shiguang Shan, Xilin Chen, and Zhenliang He
- Subjects
FOS: Computer and information sciences ,business.industry ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,media_common.quotation_subject ,Computer Science - Computer Vision and Pattern Recognition ,Representation (systemics) ,Machine Learning (stat.ML) ,Pattern recognition ,02 engineering and technology ,Computer Graphics and Computer-Aided Design ,Image (mathematics) ,Task (project management) ,Constraint (information theory) ,Text mining ,Statistics - Machine Learning ,Face (geometry) ,0202 electrical engineering, electronic engineering, information engineering ,Task analysis ,020201 artificial intelligence & image processing ,Quality (business) ,Artificial intelligence ,business ,Software ,media_common - Abstract
Facial attribute editing aims to manipulate single or multiple attributes of a face image, i.e., to generate a new face with desired attributes while preserving other details. Recently, generative adversarial net (GAN) and encoder-decoder architecture are usually incorporated to handle this task with promising results. Based on the encoder-decoder architecture, facial attribute editing is achieved by decoding the latent representation of the given face conditioned on the desired attributes. Some existing methods attempt to establish an attribute-independent latent representation for further attribute editing. However, such attribute-independent constraint on the latent representation is excessive because it restricts the capacity of the latent representation and may result in information loss, leading to over-smooth and distorted generation. Instead of imposing constraints on the latent representation, in this work we apply an attribute classification constraint to the generated image to just guarantee the correct change of desired attributes, i.e., to "change what you want". Meanwhile, the reconstruction learning is introduced to preserve attribute-excluding details, in other words, to "only change what you want". Besides, the adversarial learning is employed for visually realistic editing. These three components cooperate with each other forming an effective framework for high quality facial attribute editing, referred as AttGAN. Furthermore, our method is also directly applicable for attribute intensity control and can be naturally extended for attribute style manipulation. Experiments on CelebA dataset show that our method outperforms the state-of-the-arts on realistic attribute editing with facial details well preserved., Submitted to IEEE Transactions on Image Processing, Code: https://github.com/LynnHo/AttGAN-Tensorflow
- Published
- 2019
- Full Text
- View/download PDF
4. Fusing local patterns of gabor magnitude and phase for face recognition
- Author
-
Shufu Xie, Shiguang Shan, Xilin Chen, and Jie Chen
- Subjects
Image processing -- Technology application ,Mathematical optimization -- Usage ,Principal components analysis -- Usage ,Technology application ,Business ,Computers ,Electronics ,Electronics and electrical industries - Published
- 2010
5. Hierarchical ensemble of global and local classifiers for face recognition
- Author
-
Yu Su, Shiguang Shan, Xilin Chen, and Wen Gao
- Subjects
Fourier transformations -- Usage ,Image processing -- Research ,Business ,Computers ,Electronics ,Electronics and electrical industries - Published
- 2009
6. Locally linear regression for pose-invariant face recognition
- Author
-
Xiujuan Chai, Shiguang Shan, and Xilin Chen
- Subjects
Regression analysis -- Usage ,Invariants -- Analysis ,Business ,Computers ,Electronics ,Electronics and electrical industries - Abstract
A simple and efficient locally linear regression (LLR) method which generates the virtual frontal view from a given nonfrontal face image is presented.
- Published
- 2007
7. Histogram of Gabor phase patterns (HGPP): A novel object representation approach for face recognition
- Author
-
Zhang, Baochang, Shan, Shiguang, Xilin Chen, and Wen Gao
- Subjects
Object recognition (Computers) -- Methods ,Pattern recognition -- Methods ,Business ,Computers ,Electronics ,Electronics and electrical industries - Abstract
A study proposes a compact and effective object descriptor method, histogram of Gabor phase pattern (HGPP), for robust face recognition. This method is applied to the face recognition problem and the results on the large-scale FERET and CAS-PEAL databases have shown that the proposed algorithms have outperformed other well-known systems in terms of recognition rate.
- Published
- 2007
8. Automatic detection and recognition of signs from natural scenes
- Author
-
Xilin Chen, Jie Yang, Jing Zhang, and Waibel, Alex
- Subjects
Text processing -- Analysis ,Edge detection (Image processing) -- Analysis ,Business ,Computers ,Electronics ,Electronics and electrical industries - Abstract
Automatic detection and recognition of signs from natural scenes and its application to a sign translation task are discussed. The procedure can improve text detection rate and optical character recognition (OCR) accuracy.
- Published
- 2004
9. Learning Expressionlets via Universal Manifold Model for Dynamic Facial Expression Recognition
- Author
-
Xilin Chen, Shiguang Shan, Mengyi Liu, and Ruiping Wang
- Subjects
FOS: Computer and information sciences ,Facial expression ,Computer science ,business.industry ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,020207 software engineering ,Pattern recognition ,02 engineering and technology ,Computer Graphics and Computer-Aided Design ,Manifold ,Expression (mathematics) ,law.invention ,Set (abstract data type) ,Discriminative model ,Margin (machine learning) ,law ,0202 electrical engineering, electronic engineering, information engineering ,Embedding ,020201 artificial intelligence & image processing ,Artificial intelligence ,Representation (mathematics) ,business ,Manifold (fluid mechanics) ,Software - Abstract
Facial expression is temporally dynamic event which can be decomposed into a set of muscle motions occurring in different facial regions over various time intervals. For dynamic expression recognition, two key issues, temporal alignment and semantics-aware dynamic representation, must be taken into account. In this paper, we attempt to solve both problems via manifold modeling of videos based on a novel mid-level representation, i.e. \textbf{expressionlet}. Specifically, our method contains three key stages: 1) each expression video clip is characterized as a spatial-temporal manifold (STM) formed by dense low-level features; 2) a Universal Manifold Model (UMM) is learned over all low-level features and represented as a set of local modes to statistically unify all the STMs. 3) the local modes on each STM can be instantiated by fitting to UMM, and the corresponding expressionlet is constructed by modeling the variations in each local mode. With above strategy, expression videos are naturally aligned both spatially and temporally. To enhance the discriminative power, the expressionlet-based STM representation is further processed with discriminant embedding. Our method is evaluated on four public expression databases, CK+, MMI, Oulu-CASIA, and FERA. In all cases, our method outperforms the known state-of-the-art by a large margin., 12 pages
- Published
- 2016
- Full Text
- View/download PDF
10. A Benchmark and Comparative Study of Video-Based Face Recognition on COX Face Database
- Author
-
Ruiping Wang, Haihong Zhang, Shihong Lao, Alifu Kuerban, Xilin Chen, Zhiwu Huang, and Shiguang Shan
- Subjects
Male ,Models, Statistical ,Databases, Factual ,Biometrics ,Database ,Computer science ,Video Recording ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Image processing ,computer.software_genre ,Computer Graphics and Computer-Aided Design ,Facial recognition system ,Benchmarking ,Object-class detection ,Biometric Identification ,Face ,Face (geometry) ,Video tracking ,Image Processing, Computer-Assisted ,Benchmark (computing) ,Humans ,Female ,computer ,Software - Abstract
Face recognition with still face images has been widely studied, while the research on video-based face recognition is inadequate relatively, especially in terms of benchmark datasets and comparisons. Real-world video-based face recognition applications require techniques for three distinct scenarios: 1) Video-to-Still (V2S); 2) Still-to-Video (S2V); and 3) Video-to-Video (V2V), respectively, taking video or still image as query or target. To the best of our knowledge, few datasets and evaluation protocols have benchmarked for all the three scenarios. In order to facilitate the study of this specific topic, this paper contributes a benchmarking and comparative study based on a newly collected still/video face database, named COX 1 Face DB. Specifically, we make three contributions. First, we collect and release a large-scale still/video face database to simulate video surveillance with three different video-based face recognition scenarios (i.e., V2S, S2V, and V2V). Second, for benchmarking the three scenarios designed on our database, we review and experimentally compare a number of existing set-based methods. Third, we further propose a novel Point-to-Set Correlation Learning (PSCL) method, and experimentally show that it can be used as a promising baseline method for V2S/S2V face recognition on COX Face DB. Extensive experimental results clearly demonstrate that video-based face recognition needs more efforts, and our COX Face DB is a good benchmark database for evaluation. 1 COX Face DB was constructed by Institute of Computing Technology, Chinese Academy of Sciences ( C AS) under the sponsor of OMRON Social Solutions Co. Ltd. ( O SS), and the support of X injiang University.
- Published
- 2015
- Full Text
- View/download PDF
11. Learning Prototype Hyperplanes for Face Verification in the Wild
- Author
-
Dong Xu, Xilin Chen, Shiguang Shan, Meina Kan, and Wen Li
- Subjects
Biometry ,Feature extraction ,Sensitivity and Specificity ,Pattern Recognition, Automated ,Set (abstract data type) ,Imaging, Three-Dimensional ,Artificial Intelligence ,Image Interpretation, Computer-Assisted ,Feature (machine learning) ,Humans ,Mathematics ,business.industry ,Dimensionality reduction ,Cosine similarity ,Reproducibility of Results ,Pattern recognition ,Image Enhancement ,Linear discriminant analysis ,Computer Graphics and Computer-Aided Design ,Support vector machine ,ComputingMethodologies_PATTERNRECOGNITION ,Hyperplane ,Face ,Subtraction Technique ,Artificial intelligence ,business ,Algorithms ,Software - Abstract
In this paper, we propose a new scheme called Prototype Hyperplane Learning (PHL) for face verification in the wild using only weakly labeled training samples (i.e., we only know whether each pair of samples are from the same class or different classes without knowing the class label of each sample) by leveraging a large number of unlabeled samples in a generic data set. Our scheme represents each sample in the weakly labeled data set as a mid-level feature with each entry as the corresponding decision value from the classification hyperplane (referred to as the prototype hyperplane) of one Support Vector Machine (SVM) model, in which a sparse set of support vectors is selected from the unlabeled generic data set based on the learnt combination coefficients. To learn the optimal prototype hyperplanes for the extraction of mid-level features, we propose a Fisher’s Linear Discriminant-like (FLD-like) objective function by maximizing the discriminability on the weakly labeled data set with a constraint enforcing sparsity on the combination coefficients of each SVM model, which is solved by using an alternating optimization method. Then, we use the recent work called Side-Information based Linear Discriminant (SILD) analysis for dimensionality reduction and a cosine similarity measure for final face verification. Comprehensive experiments on two data sets, Labeled Faces in the Wild (LFW) and YouTube Faces, demonstrate the effectiveness of our scheme.
- Published
- 2013
- Full Text
- View/download PDF
12. Discriminant Analysis on Riemannian Manifold of Gaussian Distributions for Face Recognition with Image Sets
- Author
-
Xilin Chen, Ruiping Wang, Zhiwu Huang, Shiguang Shan, and Wen Wang
- Subjects
Gaussian ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,02 engineering and technology ,Riemannian geometry ,symbols.namesake ,Gaussian function ,0202 electrical engineering, electronic engineering, information engineering ,Information geometry ,Mathematics ,Manifold alignment ,business.industry ,020206 networking & telecommunications ,Pattern recognition ,Riemannian manifold ,Mixture model ,Linear discriminant analysis ,Computer Graphics and Computer-Aided Design ,Statistical manifold ,ComputingMethodologies_PATTERNRECOGNITION ,symbols ,020201 artificial intelligence & image processing ,Artificial intelligence ,Kernel Fisher discriminant analysis ,business ,Software ,Distribution (differential geometry) - Abstract
To address the problem of face recognition with image sets, we aim to capture the underlying data distribution in each set and thus facilitate more robust classification. To this end, we represent image set as the Gaussian mixture model (GMM) comprising a number of Gaussian components with prior probabilities and seek to discriminate Gaussian components from different classes. Since in the light of information geometry, the Gaussians lie on a specific Riemannian manifold, this paper presents a method named discriminant analysis on Riemannian manifold of Gaussian distributions (DARG). We investigate several distance metrics between Gaussians and accordingly two discriminative learning frameworks are presented to meet the geometric and statistical characteristics of the specific manifold. The first framework derives a series of provably positive definite probabilistic kernels to embed the manifold to a high-dimensional Hilbert space, where conventional discriminant analysis methods developed in Euclidean space can be applied, and a weighted Kernel discriminant analysis is devised which learns discriminative representation of the Gaussian components in GMMs with their prior probabilities as sample weights. Alternatively, the other framework extends the classical graph embedding method to the manifold by utilizing the distance metrics between Gaussians to construct the adjacency graph, and hence the original manifold is embedded to a lower-dimensional and discriminative target manifold with the geometric structure preserved and the interclass separability maximized. The proposed method is evaluated by face identification and verification tasks on four most challenging and largest databases, YouTube Celebrities, COX, YouTube Face DB, and Point-and-Shoot Challenge, to demonstrate its superiority over the state-of-the-art.To address the problem of face recognition with image sets, we aim to capture the underlying data distribution in each set and thus facilitate more robust classification. To this end, we represent image set as the Gaussian mixture model (GMM) comprising a number of Gaussian components with prior probabilities and seek to discriminate Gaussian components from different classes. Since in the light of information geometry, the Gaussians lie on a specific Riemannian manifold, this paper presents a method named discriminant analysis on Riemannian manifold of Gaussian distributions (DARG). We investigate several distance metrics between Gaussians and accordingly two discriminative learning frameworks are presented to meet the geometric and statistical characteristics of the specific manifold. The first framework derives a series of provably positive definite probabilistic kernels to embed the manifold to a high-dimensional Hilbert space, where conventional discriminant analysis methods developed in Euclidean space can be applied, and a weighted Kernel discriminant analysis is devised which learns discriminative representation of the Gaussian components in GMMs with their prior probabilities as sample weights. Alternatively, the other framework extends the classical graph embedding method to the manifold by utilizing the distance metrics between Gaussians to construct the adjacency graph, and hence the original manifold is embedded to a lower-dimensional and discriminative target manifold with the geometric structure preserved and the interclass separability maximized. The proposed method is evaluated by face identification and verification tasks on four most challenging and largest databases, YouTube Celebrities, COX, YouTube Face DB, and Point-and-Shoot Challenge, to demonstrate its superiority over the state-of-the-art.
- Published
- 2017
- Full Text
- View/download PDF
13. Histogram of Gabor Phase Patterns (HGPP): A Novel Object Representation Approach for Face Recognition.
- Author
-
Baochang Zhang, Shiguang Shan, Xilin Chen, and Wen Gao
- Subjects
GABOR transforms ,FACE perception ,WAVELETS (Mathematics) ,GAUSSIAN processes ,DATA compression ,OPTICAL character recognition devices ,HUMAN-computer interaction - Abstract
A novel object descriptor, histogram of Gabor phase pattern (HGPP), is proposed for robust face recognition. In HGPP, the quadrant-bit codes are first extracted from faces based on the Gabor transformation. Global Gabor phase pattern (GGPP) and local Gabor phase pattern (LGPP) are then proposed to encode the phase variations. GGPP captures the variations derived from the orientation changing of Gabor wavelet at a given scale (frequency), while LGPP encodes the local neighborhood variations by using a novel local XOR pattern (LXP) operator. They are both divided into the nonoverlapping rectangular regions, from which spatial histograms are extracted and concatenated into an extended histogram feature to represent the original image. Finally, the recognition is performed by using the nearest-neighbor classifier with histogram intersection as the similarity measurement. The features of HGPP lie in two aspects: 1) HGPP can describe the general face images robustly without the training procedure; 2) HGPP encodes the Gabor phase information, while most previous face recognition methods exploit the Gabor magnitude information. In addition, Fisher separation criterion is further used to improve the performance of HGPP by weighing the subregions of the image according to their discriminative powers. The proposed methods are success fully applied to face recognition, and the experiment results on the large-scale FERET and CAS-PEAL databases show that the proposed algorithms significantly outperform other well-known systems in terms of recognition rate. [ABSTRACT FROM AUTHOR]
- Published
- 2007
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.