24 results on '"Yinfu Feng"'
Search Results
2. Cross-Lingual Product Retrieval in E-Commerce Search.
- Author
-
Wenya Zhu, Xiaoyu Lv, Baosong Yang, Yinghua Zhang, Xu Yong, Linlong Xu, Yinfu Feng, Haibo Zhang 0013, Qing Da, Anxiang Zeng, and Ronghua Chen
- Published
- 2022
- Full Text
- View/download PDF
3. DHA: Product Title Generation with Discriminative Hierarchical Attention for E-commerce.
- Author
-
Wenya Zhu, Yinghua Zhang, Yu Zhang 0006, Yu-Hang Zhou, Yinfu Feng, Yuxiang Wu, Qing Da, and Anxiang Zeng
- Published
- 2022
- Full Text
- View/download PDF
4. A Primal-Dual Online Algorithm for Online Matching Problem in Dynamic Environments.
- Author
-
Yu-Hang Zhou, Peng Hu, Chen Liang, Huan Xu, Guangda Huzhang, Yinfu Feng, Qing Da, Xinshang Wang, and Anxiang Zeng
- Published
- 2021
- Full Text
- View/download PDF
5. A 3D human motion refinement method based on sparse motion bases selection.
- Author
-
Zhao Wang, Yinfu Feng, Shuang Liu 0006, Jun Xiao 0001, Xiaosong Yang, and Jian J. Zhang 0001
- Published
- 2016
- Full Text
- View/download PDF
6. Adaptive Unsupervised Multi-view Feature Selection for Visual Concept Recognition.
- Author
-
Yinfu Feng, Jun Xiao 0001, Yueting Zhuang, and Xiaoming Liu 0002
- Published
- 2012
- Full Text
- View/download PDF
7. Fast view-based 3D model retrieval via unsupervised multiple feature fusion and online projection learning
- Author
-
Yueting Zhuang, Jun Xiao, Yinfu Feng, and Mingming Ji
- Subjects
Computer science ,business.industry ,Sorting ,020207 software engineering ,Pattern recognition ,02 engineering and technology ,Machine learning ,computer.software_genre ,Discriminative model ,Ranking ,Control and Systems Engineering ,Feature (computer vision) ,Signal Processing ,0202 electrical engineering, electronic engineering, information engineering ,Graph (abstract data type) ,020201 artificial intelligence & image processing ,Learning to rank ,Computer Vision and Pattern Recognition ,Artificial intelligence ,Electrical and Electronic Engineering ,Projection (set theory) ,Representation (mathematics) ,business ,computer ,Software - Abstract
Since each visual feature only reflects a unique characteristic about a 3-dimensional (3D) model and different visual features have diverse discriminative power in model representation, it would be beneficial to fuse multiple visual features in 3D model retrieval. To this end, we propose a fast view-based 3D model retrieval framework in this article. This framework comprises two parts: the first one is an Unsupervised Multiple Feature Fusion algorithm (UMFF), which is used to learn a compact yet discriminative feature representation from the original multiple visual features; and the second one is an efficient Online Projection Learning algorithm (OPL), which is designed to fast transfer the input multiple visual features of a newcome model into its corresponding low-dimensional feature representation. In this framework, many existing ranking algorithms such as the simple distance-based ranking method can be directly adopted for sorting all 3D models in the database using the learned new feature representation and returning the top ranked models to the user. Extensive experiments on two public 3D model databases demonstrate the efficiency and the effectiveness of the proposed approach over its competitors. The proposed framework cannot only dramatically improve the retrieval performance but also reduce the computational cost in dealing with the newcome models. HighlightsAn Unsupervised Multiple Feature Fusion (UMFF) algorithm is proposed to fuse multiple features for 3D model representation.The ? 1 -graph method is used to learn roust and datum-adaptive graph to preserve local geometric structure information.An efficient Online Projection Learning (OPL) algorithm is designed to solve the out-of-sample problem.
- Published
- 2016
8. A locally weighted sparse graph regularized Non-Negative Matrix Factorization method
- Author
-
Jun Xiao, Kang Zhou, Yinfu Feng, and Yueting Zhuang
- Subjects
Dense graph ,business.industry ,Cognitive Neuroscience ,Pattern recognition ,Facial recognition system ,Regularization (mathematics) ,Computer Science Applications ,Non-negative matrix factorization ,Matrix decomposition ,symbols.namesake ,Empirical likelihood ,Artificial Intelligence ,Gaussian noise ,symbols ,Graph (abstract data type) ,Artificial intelligence ,business ,Mathematics - Abstract
Owing to the well interpretation ability, Non-Negative Matrix Factorization (NMF) has attracted much attention from computer vision and machine learning communities. However, the standard NMF adopts a least square error function as the empirical likelihood term in the model, which is sensitive to the noise and outliers. So, it is not robust in practice. To overcome this problem, we propose a noise robust NMF method named as Locally Weighted Sparse Graph regularized Non-negative Matrix Factorization (LWSG_NMF). Since many real-world noises can be broadly decomposed into the dense Gaussian random noise and the sparse block noise, we propose a sparse noise assumption. Based on this assumption, we reformulate the empirical likelihood term of the standard NMF by explicitly imposing a sparse noise term. Meanwhile, a locally weighted sparse graph regularization term is also incorporated in our model to exploit the local geometric structure information of data. Different from the other existing graph-based methods, we take the effect of noise into account in learning our graph regularization term. An iterative optimization method is also proposed to solve the objective function of LWSG_NMF. Extensive experiments on three public benchmark datasets demonstrate the robustness and the effectiveness of our proposed method for human face recognition and handwritten digital recognition in the presence of noise.
- Published
- 2015
9. Efficient semi-supervised multiple feature fusion with out-of-sample extension for 3D model retrieval
- Author
-
Yinfu Feng, Xiaosong Yang, Jun Xiao, Jian J. Zhang, Yueting Zhuang, and Mingming Ji
- Subjects
business.industry ,Computer science ,Cognitive Neuroscience ,Divergence-from-randomness model ,Pattern recognition ,Semi-supervised learning ,Machine learning ,computer.software_genre ,Computer Science Applications ,Support vector machine ,Discriminative model ,Artificial Intelligence ,Feature (computer vision) ,Visual Word ,Artificial intelligence ,Representation (mathematics) ,business ,computer ,Time complexity - Abstract
Multiple visual features have been proposed and used in 3-dimensional (3D) model retrieval in recent years. Since each visual feature reflects a unique characteristic about the model, they have unequal discriminative power with respect to a specific category of 3D model, and they are complementary to each other in model representation. Thus, it would be beneficial to combine multiple visual features together in 3D model retrieval. In light of this, we propose an efficient Semi-supervised Multiple Feature Fusion (SMFF) method for view-based 3D model retrieval in this paper. Specifically, We first extract multiple visual features to describe both the local and global appearance characteristics of multiple 2D projected images that are generated from 3D models. Then, SMFF is adopted to learn a more compact and discriminative low-dimensional feature representation via multiple feature fusion using both the labeled and unlabeled 3D models. Once the low-dimensional features have been learned, many existing methods such as SVM and KNN can be used in the subsequent retrieval phase. Moreover, an out-of-sample extension of SMFF is provided to calculate the low-dimensional features for the newly added 3D models in linear time. Experiments on two public 3D model datasets demonstrate that using such a learned feature representation can significantly improve the performance of 3D model retrieval and the proposed method outperforms the other competitors.
- Published
- 2015
10. A human motion feature based on semi-supervised learning of GMM
- Author
-
Tian Qi, Jun Xiao, Xiaosong Yang, Hanzhi Zhang, Yinfu Feng, Yueting Zhuang, and Jianjun Zhang
- Subjects
Computer Networks and Communications ,Computer science ,business.industry ,Feature vector ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,020207 software engineering ,Pattern recognition ,02 engineering and technology ,Semi-supervised learning ,Mixture model ,Motion capture ,Motion (physics) ,Hardware and Architecture ,Feature (computer vision) ,Motion estimation ,0202 electrical engineering, electronic engineering, information engineering ,Media Technology ,Structure from motion ,020201 artificial intelligence & image processing ,Computer vision ,Artificial intelligence ,business ,Software ,Information Systems - Abstract
Using motion capture to create naturally looking motion sequences for virtual character animation has become a standard procedure in the games and visual effects industry. With the fast growth of motion data, the task of automatically annotating new motions is gaining an importance. In this paper, we present a novel statistic feature to represent each motion according to the pre-labeled categories of key-poses. A probabilistic model is trained with semi-supervised learning of the Gaussian mixture model (GMM). Each pose in a given motion could then be described by a feature vector of a series of probabilities by GMM. A motion feature descriptor is proposed based on the statistics of all pose features. The experimental results and comparison with existing work show that our method performs more accurately and efficiently in motion retrieval and annotation.
- Published
- 2014
11. Feature Correlation Hypergraph: Exploiting High-order Potentials for Multimodal Recognition
- Author
-
Jianke Zhu, Yue Gao, Chaoqun Hong, Luming Zhang, Yinfu Feng, and Deng Cai
- Subjects
Hypergraph ,Boosting (machine learning) ,business.industry ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Scale-invariant feature transform ,Pattern recognition ,Machine learning ,computer.software_genre ,Computer Science Applications ,Human-Computer Interaction ,Wavelet ,Image texture ,Control and Systems Engineering ,Computer Science::Computer Vision and Pattern Recognition ,Histogram ,Artificial intelligence ,Electrical and Electronic Engineering ,Cluster analysis ,business ,computer ,Classifier (UML) ,Software ,Information Systems ,Mathematics - Abstract
In computer vision and multimedia analysis, it is common to use multiple features (or multimodal features) to represent an object. For example, to well characterize a natural scene image, we typically extract a set of visual features to represent its color, texture, and shape. However, it is challenging to integrate multimodal features optimally. Since they are usually high-order correlated, e.g., the histogram of gradient (HOG), bag of scale invariant feature transform descriptors, and wavelets are closely related because they collaboratively reflect the image texture. Nevertheless, the existing algorithms fail to capture the high-order correlation among multimodal features. To solve this problem, we present a new multimodal feature integration framework. Particularly, we first define a new measure to capture the high-order correlation among the multimodal features, which can be deemed as a direct extension of the previous binary correlation. Therefore, we construct a feature correlation hypergraph (FCH) to model the high-order relations among multimodal features. Finally, a clustering algorithm is performed on FCH to group the original multimodal features into a set of partitions. Moreover, a multiclass boosting strategy is developed to obtain a strong classifier by combining the weak classifiers learned from each partition. The experimental results on seven popular datasets show the effectiveness of our approach.
- Published
- 2014
12. A semantic feature for human motion retrieval
- Author
-
Xiaosong Yang, Yinfu Feng, Yueting Zhuang, Jianjun Zhang, Jun Xiao, and Tian Qi
- Subjects
business.industry ,Semantic feature ,Computer science ,Animation ,Mixture model ,Computer Graphics and Computer-Aided Design ,Motion capture ,Motion (physics) ,Feature (computer vision) ,Motion estimation ,Structure from motion ,Computer vision ,Artificial intelligence ,business ,Software - Abstract
With the explosive growth of motion capture data, it becomes very imperative in animation production to have an efficient search engine to retrieve motions from large motion repository. However, because of the high dimension of data space and complexity of matching methods, most of the existing approaches cannot return the result in real time. This paper proposes a high level semantic feature in a low dimensional space to represent the essential characteristic of different motion classes. On the basis of the statistic training of Gauss Mixture Model, this feature can effectively achieve motion matching on both global clip level and local frame level. Experiment results show that our approach can retrieve similar motions with rankings from large motion database in real-time and also can make motion annotation automatically on the fly. Copyright © 2013 John Wiley & Sons, Ltd.
- Published
- 2013
13. A 3D human motion refinement method based on sparse motion bases selection
- Author
-
Shuang Liu, Yinfu Feng, Zhao Wang, Jian J. Zhang, Xiaosong Yang, and Jun Xiao
- Subjects
Computer science ,business.industry ,020206 networking & telecommunications ,020207 software engineering ,02 engineering and technology ,Kinematics ,Motion capture ,Motion (physics) ,Quarter-pixel motion ,Feature (computer vision) ,Motion estimation ,0202 electrical engineering, electronic engineering, information engineering ,Structure from motion ,Computer vision ,Artificial intelligence ,business ,Computer animation ,ComputingMethodologies_COMPUTERGRAPHICS - Abstract
Motion capture (MOCAP) is an important technique that is widely used in many areas such as computer animation, film industry, physical training and so on. Even with professional MOCAP system, the missing marker problems always occur. Motion refinement is an essential preprocessing step for MOCAP data based applications. Although many existing approaches for motion refinement have been developed, it is still a challenging task due to the complexity and diversity of human motion. A data driven based motion refinement method is proposed in this paper, which modifies the traditional sparse coding process for special task of motion recovery from missing parts. Meanwhile, the objective function is derived by taking both statistical and kinematical property of motion data into account. Poselet model and moving window grouping are applied in the proposed method to achieve a fine-grained feature representation, which preserves the embedded spatial-temporal kinematic information. 5 motion dictionaries are learnt for each kind of poselet from training data in parallel. The motion refine problem is finally solved as an e1-minimization problem. Compared with several state-of-art motion refine methods, the experimental result shows that our approach outperforms the competitors.
- Published
- 2016
14. Adaptive multi-view feature selection for human motion retrieval
- Author
-
Xiaosong Yang, Yinfu Feng, Tian Qi, Zhao Wang, and Jian J. Zhang
- Subjects
Optimization problem ,Feature selection ,02 engineering and technology ,External Data Representation ,Motion (physics) ,Human motion retrieval ,Discriminative model ,Data retrieval ,0202 electrical engineering, electronic engineering, information engineering ,Multi-view learning ,Electrical and Electronic Engineering ,Representation (mathematics) ,Mathematics ,business.industry ,020207 software engineering ,Pattern recognition ,Trace ratio minimization problem ,Control and Systems Engineering ,Feature (computer vision) ,Signal Processing ,020201 artificial intelligence & image processing ,Computer Vision and Pattern Recognition ,Artificial intelligence ,business ,Software - Abstract
Human motion retrieval plays an important role in many motion data based applications. In the past, many researchers tended to use a single type of visual feature as data representation. Because different visual feature describes different aspects about motion data, and they have dissimilar discriminative power with respect to one particular class of human motion, it led to poor retrieval performance. Thus, it would be beneficial to combine multiple visual features together for motion data representation. In this article, we present an Adaptive Multi-view Feature Selection (AMFS) method for human motion retrieval. Specifically, we first use a local linear regression model to automatically learn multiple view-based Laplacian graphs for preserving the local geometric structure of motion data. Then, these graphs are combined together with a non-negative view-weight vector to exploit the complementary information between different features. Finally, in order to discard the redundant and irrelevant feature components from the original high-dimensional feature representation, we formulate the objective function of AMFS as a general trace ratio optimization problem, and design an effective algorithm to solve the corresponding optimization problem. Extensive experiments on two public human motion database, i.e., HDM05 and MSR Action3D, demonstrate the effectiveness of the proposed AMFS over the state-of-art methods for motion data retrieval. The scalability with large motion dataset, and insensitivity with the algorithm parameters, make our method can be widely used in real-world applications. Display Omitted An Adaptive Multi-view Feature Selection (AMFS) algorithm is proposed to fuse multiple features formotion data retrieval.The local regression model isused to learn a datum-adaptive graph for each feature to preserve local structure information.The selection matrix is learnt from all local graphs by exploiting the complementary information between different features.An efficient iterative optimization approach is designed to solve objective function represented in trace ratio form.
- Published
- 2016
15. Predicting missing markers in human motion capture using l1-sparse representation
- Author
-
Jun Xiao, Yinfu Feng, and Wenyuan Hu
- Subjects
Current (mathematics) ,Computer science ,business.industry ,Perspective (graphical) ,Representation (systemics) ,Contrast (statistics) ,Observable ,Pattern recognition ,Sparse approximation ,Missing data ,Machine learning ,computer.software_genre ,Computer Graphics and Computer-Aided Design ,Artificial intelligence ,Linear combination ,business ,computer ,Software - Abstract
Missing marker problem is very common in human motion capture. In contrast to most current methods which handle this problem based on trying to learn a reliable predictor from the observations, we consider it from the perspective of sparse representation and propose a novel method which is named l1-sparse representation of missing markers prediction (L1-SRMMP). We assume that the incomplete pose can be represented by a linear combination of a few poses from the training set and the representation is sparse. Therefore, we cast the predicting missing markers as finding a sparse representation of the observable data of the incomplete pose, and then we use it to predict the missing data. In order to get a sparse representation, we employ l1-norm in our objective function. Moreover, we propose presentation coefficient weighted update (PCWU) algorithm to mitigate the limited capacity problem of the training set. Experimental results demonstrate the effectiveness and efficiency of our method to predict the missing markers in human motion capture. Copyright © 2011 John Wiley & Sons, Ltd.
- Published
- 2011
16. The R1947X mutation of NF1 causing autosomal dominant neurofibromatosis type 1 in a Chinese family
- Author
-
Xiaoying Yang, Qinbo Yang, Qing Wang, Yinfu Feng, Changzheng Huang, and Mugen Liu
- Subjects
Male ,congenital, hereditary, and neonatal diseases and abnormalities ,Neurofibromatosis 1 ,Tumor suppressor gene ,Genetic Linkage ,DNA Mutational Analysis ,Asian People ,Genetic linkage ,Genes, Neurofibromatosis 1 ,Genetics ,medicine ,Humans ,Coding region ,Neurofibromatosis ,neoplasms ,Molecular Biology ,Gene ,Genes, Dominant ,Base Sequence ,biology ,medicine.disease ,Neurofibromin 1 ,Penetrance ,Pedigree ,nervous system diseases ,Mutation ,Mutation (genetic algorithm) ,biology.protein ,Female ,Chromosomes, Human, Pair 17 - Abstract
Neurofibromatosis type 1 is a common autosomal dominant disorder with a high rate of penetrance. It is caused by the mutation of the tumor suppressor gene NF1, which encodes neurofibromin. The main function of neurofibromin is down-regulating the biological activity of the proto-oncoprotein Ras by acting as a Ras-specific GTPase activating protein. In this study, we identified a Chinese family affected with neurofibromatosis type 1. The known gene NF1 associated with NF1 was studied by linkage analysis and by direct sequencing of the entire coding region and exon-intron boundaries of the NF1 gene. The R1947X mutation of NF1 was identified, which was co-segregated with affected individuals in the Chinese family, but not present in unaffected family members. This is the first report, which states that the R1947X mutation of NF1 may be one of reasons for neurofibromatosis type 1 in Chinese population.
- Published
- 2008
17. Sparse motion bases selection for human motion denoising
- Author
-
Mingming Ji, Jian J. Zhang, Jun Xiao, Yinfu Feng, Yueting Zhuang, and Xiaosong Yang
- Subjects
business.industry ,Noise reduction ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Data-driven ,Motion (physics) ,Poselet model ,Quarter-pixel motion ,Human motion denoising ,symbols.namesake ,Gaussian noise ,Control and Systems Engineering ,Motion estimation ,ℓ1-minimization ,Signal Processing ,symbols ,Structure from motion ,Computer vision ,Artificial intelligence ,Noise (video) ,Data pre-processing ,Computer Vision and Pattern Recognition ,Electrical and Electronic Engineering ,business ,Software ,Mathematics - Abstract
Human motion denoising is an indispensable step of data preprocessing for many motion data based applications. In this paper, we propose a data-driven based human motion denoising method that sparsely selects the most correlated subset of motion bases for clean motion reconstruction. Meanwhile, it takes the statistic property of two common noises, i.e., Gaussian noise and outliers, into account in deriving the objective functions. In particular, our method firstly divides each human pose into five partitions termed as poselets to gain a much fine-grained pose representation. Then, these poselets are reorganized into multiple overlapped poselet groups using a lagged window moving across the entire motion sequence to preserve the embedded spatial-temporal motion patterns. Afterward, five compacted and representative motion dictionaries are constructed in parallel by means of fast K-SVD in the training phase; they are used to remove the noise and outliers from noisy motion sequences in the testing phase by solving ?1-minimization problems. Extensive experiments show that our method outperforms its competitors. More importantly, compared with other data-driven based method, our method does not need to specifically choose the training data, it can be more easily applied to real-world applications. HighlightsA fine-grained pose representation model is proposed to boost the performance.We present a data-driven based motion denoising method by solving ?1-minimization problems.The proposed model selects the most correlated motion bases for motion reconstruction.Our method does not need to choose the training data and can be implemented in parallel mode.
- Published
- 2015
18. Mining Spatial-Temporal Patterns and Structural Sparsity for Human Motion Data Denoising
- Author
-
Jun Xiao, Mingming Ji, Xuelong Li, Xiaosong Yang, Jian J. Zhang, Yinfu Feng, and Yueting Zhuang
- Subjects
Computer science ,Noise reduction ,Movement ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Motion capture ,Pattern Recognition, Automated ,Machine Learning ,Robustness (computer science) ,Image Processing, Computer-Assisted ,Structure from motion ,Data Mining ,Humans ,Computer vision ,Human Activities ,Electrical and Electronic Engineering ,Computer animation ,Noise measurement ,business.industry ,Pattern recognition ,Signal Processing, Computer-Assisted ,Computer Science Applications ,Human-Computer Interaction ,Control and Systems Engineering ,Outlier ,Artificial intelligence ,business ,Software ,Algorithms ,Information Systems - Abstract
Motion capture is an important technique with a wide range of applications in areas such as computer vision, computer animation, film production, and medical rehabilitation. Even with the professional motion capture systems, the acquired raw data mostly contain inevitable noises and outliers. To denoise the data, numerous methods have been developed, while this problem still remains a challenge due to the high complexity of human motion and the diversity of real-life situations. In this paper, we propose a data-driven-based robust human motion denoising approach by mining the spatial-temporal patterns and the structural sparsity embedded in motion data. We first replace the regularly used entire pose model with a much fine-grained partlet model as feature representation to exploit the abundant local body part posture and movement similarities. Then, a robust dictionary learning algorithm is proposed to learn multiple compact and representative motion dictionaries from the training data in parallel. Finally, we reformulate the human motion denoising problem as a robust structured sparse coding problem in which both the noise distribution information and the temporal smoothness property of human motion have been jointly taken into account. Compared with several state-of-the-art motion denoising methods on both the synthetic and real noisy motion data, our method consistently yields better performance than its counterparts. The outputs of our approach are much more stable than that of the others. In addition, it is much easier to setup the training dataset of our method than that of the other data-driven-based methods.
- Published
- 2015
19. Real-time motion data annotation via action string
- Author
-
Jun Xiao, Xiaosong Yang, Hanzhi Zhang, Yinfu Feng, Tian Qi, Jianjun Zhang, and Yueting Zhuang
- Subjects
Computer science ,business.industry ,String (computer science) ,Probabilistic logic ,Pattern recognition ,String searching algorithm ,Mixture model ,Computer Graphics and Computer-Aided Design ,Motion capture ,Feature model ,Motion (physics) ,Computer vision ,Artificial intelligence ,String metric ,business ,Software - Abstract
Even though there is an explosive growth of motion capture data, there is still a lack of efficient and reliable methods to automatically annotate all the motions in a database. Moreover, because of the popularity of mocap devices in home entertainment systems, real-time human motion annotation or recognition becomes more and more imperative. This paper presents a new motion annotation method that achieves both the aforementioned two targets at the same time. It uses a probabilistic pose feature based on the Gaussian Mixture Model to represent each pose. After training a clustered pose feature model, a motion clip could be represented as an action string. Then, a dynamic programming-based string matching method is introduced to compare the differences between action strings. Finally, in order to achieve the real-time target, we construct a hierarchical action string structure to quickly label each given action string. The experimental results demonstrate the efficacy and efficiency of our method. Copyright © 2014 John Wiley & Sons, Ltd.
- Published
- 2014
20. Human motion retrieval based on freehand sketch
- Author
-
Yinfu Feng, Zhangpeng Tang, Jian Zhang, Jun Xiao, and Xiaosong Yang
- Subjects
Sketch recognition ,Computer science ,business.industry ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Computer Graphics and Computer-Aided Design ,Sketch ,Tree (data structure) ,Sketch-based modeling ,Feature (computer vision) ,Computer graphics (images) ,Graph (abstract data type) ,Computer vision ,Artificial intelligence ,User interface ,business ,Software ,Computer animation ,ComputingMethodologies_COMPUTERGRAPHICS - Abstract
In this paper, we present an integrated framework of human motion retrieval based on freehand sketch. With some simple rules, the user can acquire a desired motion by sketching several key postures. To retrieve efficiently and accurately by sketch, the 3D postures are projected onto several 2D planes. The limb direction feature is proposed to represent the input sketch and the projected-postures. Furthermore, a novel index structure based on k-d tree is constructed to index the motions in the database, which speeds up the retrieval process. With our posture-by-posture retrieval algorithm, a continuous motion can be got directly or generated by using a pre-computed graph structure. What's more, our system provides an intuitive user interface. The experimental results demonstrate the effectiveness of our method. Copyright © 2014 John Wiley & Sons, Ltd.
- Published
- 2014
21. Adaptive Unsupervised Multi-view Feature Selection for Visual Concept Recognition
- Author
-
Jun Xiao, Yinfu Feng, Yueting Zhuang, and Xiaoming Liu
- Subjects
Computer science ,business.industry ,Iterative method ,Data cluster ,Cognitive neuroscience of visual object recognition ,Pattern recognition ,Feature selection ,Machine learning ,computer.software_genre ,Non-negative matrix factorization ,Discriminative model ,Unsupervised learning ,Leverage (statistics) ,Artificial intelligence ,business ,computer - Abstract
To reveal and leverage the correlated and complemental information between different views, a great amount of multi-view learning algorithms have been proposed in recent years. However, unsupervised feature selection in multi-view learning is still a challenge due to lack of data labels that could be utilized to select the discriminative features. Moreover, most of the traditional feature selection methods are developed for the single-view data, and are not directly applicable to the multi-view data. Therefore, we propose an unsupervised learning method called Adaptive Unsupervised Multi-view Feature Selection (AUMFS) in this paper. AUMFS attempts to jointly utilize three kinds of vital information, i.e., data cluster structure, data similarity and the correlations between different views, contained in the original data together for feature selection. To achieve this goal, a robust sparse regression model with the l2,1-norm penalty is introduced to predict data cluster labels, and at the same time, multiple view-dependent visual similar graphs are constructed to flexibly model the visual similarity in each view. Then, AUMFS integrates data cluster labels prediction and adaptive multi-view visual similar graph learning into a unified framework. To solve the objective function of AUMFS, a simple yet efficient iterative method is proposed. We apply AUMFS to three visual concept recognition applications (i.e., social image concept recognition, object recognition and video-based human action recognition) on four benchmark datasets. Experimental results show the proposed method significantly outperforms several state-of-the-art feature selection methods. More importantly, our method is not very sensitive to the parameters and the optimization method converges very fast.
- Published
- 2013
22. Active learning for social image retrieval using Locally Regressive Optimal Design
- Author
-
Hong Zhang, Yi Yang, Jun Xiao, Yinfu Feng, and Zheng-Jun Zha
- Subjects
Optimal design ,Computer science ,business.industry ,Active learning (machine learning) ,Cognitive Neuroscience ,Local regression ,Relevance feedback ,Function (mathematics) ,Construct (python library) ,computer.software_genre ,Machine learning ,Computer Science Applications ,Artificial Intelligence ,Point (geometry) ,Artificial Intelligence & Image Processing ,Artificial intelligence ,Data mining ,business ,computer - Abstract
In this paper, we propose a novel active learning algorithm, called Locally Regressive Optimal Design (LROD), to improve the effectiveness of relevance feedback-based social image retrieval. Our algorithm assumes that for each data point, the label values of both this data point and its neighbors can be well estimated using a locally regressive function. Specifically, we adopt a local linear regression model to predict the label value of each data point in a local patch. The regularized local model predication error of the local patch is defined as our local loss function. Then, a unified objective function is proposed to minimize the summation of these local loss functions over all the data points, so that an optimal predicated label value can be assigned to each data point. Finally, we embed it into a semi-supervised learning framework to construct the final objective function. Experiment results on MSRA-MM2.0 database demonstrate the efficiency and effectiveness of the proposed algorithm for relevance feedback-based social image retrieval. © 2012 Elsevier B.V..
- Published
- 2012
23. Sketch-based human motion retrieval via selected 2D geometric posture descriptor
- Author
-
Zhidong Xiao, Jun Xiao, Yinfu Feng, and Zhangpeng Tang
- Subjects
Motion retrieval ,Similarity (geometry) ,Computer science ,Computer animation ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Feature selection ,Sketch-based ,Motion (physics) ,Discriminative model ,Computer vision ,Electrical and Electronic Engineering ,Representation (mathematics) ,ComputingMethodologies_COMPUTERGRAPHICS ,business.industry ,Pattern recognition ,Sketch ,Feature (computer vision) ,Control and Systems Engineering ,Signal Processing ,Artificial intelligence ,Computer Vision and Pattern Recognition ,business ,Laplace operator ,Software - Abstract
Sketch-based human motion retrieval is a hot topic in computer animation in recent years. In this paper, we present a novel sketch-based human motion retrieval method via selected 2-dimensional (2D) Geometric Posture Descriptor (2GPD). Specially, we firstly propose a rich 2D pose feature call 2D Geometric Posture Descriptor (2GPD), which is effective in encoding the 2D posture similarity by exploiting the geometric relationships among different human body parts. Since the original 2GPD is of high dimension and redundant, a semi-supervised feature selection algorithm derived from Laplacian Score is then adopted to select the most discriminative feature component of 2GPD as feature representation, and we call it as selected 2GPD. Finally, a posture-by-posture motion retrieval algorithm is used to retrieve a motion sequence by sketching several key postures. Experimental results on CMU human motion database demonstrate the effectiveness of our proposed approach.
- Full Text
- View/download PDF
24. Exploiting temporal stability and low-rank structure for motion capture data refinement
- Author
-
Jian J. Zhang, Xiaosong Yang, Yueting Zhuang, Rong Song, Jun Xiao, and Yinfu Feng
- Subjects
Data refinement ,Matrix completion ,Information Systems and Management ,Rank (linear algebra) ,Property (programming) ,Computer science ,Stability (learning theory) ,computer.software_genre ,Missing data ,Motion capture ,Computer Science Applications ,Theoretical Computer Science ,Set (abstract data type) ,Motion capture data ,Artificial Intelligence ,Control and Systems Engineering ,Temporal stability ,Data mining ,Noise (video) ,computer ,Software - Abstract
Inspired by the development of the matrix completion theories and algorithms, a low-rank based motion capture (mocap) data refinement method has been developed, which has achieved encouraging results. However, it does not guarantee a stable outcome if we only consider the low-rank property of the motion data. To solve this problem, we propose to exploit the temporal stability of human motion and convert the mocap data refinement problem into a robust matrix completion problem, where both the low-rank structure and temporal stability properties of the mocap data as well as the noise effect are considered. An efficient optimization method derived from the augmented Lagrange multiplier algorithm is presented to solve the proposed model. Besides, a trust data detection method is also introduced to improve the degree of automation for processing the entire set of the data and boost the performance. Extensive experiments and comparisons with other methods demonstrate the effectiveness of our approaches on both predicting missing data and de-noising. © 2014 Elsevier Inc. All rights reserved.
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.