25 results on '"Nie, Liqiang"'
Search Results
2. Source-free Style-diversity Adversarial Domain Adaptation with Privacy-preservation for person re-identification
- Author
-
Qu, Xiaofeng, Liu, Li, Zhu, Lei, Nie, Liqiang, and Zhang, Huaxiang
- Published
- 2024
- Full Text
- View/download PDF
3. Multi-level adversarial attention cross-modal hashing
- Author
-
Wang, Benhui, Zhang, Huaxiang, Zhu, Lei, Nie, Liqiang, and Liu, Li
- Published
- 2023
- Full Text
- View/download PDF
4. Human activity recognition by manifold regularization based dynamic graph convolutional networks
- Author
-
Liu, Weifeng, Fu, Sichao, Zhou, Yicong, Zha, Zheng-Jun, and Nie, Liqiang
- Published
- 2021
- Full Text
- View/download PDF
5. Hashtag our stories: Hashtag recommendation for micro-videos via harnessing multiple modalities
- Author
-
Cao, Da, Miao, Lianhai, Rong, Huigui, Qin, Zheng, and Nie, Liqiang
- Published
- 2020
- Full Text
- View/download PDF
6. Cross-modal recipe retrieval via parallel- and cross-attention networks learning
- Author
-
Cao, Da, Chu, Jingjing, Zhu, Ningbo, and Nie, Liqiang
- Published
- 2020
- Full Text
- View/download PDF
7. HpLapGCN: Hypergraph p-Laplacian graph convolutional networks
- Author
-
Fu, Sichao, Liu, Weifeng, Zhou, Yicong, and Nie, Liqiang
- Published
- 2019
- Full Text
- View/download PDF
8. Low-rank regularized tensor discriminant representation for image set classification
- Author
-
Jing, Peiguang, Su, Yuting, Li, Zhengnan, Liu, Jing, and Nie, Liqiang
- Published
- 2019
- Full Text
- View/download PDF
9. Guest editorial: Image/video understanding and analysis
- Author
-
Chang, Xiaojun, Liang, Xiaodan, Yan, Yan, and Nie, Liqiang
- Published
- 2020
- Full Text
- View/download PDF
10. From action to activity: Sensor-based activity recognition.
- Author
-
Liu, Ye, Nie, Liqiang, Liu, Li, and Rosenblum, David S.
- Subjects
- *
HUMAN activity recognition , *FEATURE extraction , *DATA mining , *PATTERN recognition systems , *DATA analysis - Abstract
As compared to actions, activities are much more complex, but semantically they are more representative of a human׳s real life. Techniques for action recognition from sensor-generated data are mature. However, few efforts have targeted sensor-based activity recognition. In this paper, we present an efficient algorithm to identify temporal patterns among actions and utilize the identified patterns to represent activities for automated recognition. Experiments on a real-world dataset demonstrated that our approach is able to recognize activities with high accuracy from temporal patterns, and that temporal patterns can be used effectively as a mid-level feature for activity representation. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
11. Version-sensitive mobile App recommendation.
- Author
-
Cao, Da, Nie, Liqiang, He, Xiangnan, Wei, Xiaochi, Shen, Jialie, Wu, Shunxiang, and Chua, Tat-Seng
- Subjects
- *
MOBILE apps , *REAL-time computing , *DATA analysis , *RECOMMENDER systems , *MATHEMATICAL models - Abstract
Being part and parcel of the daily life for billions of people all over the globe, the domain of mobile Applications (Apps) is the fastest growing sector of mobile market today. Users, however, are frequently overwhelmed by the vast number of released Apps and frequently updated versions. Towards this end, we propose a novel version-sensitive mobile App recommendation framework. It is able to recommend appropriate Apps to right users by jointly exploring the version progression and dual-heterogeneous data. It is helpful for alleviating the data sparsity problem caused by version division. As a byproduct, it can be utilized to solve the in-matrix and out-of-matrix cold-start problems. Considering the progression of versions within the same categories, the performance of our proposed framework can be further improved. It is worth emphasizing that our proposed version progression modeling can work as a plug-in component to be embedded into most of the existing latent factor-based algorithms. To support the online learning, we design an incremental update strategy for the framework to adapt the dynamic data in real-time. Extensive experiments on a real-world dataset have demonstrated the promising performance of our proposed approach with both offline and online protocols. Relevant data, code, and parameter settings are available at http://apprec.wixsite.com/version . [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
12. Event graph based contradiction recognition from big data collection.
- Author
-
Liu, Maofu, Wang, Limin, Nie, Liqiang, Dai, Jianhua, and Ji, Donghong
- Subjects
- *
GRAPH theory , *BIG data , *SEMANTICS , *STATISTICAL hypothesis testing , *DATA analysis - Abstract
Conventional models relying on similarity utilizing low-level surface statistical, syntactic and lexical semantic features are suboptimal in contradiction recognition, especially for the large-scale data, such as the sensor and traffic data. To tackle this problem, this work treats the text and hypothesis pair as event graph and proposes a novel model based on event graph to recognize contradiction in big data collection. The proposed model is capable of seamlessly sewing the high-level event semantic features corresponding to the conflicting linguistic phenomena to identify contradictory construction. Experimental results show that our event graph based contradiction recognition model achieves significant improvement as compared to state-of-the-art competitors. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
13. Exploring heterogeneous features for query-focused summarization of categorized community answers.
- Author
-
Wei, Wei, Ming, ZhaoYan, Nie, Liqiang, Li, Guohui, Li, Jianjun, Zhu, Feida, Shang, Tianfeng, and Luo, Changyin
- Subjects
- *
ONLINE data processing , *FEATURE extraction , *QUERYING (Computer science) , *INFORMATION sharing , *WEB services - Abstract
Community-based question answering (cQA) is a popular type of online knowledge-sharing web service where users ask questions and obtain answers contributed by others. To enhance knowledge sharing, cQA also provides users with a retrieval function to access the historical question-answer pairs (QAs). However, it is still ineffective in that the retrieval result is typically a ranking list of potentially relevant QAs, rather than a succinct and informative answer. To alleviate the problem, this paper proposes a three-level scheme, which aims to generate a query-focused summary-style answer in terms of two factors, i.e., novelty and redundancy . Specifically, we first retrieve a set of QAs to the given query, and then develop a smoothed Naive Bayes model to identify the topics of answers, by exploiting their associated category information. Next, to compute the global ranking scores of answers, we first propose a parameterized graph-based method to model a Markov random walk on a graph that is parameterized by the heterogeneous features of answers, and then combine the ranking scores with the relevance scores of answers. Based on the computed global ranking scores, we utilize two different strategies to construct top- K candidate answer set, and finally solve a constrained optimization problem on the sentence set of top- K answers to generate a summary towards a user’s query. Experiments on real-world data demonstrate the effectiveness of our proposed approach as compared to the baselines. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
14. Reconstruction regularized low-rank subspace learning for cross-modal retrieval.
- Author
-
Wu, Jianlong, Xie, Xingxu, Nie, Liqiang, Lin, Zhouchen, and Zha, Hongbin
- Subjects
- *
COMPUTATIONAL complexity , *LATENT semantic analysis - Abstract
• The novel reconstruction regularization term can preserve the essential information. • Low-rank constraint can well explore the correlation among samples. • An efficient algorithm is presented to optimize the problem with convergence guarantee. • RRLSL can be applied to both supervised and unsupervised situations. • Superior performance is achieved with lowest computational complexity. With the rapid increase of multi-modal data through the internet, cross-modal matching or retrieval has received much attention recently. It aims to use one type of data as query and retrieve results from the database of another type. For this task, the most popular approach is the latent subspace learning, which learns a shared subspace for multi-modal data, so that we can efficiently measure cross-modal similarity. Instead of adopting traditional regularization terms, we hope that the latent representation could recover the multi-modal information, which works as a reconstruction regularization term. Besides, we assume that different view features for samples of the same category share the same representation in the latent space. Since the number of classes is generally smaller than the number of samples and the feature dimension, therefore the latent feature matrix of training instances should be low-rank. We try to learn the optimal latent representation, and propose a reconstruction based term to recover original multi-modal data and a low-rank term to regularize the learning of subspace. Our method can deal with both supervised and unsupervised cross-modal retrieval tasks. For those situations where the semantic labels are not easy to obtain, our proposed method can also work very well. We propose an efficient algorithm to optimize our framework. To evaluate the performance of our method, we conduct extensive experiments on various datasets. The experimental results show that our proposed method is very efficient and outperforms the state-of-the-art subspace learning approaches. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
15. Multimodal matching-aware co-attention networks with mutual knowledge distillation for fake news detection.
- Author
-
Hu, Linmei, Zhao, Ziwang, Qi, Weijian, Song, Xuemeng, and Nie, Liqiang
- Subjects
- *
FAKE news , *IMAGE registration - Abstract
Fake news often involves multimedia information such as text and image to mislead readers, proliferating and expanding its influence. Most existing fake news detection methods apply the co-attention mechanism to fuse multimodal features while ignoring the consistency of image and text in co-attention. In this paper, we propose multimodal matching-aware co-attention networks with mutual knowledge distillation for improving fake news detection. Specifically, we design an image-text matching-aware co-attention mechanism which captures the alignment of image and text for better multimodal fusion. The image-text matching representation can be obtained via a vision-language pre-trained model. Additionally, based on the designed image-text matching-aware co-attention mechanism, we propose to build two co-attention networks respectively centered on text and image for mutual knowledge distillation to improve fake news detection. Extensive experiments on three benchmark datasets demonstrate that our proposed model outperforms existing methods on multimodal fake news detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
16. Cross-modal dual subspace learning with adversarial network.
- Author
-
Shang, Fei, Zhang, Huaxiang, Sun, Jiande, Nie, Liqiang, and Liu, Li
- Subjects
- *
QUADRUPLETS , *STATISTICAL sampling , *MULTIMODAL user interfaces , *INTRA-aortic balloon counterpulsation , *BISTATIC radar - Abstract
Cross-modal retrieval has recently attracted much interest along with the rapid development of multimodal data, and effectively utilizing the complementary relationship of different modal data and eliminating the heterogeneous gap as much as possible are the two key challenges. In this paper, we present a novel network model termed cross-modal Dual Subspace learning with Adversarial Network (DSAN). The main contributions are as follows: (1) Dual subspaces (visual subspace and textual subspace) are proposed, which can better mine the underlying structure information of different modalities as well as modality-specific information. (2) An improved quadruplet loss is proposed, which takes into account the relative distance and absolute distance between positive and negative samples, together with the introduction of the idea of hard sample mining. (3) Intra-modal constrained loss is proposed to maximize the distance of the most similar cross-modal negative samples and their corresponding cross-modal positive samples. In particular, feature preserving and modality classification act as two antagonists. DSAN tries to narrow the heterogeneous gap between different modalities, and distinguish the original modality of random samples in dual subspaces. Comprehensive experimental results demonstrate that, DSAN significantly outperforms 9 state-of-the-art methods on four cross-modal datasets. • Dual parallel subspaces are proposed, which can better mine the underlying structure information of different modalities as well as modality-specific information. • An improved quadruplet loss is proposed, which integrates relative distance, absolute distance and hard sample mining. On the one hand, it pushes forward the boundaries of positive and negative samples to a certain extent. The introduction of the idea of hard sample mining reduces the complexity of the model and further improves its performance. • An Intra-modal constrained loss is proposed to maximize the distance of the most similar cross-modal negative sample and its corresponding cross-modal positive samples. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
17. Learning robust affinity graph representation for multi-view clustering.
- Author
-
Jing, Peiguang, Su, Yuting, Li, Zhengnan, and Nie, Liqiang
- Subjects
- *
REPRESENTATIONS of graphs , *LAPLACIAN matrices , *GRASSMANN manifolds , *ALGORITHMS , *FEATURE selection , *WATERMARKS - Abstract
Recently, an increasingly pervasive trend in real-word applications is that the data are collected from multiple sources or represented by multiple views. Owing to the powerful ability of affinity graph in capturing the structural relationships among samples, constructing a robust and meaningful affinity graph has been extensively studied, especially in spectral clustering tasks. However, conventional spectral clustering extended to multi-view scenarios cannot obtain the satisfactory performance due to the presence of noise and the heterogeneity among different views. In this paper, we propose a robust affinity graph learning framework to deal with this issue. First, we employ an improved feature selection algorithm that integrates the advantages of hypergraph embedding and sparse regression to select significant features such that more robust graph Laplacian matrices for various views on this basis can be constructed. Second, we model hypergraph Laplacians as points on a Grassmann manifold and propose a Consistent Affinity Graph Learning (CAGL) algorithm to fuse all views. CAGL aims to learn a latent common affinity matrix shared by all Laplacian matrices by taking both the clustering quality evaluation criterion and the view consistency loss into account. We also develop an alternating descent algorithm to optimize the objective function of CAGL. Experiments on five publicly available datasets demonstrate that our proposed method obtains promising results compared with state-of-the-art methods. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
18. HesGCN: Hessian graph convolutional networks for semi-supervised classification.
- Author
-
Fu, Sichao, Liu, Weifeng, Tao, Dapeng, Zhou, Yicong, and Nie, Liqiang
- Subjects
- *
ARTIFICIAL neural networks , *HESSIAN matrices , *MATHEMATICAL regularization , *DATA distribution - Abstract
Manifold or local geometry of samples have been recognized as a powerful tool in machine learning areas, especially in the graph-based semi-supervised learning (GSSL) problems. Over recent decades, plenty of manifold assumption-based SSL algorithms (MSSL) have been proposed including graph embedding and graph regularization models, where the objective is to utilize the local geometry of data distributions. One of most representative MSSL approaches is graph convolutional networks (GCN), which effectively generalizes the convolutional neural networks to deal with the graphs with the arbitrary structures by constructing and fusing the Laplacian-based structure information. However, the null space of the Laplacian remains unchanged along the underlying manifold, it causes the poor extrapolating ability of the model. In this paper, we introduce a variant of GCN, i.e. Hessian graph convolutional networks (HesGCN). In particularly, we get a more efficient convolution layer rule by optimizing the one-order spectral graph Hessian convolutions. In addition, the spectral graph Hessian convolutions is a combination of the Hessian matrix and the spectral graph convolutions. Hessian gets a richer null space by the existence of its two-order derivatives, which can describe the intrinsic local geometry structure of data accurately. Thus, HesGCN can learn more efficient data features by fusing the original feature information with its structure information based on Hessian. We conduct abundant experiments on four public datasets. Extensive experiment results validate the superiority of our proposed HesGCN compared with many state-of-the-art methods. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
19. Multi-criteria active deep learning for image classification.
- Author
-
Yuan, Jin, Hou, Xingxing, Xiao, Yaoqiang, Cao, Da, Guan, Weili, and Nie, Liqiang
- Subjects
- *
DEEP learning , *CONVOLUTIONAL neural networks , *CLASSIFICATION , *IMAGE - Abstract
Abstract As a robust and heuristic technique in machine learning, active learning has been established as an effective method for addressing large volumes of unlabeled data; it interactively queries users (or certain information sources) to obtain desired outputs at new data points. With regard to deep learning techniques (e.g., CNN) and their applications (e.g., image classification), labeling work is of great significance as training processes for obtaining parameters in neural networks which requires abundant labeled samples. Although a few active learning algorithms have been proposed for devising certain straightforward sampling strategies (e.g., density, similarity, uncertainty, and label-based measure) for deep learning algorithms, these employ onefold sampling strategies and do not consider the relationship among multiple sampling strategies. To this end, we devised a novel solution " m ulti- c riteria a ctive l eep l earning"(MCADL) to learn an active learning strategy for deep neural networks in image classification. Our sample selection strategy selects informative samples by considering multiple criteria simultaneously (i.e., density, similarity, uncertainty, and label-based measure). Moreover, our approach is capable of adjusting weights adaptively to fuse the results from multiple criteria effectively by exploring the utilities of the criteria at different training stages. Through extensive experiments on two popular image datasets (i.e., MNIST and CIFAR-10), we demonstrate that our proposed method consistently outperforms highly competitive active learning approaches; thereby, it can be verified that our multi-criteria active learning proposal is rational and our solution is effective. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
20. Multi-view face hallucination using SVD and a mapping model.
- Author
-
Jian, Muwei, Cui, Chaoran, Nie, Xiushan, Zhang, Huaxiang, Nie, Liqiang, and Yin, Yilong
- Subjects
- *
HALLUCINATIONS , *SINGULAR value decomposition , *CARTOGRAPHY , *RECONSTRUCTION (Graph theory) , *MATRICES (Mathematics) - Abstract
Abstract Multi-view face hallucination (MFH) presents a challenge issue in face recognition domain. In this paper, an efficient method based on singular value decomposition (SVD) and a mapping model is proposed for multi-view face hallucination. Based on an approximately same linear mapping relationship across different views, two corresponding matrices obtained from the SVD of the low resolution (LR) image for the high-resolution (HR) multi-view face images can be constructed via the mapping model using global reconstruction. Experiments show that our proposed multi-view face-hallucination scheme is effective and produces promising super-resolved results. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
21. Identifying advisor-advisee relationships from co-author networks via a novel deep model.
- Author
-
Liu, Wenqiang, Zhao, Zhongying, Zhang, Yong, Qian, Yuhua, Nie, Liqiang, and Yin, Yilong
- Subjects
- *
AUTHORSHIP collaboration , *SOCIAL network analysis , *RELATIONSHIP marketing , *SCIENTIFIC community , *SCHOLARLY periodicals - Abstract
Advisor-advisee is one of the most important relationships in research publication networks. Identifying it can benefit many interesting applications, such as double-blind peer review, academic circle mining, and scientific community analysis. However, the advisor-advisee relationships are often hidden in research publication network and vary over time, thus are difficult to detect. In this paper, we present a time-aware Advisor-advisee Relationship Mining Model (tARMM) to better identify such relationships. It is a deep model equipped with improved Refresh Gate Recurrent Units (RGRU). Extensive experiments over real-world DBLP data have well verified the effectiveness of our proposed model. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
22. A classification model for semantic entailment recognition with feature combination.
- Author
-
Liu, Maofu, Zhang, Luming, Hu, Huijun, Nie, Liqiang, and Dai, Jianhua
- Subjects
- *
SEMANTIC computing , *TEXT recognition , *MULTIMEDIA systems - Abstract
Recent years have witnessed the fast development of multimedia platforms in China, such as Youku, LeTV and Weibo. Images and videos are usually uploaded with textual descriptions, such as titles and introductions of these media. These texts are the key to multimedia content understanding, and this paper is dedicated to multimedia understanding with visual content entailment via recognizing semantic entailment in these texts. In fact, the natural language processing community has been manifesting increasing interest in semantic entailment recognition in English texts. Yet, so far not much attention has been paid to semantic entailment recognition in Chinese texts. Therefore, this paper investigates on multimedia semantic entailment with Chinese texts. Recognizing semantic entailment in Chinese texts can be cast as a classification problem. In this paper, a classification model is constructed based on support vector machine to detect high-level semantic entailment relations in Chinese text pair, including entailment and non-entailment for the Binary-Class and forward entailment, reverse entailment, bidirectional entailment, contradiction and independence for the Multi-Class. We explore different semantic feature combinations based on three kinds of Chinese textual features, including Chinese surface textual, Chinese lexical semantic and Chinese syntactic features, and utilize them to feed our classification model. The experiment results show that the accuracy of our classification model for semantic entailment recognition with the feature combination using all the three kinds of Chinese textual features achieves a much better performance than the baseline in Multi-Class and slightly better results than the baseline in the Binary-Class. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
23. Bridge the semantic gap between pop music acoustic feature and emotion: Build an interpretable model.
- Author
-
Zhang, JiangLong, Huang, XiangLin, Yang, Lifang, and Nie, Liqiang
- Subjects
- *
POPULAR music , *AUDIO acoustics , *SEMANTIC computing , *EMOTIONS , *MACHINE learning - Abstract
Music emotion recognition (MER) is an important topic in music understanding, recommendation, retrieval and human computer interaction. Great success has been achieved by machine learning methods in estimating human emotional response to music. However, few of them pay much attention in semantic interpret for emotion response. In our work, we first train an interpretable model between acoustic audio and emotion. Filter, wrapper and shrinkage methods are applied to select important features. We then apply statistical models to build and explain the emotion model. Extensive experimental results reveal that the shrinkage methods outperform the wrapper methods and the filter methods in arousal emotion. In addition, we observed that only a small set of the extracted features have the key effects to arousal. While, most of our extracted features have small contribution to valence music perception. Ultimately, we obtain a higher average accuracy rate in arousal, compared to that in valence. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
24. Surface defect classification in large-scale strip steel image collection via hybrid chromosome genetic algorithm.
- Author
-
Hu, Huijun, Liu, Ya, Liu, Maofu, and Nie, Liqiang
- Subjects
- *
STEEL strip , *SURFACE defects , *GENETIC algorithms , *IMAGE processing , *FEATURE extraction , *KERNEL functions - Abstract
In this paper, hybrid chromosome genetic algorithm is applied to establishing the real-time classification model for surface defects in a large-scale strip steel image collection. After image preprocessing, four types of visual features, comprising geometric feature, shape feature, texture feature and grayscale feature, are extracted from the defect target image and its corresponding preprocessed image. In order to use genetic algorithm to optimize classification model based on hybrid chromosome, the structure of hybrid chromosome is designed to seamlessly integrate the kernel function, visual features and model parameters. And then the chromosome and the SVM classification model will be evolved and optimized according to the genetic operations and the fitness evaluation. In the end, the final SVM classifier is established using the decoding result of the optimal chromosome. The experimental results show that our method is effective and efficient in classifying the surface defects in a large-scale strip steel image collection. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
25. On robust image spam filtering via comprehensive visual modeling.
- Author
-
Shen, Jialie, Deng, Robert H., Cheng, Zhiyong, Nie, Liqiang, and Yan, Shuicheng
- Subjects
- *
ROBUST control , *IMAGE analysis , *SPAM filtering (Email) , *INTERNET , *INFORMATION theory , *INTERNET service providers - Abstract
The Internet has brought about fundamental changes in the way peoples generate and exchange media information. Over the last decade, unsolicited message images (image spams) have become one of the most serious problems for Internet service providers (ISPs), business firms and general end users. In this paper, we report a novel system called RoBoTs ( Ro bust Bo os T rap based s pam detector) to support accurate and robust image spam filtering. The system is developed based on multiple visual properties extracted from different levels of granularity, aiming to capture more discriminative contents for effective spam image identification. In addition, a resampling based learning framework is developed to effectively integrate random forest and linear discriminative analysis (LDA) to generate comprehensive signature of spam images. It can facilitate more accurate and robust spam classification process with very limited amount of initial training examples. Using three public available test collections, the proposed system is empirically compared with the state-of-the-art techniques. Our results demonstrate its significantly higher performance from different perspectives. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.