230 results on '"Jiye Liang"'
Search Results
2. Multiple metric learning via local metric fusion
- Author
-
Xinyao Guo, Lin Li, Chuangyin Dang, Jiye Liang, and Wei Wei
- Subjects
Information Systems and Management ,Artificial Intelligence ,Control and Systems Engineering ,Software ,Computer Science Applications ,Theoretical Computer Science - Published
- 2023
3. Self-Constrained Spectral Clustering
- Author
-
Liang Bai, Jiye Liang, and Yunxiao Zhao
- Subjects
Computational Theory and Mathematics ,Artificial Intelligence ,Applied Mathematics ,Computer Vision and Pattern Recognition ,Software - Abstract
As a leading graph clustering technique, spectral clustering is one of the most widely used clustering methods to capture complex clusters in data. Some additional prior information can help it to further reduce the difference between its clustering results and users' expectations. However, it is hard to get the prior information under unsupervised scene to guide the clustering process. To solve this problem, we propose a self-constrained spectral clustering algorithm. In this algorithm, we extend the objective function of spectral clustering by adding pairwise and label self-constrained terms to it. We provide the theoretical analysis to show the roles of the self-constrained terms and the extensibility of the proposed algorithm. Based on the new objective function, we build an optimization model for self-constrained spectral clustering so that we can simultaneously learn the clustering results and constraints. Furthermore, we propose an iterative method to solve the new optimization problem. Compared to other existing versions of spectral clustering algorithms, the new algorithm can discover a high-quality cluster structure of a data set without prior information. Extensive experiments on benchmark data sets illustrate the effectiveness of the proposed algorithm.
- Published
- 2023
4. Coupling learning for feature selection in categorical data
- Author
-
Feng Wang, Jiye Liang, and Peng Song
- Subjects
Artificial Intelligence ,Computer Vision and Pattern Recognition ,Software - Published
- 2023
5. RSS-Bagging: Improving Generalization Through the Fisher Information of Training Data
- Author
-
Jieting Wang, Feijiang Li, Jue Li, Chenping Hou, Yuhua Qian, and Jiye Liang
- Subjects
Artificial Intelligence ,Computer Networks and Communications ,Software ,Computer Science Applications - Published
- 2023
6. Evaluating Classification Model Against Bayes Error Rate
- Author
-
Qingqiang Chen, Fuyuan Cao, Ying Xing, and Jiye Liang
- Subjects
Computational Theory and Mathematics ,Artificial Intelligence ,Applied Mathematics ,Computer Vision and Pattern Recognition ,Software - Published
- 2023
7. Hybrid sampling-based contrastive learning for imbalanced node classification
- Author
-
Caixia Cui, Jie Wang, Wei Wei, and Jiye Liang
- Subjects
Artificial Intelligence ,Computer Vision and Pattern Recognition ,Software - Published
- 2022
8. Hierarchical metric learning with intra-level and inter-level regularization
- Author
-
Lin Li, Ting Li, Wei Wei, Xinyao Guo, and Jiye Liang
- Subjects
Artificial Intelligence ,Computer Vision and Pattern Recognition ,Software - Published
- 2022
9. Centroids-guided deep multi-view K-means clustering
- Author
-
Jing Liu, Fuyuan Cao, and Jiye Liang
- Subjects
Information Systems and Management ,Artificial Intelligence ,Control and Systems Engineering ,Software ,Computer Science Applications ,Theoretical Computer Science - Published
- 2022
10. Clustering mixed type data: a space structure-based approach
- Author
-
Feijiang Li, Yuhua Qian, Jieting Wang, Furong Peng, and Jiye Liang
- Subjects
Artificial Intelligence ,Computer Vision and Pattern Recognition ,Software - Published
- 2022
11. Efficient Causal Structure Learning from Multiple Interventional Datasets with Unknown Targets
- Author
-
Yunxia Wang, Fuyuan Cao, Kui Yu, and Jiye Liang
- Subjects
General Medicine - Abstract
We consider the problem of reducing the false discovery rate in multiple high-dimensional interventional datasets under unknown targets. Traditional algorithms merged directly multiple causal graphs learned, which ignores the contradictions of different datasets, leading to lots of inconsistent directions of edges. For reducing the contradictory information, we propose a new algorithm, which first learns an interventional Markov equivalence class (I-MEC) before merging multiple graphs. It utilizes the full power of the constraints available in interventional data and combines ideas from local learning, intervention, and search-and-score techniques in a principled and effective way in different intervention experiments. Specifically, local learning on multiple datasets is used to build a causal skeleton. Perfect intervention destroys some possible triangles, leading to the identification of more possible V-structures. And then a theoretically correct I-MEC is learned. Search and scoring techniques based on the learned I-MEC further identify the remaining unoriented edges. Both theoretical analysis and experiments on benchmark Bayesian networks with the number of variables from 20 to 724 validate that the effectiveness of our algorithm in reducing the false discovery rate in high-dimensional interventional data.
- Published
- 2022
12. Controlling Underestimation Bias in Reinforcement Learning via Quasi-median Operation
- Author
-
Wei Wei, Yujia Zhang, Jiye Liang, Lin Li, and Yyuze Li
- Subjects
General Medicine - Abstract
How to get a good value estimation is one of the key problems in reinforcement learning (RL). Current off-policy methods, such as Maxmin Q-learning, TD3 and TADD, suffer from the underestimation problem when solving the overestimation problem. In this paper, we propose the Quasi-Median Operation, a novel way to mitigate the underestimation bias by selecting the quasi-median from multiple state-action values. Based on the quasi-median operation, we propose Quasi-Median Q-learning (QMQ) for the discrete action tasks and Quasi-Median Delayed Deep Deterministic Policy Gradient (QMD3) for the continuous action tasks. Theoretically, the underestimation bias of our method is improved while the estimation variance is significantly reduced compared to Maxmin Q-learning, TD3 and TADD. We conduct extensive experiments on the discrete and continuous action tasks, and results show that our method outperforms the state-of-the-art methods.
- Published
- 2022
13. Instance Selection: A Bayesian Decision Theory Perspective
- Author
-
Qingqiang Chen, Fuyuan Cao, Ying Xing, and Jiye Liang
- Subjects
General Medicine - Abstract
In this paper, we consider the problem of lacking theoretical foundation and low execution efficiency of the instance selection methods based on the k-nearest neighbour rule when processing large-scale data. We point out that the core idea of these methods can be explained from the perspective of Bayesian decision theory, that is, to find which instances are reducible, irreducible, and deleterious. Then, based on the percolation theory, we establish the relationship between these three types of instances and local homogeneous cluster (i.e., a set of instances with the same labels). Finally, we propose a method based on an accelerated k-means algorithm to construct local homogeneous clusters and remove the superfluous instances. The performance of our method is studied on extensive synthetic and benchmark data sets. Our proposed method can handle large-scale data more effectively than the state-of-the-art instance selection methods. All code and data results are available at https://github.com/CQQXY161120/Instance-Selection.
- Published
- 2022
14. Study on the Change in Vegetation Coverage in Desert Oasis and Its Driving Factors from 1990 to 2020 Based on Google Earth Engine
- Author
-
Xu Li, Ziyan Shi, Jun Yu, and Jiye Liang
- Subjects
Fluid Flow and Transfer Processes ,Process Chemistry and Technology ,General Engineering ,General Materials Science ,desert oasis ,vegetation coverage ,Google Earth Engine ,Hurst index ,coefficient of variation ,change detection ,Instrumentation ,Computer Science Applications - Abstract
Fractional Vegetation Cover (FVC) is an important indicator to evaluate the quality of the regional ecological environment. Alar City is a typical desert oasis region. Investigating the spatial and temporal changes in its vegetation cover at different stages is a guide to the ecological balance and sustainable green development of the Tarim River basin. Based on the Google Earth Engine (GEE) cloud platform, this study analyzed the spatial and temporal characteristics and trends of vegetation cover changes in Alar City from 1990 to 2020 using the Hurst index and coefficient of variation. The results show that the spatial distribution of vegetation in the study area in the last 30 years shows a wave-like characteristic with an overall apparent upward trend. The vegetation cover in the study area is predominantly increasing and the spatial distribution shows a phased and regional character. Compared with 1990, there is a significant increase in the area of cultivated land in 2020. Among them, the areas of vegetation growth mainly occur in the basin around the Tarim River. Human activities have weakened the influence of natural factors on FVC. The results of the study suggest that the GEE platform can be an effective tool for permanently monitoring vegetation.
- Published
- 2023
- Full Text
- View/download PDF
15. Weak multi-label learning with missing labels via instance granular discrimination
- Author
-
Anhui Tan, Xiaowan Ji, Jiye Liang, Yuzhi Tao, Wei-Zhi Wu, and Witold Pedrycz
- Subjects
Information Systems and Management ,Artificial Intelligence ,Control and Systems Engineering ,Software ,Computer Science Applications ,Theoretical Computer Science - Published
- 2022
16. A trilevel analysis of uncertainty measuresin partition-based granular computing
- Author
-
Baoli Wang, Jiye Liang, and Yiyu Yao
- Subjects
Linguistics and Language ,Artificial Intelligence ,Language and Linguistics - Published
- 2022
17. An unsupervised multi-manifold discriminant isomap algorithm based on the pairwise constraints
- Author
-
Xiaofang Gao, Jiye Liang, Wenjian Wang, Xuefei Bai, and Lina Jia
- Subjects
Artificial Intelligence ,Computer Vision and Pattern Recognition ,Software - Published
- 2022
18. Group-wise interactive region learning for zero-shot recognition
- Author
-
Ting Guo, Jiye Liang, and Guo-Sen Xie
- Subjects
Information Systems and Management ,Artificial Intelligence ,Control and Systems Engineering ,Software ,Computer Science Applications ,Theoretical Computer Science - Published
- 2023
19. A group incremental approach for feature selection on hybrid data
- Author
-
Feng Wang, Wei Wei, and Jiye Liang
- Subjects
Geometry and Topology ,Software ,Theoretical Computer Science - Published
- 2022
20. Random Deep Graph Matching
- Author
-
Yu Xie, Zhiguo Qin, Maoguo Gong, Bin Yu, and Jiye Liang
- Subjects
Computational Theory and Mathematics ,Computer Science Applications ,Information Systems - Published
- 2022
21. GUIDE: Training Deep Graph Neural Networks via Guided Dropout Over Edges
- Author
-
Jie Wang, Jianqing Liang, Jiye Liang, and Kaixuan Yao
- Subjects
Artificial Intelligence ,Computer Networks and Communications ,Software ,Computer Science Applications - Abstract
Graph neural networks (GNNs) have made great progress in graph-based semi-supervised learning (GSSL). However, most existing GNNs are confronted with the oversmoothing issue that limits their expressive ability. A key factor that leads to this problem is the excessive aggregation of information from other classes when updating the node representation. To alleviate this limitation, we propose an effective method called GUIded Dropout over Edges (GUIDE) for training deep GNNs. The core of the method is to reduce the influence of nodes from other classes by removing a certain number of inter-class edges. In GUIDE, we drop edges according to the edge strength, which is defined as the time an edge acts as a bridge along the shortest path between node pairs. We find that the stronger the edge strength, the more likely it is to be an inter-class edge. In this way, GUIDE can drop more inter-class edges and keep more intra-class edges. Therefore, nodes in the same community or class are more similar, whereas different classes are more separated in the embedded space. In addition, we perform some theoretical analysis of the proposed method, which explains why it is effective in alleviating the oversmoothing problem. To validate its rationality and effectiveness, we conduct experiments on six public benchmarks with different GNNs backbones. Experimental results demonstrate that GUIDE consistently outperforms state-of-the-art methods in both shallow and deep GNNs.
- Published
- 2022
22. Local Causal Discovery in Multiple Manipulated Datasets
- Author
-
Yunxia Wang, Fuyuan Cao, Kui Yu, and Jiye Liang
- Subjects
Artificial Intelligence ,Computer Networks and Communications ,Software ,Computer Science Applications - Abstract
We consider the problem of distinguishing direct causes from direct effects of a target variable of interest from multiple manipulated datasets with unknown manipulated variables and nonidentical data distributions. Recent studies have shown that datasets attained from manipulated experiments (i.e., manipulated data) contain richer causal information than observational data for causal structure learning. Thus, in this article, we propose a new algorithm, which makes full use of the interventional properties of a causal model to discover the direct causes and direct effects of a target variable from multiple datasets with different manipulations. It is more suited to real-world cases and is also a challenge to be addressed in this article. First, we apply the backward framework to learn parents and children (PC) of a given target from multiple manipulated datasets. Second, we orient some edges connected to the target in advance through the assumption that the target variable is not manipulated and then orient the remaining undirected edges by finding invariant V-structures from multiple datasets. Third, we analyze the correctness of the proposed algorithm. To the best of our knowledge, the proposed algorithm is the first that can identify the local causal structure of a given target from multiple manipulated datasets with unknown manipulated variables. Experimental results on standard Bayesian networks validate the effectiveness of our algorithm.
- Published
- 2022
23. Graph Neural Networks with Interlayer Feature Representation for Image Super-Resolution
- Author
-
Shenggui Tang, Kaixuan Yao, Jianqing Liang, Zhiqiang Wang, and Jiye Liang
- Published
- 2023
24. Nitrogen Preference of Dominant Species during Hailuogou Glacier Retreat Succession on the Eastern Tibetan Plateau
- Author
-
Yulin Huang, Liushan Du, Yanbao Lei, and Jiye Liang
- Subjects
ammonium ,soluble organic N ,nitrogen isotopes ,Ecology ,nitrogen uptake preference ,primary succession ,nitrate ,glacier retreat area ,Plant Science ,dominant plants ,Ecology, Evolution, Behavior and Systematics - Abstract
Plant nitrogen (N) uptake preference is a key factor affecting plant nutrient acquisition, vegetation composition and ecosystem function. However, few studies have investigated the contribution of different N sources to plant N strategies, especially during the process of primary succession of a glacial retreat area. By measuring the natural abundance of N isotopes (δ15N) of dominant plants and soil, we estimated the relative contribution of different N forms (ammonium-NH4+, nitrate-NO3− and soluble organic N-DON) and absorption preferences of nine dominant plants of three stages (12, 40 and 120 years old) of the Hailuogou glacier retreat area. Along with the chronosequence of primary succession, dominant plants preferred to absorb NO3− in the early (73.5%) and middle (46.5%) stages. At the late stage, soil NH4+ contributed more than 60.0%, In addition, the contribution of DON to the total N uptake of plants was nearly 19.4%. Thus, the dominant plants’ preference for NO3− in the first two stages changes to NH4+ in the late stages during primary succession. The contribution of DON to the N source of dominant plants should not be ignored. It suggests that the shift of N uptake preference of dominant plants may reflect the adjustment of their N acquisition strategy, in response to the changes in their physiological traits and soil nutrient conditions. Better knowledge of plant preferences for different N forms could significantly improve our understanding on the potential feedbacks of plant N acquisition strategies to environmental changes, and provide valuable suggestions for the sustainable management of plantations during different successional stages.
- Published
- 2023
- Full Text
- View/download PDF
25. Exploring the Role of Edge Distribution in Graph Convolutional Networks
- Author
-
Liancheng He, Liang Bai, Xian Yang, and Jiye Liang
- Published
- 2023
26. Fuzzy rough discrimination and label weighting for multi-label feature selection
- Author
-
Wei-Zhi Wu, Chao Chen, Lin Sun, Jia Zhang, Anhui Tan, and Jiye Liang
- Subjects
Similarity (geometry) ,Series (mathematics) ,Computer science ,business.industry ,Cognitive Neuroscience ,Pattern recognition ,Feature selection ,Fuzzy logic ,Computer Science Applications ,Weighting ,Matrix (mathematics) ,ComputingMethodologies_PATTERNRECOGNITION ,Discriminative model ,Artificial Intelligence ,Artificial intelligence ,business ,Feature learning - Abstract
Fuzzy rough set is a theoretical framework of fuzzy uncertainty management, and discernibility matrix offers a mathematical foundation for algorithm construction of feature learning. The approaches of fuzzy rough set and discernibility matrix have been successfully applied in single-label learning. However, few works have been done on investigating the foundation of fuzzy rough discernibility matrix on multi-label data. There will be two pivotal problems to be addressed when using fuzzy rough discernibility matrix for multi-label data analysis. One is how to extract sample-level and label-level correlations; and the other is how to utilize the discernibility matrix for algorithm construction. For this reason, in this paper the fuzzy rough discrimination matrix is introduced to deal with the problem of multi-label feature selection. First, the significance of labels in the label space is captured based on the label correlation. Labels with different significances contribute to different weights for measuring the similarity between samples. Hence, a sample similarity matrix in the label space is computed based on the label weighting strategy. Then, a framework of a fuzzy decision system is formalized, in which the discernibility matrix of fuzzy rough sets is introduced as a foundation to evaluate the sample discrimination ability of features. Under the discernibility matrix criterion, a multi-label learning algorithm is developed to select discriminative features from multi-label data. A series of experimental analysis verifies the effectiveness and efficiency of the proposed method.
- Published
- 2021
27. Semi-supervised learning with mixed-order graph convolutional networks
- Author
-
Junbiao Cui, Jiye Liang, Jie Wang, and Jianqing Liang
- Subjects
Information Systems and Management ,Exploit ,Computer science ,02 engineering and technology ,Semi-supervised learning ,Machine learning ,computer.software_genre ,Theoretical Computer Science ,Artificial Intelligence ,Simple (abstract algebra) ,Node (computer science) ,0202 electrical engineering, electronic engineering, information engineering ,Feature (machine learning) ,Adjacency matrix ,business.industry ,05 social sciences ,050301 education ,Computer Science Applications ,Control and Systems Engineering ,Benchmark (computing) ,Graph (abstract data type) ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,0503 education ,computer ,Software - Abstract
Recently, graph convolutional networks (GCN) have made substantial progress in semi-supervised learning (SSL). However, established GCN-based methods have two major limitations. First, GCN-based methods are restricted by the oversmoothing issue that limits their ability to extract knowledge from distant but informative nodes. Second, most available GCN-based methods exploit only the feature information of unlabeled nodes, and the pseudo-labels of unlabeled nodes, which contain important information about the data distribution, are not fully utilized. To address these issues, we propose a novel end-to-end ensemble framework, which is named mixed-order graph convolutional networks (MOGCN). MOGCN consists of two modules. (1) It constructs multiple simple GCN learners with multi-order adjacency matrices, which can directly capture the high-order connectivity among the nodes to alleviate the problem of oversmoothing. (2) To efficiently combine the results from multiple GCN learners, MOGCN employs a novel ensemble module, in which the pseudo-labels of unlabeled nodes from various GCN learners are used to augment the diversity among the learners. We conduct experiments on three public benchmark datasets to evaluate the performance of MOGCN on semi-supervised node classification tasks. The experimental results demonstrate that MOGCN consistently outperforms state-of-the-art methods.
- Published
- 2021
28. A bi-level metric learning framework via self-paced learning weighting
- Author
-
Jing Yan, Wei Wei, Xinyao Guo, Chuangyin Dang, and Jiye Liang
- Subjects
Artificial Intelligence ,Signal Processing ,Computer Vision and Pattern Recognition ,Software - Published
- 2023
29. Metric learning with clustering-based constraints
- Author
-
Chuangyin Dang, Jiye Liang, Wei Wei, Jianqing Liang, and Xinyao Guo
- Subjects
Euclidean distance ,Data point ,Theoretical computer science ,Relation (database) ,Artificial Intelligence ,Computer science ,Margin (machine learning) ,Metric (mathematics) ,Computational intelligence ,Pairwise comparison ,Computer Vision and Pattern Recognition ,Cluster analysis ,Software - Abstract
In most of the existing metric learning methods, the relation is fixed throughout the metric learning process. However, the fixed relation may be harmful to learn a good metric. The adversarial metric learning implements a dynamic update of the pairwise constraints. Inspired by the idea of dynamically updating constraints, we propose in this paper a metric learning model with clustering-based constraints (ML-CC), wherein the triple constraints of large margin are iteratively generated with the clusters of data points. The proposed method can overcome the shortage of the fixed triple constraints constructed under the Euclidian distance. The experimental results on synthetic and real datasets indicate that the performance of the ML-CC is superior to that of the existing state-of-the-art metric learning methods.
- Published
- 2021
30. Logic could be learned from images
- Author
-
Xinyan Liang, Jiye Liang, Yuhua Qian, Yanhong She, Deyu Li, and Qian Guo
- Subjects
FOS: Computer and information sciences ,Divide and conquer algorithms ,0209 industrial biotechnology ,Relation (database) ,business.industry ,Computer science ,Logical reasoning ,Human intelligence ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,Computational intelligence ,02 engineering and technology ,Task (project management) ,020901 industrial engineering & automation ,Artificial Intelligence ,Pattern recognition (psychology) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Computer Vision and Pattern Recognition ,Artificial intelligence ,business ,Bitwise operation ,Software - Abstract
Logic reasoning is a significant ability of human intelligence and also an important task in artificial intelligence. The existing logic reasoning methods, quite often, need to design some reasoning patterns beforehand. This has led to an interesting question: can logic reasoning patterns be directly learned from given data? The problem is termed as a data concept logic. In this study, a learning logic task from images, called a LiLi task, first is proposed. This task is to learn and reason the logic relation from images, without presetting any reasoning patterns. As a preliminary exploration, we design six LiLi data sets (Bitwise And, Bitwise Or, Bitwise Xor, Addition, Subtraction and Multiplication), in which each image is embedded with a n-digit number. It is worth noting that a learning model beforehand does not know the meaning of the n-digit numbers embedded in images and the relation between the input images and the output image. In order to tackle the task, in this work we use many typical neural network models and produce fruitful results. However, these models have the poor performances on the difficult logic task. For furthermore addressing this task, a novel network framework called a divide and conquer model by adding some label information is designed, achieving a high testing accuracy.
- Published
- 2021
31. An IoT-Oriented Gesture Recognition System Based on ResNet-Mediapipe Hybrid Model
- Author
-
Zhuo Huang, Jian Li, Jiye Liang, Baizhi Zen, and Jiani Tan
- Published
- 2022
32. Dual Bidirectional Graph Convolutional Networks for Zero-shot Node Classification
- Author
-
Qin Yue, Jiye Liang, Junbiao Cui, and Liang Bai
- Published
- 2022
33. Fall Detection System Based on Millimeter Wave Radar and Machine Learning
- Author
-
Jiye Liang, Yu Huang, and Zhuo Huang
- Published
- 2022
34. Graph-based semi-supervised learning via improving the quality of the graph dynamically
- Author
-
Jie Wang, Jiye Liang, Junbiao Cui, and Wei Wei
- Subjects
Correctness ,Theoretical computer science ,Computer science ,media_common.quotation_subject ,Graph based ,Inference ,02 engineering and technology ,Semi-supervised learning ,Named graph ,Artificial Intelligence ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Graph (abstract data type) ,020201 artificial intelligence & image processing ,Quality (business) ,Cluster analysis ,Software ,media_common - Abstract
Graph-based semi-supervised learning (GSSL) is an important paradigm among semi-supervised learning approaches and includes the two processes of graph construction and label inference. In most traditional GSSL methods, the two processes are completed independently. Once the graph is constructed, the result of label inference cannot be changed. Therefore, the quality of the graph directly determines the GSSL’s performance. Most traditional graph construction methods make certain assumptions about the data distribution, resulting in the quality of the graph heavily depends on the correctness of these assumptions. Therefore, it is difficult to handle complex and various data distribution for traditional graph construction methods. To overcome such issues, this paper proposes a framework named Graph-based Semi-supervised Learning via Improving the Quality of the Graph Dynamically. In it, the graph construction based on the weighted fusion of multiple clustering results and the label inference are integrated into a unified framework to achieve their mutual guidance and dynamic improvement. Moreover, the proposed framework is a general framework, and most existing GSSL methods can be embedded into it so as to improve their performance. Finally, the working mechanism, the effectiveness in improving the performance of GSSL methods and the advantage compared with other GSSL methods based on dynamic graph construction methods of the proposal are verified through systematic experiments.
- Published
- 2021
35. Accelerating ReliefF using information granulation
- Author
-
Wei Wei, Da Wang, and Jiye Liang
- Subjects
Relief algorithm ,business.industry ,Computer science ,Computational intelligence ,Feature selection ,Pattern recognition ,Class (biology) ,Granulation ,Artificial Intelligence ,Feature (computer vision) ,Pattern recognition (psychology) ,Preprocessor ,Computer Vision and Pattern Recognition ,Artificial intelligence ,business ,Software - Abstract
Feature selection is an essential preprocessing requirement when solving a classification problem. In this respect, the Relief algorithm and its derivatives have been demonstrated to be a class of successful feature selectors. However, the computational cost of these algorithms is very high when large-scale datasets are processed. To solve this problem, we propose the fast ReliefF algorithm based on the information granulation of instances (IG-FReliefF). The algorithm uses K-means to granulate the dataset and selects the significant granules among them using the criteria defined by information entropy and information granulation, and then evaluates each feature on the dataset composed of the selected granules. Extensive experiments show that the proposed algorithm is more efficient than the existing representative algorithms, especially on large-scale data sets, and the proposed algorithm is almost the same as the comparison algorithm in terms of classification performance.
- Published
- 2021
36. k-Mnv-Rep: A k-type clustering algorithm for matrix-object data
- Author
-
Liqin Yu, Jiye Liang, Xiao-Zhi Gao, Fuyuan Cao, and Jing Liu
- Subjects
Information Systems and Management ,Heuristic (computer science) ,Computer science ,Feature vector ,02 engineering and technology ,Theoretical Computer Science ,Set (abstract data type) ,Artificial Intelligence ,0202 electrical engineering, electronic engineering, information engineering ,Cluster (physics) ,Cluster analysis ,Measure (data warehouse) ,business.industry ,05 social sciences ,050301 education ,Pattern recognition ,Object (computer science) ,Computer Science Applications ,Data set ,Task (computing) ,Control and Systems Engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,0503 education ,Software - Abstract
In matrix-object data, an object (or a sample) is described by more than one feature vector (record) and all of those feature vectors are responsible for the observed classification of the object. A task for matrix-object data is to cluster it into a set of groups by analyzing and utilizing the information of feature vectors. Matrix-object data are widespread in many real applications. Previous studies typically address data sets that an object is generally represented by a feature vector, which may be violated in many real-world tasks. In this paper, we propose a k-multi-numeric-values-representatives (abbr. k-Mnv-Rep) algorithm to cluster numeric matrix-object data. In this algorithm, a new dissimilarity measure between two numeric matrix-objects is defined and a new heuristic method of updating cluster centers is given. Furthermore, we also propose a k-multi-values-representatives (abbr. k-Mv-Rep) algorithm to cluster hybrid matrix-object data. The two proposed algorithms break the limitations of the previous studies, and can be applied to address matrix-object data sets that exist widely in many real-world tasks. The benefits and effectiveness of the two algorithms are shown by some experiments on real and synthetic data sets.
- Published
- 2021
37. Generalization Performance of Pure Accuracy and Its Application in Selective Ensemble Learning
- Author
-
Jieting Wang, Yuhua Qian, Feijiang Li, Jiye Liang, and Qingfu Zhang
- Subjects
Computational Theory and Mathematics ,Artificial Intelligence ,Applied Mathematics ,Computer Vision and Pattern Recognition ,Software - Abstract
The pure accuracy measure is used to eliminate random consistency from the accuracy measure. Biases to both majority and minority classes in the pure accuracy are lower than that in the accuracy measure. In this paper, we demonstrate that compared with the accuracy measure and F-measure, the pure accuracy measure is class distribution insensitive and discriminative for good classifiers. The advantages make the pure accuracy measure suitable for traditional classification. Further, we mainly focus on two points: exploring a tighter generalization bound on pure accuracy based learning paradigm and designing a learning algorithm based on the pure accuracy measure. Particularly, with the self-bounding property, we build an algorithm-independent generalization bound on the pure accuracy measure, which is tighter than the existing bound of an order O(1/√N) (N is the number of instances). The proposed bound is free from making a smoothness or convex assumption on the hypothesis functions. In addition, we design a learning algorithm optimizing the pure accuracy measure and use it in the selective ensemble learning setting. The experiments on sixteen benchmark data sets and four image data sets demonstrate that the proposed method statistically performs better than the other eight representative benchmark algorithms.
- Published
- 2022
38. A Bayesian matrix factorization model for dynamic user embedding in recommender system
- Author
-
Kaihan Zhang, Zhiqiang Wang, Jiye Liang, and Xingwang Zhao
- Subjects
General Computer Science ,Theoretical Computer Science - Published
- 2022
39. Incomplete multi-view clustering via local and global co-regularization
- Author
-
Jiye Liang, Xiaolin Liu, Liang Bai, Fuyuan Cao, and Dianhui Wang
- Subjects
General Computer Science - Published
- 2022
40. MAGDM-oriented dual hesitant fuzzy multigranulation probabilistic models based on MULTIMOORA
- Author
-
Baoli Wang, Jiye Liang, Deyu Li, and Chao Zhang
- Subjects
Soft computing ,0209 industrial biotechnology ,Computer science ,business.industry ,Granular computing ,Probabilistic logic ,Context (language use) ,Computational intelligence ,02 engineering and technology ,Fuzzy logic ,Expression (mathematics) ,Group decision-making ,020901 industrial engineering & automation ,Artificial Intelligence ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Computer Vision and Pattern Recognition ,Artificial intelligence ,business ,Software - Abstract
In real world, multi-attribute group decision making (MAGDM) is a complicated cognitive process that involves expression, fusion and analysis of multi-source uncertain information. Among diverse soft computing tools for addressing MAGDM, the ones from granular computing (GrC) frameworks perform excellently via efficient strategies for multi-source uncertain information. However, they usually lack convincing semantic interpretations for MAGDM due to extreme information fusion rules and instabilities of information analysis mechanisms. This work adopts a typical GrC framework named multigranulation probabilistic models to enrich semantic interpretations for GrC-based MAGDM approaches, and constructs MAGDM-oriented multigranulation probabilistic models with dual hesitant fuzzy (DHF) information in light of the MULTIMOORA (Multi-Objective Optimization by Ratio Analysis plus the full MULTIplicative form) method. After reviewing several basic knowledge, we first put forward four types of DHF multigranulation probabilistic models. Then, according to the MULTIMOORA method, a DHF MAGDM algorithm is designed via the proposed theoretical models in the context of person-job (P-J) fit. Finally, an illustrative case study for P-J fit is investigated, and corresponding validity tests and comparative analysis are conducted as well to demonstrate the rationality of the presented models.
- Published
- 2020
41. A multiple k-means clustering ensemble algorithm to find nonlinearly separable clusters
- Author
-
Jiye Liang, Liang Bai, and Fuyuan Cao
- Subjects
Computer science ,k-means clustering ,020206 networking & telecommunications ,02 engineering and technology ,Ensemble learning ,Separable space ,ComputingMethodologies_PATTERNRECOGNITION ,Hardware and Architecture ,Robustness (computer science) ,Signal Processing ,Scalability ,0202 electrical engineering, electronic engineering, information engineering ,Cluster (physics) ,020201 artificial intelligence & image processing ,Cluster analysis ,Algorithm ,Software ,Linear separability ,Information Systems - Abstract
Cluster ensemble is an important research content of ensemble learning, which is used to aggregate several base clusterings to generate a single output clustering with improved robustness and quality. Since clustering is unsupervised, where the “accuracy” does not have a clear meaning, most of existing ensemble methods try to obtain the most consistent clustering result with base clusterings. However, it is difficult for these methods to realize “Multi-weaks equal to a Strong”. For example, on a data set with nonlinearly separable clusters, if the base clusterings are produced by some linear clusterers, these methods generally cannot integrate them to obtain a good nonlinear clustering. In this paper, we select k-means as a base clusterer and provide an ensemble clusterer (algorithm) of multiple k-means clusterings based on a local hypothesis. In the new algorithm, we study the extraction of the local-credible labels from a base clustering, the production of different base clusterings, the construction of cluster relation and the final assignment of each object. The proposed ensemble clusterer not only inherits the scalability of k-means but also overcomes its limitation that it only can find linearly separable clusters. Finally, the experimental results illustrate its effectiveness and efficiency.
- Published
- 2020
42. Clustering method based on sample's stability
- Author
-
Wenjian Wang, Yuhua Qian, Feijiang Li, Jiye Liang, and Jieting Wang
- Subjects
General Computer Science ,Statistics ,Sample (statistics) ,Cluster analysis ,Engineering (miscellaneous) ,Stability (probability) ,Mathematics - Published
- 2020
43. Clustering method based on sample's stability
- Author
-
Wenjian WANG, Jieting WANG, Jiye LIANG, and Yuhua QIAN
- Subjects
General Computer Science ,Engineering (miscellaneous) - Published
- 2020
44. A Novel Preference Measure for Multi-Granularity Probabilistic Linguistic Term Sets and its Applications in Large-Scale Group Decision-Making
- Author
-
Baoli Wang and Jiye Liang
- Subjects
Computer science ,Semantics (computer science) ,Rank (computer programming) ,Probabilistic logic ,Computational intelligence ,02 engineering and technology ,Linguistics ,Preference ,Theoretical Computer Science ,Group decision-making ,Term (time) ,Computational Theory and Mathematics ,Artificial Intelligence ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Set (psychology) ,Software - Abstract
Comparing probabilistic linguistic term sets (PLTSs) is quite essential in solving PLTS-expressed multi-attribute group decision-making problems (PLTS-MAGDM). Researchers have designed various comparison measures to obtain the rank of PLTSs. However, most of the existing PLTS comparison measures need additional tedious adjustments before conducting a specific computation. Besides, these measures do not adequately consider the effects of the semantics of the basic linguistic term set and the probabilistic distributions. This paper proposes a new preference degree for g-granularity probabilistic term sets (g-GPLTSs) to overcome the two shortcomings simultaneously by integrating the effect from basic linguistic terms and probabilistic distributions without any adjustment. Moreover, the g-GPLTS preference degree also shows the extended adaptability for comparing PLTSs with unbalanced semantics. Based on the newly proposed preference degree, we construct a useful min-conflict model to solve PLTS-MAGDM with a large number of experts expressing the three-way primary grading. Finally, an illustrative example concerning software supplier selections, followed by the comparative analysis, is presented to verify the feasibility and effectiveness of the proposed method.
- Published
- 2020
45. A fusion collaborative filtering method for sparse data in recommender systems
- Author
-
Jiye Liang, Chenjiao Feng, Zhiqiang Wang, and Peng Song
- Subjects
Information Systems and Management ,Computer science ,05 social sciences ,050301 education ,02 engineering and technology ,Similarity measure ,Recommender system ,computer.software_genre ,Computer Science Applications ,Theoretical Computer Science ,Matrix decomposition ,Factorization ,Artificial Intelligence ,Control and Systems Engineering ,Robustness (computer science) ,0202 electrical engineering, electronic engineering, information engineering ,Collaborative filtering ,020201 artificial intelligence & image processing ,Data mining ,0503 education ,computer ,Software ,Sparse matrix - Abstract
Collaborative filtering is a fundamental technique in recommender systems, for which memory-based and matrix-factorization-based collaborative filtering are the two types of widely used methods. However, the performance of these two types of methods is limited in the case of sparse data, particularly with extremely sparse data. To improve the effectiveness of the methods in a sparse scenario, this paper proposes a multi-factor similarity measure that captures linear and nonlinear correlations between users resulting from extreme behavior. Subsequently, a fusion method that simultaneously considers the multi-factor similarity and the global rating information in a probability matrix factorization framework is proposed. In our framework, users’ local relations are integrated into the global ratings optimization process, so that prediction accuracy and robustness are improved in sparse data, particularly in extremely sparse data. To verify the performance of the proposed methods, we conduct experiments on four public datasets. The experimental results show that the fusion method is superior to the typical matrix factorization models used in collaborative filtering and significantly improves both the prediction results and robustness in sparse data.
- Published
- 2020
46. Fusing Fuzzy Monotonic Decision Trees
- Author
-
Jieting Wang, Weiping Ding, Yuhua Qian, Jiye Liang, and Feijiang Li
- Subjects
Majority rule ,Computer science ,business.industry ,Applied Mathematics ,Decision tree ,Pattern recognition ,Monotonic function ,02 engineering and technology ,Fuzzy logic ,Computational Theory and Mathematics ,Artificial Intelligence ,Control and Systems Engineering ,0202 electrical engineering, electronic engineering, information engineering ,Entropy (information theory) ,020201 artificial intelligence & image processing ,Artificial intelligence ,Rough set ,Decision table ,business ,Interpretability - Abstract
Ordinal classification is an important classification task, in which there exists a monotonic constraint between features and the decision class. In this article, we aim to develop a method of fusing ordinal decision trees with fuzzy rough-set-based attribute reduction. Most of the existing attribute reduction methods for ordinal decision tables are based on the dominance rough set theory or significance measures. However, the crisp dominance relation is difficult in making full use of the information of attribute values; and the reducts based on significance measures are poor in interpretability and may contain unnecessary attributes. In this article, we first define a discernibility matrix with fuzzy dominance rough set. With this discernibility matrix, multiple reducts can be found, which provide multiple complementary feature subspaces with original information. Then, diverse ordinal trees can be established from these feature subspaces, and finally, the trees are fused by majority voting. The experimental results show that the proposed fusion method performs significantly better than other fusion methods using dominance rough set or significance measures.
- Published
- 2020
47. A Three-Level Optimization Model for Nonlinearly Separable Clustering
- Author
-
Jiye Liang and Liang Bai
- Subjects
Optimization problem ,Computer science ,Iterative method ,02 engineering and technology ,General Medicine ,Spectral clustering ,Separable space ,Data set ,Set (abstract data type) ,Kernel (linear algebra) ,ComputingMethodologies_PATTERNRECOGNITION ,Kernel (statistics) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Cluster analysis ,Algorithm ,Linear separability - Abstract
Due to the complex structure of the real-world data, nonlinearly separable clustering is one of popular and widely studied clustering problems. Currently, various types of algorithms, such as kernel k-means, spectral clustering and density clustering, have been developed to solve this problem. However, it is difficult for them to balance the efficiency and effectiveness of clustering, which limits their real applications. To get rid of the deficiency, we propose a three-level optimization model for nonlinearly separable clustering which divides the clustering problem into three sub-problems: a linearly separable clustering on the object set, a nonlinearly separable clustering on the cluster set and an ensemble clustering on the partition set. An iterative algorithm is proposed to solve the optimization problem. The proposed algorithm can use low computational cost to effectively recognize nonlinearly separable clusters. The performance of this algorithm has been studied on synthetical and real data sets. Comparisons with other nonlinearly separable clustering algorithms illustrate the efficiency and effectiveness of the proposed algorithm.
- Published
- 2020
48. Multi-attribute group decision-making method based on multi-granulation weights and three-way decisions
- Author
-
Peng Song, Jifang Pang, Xiaoqiang Guan, Jiye Liang, and Baoli Wang
- Subjects
Computer science ,business.industry ,Applied Mathematics ,Rank (computer programming) ,Rationality ,02 engineering and technology ,Machine learning ,computer.software_genre ,Field (computer science) ,Theoretical Computer Science ,Group decision-making ,Artificial Intelligence ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Rough set ,Artificial intelligence ,Grading (education) ,business ,computer ,Software ,Decision analysis ,Interpretability - Abstract
With the increasing complexity of decision-making problems and environment, the integration and fusion of three-way decisions, rough set and multi-attribute group decision-making (MAGDM) have become a major trend in the field of decision analysis. Although many researchers have presented various MAGDM methods under different environments, there are still some imperfections, such as the weight information is not comprehensive or flexible enough, the decision results lack interpretability, and the impact of risk attitude is not fully taken into account. In order to overcome the above shortcomings and improve the scientificity and rationality of decision-making, a novel data-driven MAGDM method under interval-valued intuitionistic uncertain linguistic environment is established based on the idea of multi-granulation and three-way decisions. Our contributions can be identified as follows: (1) The multi-granulation weight mining and fusion methods for experts and attributes are proposed, respectively; (2) The coarse-granulation grading information based on three-way decisions is developed to enhance the interpretability and reference value of decision results; (3) The expected value with risk attitude factor is defined to compare interval-valued intuitionistic uncertain linguistic variables (IVIULVs) and then is used to grade and rank alternatives under different risk attitudes. To illustrate the feasibility and practicality of the proposed method, a case of logistics supplier selection in e-commerce enterprises is demonstrated. Furthermore, the advantages and characteristics of the proposed method are highlighted via detailed comparison and thorough analysis.
- Published
- 2020
49. Interval-valued hesitant fuzzy multi-granularity three-way decisions in consensus processes with applications to multi-attribute group decision making
- Author
-
Chao Zhang, Jiye Liang, and Deyu Li
- Subjects
Information Systems and Management ,Operations research ,Computer science ,05 social sciences ,050301 education ,Context (language use) ,02 engineering and technology ,Space (commercial competition) ,Fuzzy logic ,Computer Science Applications ,Theoretical Computer Science ,Group decision-making ,Artificial Intelligence ,Control and Systems Engineering ,0202 electrical engineering, electronic engineering, information engineering ,Selection (linguistics) ,Collective wisdom ,020201 artificial intelligence & image processing ,Rough set ,0503 education ,Software - Abstract
Multi-attribute group decision making (MAGDM) is a common activity for multi-variable complicated decision making situations by integrating collective wisdom. Aiming at fusing granular computing with three-way decisions (3WD) to study scheme synthesis and analysis of solution space, multi-granularity three-way decisions (MG-3WD) provide multi-dimension problem solving methods for MAGDM problems. By using MG-3WD frameworks, this paper intends to study viable strategies of processing consensus and conflicting opinions provided by different decision makers in the interval-valued hesitant fuzzy (IVHF) MAGDM problem. More specifically, after reviewing the relevant literature, four kinds of IVHF multigranulation decision-theoretic rough sets (MG-DTRSs) over two universes are proposed according to different risk appetites of experts firstly. Then, we explore some fundamental propositions of newly proposed models. Afterwards, solutions to MAGDM problems in the context of mergers and acquisitions (M&A) target selections by using the presented IVHF MG-DTRSs over two universes are constructed. At last, a M&A target selection case study, along with a sensitivity analysis and a comparative analysis, is applied to illustrate the established decision making approaches.
- Published
- 2020
50. Multi-granularity three-way decisions with adjustable hesitant fuzzy linguistic multigranulation decision-theoretic rough sets over two universes
- Author
-
Deyu Li, Chao Zhang, and Jiye Liang
- Subjects
Information Systems and Management ,Basis (linear algebra) ,business.industry ,Computer science ,05 social sciences ,050301 education ,Context (language use) ,Rationality ,02 engineering and technology ,Term (logic) ,Computer Science Applications ,Theoretical Computer Science ,Group decision-making ,Decision-theoretic rough sets ,Artificial Intelligence ,Control and Systems Engineering ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Granularity ,Rough set ,business ,0503 education ,Software - Abstract
The notion of hesitant fuzzy linguistic term sets (HFLTSs), which enables experts to utilize a few possible linguistic terms to evaluate varieties of common qualitative information, plays a significant role in handling situations in cases where these experts are hesitant in offering linguistic expressions. For addressing the challenges of information analysis and information fusion in hesitant fuzzy linguistic (HFL) group decision making, in accordance with the multi-granularity three-way decisions paradigm, the primary purpose of this study is to develop the notion of multigranulation decision-theoretic rough sets (MG-DTRSs) into the HFL background within the two-universe framework. Having revisited the relevant literature, we first propose a hybrid model named adjustable HFL MG-DTRSs over two universes by introducing an adjustable parameter for the expected risk appetite of experts, in which both optimistic and pessimistic versions of HFL MG-DTRSs over two universes are special cases of the adjustable version. Second, some of the fundamental properties of the proposed model are discussed. Then, on the basis of the presented hybrid model, a group decision making approach within the HFL context is further constructed. Finally, a practical example, a comparative analysis, and a validity test concerning person-job fit problems are explored to reveal the rationality and practicability of the constructed decision making rule.
- Published
- 2020
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.