Author: "Xia, Xiaobo" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Xia, Xiaobo"' showing total 667 results

Start Over Author "Xia, Xiaobo"

667 results on '"Xia, Xiaobo"'

1. MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Author: Luo, Run, Zhang, Haonan, Chen, Longze, Lin, Ting-En, Liu, Xiong, Wu, Yuchuan, Yang, Min, Wang, Minzheng, Zeng, Pengpeng, Gao, Lianli, Shen, Heng Tao, Li, Yunshui, Xia, Xiaobo, Huang, Fei, Song, Jingkuan, and Li, Yongbin
Subjects: Computer Science - Computation and Language
Abstract: The development of Multimodal Large Language Models (MLLMs) has seen significant advancements with increasing demands in various fields (e.g., multimodal agents, embodied intelligence). While model-driven approaches attempt to enhance MLLMs capabilities through diverse architectures, the gains have become increasingly marginal. Conversely, data-driven methods, which scale up image-text instruction data, are more effective but face limited data diversity and complexity challenges. The absence of high-quality data constitutes a significant development barrier for MLLMs. To address the data quality bottleneck, we propose MMEvol, a novel multimodal instruction data evolution framework. This framework iteratively improve data quality through a refined combination of fine-grained perception, cognitive reasoning, and interaction evolution, generating a more complex and diverse image-text instruction dataset that empowers MLLMs with enhanced capabilities. Beginning with an initial set of instructions, SEED-163K, we utilize MMEvol to systematically broaden the diversity of instruction types, extend visual reasoning steps to improve cognitive reasoning abilities, and thoroughly explore fine-grained information within images to enhance visual understanding and robustness. To comprehensively evaluate the effectiveness of our approach, we conduct extensive qualitative analysis and quantitative experiments across 13 vision-language tasks. Compared to baseline models trained with the initial seed data, the results demonstrate that our method achieves an average accuracy improvement of 3.1 percentage points. Furthermore, our approach reaches state-of-the-art (SOTA) performance in nine tasks using significantly less data compared to state-of-the-art models.
Published: 2024

2. Resultant: Incremental Effectiveness on Likelihood for Unsupervised Out-of-Distribution Detection

Author: Li, Yewen, Wang, Chaojie, Xia, Xiaobo, He, Xu, An, Ruyi, Li, Dong, Liu, Tongliang, An, Bo, and Wang, Xinrun
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: Unsupervised out-of-distribution (U-OOD) detection is to identify OOD data samples with a detector trained solely on unlabeled in-distribution (ID) data. The likelihood function estimated by a deep generative model (DGM) could be a natural detector, but its performance is limited in some popular "hard" benchmarks, such as FashionMNIST (ID) vs. MNIST (OOD). Recent studies have developed various detectors based on DGMs to move beyond likelihood. However, despite their success on "hard" benchmarks, most of them struggle to consistently surpass or match the performance of likelihood on some "non-hard" cases, such as SVHN (ID) vs. CIFAR10 (OOD) where likelihood could be a nearly perfect detector. Therefore, we appeal for more attention to incremental effectiveness on likelihood, i.e., whether a method could always surpass or at least match the performance of likelihood in U-OOD detection. We first investigate the likelihood of variational DGMs and find its detection performance could be improved in two directions: i) alleviating latent distribution mismatch, and ii) calibrating the dataset entropy-mutual integration. Then, we apply two techniques for each direction, specifically post-hoc prior and dataset entropy-mutual calibration. The final method, named Resultant, combines these two directions for better incremental effectiveness compared to either technique alone. Experimental results demonstrate that the Resultant could be a new state-of-the-art U-OOD detector while maintaining incremental effectiveness on likelihood in a wide range of tasks.
Published: 2024

3. Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs

Author: Zhang, Lei, Li, Yunshui, Li, Jiaming, Xia, Xiaobo, Yang, Jiaxi, Luo, Run, Wang, Minzheng, Chen, Longze, Liu, Junhao, and Yang, Min
Subjects: Computer Science - Computation and Language
Abstract: Some recently developed code large language models (Code LLMs) have been pre-trained on repository-level code data (Repo-Code LLMs), enabling these models to recognize repository structures and utilize cross-file information for code completion. However, in real-world development scenarios, simply concatenating the entire code repository often exceeds the context window limits of these Repo-Code LLMs, leading to significant performance degradation. In this study, we conducted extensive preliminary experiments and analyses on six Repo-Code LLMs. The results indicate that maintaining the topological dependencies of files and increasing the code file content in the completion prompts can improve completion accuracy; pruning the specific implementations of functions in all dependent files does not significantly reduce the accuracy of completions. Based on these findings, we proposed a strategy named Hierarchical Context Pruning (HCP) to construct completion prompts with high informational code content. The HCP models the code repository at the function level, maintaining the topological dependencies between code files while removing a large amount of irrelevant code content, significantly reduces the input length for repository-level code completion. We applied the HCP strategy in experiments with six Repo-Code LLMs, and the results demonstrate that our proposed method can significantly enhance completion accuracy while substantially reducing the length of input. Our code and data are available at https://github.com/Hambaobao/HCP-Coder.
Published: 2024

4. DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception

Author: Luo, Run, Li, Yunshui, Chen, Longze, He, Wanwei, Lin, Ting-En, Liu, Ziqiang, Zhang, Lei, Song, Zikai, Xia, Xiaobo, Liu, Tongliang, Yang, Min, and Hui, Binyuan
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: The development of large language models (LLMs) has significantly advanced the emergence of large multimodal models (LMMs). While LMMs have achieved tremendous success by promoting the synergy between multimodal comprehension and creation, they often face challenges when confronted with out-of-distribution data, such as which can hardly distinguish orientation, quantity, color, structure, etc. This is primarily due to their reliance on image encoders trained to encode images into task-relevant features, which may lead them to disregard irrelevant details. Delving into the modeling capabilities of diffusion models for images naturally prompts the question: Can diffusion models serve as the eyes of large language models for image perception? In this paper, we propose DEEM, a simple but effective approach that utilizes the generative feedback of diffusion models to align the semantic distributions of the image encoder. This addresses the drawbacks of previous methods that solely relied on image encoders like CLIP-ViT, thereby enhancing the model's resilience against out-of-distribution samples and reducing visual hallucinations. Importantly, this is achieved without requiring additional training modules and with fewer training parameters. We extensively evaluated DEEM on both our newly constructed RobustVQA benchmark and other well-known benchmarks, POPE and MMVP, for visual hallucination and perception. In particular, DEEM improves LMM's visual perception performance to a large extent (e.g., 4% higher on RobustVQA, 6.5% higher on MMVP and 12.8 % higher on POPE ). Compared to the state-of-the-art interleaved content generation models, DEEM exhibits enhanced robustness and a superior capacity to alleviate model hallucinations while utilizing fewer trainable parameters, less pre-training data (10%), and a smaller base model size., Comment: 25 pages. arXiv admin note: text overlap with arXiv:2401.10208 by other authors
Published: 2024

5. Few-Shot Adversarial Prompt Learning on Vision-Language Models

Author: Zhou, Yiwei, Xia, Xiaobo, Lin, Zhiwei, Han, Bo, and Liu, Tongliang
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language, Computer Science - Cryptography and Security, Computer Science - Machine Learning
Abstract: The vulnerability of deep neural networks to imperceptible adversarial perturbations has attracted widespread attention. Inspired by the success of vision-language foundation models, previous efforts achieved zero-shot adversarial robustness by aligning adversarial visual features with text supervision. However, in practice, they are still unsatisfactory due to several issues, including heavy adaptation cost, suboptimal text supervision, and uncontrolled natural generalization capacity. In this paper, to address these issues, we propose a few-shot adversarial prompt framework where adapting input sequences with limited data makes significant adversarial robustness improvement. Specifically, we achieve this by providing adversarially correlated text supervision that is end-to-end learned from adversarial examples. We also propose a novel training objective that enhances the consistency of multi-modal features while encourages differentiated uni-modal features between natural and adversarial examples. The proposed framework gives access to learn adversarial text supervision, which provides superior cross-modal adversarial alignment and matches state-of-the-art zero-shot adversarial robustness with only 1% training data. Code is available at: https://github.com/lionel-w2/FAP., Comment: NeurIPS 2024
Published: 2024

6. Tackling Noisy Labels with Network Parameter Additive Decomposition

Author: Wang, Jingyi, Xia, Xiaobo, Lan, Long, Wu, Xinghao, Yu, Jun, Yang, Wenjing, Han, Bo, and Liu, Tongliang
Subjects: Computer Science - Machine Learning
Abstract: Given data with noisy labels, over-parameterized deep networks suffer overfitting mislabeled data, resulting in poor generalization. The memorization effect of deep networks shows that although the networks have the ability to memorize all noisy data, they would first memorize clean training data, and then gradually memorize mislabeled training data. A simple and effective method that exploits the memorization effect to combat noisy labels is early stopping. However, early stopping cannot distinguish the memorization of clean data and mislabeled data, resulting in the network still inevitably overfitting mislabeled data in the early training stage.In this paper, to decouple the memorization of clean data and mislabeled data, and further reduce the side effect of mislabeled data, we perform additive decomposition on network parameters. Namely, all parameters are additively decomposed into two groups, i.e., parameters $\mathbf{w}$ are decomposed as $\mathbf{w}=\bm{\sigma}+\bm{\gamma}$. Afterward, the parameters $\bm{\sigma}$ are considered to memorize clean data, while the parameters $\bm{\gamma}$ are considered to memorize mislabeled data. Benefiting from the memorization effect, the updates of the parameters $\bm{\sigma}$ are encouraged to fully memorize clean data in early training, and then discouraged with the increase of training epochs to reduce interference of mislabeled data. The updates of the parameters $\bm{\gamma}$ are the opposite. In testing, only the parameters $\bm{\sigma}$ are employed to enhance generalization. Extensive experiments on both simulated and real-world benchmarks confirm the superior performance of our method., Comment: Accepted by IEEE T-PAMI
Published: 2024

7. Mitigating Label Noise on Graph via Topological Sample Selection

Author: Wu, Yuhao, Yao, Jiangchao, Xia, Xiaobo, Yu, Jun, Wang, Ruxin, Han, Bo, and Liu, Tongliang
Subjects: Computer Science - Machine Learning
Abstract: Despite the success of the carefully-annotated benchmarks, the effectiveness of existing graph neural networks (GNNs) can be considerably impaired in practice when the real-world graph data is noisily labeled. Previous explorations in sample selection have been demonstrated as an effective way for robust learning with noisy labels, however, the conventional studies focus on i.i.d data, and when moving to non-iid graph data and GNNs, two notable challenges remain: (1) nodes located near topological class boundaries are very informative for classification but cannot be successfully distinguished by the heuristic sample selection. (2) there is no available measure that considers the graph topological information to promote sample selection in a graph. To address this dilemma, we propose a $\textit{Topological Sample Selection}$ (TSS) method that boosts the informative sample selection process in a graph by utilising topological information. We theoretically prove that our procedure minimizes an upper bound of the expected risk under target clean distribution, and experimentally show the superiority of our method compared with state-of-the-art baselines., Comment: ICML 2024
Published: 2024

8. Open-Vocabulary Segmentation with Unpaired Mask-Text Supervision

Author: Wang, Zhaoqing, Xia, Xiaobo, Chen, Ziye, He, Xiao, Guo, Yandong, Gong, Mingming, and Liu, Tongliang
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Current state-of-the-art open-vocabulary segmentation methods typically rely on image-mask-text triplet annotations for supervision. However, acquiring such detailed annotations is labour-intensive and poses scalability challenges in complex real-world scenarios. While existing weakly-supervised approaches leverage image-text pairs to reduce the expansive annotation cost, the lack of mask supervision makes it difficult for the model to locate multiple instances and accurately group pixels with similar semantics, significantly hampering versatility and performance. In this paper, we introduce Unpair-Seg, a novel weakly-supervised open-vocabulary segmentation framework that learns from unpaired image-mask and image-text pairs, which can be independently and efficiently collected. Unpair-Seg initially predicts a set of binary masks and generates pseudo labels by identifying confident pairs of masks and text entities. We then train a feature adapter to align region embeddings with text embeddings based on these pseudo labels, achieving open-vocabulary segmentation. However, the inherent noise in the mask-entity correspondence poses a challenge to obtaining reliable pairs. To address this, we employ a vision-language large model to re-caption the input images and extract precise entities, and we design a multi-scale matching strategy to reduce noisy mask-entity pairs. Our Unpair-Seg framework demonstrates impressive performance, achieving 14.6\% and 19.5\% mIoU on the ADE-847 and PASCAL Context-459 datasets, significantly narrowing the gap between fully-supervised and weakly-supervised methods., Comment: 27 pages, 18 figures, 10 tables
Published: 2024

9. One-Shot Learning as Instruction Data Prospector for Large Language Models

Author: Li, Yunshui, Hui, Binyuan, Xia, Xiaobo, Yang, Jiaxi, Yang, Min, Zhang, Lei, Si, Shuzheng, Chen, Ling-Hao, Liu, Junhao, Liu, Tongliang, Huang, Fei, and Li, Yongbin
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Contemporary practices in instruction tuning often hinge on enlarging data scaling without a clear strategy for ensuring data quality, inadvertently introducing noise that may compromise model performance. To address this challenge, we introduce \textsc{Nuggets}, a novel and efficient methodology that leverages one-shot learning to discern and select high-quality instruction data from extensive datasets. \textsc{Nuggets} assesses the potential of individual instruction examples to act as effective one-shot learning instances, thereby identifying those that can significantly improve performance across diverse tasks. \textsc{Nuggets} utilizes a scoring system based on the impact of candidate examples on the perplexity of a diverse anchor set, facilitating the selection of the most advantageous data for instruction tuning. Through comprehensive evaluations on two benchmarks, including MT-Bench and Alpaca-Eval, we show that instruction tuning with the top 1\% of examples curated by \textsc{Nuggets} substantially outperforms conventional methods employing the entire dataset., Comment: ACL 2024
Published: 2023

10. ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance

Author: Chen, Ling-Hao, Zhang, Yuanshuo, Huang, Taohua, Su, Liangcai, Lin, Zeyi, Xiao, Xi, Xia, Xiaobo, and Liu, Tongliang
Subjects: Computer Science - Machine Learning
Abstract: Deep learning has achieved remarkable success in graph-related tasks, yet this accomplishment heavily relies on large-scale high-quality annotated datasets. However, acquiring such datasets can be cost-prohibitive, leading to the practical use of labels obtained from economically efficient sources such as web searches and user tags. Unfortunately, these labels often come with noise, compromising the generalization performance of deep networks. To tackle this challenge and enhance the robustness of deep learning models against label noise in graph-based tasks, we propose a method called ERASE (Error-Resilient representation learning on graphs for lAbel noiSe tolerancE). The core idea of ERASE is to learn representations with error tolerance by maximizing coding rate reduction. Particularly, we introduce a decoupled label propagation method for learning representations. Before training, noisy labels are pre-corrected through structural denoising. During training, ERASE combines prototype pseudo-labels with propagated denoised labels and updates representations with error resilience, which significantly improves the generalization performance in node classification. The proposed method allows us to more effectively withstand errors caused by mislabeled nodes, thereby strengthening the robustness of deep networks in handling noisy graph data. Extensive experimental results show that our method can outperform multiple baselines with clear margins in broad noise levels and enjoy great scalability. Codes are released at https://github.com/eraseai/erase., Comment: 24 pages, 14 figures, 15 tables and a project page at https://eraseai.github.io/ERASE-page
Published: 2023

11. Refined Coreset Selection: Towards Minimal Coreset Size under Model Performance Constraints

Author: Xia, Xiaobo, Liu, Jiale, Zhang, Shaokun, Wu, Qingyun, Wei, Hongxin, and Liu, Tongliang
Subjects: Computer Science - Machine Learning
Abstract: Coreset selection is powerful in reducing computational costs and accelerating data processing for deep learning algorithms. It strives to identify a small subset from large-scale data, so that training only on the subset practically performs on par with full data. Practitioners regularly desire to identify the smallest possible coreset in realistic scenes while maintaining comparable model performance, to minimize costs and maximize acceleration. Motivated by this desideratum, for the first time, we pose the problem of refined coreset selection, in which the minimal coreset size under model performance constraints is explored. Moreover, to address this problem, we propose an innovative method, which maintains optimization priority order over the model performance and coreset size, and efficiently optimizes them in the coreset selection procedure. Theoretically, we provide the convergence guarantee of the proposed method. Empirically, extensive experiments confirm its superiority compared with previous strategies, often yielding better model performance with smaller coreset sizes., Comment: 22 pages, 10 tables, 4 figures
Published: 2023

12. Out-of-distribution Detection Learning with Unreliable Out-of-distribution Sources

Author: Zheng, Haotian, Wang, Qizhou, Fang, Zhen, Xia, Xiaobo, Liu, Feng, Liu, Tongliang, and Han, Bo
Subjects: Computer Science - Machine Learning, Computer Science - Multimedia
Abstract: Out-of-distribution (OOD) detection discerns OOD data where the predictor cannot make valid predictions as in-distribution (ID) data, thereby increasing the reliability of open-world classification. However, it is typically hard to collect real out-of-distribution (OOD) data for training a predictor capable of discerning ID and OOD patterns. This obstacle gives rise to data generation-based learning methods, synthesizing OOD data via data generators for predictor training without requiring any real OOD data. Related methods typically pre-train a generator on ID data and adopt various selection procedures to find those data likely to be the OOD cases. However, generated data may still coincide with ID semantics, i.e., mistaken OOD generation remains, confusing the predictor between ID and OOD data. To this end, we suggest that generated data (with mistaken OOD generation) can be used to devise an auxiliary OOD detection task to facilitate real OOD detection. Specifically, we can ensure that learning from such an auxiliary task is beneficial if the ID and the OOD parts have disjoint supports, with the help of a well-designed training procedure for the predictor. Accordingly, we propose a powerful data generation-based learning method named Auxiliary Task-based OOD Learning (ATOL) that can relieve the mistaken OOD generation. We conduct extensive experiments under various OOD detection setups, demonstrating the effectiveness of our method against its advanced counterparts., Comment: Accepted by NeurIPS 2023
Published: 2023

13. IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models

Author: Zhang, Shaokun, Xia, Xiaobo, Wang, Zhaoqing, Chen, Ling-Hao, Liu, Jiale, Wu, Qingyun, and Liu, Tongliang
Subjects: Computer Science - Computation and Language
Abstract: In-context learning is a promising paradigm that utilizes in-context examples as prompts for the predictions of large language models. These prompts are crucial for achieving strong performance. However, since the prompts need to be sampled from a large volume of annotated examples, finding the right prompt may result in high annotation costs. To address this challenge, this paper introduces an influence-driven selective annotation method that aims to minimize annotation costs while improving the quality of in-context examples. The essence of our method is to select a pivotal subset from a large-scale unlabeled data pool to annotate for the subsequent sampling of prompts. Specifically, a directed graph is first constructed to represent unlabeled data. Afterward, the influence of candidate unlabeled subsets is quantified with a diffusion process. A simple yet effective greedy algorithm for unlabeled data selection is lastly introduced. It iteratively selects the data if it provides a maximum marginal gain with respect to quantified influence. Compared with previous efforts on selective annotations, our influence-driven method works in an end-to-end manner, avoids an intractable explicit balance between data diversity and representativeness, and enjoys theoretical support. Experiments confirm the superiority of the proposed method on various benchmarks, achieving better performance under lower time consumption during subset selection. The project page is available at https://skzhang1.github.io/IDEAL/., Comment: Accepted by ICLR 2024
Published: 2023

14. VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence

Author: Qiu, Jianing, Wu, Jian, Wei, Hao, Shi, Peilun, Zhang, Minqing, Sun, Yunyun, Li, Lin, Liu, Hanruo, Liu, Hongyi, Hou, Simeng, Zhao, Yuyang, Shi, Xuehui, Xian, Junfang, Qu, Xiaoxia, Zhu, Sirui, Pan, Lijie, Chen, Xiaoniao, Zhang, Xiaojia, Jiang, Shuai, Wang, Kebing, Yang, Chenlong, Chen, Mingqiang, Fan, Sujie, Hu, Jianhua, Lv, Aiguo, Miao, Hui, Guo, Li, Zhang, Shujun, Pei, Cheng, Fan, Xiaojuan, Lei, Jianqin, Wei, Ting, Duan, Junguo, Liu, Chun, Xia, Xiaobo, Xiong, Siqi, Li, Junhong, Lo, Benny, Tham, Yih Chung, Wong, Tien Yin, Wang, Ningli, and Yuan, Wu
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: We present VisionFM, a foundation model pre-trained with 3.4 million ophthalmic images from 560,457 individuals, covering a broad range of ophthalmic diseases, modalities, imaging devices, and demography. After pre-training, VisionFM provides a foundation to foster multiple ophthalmic artificial intelligence (AI) applications, such as disease screening and diagnosis, disease prognosis, subclassification of disease phenotype, and systemic biomarker and disease prediction, with each application enhanced with expert-level intelligence and accuracy. The generalist intelligence of VisionFM outperformed ophthalmologists with basic and intermediate levels in jointly diagnosing 12 common ophthalmic diseases. Evaluated on a new large-scale ophthalmic disease diagnosis benchmark database, as well as a new large-scale segmentation and detection benchmark database, VisionFM outperformed strong baseline deep neural networks. The ophthalmic image representations learned by VisionFM exhibited noteworthy explainability, and demonstrated strong generalizability to new ophthalmic modalities, disease spectrum, and imaging devices. As a foundation model, VisionFM has a large capacity to learn from diverse ophthalmic imaging data and disparate datasets. To be commensurate with this capacity, in addition to the real data used for pre-training, we also generated and leveraged synthetic ophthalmic imaging data. Experimental results revealed that synthetic data that passed visual Turing tests, can also enhance the representation learning capability of VisionFM, leading to substantial performance gains on downstream ophthalmic AI tasks. Beyond the ophthalmic AI applications developed, validated, and demonstrated in this work, substantial further applications can be achieved in an efficient and cost-effective manner using VisionFM as the foundation.
Published: 2023

15. Multi-Label Noise Transition Matrix Estimation with Label Correlations: Theory and Algorithm

Author: Li, Shikun, Xia, Xiaobo, Zhang, Hansong, Ge, Shiming, and Liu, Tongliang
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Noisy multi-label learning has garnered increasing attention due to the challenges posed by collecting large-scale accurate labels, making noisy labels a more practical alternative. Motivated by noisy multi-class learning, the introduction of transition matrices can help model multi-label noise and enable the development of statistically consistent algorithms for noisy multi-label learning. However, estimating multi-label noise transition matrices remains a challenging task, as most existing estimators in noisy multi-class learning rely on anchor points and accurate fitting of noisy class posteriors, which is hard to satisfy in noisy multi-label learning. In this paper, we address this problem by first investigating the identifiability of class-dependent transition matrices in noisy multi-label learning. Building upon the identifiability results, we propose a novel estimator that leverages label correlations without the need for anchor points or precise fitting of noisy class posteriors. Specifically, we first estimate the occurrence probability of two noisy labels to capture noisy label correlations. Subsequently, we employ sample selection techniques to extract information implying clean label correlations, which are then used to estimate the occurrence probability of one noisy label when a certain clean label appears. By exploiting the mismatches in label correlations implied by these occurrence probabilities, we demonstrate that the transition matrix becomes identifiable and can be acquired by solving a bilinear decomposition problem. Theoretically, we establish an estimation error bound for our multi-label transition matrix estimator and derive a generalization error bound for our statistically consistent algorithm. Empirically, we validate the effectiveness of our estimator in estimating multi-label noise transition matrices, leading to excellent classification performance.
Published: 2023

16. Regularly Truncated M-estimators for Learning with Noisy Labels

Author: Xia, Xiaobo, Lu, Pengqian, Gong, Chen, Han, Bo, Yu, Jun, and Liu, Tongliang
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: The sample selection approach is very popular in learning with noisy labels. As deep networks learn pattern first, prior methods built on sample selection share a similar training procedure: the small-loss examples can be regarded as clean examples and used for helping generalization, while the large-loss examples are treated as mislabeled ones and excluded from network parameter updates. However, such a procedure is arguably debatable from two folds: (a) it does not consider the bad influence of noisy labels in selected small-loss examples; (b) it does not make good use of the discarded large-loss examples, which may be clean or have meaningful information for generalization. In this paper, we propose regularly truncated M-estimators (RTME) to address the above two issues simultaneously. Specifically, RTME can alternately switch modes between truncated M-estimators and original M-estimators. The former can adaptively select small-losses examples without knowing the noise rate and reduce the side-effects of noisy labels in them. The latter makes the possibly clean examples but with large losses involved to help generalization. Theoretically, we demonstrate that our strategies are label-noise-tolerant. Empirically, comprehensive experimental results show that our method can outperform multiple baselines and is robust to broad noise types and levels., Comment: 16 pages, 11 tables, 9 figures
Published: 2023

17. Making Binary Classification from Multiple Unlabeled Datasets Almost Free of Supervision

Author: Wu, Yuhao, Xia, Xiaobo, Yu, Jun, Han, Bo, Niu, Gang, Sugiyama, Masashi, and Liu, Tongliang
Subjects: Computer Science - Machine Learning
Abstract: Training a classifier exploiting a huge amount of supervised data is expensive or even prohibited in a situation, where the labeling cost is high. The remarkable progress in working with weaker forms of supervision is binary classification from multiple unlabeled datasets which requires the knowledge of exact class priors for all unlabeled datasets. However, the availability of class priors is restrictive in many real-world scenarios. To address this issue, we propose to solve a new problem setting, i.e., binary classification from multiple unlabeled datasets with only one pairwise numerical relationship of class priors (MU-OPPO), which knows the relative order (which unlabeled dataset has a higher proportion of positive examples) of two class-prior probabilities for two datasets among multiple unlabeled datasets. In MU-OPPO, we do not need the class priors for all unlabeled datasets, but we only require that there exists a pair of unlabeled datasets for which we know which unlabeled dataset has a larger class prior. Clearly, this form of supervision is easier to be obtained, which can make labeling costs almost free. We propose a novel framework to handle the MU-OPPO problem, which consists of four sequential modules: (i) pseudo label assignment; (ii) confident example collection; (iii) class prior estimation; (iv) classifier training with estimated class priors. Theoretically, we analyze the gap between estimated class priors and true class priors under the proposed framework. Empirically, we confirm the superiority of our framework with comprehensive experiments. Experimental results demonstrate that our framework brings smaller estimation errors of class priors and better performance of binary classification., Comment: 38 pages, 5 figures, 10 tables
Published: 2023

18. Transferring Annotator- and Instance-dependent Transition Matrix for Learning from Crowds

Author: Li, Shikun, Xia, Xiaobo, Deng, Jiankang, Ge, Shiming, and Liu, Tongliang
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Learning from crowds describes that the annotations of training data are obtained with crowd-sourcing services. Multiple annotators each complete their own small part of the annotations, where labeling mistakes that depend on annotators occur frequently. Modeling the label-noise generation process by the noise transition matrix is a power tool to tackle the label noise. In real-world crowd-sourcing scenarios, noise transition matrices are both annotator- and instance-dependent. However, due to the high complexity of annotator- and instance-dependent transition matrices (AIDTM), annotation sparsity, which means each annotator only labels a little part of instances, makes modeling AIDTM very challenging. Prior works simplify the problem by assuming the transition matrix is instance-independent or using simple parametric ways, which lose modeling generality. Motivated by this, we target a more realistic problem, estimating general AIDTM in practice. Without losing modeling generality, we parameterize AIDTM with deep neural networks. To alleviate the modeling challenge, we suppose every annotator shares its noise pattern with similar annotators, and estimate AIDTM via knowledge transfer. We hence first model the mixture of noise patterns by all annotators, and then transfer this modeling to individual annotators. Furthermore, considering that the transfer from the mixture of noise patterns to individuals may cause two annotators with highly different noise generations to perturb each other, we employ the knowledge transfer between identified neighboring annotators to calibrate the modeling. Theoretical analyses are derived to demonstrate that both the knowledge transfer from global to individuals and the knowledge transfer between neighboring individuals can help model general AIDTM. Experiments confirm the superiority of the proposed approach on synthetic and real-world crowd-sourcing data., Comment: Accepted by IEEE TPAMI. 22 pages, 4 figures, and 8 tables
Published: 2023

19. Genome-wide identification of the sorghum OVATE gene family and revelation of its expression characteristics in sorghum seeds and leaves

Author: An, Yanlin, Xia, Xiaobo, Zhang, Xiaoqin, Liu, Li, Jiang, Sixia, Jing, Tingting, and Zhang, Feng
Published: 2024
Full Text: View/download PDF

20. Robust Generalization against Photon-Limited Corruptions via Worst-Case Sharpness Minimization

Author: Huang, Zhuo, Zhu, Miaoxi, Xia, Xiaobo, Shen, Li, Yu, Jun, Gong, Chen, Han, Bo, Du, Bo, and Liu, Tongliang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Robust generalization aims to tackle the most challenging data distributions which are rare in the training set and contain severe noises, i.e., photon-limited corruptions. Common solutions such as distributionally robust optimization (DRO) focus on the worst-case empirical risk to ensure low training error on the uncommon noisy distributions. However, due to the over-parameterized model being optimized on scarce worst-case data, DRO fails to produce a smooth loss landscape, thus struggling on generalizing well to the test set. Therefore, instead of focusing on the worst-case risk minimization, we propose SharpDRO by penalizing the sharpness of the worst-case distribution, which measures the loss changes around the neighbor of learning parameters. Through worst-case sharpness minimization, the proposed method successfully produces a flat loss curve on the corrupted distributions, thus achieving robust generalization. Moreover, by considering whether the distribution annotation is available, we apply SharpDRO to two problem settings and design a worst-case selection process for robust generalization. Theoretically, we show that SharpDRO has a great convergence guarantee. Experimentally, we simulate photon-limited corruptions using CIFAR10/100 and ImageNet30 datasets and show that SharpDRO exhibits a strong generalization ability against severe corruptions and exceeds well-known baseline methods with large performance gains., Comment: CVPR 2023
Published: 2023

21. Dynamics-Aware Loss for Learning with Label Noise

Author: Li, Xiu-Chuan, Xia, Xiaobo, Zhu, Fei, Liu, Tongliang, Zhang, Xu-Yao, and Liu, Cheng-Lin
Subjects: Computer Science - Machine Learning
Abstract: Label noise poses a serious threat to deep neural networks (DNNs). Employing robust loss functions which reconcile fitting ability with robustness is a simple but effective strategy to handle this problem. However, the widely-used static trade-off between these two factors contradicts the dynamics of DNNs learning with label noise, leading to inferior performance. Therefore, we propose a dynamics-aware loss (DAL) to solve this problem. Considering that DNNs tend to first learn beneficial patterns, then gradually overfit harmful label noise, DAL strengthens the fitting ability initially, then gradually improves robustness. Moreover, at the later stage, to further reduce the negative impact of label noise and combat underfitting simultaneously, we let DNNs put more emphasis on easy examples than hard ones and introduce a bootstrapping term. Both the detailed theoretical analyses and extensive experimental results demonstrate the superiority of our method. Our source code can be found in https://github.com/XiuchuanLi/DAL., Comment: accepted by Pattern Recognition Journal
Published: 2023

22. HumanMAC: Masked Motion Completion for Human Motion Prediction

Author: Chen, Ling-Hao, Zhang, Jiawei, Li, Yewen, Pang, Yiren, Xia, Xiaobo, and Liu, Tongliang
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Human motion prediction is a classical problem in computer vision and computer graphics, which has a wide range of practical applications. Previous effects achieve great empirical performance based on an encoding-decoding style. The methods of this style work by first encoding previous motions to latent representations and then decoding the latent representations into predicted motions. However, in practice, they are still unsatisfactory due to several issues, including complicated loss constraints, cumbersome training processes, and scarce switch of different categories of motions in prediction. In this paper, to address the above issues, we jump out of the foregoing style and propose a novel framework from a new perspective. Specifically, our framework works in a masked completion fashion. In the training stage, we learn a motion diffusion model that generates motions from random noise. In the inference stage, with a denoising procedure, we make motion prediction conditioning on observed motions to output more continuous and controllable predictions. The proposed framework enjoys promising algorithmic properties, which only needs one loss in optimization and is trained in an end-to-end manner. Additionally, it accomplishes the switch of different categories of motions effectively, which is significant in realistic tasks, e.g., the animation task. Comprehensive experiments on benchmarks confirm the superiority of the proposed framework. The project page is available at https://lhchen.top/Human-MAC., Comment: Accepted by ICCV 2023
Published: 2023

23. Harnessing Out-Of-Distribution Examples via Augmenting Content and Style

Author: Huang, Zhuo, Xia, Xiaobo, Shen, Li, Han, Bo, Gong, Mingming, Gong, Chen, and Liu, Tongliang
Subjects: Computer Science - Machine Learning
Abstract: Machine learning models are vulnerable to Out-Of-Distribution (OOD) examples, and such a problem has drawn much attention. However, current methods lack a full understanding of different types of OOD data: there are benign OOD data that can be properly adapted to enhance the learning performance, while other malign OOD data would severely degenerate the classification result. To Harness OOD data, this paper proposes a HOOD method that can leverage the content and style from each image instance to identify benign and malign OOD data. Particularly, we design a variational inference framework to causally disentangle content and style features by constructing a structural causal model. Subsequently, we augment the content and style through an intervention process to produce malign and benign OOD data, respectively. The benign OOD data contain novel styles but hold our interested contents, and they can be leveraged to help train a style-invariant model. In contrast, the malign OOD data inherit unknown contents but carry familiar styles, by detecting them can improve model robustness against deceiving anomalies. Thanks to the proposed novel disentanglement and data augmentation techniques, HOOD can effectively deal with OOD examples in unknown and open environments, whose effectiveness is empirically validated in three typical OOD applications including OOD detection, open-set semi-supervised learning, and open-set domain adaptation., Comment: ICLR 2023
Published: 2022

24. Pluralistic Image Completion with Probabilistic Mixture-of-Experts

Author: Xia, Xiaobo, Yang, Wenhao, Ren, Jie, Li, Yewen, Zhan, Yibing, Han, Bo, and Liu, Tongliang
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Pluralistic image completion focuses on generating both visually realistic and diverse results for image completion. Prior methods enjoy the empirical successes of this task. However, their used constraints for pluralistic image completion are argued to be not well interpretable and unsatisfactory from two aspects. First, the constraints for visual reality can be weakly correlated to the objective of image completion or even redundant. Second, the constraints for diversity are designed to be task-agnostic, which causes the constraints to not work well. In this paper, to address the issues, we propose an end-to-end probabilistic method. Specifically, we introduce a unified probabilistic graph model that represents the complex interactions in image completion. The entire procedure of image completion is then mathematically divided into several sub-procedures, which helps efficient enforcement of constraints. The sub-procedure directly related to pluralistic results is identified, where the interaction is established by a Gaussian mixture model (GMM). The inherent parameters of GMM are task-related, which are optimized adaptively during training, while the number of its primitives can control the diversity of results conveniently. We formally establish the effectiveness of our method and demonstrate it with comprehensive experiments.
Published: 2022

25. Selective-Supervised Contrastive Learning with Noisy Labels

Author: Li, Shikun, Xia, Xiaobo, Ge, Shiming, and Liu, Tongliang
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Deep networks have strong capacities of embedding data into latent representations and finishing following tasks. However, the capacities largely come from high-quality annotated labels, which are expensive to collect. Noisy labels are more affordable, but result in corrupted representations, leading to poor generalization performance. To learn robust representations and handle noisy labels, we propose selective-supervised contrastive learning (Sel-CL) in this paper. Specifically, Sel-CL extend supervised contrastive learning (Sup-CL), which is powerful in representation learning, but is degraded when there are noisy labels. Sel-CL tackles the direct cause of the problem of Sup-CL. That is, as Sup-CL works in a \textit{pair-wise} manner, noisy pairs built by noisy labels mislead representation learning. To alleviate the issue, we select confident pairs out of noisy ones for Sup-CL without knowing noise rates. In the selection process, by measuring the agreement between learned representations and given labels, we first identify confident examples that are exploited to build confident pairs. Then, the representation similarity distribution in the built confident pairs is exploited to identify more confident pairs out of noisy pairs. All obtained confident pairs are finally used for Sup-CL to enhance representations. Experiments on multiple noisy datasets demonstrate the robustness of the learned representations by our method, following the state-of-the-art performance. Source codes are available at https://github.com/ShikunLi/Sel-CL, Comment: Accepted to CVPR 2022. 12 pages, 5 figure, and 10 tables
Published: 2022

26. Injectable, antioxidative, and loaded with exosomes/Liproxstatin-1 hydrogel as a potential treatment for retinal ischemia–reperfusion by inhibiting ferroptosis and apoptosis

Author: You, Mengling, Rong, Rong, Liang, Zhuotao, Xie, Sijian, Ma, Xiaoya, and Xia, Xiaobo
Published: 2024
Full Text: View/download PDF

27. Objects in Semantic Topology

Author: Yang, Shuo, Sun, Peize, Jiang, Yi, Xia, Xiaobo, Zhang, Ruiheng, Yuan, Zehuan, Wang, Changhu, Luo, Ping, and Xu, Min
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: A more realistic object detection paradigm, Open-World Object Detection, has arisen increasing research interests in the community recently. A qualified open-world object detector can not only identify objects of known categories, but also discover unknown objects, and incrementally learn to categorize them when their annotations progressively arrive. Previous works rely on independent modules to recognize unknown categories and perform incremental learning, respectively. In this paper, we provide a unified perspective: Semantic Topology. During the life-long learning of an open-world object detector, all object instances from the same category are assigned to their corresponding pre-defined node in the semantic topology, including the `unknown' category. This constraint builds up discriminative feature representations and consistent relationships among objects, thus enabling the detector to distinguish unknown objects out of the known categories, as well as making learned features of known objects undistorted when learning new categories incrementally. Extensive experiments demonstrate that semantic topology, either randomly-generated or derived from a well-trained language model, could outperform the current state-of-the-art open-world object detectors by a large margin, e.g., the absolute open-set error is reduced from 7832 to 2546, exhibiting the inherent superiority of semantic topology on open-world object detection., Comment: ICLR 2022
Published: 2021

28. Multi-genome comprehensive identification of SSR/SV and development of molecular markers database to serve Sorghum bicolor (L.) breeding

Author: An, Yanlin, Xia, Xiaobo, Zheng, Huayan, Yu, Shirui, Jing, Tingting, and Zhang, Feng
Published: 2023
Full Text: View/download PDF

29. Endoplasmic reticulum stress: molecular mechanism and therapeutic targets

Author: Chen, Xingyi, Shi, Chaoran, He, Meihui, Xiong, Siqi, and Xia, Xiaobo
Published: 2023
Full Text: View/download PDF

30. Genome-wide identification of the PYL gene family of tea plants (Camellia sinensis) revealed its expression profiles under different stress and tissues

Author: An, Yanlin, Mi, Xiaozeng, Xia, Xiaobo, Qiao, Dahe, Yu, Shirui, Zheng, Huayan, Jing, Tingting, and Zhang, Feng
Published: 2023
Full Text: View/download PDF

31. Regional differences of the sclera in the ocular hypertensive rat model induced by circumlimbal suture

Author: Xia, Mingfang, Zhang, Endong, Yao, Fei, Xia, Zhaohua, Zhou, Mingmin, Ran, Xufang, and Xia, Xiaobo
Published: 2023
Full Text: View/download PDF

32. Kernel Mean Estimation by Marginalized Corrupted Distributions

Author: Xia, Xiaobo, Shan, Shuo, Gong, Mingming, Wang, Nannan, Gao, Fei, Wei, Haikun, and Liu, Tongliang
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Estimating the kernel mean in a reproducing kernel Hilbert space is a critical component in many kernel learning algorithms. Given a finite sample, the standard estimate of the target kernel mean is the empirical average. Previous works have shown that better estimators can be constructed by shrinkage methods. In this work, we propose to corrupt data examples with noise from known distributions and present a new kernel mean estimator, called the marginalized kernel mean estimator, which estimates kernel mean under the corrupted distribution. Theoretically, we show that the marginalized kernel mean estimator introduces implicit regularization in kernel mean estimation. Empirically, we show on a variety of datasets that the marginalized kernel mean estimator obtains much lower estimation error than the existing estimators.
Published: 2021

33. Instance Correction for Learning with Open-set Noisy Labels

Author: Xia, Xiaobo, Liu, Tongliang, Han, Bo, Gong, Mingming, Yu, Jun, Niu, Gang, and Sugiyama, Masashi
Subjects: Computer Science - Machine Learning
Abstract: The problem of open-set noisy labels denotes that part of training data have a different label space that does not contain the true class. Lots of approaches, e.g., loss correction and label correction, cannot handle such open-set noisy labels well, since they need training data and test data to share the same label space, which does not hold for learning with open-set noisy labels. The state-of-the-art methods thus employ the sample selection approach to handle open-set noisy labels, which tries to select clean data from noisy data for network parameters updates. The discarded data are seen to be mislabeled and do not participate in training. Such an approach is intuitive and reasonable at first glance. However, a natural question could be raised "can such data only be discarded during training?". In this paper, we show that the answer is no. Specifically, we discuss that the instances of discarded data could consist of some meaningful information for generalization. For this reason, we do not abandon such data, but use instance correction to modify the instances of the discarded data, which makes the predictions for the discarded data consistent with given labels. Instance correction are performed by targeted adversarial attacks. The corrected data are then exploited for training to help generalization. In addition to the analytical results, a series of empirical evidences are provided to justify our claims.
Published: 2021

34. Sample Selection with Uncertainty of Losses for Learning with Noisy Labels

Author: Xia, Xiaobo, Liu, Tongliang, Han, Bo, Gong, Mingming, Yu, Jun, Niu, Gang, and Sugiyama, Masashi
Subjects: Computer Science - Machine Learning
Abstract: In learning with noisy labels, the sample selection approach is very popular, which regards small-loss data as correctly labeled during training. However, losses are generated on-the-fly based on the model being trained with noisy labels, and thus large-loss data are likely but not certainly to be incorrect. There are actually two possibilities of a large-loss data point: (a) it is mislabeled, and then its loss decreases slower than other data, since deep neural networks "learn patterns first"; (b) it belongs to an underrepresented group of data and has not been selected yet. In this paper, we incorporate the uncertainty of losses by adopting interval estimation instead of point estimation of losses, where lower bounds of the confidence intervals of losses derived from distribution-free concentration inequalities, but not losses themselves, are used for sample selection. In this way, we also give large-loss but less selected data a try; then, we can better distinguish between the cases (a) and (b) by seeing if the losses effectively decrease with the uncertainty after the try. As a result, we can better explore underrepresented data that are correctly labeled but seem to be mislabeled at first glance. Experiments demonstrate that the proposed method is superior to baselines and robust to a broad range of label noise types.
Published: 2021

35. The correlation between primary open-angle glaucoma (POAG) and gut microbiota: a pilot study towards predictive, preventive, and personalized medicine

Author: Chen, Si, Wang, Nan, Xiong, Siqi, and Xia, Xiaobo
Published: 2023
Full Text: View/download PDF

36. The Epidemiology and Clinical Characteristics of Fungemia in a Tertiary Hospital in Southern China: A 6-Year Retrospective Study

Author: Chen, Meini, Hu, Dongmei, Li, Tianmin, Zheng, Dongyan, Liao, Wanqing, Xia, Xiaobo, and Cao, Cunwei
Published: 2023
Full Text: View/download PDF

37. A phase-separated protein hub modulates resistance to Fusarium head blight in wheat

Author: He, Yi, Yang, Xiujuan, Xia, Xiaobo, Wang, Yuhua, Dong, Yifan, Wu, Lei, Jiang, Peng, Zhang, Xu, Jiang, Cong, Ma, Hongxiang, Ma, Wujun, Liu, Cong, Whitford, Ryan, Tucker, Matthew R., Zhang, Zhengguang, and Li, Gang
Published: 2024
Full Text: View/download PDF

38. JA-mediated MYC2/LOX/AOS feedback loop regulates osmotic stress response in tea plant

Author: Zhu, Junyan, Chen, Hongrong, Liu, Lu, Xia, Xiaobo, Yan, Xiaomei, Mi, Xiaozeng, Liu, Shengrui, and Wei, Chaoling
Published: 2024
Full Text: View/download PDF

39. Extended T: Learning with Mixed Closed-set and Open-set Noisy Labels

Author: Xia, Xiaobo, Liu, Tongliang, Han, Bo, Wang, Nannan, Deng, Jiankang, Li, Jiatong, and Mao, Yinian
Subjects: Computer Science - Machine Learning
Abstract: The label noise transition matrix $T$, reflecting the probabilities that true labels flip into noisy ones, is of vital importance to model label noise and design statistically consistent classifiers. The traditional transition matrix is limited to model closed-set label noise, where noisy training data has true class labels within the noisy label set. It is unfitted to employ such a transition matrix to model open-set label noise, where some true class labels are outside the noisy label set. Thus when considering a more realistic situation, i.e., both closed-set and open-set label noise occurs, existing methods will undesirably give biased solutions. Besides, the traditional transition matrix is limited to model instance-independent label noise, which may not perform well in practice. In this paper, we focus on learning under the mixed closed-set and open-set label noise. We address the aforementioned issues by extending the traditional transition matrix to be able to model mixed label noise, and further to the cluster-dependent transition matrix to better approximate the instance-dependent label noise in real-world applications. We term the proposed transition matrix as the cluster-dependent extended transition matrix. An unbiased estimator (i.e., extended $T$-estimator) has been designed to estimate the cluster-dependent extended transition matrix by only exploiting the noisy data. Comprehensive synthetic and real experiments validate that our method can better model the mixed label noise, following its more robust performance than the prior state-of-the-art label-noise learning methods.
Published: 2020

40. New insight of metabolomics in ocular diseases in the context of 3P medicine

Author: Zhang, Quyan, Wang, Nan, Rui, Yuhua, Xia, Yang, Xiong, Siqi, and Xia, Xiaobo
Published: 2023
Full Text: View/download PDF

41. Class2Simi: A Noise Reduction Perspective on Learning with Noisy Labels

Author: Wu, Songhua, Xia, Xiaobo, Liu, Tongliang, Han, Bo, Gong, Mingming, Wang, Nannan, Liu, Haifeng, and Niu, Gang
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Learning with noisy labels has attracted a lot of attention in recent years, where the mainstream approaches are in pointwise manners. Meanwhile, pairwise manners have shown great potential in supervised metric learning and unsupervised contrastive learning. Thus, a natural question is raised: does learning in a pairwise manner mitigate label noise? To give an affirmative answer, in this paper, we propose a framework called Class2Simi: it transforms data points with noisy class labels to data pairs with noisy similarity labels, where a similarity label denotes whether a pair shares the class label or not. Through this transformation, the reduction of the noise rate is theoretically guaranteed, and hence it is in principle easier to handle noisy similarity labels. Amazingly, DNNs that predict the clean class labels can be trained from noisy data pairs if they are first pretrained from noisy data points. Class2Simi is computationally efficient because not only this transformation is on-the-fly in mini-batches, but also it just changes loss computation on top of model prediction into a pairwise manner. Its effectiveness is verified by extensive experiments.
Published: 2020

42. Part-dependent Label Noise: Towards Instance-dependent Label Noise

Author: Xia, Xiaobo, Liu, Tongliang, Han, Bo, Wang, Nannan, Gong, Mingming, Liu, Haifeng, Niu, Gang, Tao, Dacheng, and Sugiyama, Masashi
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Learning with the \textit{instance-dependent} label noise is challenging, because it is hard to model such real-world noise. Note that there are psychological and physiological evidences showing that we humans perceive instances by decomposing them into parts. Annotators are therefore more likely to annotate instances based on the parts rather than the whole instances, where a wrong mapping from parts to classes may cause the instance-dependent label noise. Motivated by this human cognition, in this paper, we approximate the instance-dependent label noise by exploiting \textit{part-dependent} label noise. Specifically, since instances can be approximately reconstructed by a combination of parts, we approximate the instance-dependent \textit{transition matrix} for an instance by a combination of the transition matrices for the parts of the instance. The transition matrices for parts can be learned by exploiting anchor points (i.e., data points that belong to a specific class almost surely). Empirical evaluations on synthetic and real-world datasets demonstrate our method is superior to the state-of-the-art approaches for learning from the instance-dependent label noise.
Published: 2020

43. Multi-Class Classification from Noisy-Similarity-Labeled Data

Author: Wu, Songhua, Xia, Xiaobo, Liu, Tongliang, Han, Bo, Gong, Mingming, Wang, Nannan, Liu, Haifeng, and Niu, Gang
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: A similarity label indicates whether two instances belong to the same class while a class label shows the class of the instance. Without class labels, a multi-class classifier could be learned from similarity-labeled pairwise data by meta classification learning. However, since the similarity label is less informative than the class label, it is more likely to be noisy. Deep neural networks can easily remember noisy data, leading to overfitting in classification. In this paper, we propose a method for learning from only noisy-similarity-labeled data. Specifically, to model the noise, we employ a noise transition matrix to bridge the class-posterior probability between clean and noisy data. We further estimate the transition matrix from only noisy data and build a novel learning system to learn a classifier which can assign noise-free class labels for instances. Moreover, we theoretically justify how our proposed method generalizes for learning classifiers. Experimental results demonstrate the superiority of the proposed method over the state-of-the-art method on benchmark-simulated and real-world noisy-label datasets.
Published: 2020

44. Phosphoinositide 3-kinase as a therapeutic target in angiogenic disease

Author: Wu, Wenyi, Xia, Xiaobo, Tang, Luosheng, Luo, Jing, Xiong, Siqi, Ma, Gaoen, and Lei, Hetian
Published: 2023
Full Text: View/download PDF

45. Comparison of clinical outcomes between cystotome-assisted prechop phacoemulsification surgery and femtosecond laser-assisted cataract surgery for hard nucleus cataracts

Author: He, Ye, Wang, Chao, Zhou, Xuezhi, Peng, Jingjie, Zhang, Xuan, Wang, Yujue, Rui, Yuhua, Zhang, Cheng, Zhang, Wulong, Feng, Lemeng, Dai, Shirui, Xia, Xiaobo, and Song, Weitao
Published: 2023
Full Text: View/download PDF

46. Pathologically high intraocular pressure disturbs normal iron homeostasis and leads to retinal ganglion cell ferroptosis in glaucoma

Author: Yao, Fei, Peng, Jingjie, Zhang, Endong, Ji, Dan, Gao, Zhaolin, Tang, Yixiong, Yao, Xueyan, and Xia, Xiaobo
Published: 2023
Full Text: View/download PDF

47. Are Anchor Points Really Indispensable in Label-Noise Learning?

Author: Xia, Xiaobo, Liu, Tongliang, Wang, Nannan, Han, Bo, Gong, Chen, Niu, Gang, and Sugiyama, Masashi
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: In label-noise learning, \textit{noise transition matrix}, denoting the probabilities that clean labels flip into noisy labels, plays a central role in building \textit{statistically consistent classifiers}. Existing theories have shown that the transition matrix can be learned by exploiting \textit{anchor points} (i.e., data points that belong to a specific class almost surely). However, when there are no anchor points, the transition matrix will be poorly learned, and those current consistent classifiers will significantly degenerate. In this paper, without employing anchor points, we propose a \textit{transition-revision} ($T$-Revision) method to effectively learn transition matrices, leading to better classifiers. Specifically, to learn a transition matrix, we first initialize it by exploiting data points that are similar to anchor points, having high \textit{noisy class posterior probabilities}. Then, we modify the initialized matrix by adding a \textit{slack variable}, which can be learned and validated together with the classifier by using noisy data. Empirical results on benchmark-simulated and real-world label-noise datasets demonstrate that without using exact anchor points, the proposed method is superior to the state-of-the-art label-noise learning methods., Comment: Accepted by NeurIPS 2019
Published: 2019

48. Translocator protein 18 kDa regulates retinal neuron apoptosis and pyroptosis in glaucoma

Author: Zeng, Zhou, You, Mengling, Rong, Rong, Fan, Cong, Chen, Meini, Li, Haibo, Ji, Dan, and Xia, Xiaobo
Published: 2023
Full Text: View/download PDF

49. Pathologically high intraocular pressure induces mitochondrial dysfunction through Drp1 and leads to retinal ganglion cell PANoptosis in glaucoma

Author: Zeng, Zhou, You, Mengling, Fan, Cong, Rong, Rong, Li, Haibo, and Xia, Xiaobo
Published: 2023
Full Text: View/download PDF

50. Simultaneously deplete reactive oxygen species and inhibit pyroptosis by dopamine/thioketal-containing polymers delivering disulfiram in combination with Cu(II) for acute glaucoma

Author: Zhou, Xuezhi, Rong, Rong, Liang, Ganghao, Li, Haibo, You, Mengling, Zeng, Zhou, Xiao, Haihua, Ji, Dan, and Xia, Xiaobo
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

667 results on '"Xia, Xiaobo"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources