Author: "An, Huaming" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"An, Huaming"' showing total 15,981 results

Start Over Author "An, Huaming"

15,981 results on '"An, Huaming"'

1. AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems

Author: Zhu, Zhiyu, Jin, Zhibo, Hu, Hongsheng, Xue, Minhui, Sun, Ruoxi, Camtepe, Seyit, Gauravaram, Praveen, and Chen, Huaming
Subjects: Computer Science - Artificial Intelligence
Abstract: AI systems, in particular with deep learning techniques, have demonstrated superior performance for various real-world applications. Given the need for tailored optimization in specific scenarios, as well as the concerns related to the exploits of subsurface vulnerabilities, a more comprehensive and in-depth testing AI system becomes a pivotal topic. We have seen the emergence of testing tools in real-world applications that aim to expand testing capabilities. However, they often concentrate on ad-hoc tasks, rendering them unsuitable for simultaneously testing multiple aspects or components. Furthermore, trustworthiness issues arising from adversarial attacks and the challenge of interpreting deep learning models pose new challenges for developing more comprehensive and in-depth AI system testing tools. In this study, we design and implement a testing tool, \tool, to comprehensively and effectively evaluate AI systems. The tool extensively assesses multiple measurements towards adversarial robustness, model interpretability, and performs neuron analysis. The feasibility of the proposed testing tool is thoroughly validated across various modalities, including image classification, object detection, and text classification. Extensive experiments demonstrate that \tool is the state-of-the-art tool for a comprehensive assessment of the robustness and trustworthiness of AI systems. Our research sheds light on a general solution for AI systems testing landscape.
Published: 2024

2. CAKD: A Correlation-Aware Knowledge Distillation Framework Based on Decoupling Kullback-Leibler Divergence

Author: Zhang, Zao, Chen, Huaming, Ning, Pei, Yang, Nan, and Yuan, Dong
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: In knowledge distillation, a primary focus has been on transforming and balancing multiple distillation components. In this work, we emphasize the importance of thoroughly examining each distillation component, as we observe that not all elements are equally crucial. From this perspective,we decouple the Kullback-Leibler (KL) divergence into three unique elements: Binary Classification Divergence (BCD), Strong Correlation Divergence (SCD), and Weak Correlation Divergence (WCD). Each of these elements presents varying degrees of influence. Leveraging these insights, we present the Correlation-Aware Knowledge Distillation (CAKD) framework. CAKD is designed to prioritize the facets of the distillation components that have the most substantial influence on predictions, thereby optimizing knowledge transfer from teacher to student models. Our experiments demonstrate that adjusting the effect of each element enhances the effectiveness of knowledge transformation. Furthermore, evidence shows that our novel CAKD framework consistently outperforms the baseline across diverse models and datasets. Our work further highlights the importance and effectiveness of closely examining the impact of different parts of distillation process.
Published: 2024

3. RGD: Multi-LLM Based Agent Debugger via Refinement and Generation Guidance

Author: Jin, Haolin, Sun, Zechao, and Chen, Huaming
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Large Language Models (LLMs) have shown incredible potential in code generation tasks, and recent research in prompt engineering have enhanced LLMs' understanding of textual information. However, ensuring the accuracy of generated code often requires extensive testing and validation by programmers. While LLMs can typically generate code based on task descriptions, their accuracy remains limited, especially for complex tasks that require a deeper understanding of both the problem statement and the code generation process. This limitation is primarily due to the LLMs' need to simultaneously comprehend text and generate syntactically and semantically correct code, without having the capability to automatically refine the code. In real-world software development, programmers rarely produce flawless code in a single attempt based on the task description alone, they rely on iterative feedback and debugging to refine their programs. Inspired by this process, we introduce a novel architecture of LLM-based agents for code generation and automatic debugging: Refinement and Guidance Debugging (RGD). The RGD framework is a multi-LLM-based agent debugger that leverages three distinct LLM agents-Guide Agent, Debug Agent, and Feedback Agent. RGD decomposes the code generation task into multiple steps, ensuring a clearer workflow and enabling iterative code refinement based on self-reflection and feedback. Experimental results demonstrate that RGD exhibits remarkable code generation capabilities, achieving state-of-the-art performance with a 9.8% improvement on the HumanEval dataset and a 16.2% improvement on the MBPP dataset compared to the state-of-the-art approaches and traditional direct prompting approaches. We highlight the effectiveness of the RGD framework in enhancing LLMs' ability to generate and refine code autonomously.
Published: 2024

4. Leveraging Information Consistency in Frequency and Spatial Domain for Adversarial Attacks

Author: Jin, Zhibo, Zhang, Jiayu, Zhu, Zhiyu, Wang, Xinyi, Huang, Yiyun, and Chen, Huaming
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Adversarial examples are a key method to exploit deep neural networks. Using gradient information, such examples can be generated in an efficient way without altering the victim model. Recent frequency domain transformation has further enhanced the transferability of such adversarial examples, such as spectrum simulation attack. In this work, we investigate the effectiveness of frequency domain-based attacks, aligning with similar findings in the spatial domain. Furthermore, such consistency between the frequency and spatial domains provides insights into how gradient-based adversarial attacks induce perturbations across different domains, which is yet to be explored. Hence, we propose a simple, effective, and scalable gradient-based adversarial attack algorithm leveraging the information consistency in both frequency and spatial domains. We evaluate the algorithm for its effectiveness against different models. Extensive experiments demonstrate that our algorithm achieves state-of-the-art results compared to other gradient-based algorithms. Our code is available at: https://github.com/LMBTough/FSA., Comment: Accepted by PRICAI 2024
Published: 2024

5. Enhancing Model Interpretability with Local Attribution over Global Exploration

Author: Zhu, Zhiyu, Jin, Zhibo, Zhang, Jiayu, and Chen, Huaming
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: In the field of artificial intelligence, AI models are frequently described as `black boxes' due to the obscurity of their internal mechanisms. It has ignited research interest on model interpretability, especially in attribution methods that offers precise explanations of model decisions. Current attribution algorithms typically evaluate the importance of each parameter by exploring the sample space. A large number of intermediate states are introduced during the exploration process, which may reach the model's Out-of-Distribution (OOD) space. Such intermediate states will impact the attribution results, making it challenging to grasp the relative importance of features. In this paper, we firstly define the local space and its relevant properties, and we propose the Local Attribution (LA) algorithm that leverages these properties. The LA algorithm comprises both targeted and untargeted exploration phases, which are designed to effectively generate intermediate states for attribution that thoroughly encompass the local space. Compared to the state-of-the-art attribution methods, our approach achieves an average improvement of 38.21\% in attribution effectiveness. Extensive ablation studies in our experiments also validate the significance of each component in our algorithm. Our code is available at: https://github.com/LMBTough/LA/, Comment: Accepted by ACMMM 2024
Published: 2024

6. Deep Learning with Data Privacy via Residual Perturbation

Author: Tao, Wenqi, Ling, Huaming, Shi, Zuoqiang, and Wang, Bao
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security, Computer Science - Computer Vision and Pattern Recognition
Abstract: Protecting data privacy in deep learning (DL) is of crucial importance. Several celebrated privacy notions have been established and used for privacy-preserving DL. However, many existing mechanisms achieve privacy at the cost of significant utility degradation and computational overhead. In this paper, we propose a stochastic differential equation-based residual perturbation for privacy-preserving DL, which injects Gaussian noise into each residual mapping of ResNets. Theoretically, we prove that residual perturbation guarantees differential privacy (DP) and reduces the generalization gap of DL. Empirically, we show that residual perturbation is computationally efficient and outperforms the state-of-the-art differentially private stochastic gradient descent (DPSGD) in utility maintenance without sacrificing membership privacy.
Published: 2024

7. Fast and Scalable Semi-Supervised Learning for Multi-View Subspace Clustering

Author: Ling, Huaming, Bao, Chenglong, Song, Jiebo, and Shi, Zuoqiang
Subjects: Computer Science - Machine Learning
Abstract: In this paper, we introduce a Fast and Scalable Semi-supervised Multi-view Subspace Clustering (FSSMSC) method, a novel solution to the high computational complexity commonly found in existing approaches. FSSMSC features linear computational and space complexity relative to the size of the data. The method generates a consensus anchor graph across all views, representing each data point as a sparse linear combination of chosen landmarks. Unlike traditional methods that manage the anchor graph construction and the label propagation process separately, this paper proposes a unified optimization model that facilitates simultaneous learning of both. An effective alternating update algorithm with convergence guarantees is proposed to solve the unified optimization model. Additionally, the method employs the obtained anchor graph and landmarks' low-dimensional representations to deduce low-dimensional representations for raw data. Following this, a straightforward clustering approach is conducted on these low-dimensional representations to achieve the final clustering results. The effectiveness and efficiency of FSSMSC are validated through extensive experiments on multiple benchmark datasets of varying scales., Comment: 40 pages,7 figures
Published: 2024

8. From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future

Author: Jin, Haolin, Huang, Linghan, Cai, Haipeng, Yan, Jun, Li, Bo, and Chen, Huaming
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: With the rise of large language models (LLMs), researchers are increasingly exploring their applications in var ious vertical domains, such as software engineering. LLMs have achieved remarkable success in areas including code generation and vulnerability detection. However, they also exhibit numerous limitations and shortcomings. LLM-based agents, a novel tech nology with the potential for Artificial General Intelligence (AGI), combine LLMs as the core for decision-making and action-taking, addressing some of the inherent limitations of LLMs such as lack of autonomy and self-improvement. Despite numerous studies and surveys exploring the possibility of using LLMs in software engineering, it lacks a clear distinction between LLMs and LLM based agents. It is still in its early stage for a unified standard and benchmarking to qualify an LLM solution as an LLM-based agent in its domain. In this survey, we broadly investigate the current practice and solutions for LLMs and LLM-based agents for software engineering. In particular we summarise six key topics: requirement engineering, code generation, autonomous decision-making, software design, test generation, and software maintenance. We review and differentiate the work of LLMs and LLM-based agents from these six topics, examining their differences and similarities in tasks, benchmarks, and evaluation metrics. Finally, we discuss the models and benchmarks used, providing a comprehensive analysis of their applications and effectiveness in software engineering. We anticipate this work will shed some lights on pushing the boundaries of LLM-based agents in software engineering for future research.
Published: 2024

9. QUITO: Accelerating Long-Context Reasoning through Query-Guided Context Compression

Author: Wang, Wenshan, Wang, Yihang, Fan, Yixing, Liao, Huaming, and Guo, Jiafeng
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: In-context learning (ICL) capabilities are foundational to the success of large language models (LLMs). Recently, context compression has attracted growing interest since it can largely reduce reasoning complexities and computation costs of LLMs. In this paper, we introduce a novel Query-gUIded aTtention cOmpression (QUITO) method, which leverages attention of the question over the contexts to filter useless information. Specifically, we take a trigger token to calculate the attention distribution of the context in response to the question. Based on the distribution, we propose three different filtering methods to satisfy the budget constraints of the context length. We evaluate the QUITO using two widely-used datasets, namely, NaturalQuestions and ASQA. Experimental results demonstrate that QUITO significantly outperforms established baselines across various datasets and downstream LLMs, underscoring its effectiveness. Our code is available at https://github.com/Wenshansilvia/attention_compressor.
Published: 2024

10. Threats and Defenses in Federated Learning Life Cycle: A Comprehensive Survey and Challenges

Author: Li, Yanli, Guo, Zhongliang, Yang, Nan, Chen, Huaming, Yuan, Dong, and Ding, Weiping
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Artificial Intelligence
Abstract: Federated Learning (FL) offers innovative solutions for privacy-preserving collaborative machine learning (ML). Despite its promising potential, FL is vulnerable to various attacks due to its distributed nature, affecting the entire life cycle of FL services. These threats can harm the model's utility or compromise participants' privacy, either directly or indirectly. In response, numerous defense frameworks have been proposed, demonstrating effectiveness in specific settings and scenarios. To provide a clear understanding of the current research landscape, this paper reviews the most representative and state-of-the-art threats and defense frameworks throughout the FL service life cycle. We start by identifying FL threats that harm utility and privacy, including those with potential or direct impacts. Then, we dive into the defense frameworks, analyze the relationship between threats and defenses, and compare the trade-offs among different defense strategies. Finally, we summarize current research bottlenecks and offer insights into future research directions to conclude this survey. We hope this survey sheds light on trustworthy FL research and contributes to the FL community.
Published: 2024

11. Edge AI: A Taxonomy, Systematic Review and Future Directions

Author: Gill, Sukhpal Singh, Golec, Muhammed, Hu, Jianmin, Xu, Minxian, Du, Junhui, Wu, Huaming, Walia, Guneet Kaur, Murugesan, Subramaniam Subramanian, Ali, Babar, Kumar, Mohit, Ye, Kejiang, Verma, Prabal, Kumar, Surendra, Cuadrado, Felix, and Uhlig, Steve
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Edge Artificial Intelligence (AI) incorporates a network of interconnected systems and devices that receive, cache, process, and analyze data in close communication with the location where the data is captured with AI technology. Recent advancements in AI efficiency, the widespread use of Internet of Things (IoT) devices, and the emergence of edge computing have unlocked the enormous scope of Edge AI. Edge AI aims to optimize data processing efficiency and velocity while ensuring data confidentiality and integrity. Despite being a relatively new field of research from 2014 to the present, it has shown significant and rapid development over the last five years. This article presents a systematic literature review for Edge AI to discuss the existing research, recent advancements, and future research directions. We created a collaborative edge AI learning system for cloud and edge computing analysis, including an in-depth study of the architectures that facilitate this mechanism. The taxonomy for Edge AI facilitates the classification and configuration of Edge AI systems while examining its potential influence across many fields through compassing infrastructure, cloud computing, fog computing, services, use cases, ML and deep learning, and resource management. This study highlights the significance of Edge AI in processing real-time data at the edge of the network. Additionally, it emphasizes the research challenges encountered by Edge AI systems, including constraints on resources, vulnerabilities to security threats, and problems with scalability. Finally, this study highlights the potential future research directions that aim to address the current limitations of Edge AI by providing innovative solutions., Comment: Preprint Version Accepted for Publication in Springer Cluster Computing, 2024
Published: 2024
Full Text: View/download PDF

12. Fairpriori: Improving Biased Subgroup Discovery for Deep Neural Network Fairness

Author: Zhou, Kacy, Wen, Jiawen, Yang, Nan, Yuan, Dong, Lu, Qinghua, and Chen, Huaming
Subjects: Computer Science - Machine Learning, Computer Science - Computers and Society, Computer Science - Software Engineering
Abstract: While deep learning has become a core functional module of most software systems, concerns regarding the fairness of ML predictions have emerged as a significant issue that affects prediction results due to discrimination. Intersectional bias, which disproportionately affects members of subgroups, is a prime example of this. For instance, a machine learning model might exhibit bias against darker-skinned women, while not showing bias against individuals with darker skin or women. This problem calls for effective fairness testing before the deployment of such deep learning models in real-world scenarios. However, research into detecting such bias is currently limited compared to research on individual and group fairness. Existing tools to investigate intersectional bias lack important features such as support for multiple fairness metrics, fast and efficient computation, and user-friendly interpretation. This paper introduces Fairpriori, a novel biased subgroup discovery method, which aims to address these limitations. Fairpriori incorporates the frequent itemset generation algorithm to facilitate effective and efficient investigation of intersectional bias by producing fast fairness metric calculations on subgroups of a dataset. Through comparison with the state-of-the-art methods (e.g., Themis, FairFictPlay, and TestSGD) under similar conditions, Fairpriori demonstrates superior effectiveness and efficiency when identifying intersectional bias. Specifically, Fairpriori is easier to use and interpret, supports a wider range of use cases by accommodating multiple fairness metrics, and exhibits higher efficiency in computing fairness metrics. These findings showcase Fairpriori's potential for effectively uncovering subgroups affected by intersectional bias, supported by its open-source tooling at https://anonymous.4open.science/r/Fairpriori-0320., Comment: 11 pages
Published: 2024

13. Vox-UDA: Voxel-wise Unsupervised Domain Adaptation for Cryo-Electron Subtomogram Segmentation with Denoised Pseudo Labeling

Author: Li, Haoran, Li, Xingjian, Shi, Jiahua, Chen, Huaming, Du, Bo, Kihara, Daisuke, Barthelemy, Johan, Shen, Jun, and Xu, Min
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Cryo-Electron Tomography (cryo-ET) is a 3D imaging technology facilitating the study of macromolecular structures at near-atomic resolution. Recent volumetric segmentation approaches on cryo-ET images have drawn widespread interest in biological sector. However, existing methods heavily rely on manually labeled data, which requires highly professional skills, thereby hindering the adoption of fully-supervised approaches for cryo-ET images. Some unsupervised domain adaptation (UDA) approaches have been designed to enhance the segmentation network performance using unlabeled data. However, applying these methods directly to cryo-ET images segmentation tasks remains challenging due to two main issues: 1) the source data, usually obtained through simulation, contain a certain level of noise, while the target data, directly collected from raw-data from real-world scenario, have unpredictable noise levels. 2) the source data used for training typically consists of known macromoleculars, while the target domain data are often unknown, causing the model's segmenter to be biased towards these known macromolecules, leading to a domain shift problem. To address these challenges, in this work, we introduce the first voxel-wise unsupervised domain adaptation approach, termed Vox-UDA, specifically for cryo-ET subtomogram segmentation. Vox-UDA incorporates a noise generation module to simulate target-like noises in the source dataset for cross-noise level adaptation. Additionally, we propose a denoised pseudo-labeling strategy based on improved Bilateral Filter to alleviate the domain shift problem. Experimental results on both simulated and real cryo-ET subtomogram datasets demonstrate the superiority of our proposed approach compared to state-of-the-art UDA methods., Comment: 11 pages
Published: 2024

14. On Security Weaknesses and Vulnerabilities in Deep Learning Systems

Author: Lai, Zhongzheng, Chen, Huaming, Sun, Ruoxi, Zhang, Yu, Xue, Minhui, and Yuan, Dong
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence
Abstract: The security guarantee of AI-enabled software systems (particularly using deep learning techniques as a functional core) is pivotal against the adversarial attacks exploiting software vulnerabilities. However, little attention has been paid to a systematic investigation of vulnerabilities in such systems. A common situation learned from the open source software community is that deep learning engineers frequently integrate off-the-shelf or open-source learning frameworks into their ecosystems. In this work, we specifically look into deep learning (DL) framework and perform the first systematic study of vulnerabilities in DL systems through a comprehensive analysis of identified vulnerabilities from Common Vulnerabilities and Exposures (CVE) and open-source DL tools, including TensorFlow, Caffe, OpenCV, Keras, and PyTorch. We propose a two-stream data analysis framework to explore vulnerability patterns from various databases. We investigate the unique DL frameworks and libraries development ecosystems that appear to be decentralized and fragmented. By revisiting the Common Weakness Enumeration (CWE) List, which provides the traditional software vulnerability related practices, we observed that it is more challenging to detect and fix the vulnerabilities throughout the DL systems lifecycle. Moreover, we conducted a large-scale empirical study of 3,049 DL vulnerabilities to better understand the patterns of vulnerability and the challenges in fixing them. We have released the full replication package at https://github.com/codelzz/Vulnerabilities4DLSystem. We anticipate that our study can advance the development of secure DL systems.
Published: 2024

15. DMS: Addressing Information Loss with More Steps for Pragmatic Adversarial Attacks

Author: Zhu, Zhiyu, Zhang, Jiayu, Wang, Xinyi, Jin, Zhibo, and Chen, Huaming
Subjects: Computer Science - Cryptography and Security, Computer Science - Machine Learning
Abstract: Despite the exceptional performance of deep neural networks (DNNs) across different domains, they are vulnerable to adversarial samples, in particular for tasks related to computer vision. Such vulnerability is further influenced by the digital container formats used in computers, where the discrete numerical values are commonly used for storing the pixel values. This paper examines how information loss in file formats impacts the effectiveness of adversarial attacks. Notably, we observe a pronounced hindrance to the adversarial attack performance due to the information loss of the non-integer pixel values. To address this issue, we explore to leverage the gradient information of the attack samples within the model to mitigate the information loss. We introduce the Do More Steps (DMS) algorithm, which hinges on two core techniques: gradient ascent-based \textit{adversarial integerization} (DMS-AI) and integrated gradients-based \textit{attribution selection} (DMS-AS). Our goal is to alleviate such lossy process to retain the attack performance when storing these adversarial samples digitally. In particular, DMS-AI integerizes the non-integer pixel values according to the gradient direction, and DMS-AS selects the non-integer pixels by comparing attribution results. We conduct thorough experiments to assess the effectiveness of our approach, including the implementations of the DMS-AI and DMS-AS on two large-scale datasets with various latest gradient-based attack methods. Our empirical findings conclusively demonstrate the superiority of our proposed DMS-AI and DMS-AS pixel integerization methods over the standardised methods, such as rounding, truncating and upper approaches, in maintaining attack integrity.
Published: 2024

16. OceanCastNet: A Deep Learning Ocean Wave Model with Energy Conservation

Author: Zhang, Ziliang, Yu, Huaming, and Ren, Danqin
Subjects: Physics - Atmospheric and Oceanic Physics, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Traditional wave forecasting models, although based on energy conservation equations, are computationally expensive. On the other hand, existing deep learning geophysical fluid models, while computationally efficient, often suffer from issues such as energy dissipation in long-term forecasts. This paper proposes a novel energy-balanced deep learning wave forecasting model called OceanCastNet (OCN). By incorporating wind fields at the current, previous, and future time steps, as well as wave fields at the current and previous time steps as input variables, OCN maintains energy balance within the model. Furthermore, the model employs adaptive Fourier operators as its core components and designs a masked loss function to better handle the impact of land-sea boundaries. A series of experiments on the ERA5 dataset demonstrate that OCN can achieve short-term forecast accuracy comparable to traditional models while exhibiting an understanding of the wave generation process. In comparative experiments under both normal and extreme conditions, OCN consistently outperforms the widely used WaveWatch III model in the industry. Even after long-term forecasting, OCN maintains a stable and energy-rich state. By further constructing a simple meteorological model, OCN-wind, which considers energy balance, this paper confirms the importance of energy constraints for improving the long-term forecast performance of deep learning meteorological models. This finding provides new ideas for future research on deep learning geophysical fluid models.
Published: 2024

17. MVMS-RCN: A Dual-Domain Unfolding CT Reconstruction with Multi-sparse-view and Multi-scale Refinement-correction

Author: Fan, Xiaohong, Chen, Ke, Yi, Huaming, Yang, Yin, and Zhang, Jianping
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: X-ray Computed Tomography (CT) is one of the most important diagnostic imaging techniques in clinical applications. Sparse-view CT imaging reduces the number of projection views to a lower radiation dose and alleviates the potential risk of radiation exposure. Most existing deep learning (DL) and deep unfolding sparse-view CT reconstruction methods: 1) do not fully use the projection data; 2) do not always link their architecture designs to a mathematical theory; 3) do not flexibly deal with multi-sparse-view reconstruction assignments. This paper aims to use mathematical ideas and design optimal DL imaging algorithms for sparse-view tomography reconstructions. We propose a novel dual-domain deep unfolding unified framework that offers a great deal of flexibility for multi-sparse-view CT reconstruction with different sampling views through a single model. This framework combines the theoretical advantages of model-based methods with the superior reconstruction performance of DL-based methods, resulting in the expected generalizability of DL. We propose a refinement module that utilizes unfolding projection domain to refine full-sparse-view projection errors, as well as an image domain correction module that distills multi-scale geometric error corrections to reconstruct sparse-view CT. This provides us with a new way to explore the potential of projection information and a new perspective on designing network architectures. All parameters of our proposed framework are learnable end to end, and our method possesses the potential to be applied to plug-and-play reconstruction. Extensive experiments demonstrate that our framework is superior to other existing state-of-the-art methods. Our source codes are available at https://github.com/fanxiaohong/MVMS-RCN., Comment: 12 pages, submitted
Published: 2024

18. Holistic Evaluation Metrics: Use Case Sensitive Evaluation Metrics for Federated Learning

Author: Li, Yanli, Ibrahim, Jehad, Chen, Huaming, Yuan, Dong, and Choo, Kim-Kwang Raymond
Subjects: Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: A large number of federated learning (FL) algorithms have been proposed for different applications and from varying perspectives. However, the evaluation of such approaches often relies on a single metric (e.g., accuracy). Such a practice fails to account for the unique demands and diverse requirements of different use cases. Thus, how to comprehensively evaluate an FL algorithm and determine the most suitable candidate for a designated use case remains an open question. To mitigate this research gap, we introduce the Holistic Evaluation Metrics (HEM) for FL in this work. Specifically, we collectively focus on three primary use cases, which are Internet of Things (IoT), smart devices, and institutions. The evaluation metric encompasses various aspects including accuracy, convergence, computational efficiency, fairness, and personalization. We then assign a respective importance vector for each use case, reflecting their distinct performance requirements and priorities. The HEM index is finally generated by integrating these metric components with their respective importance vectors. Through evaluating different FL algorithms in these three prevalent use cases, our experimental results demonstrate that HEM can effectively assess and identify the FL algorithms best suited to particular scenarios. We anticipate this work sheds light on the evaluation process for pragmatic FL algorithms in real-world applications.
Published: 2024

19. Residual Chain Prediction for Autonomous Driving Path Planning

Author: Zhou, Liguo, Zhou, Yirui, Liu, Huaming, and Knoll, Alois
Subjects: Computer Science - Robotics, Computer Science - Artificial Intelligence
Abstract: In the rapidly evolving field of autonomous driving systems, the refinement of path planning algorithms is paramount for navigating vehicles through dynamic environments, particularly in complex urban scenarios. Traditional path planning algorithms, which are heavily reliant on static rules and manually defined parameters, often fall short in such contexts, highlighting the need for more adaptive, learning-based approaches. Among these, behavior cloning emerges as a noteworthy strategy for its simplicity and efficiency, especially within the realm of end-to-end path planning. However, behavior cloning faces challenges, such as covariate shift when employing traditional Manhattan distance as the metric. Addressing this, our study introduces the novel concept of Residual Chain Loss. Residual Chain Loss dynamically adjusts the loss calculation process to enhance the temporal dependency and accuracy of predicted path points, significantly improving the model's performance without additional computational overhead. Through testing on the nuScenes dataset, we underscore the method's substantial advancements in addressing covariate shift, facilitating dynamic loss adjustments, and ensuring seamless integration with end-to-end path planning frameworks. Our findings highlight the potential of Residual Chain Loss to revolutionize planning component of autonomous driving systems, marking a significant step forward in the quest for level 5 autonomous driving system., Comment: 6 pages, 2 figures
Published: 2024

20. Quantum Computing: Vision and Challenges

Author: Gill, Sukhpal Singh, Cetinkaya, Oktay, Marrone, Stefano, Claudino, Daniel, Haunschild, David, Schlote, Leon, Wu, Huaming, Ottaviani, Carlo, Liu, Xiaoyuan, Machupalli, Sree Pragna, Kaur, Kamalpreet, Arora, Priyansh, Liu, Ji, Farouk, Ahmed, Song, Houbing Herbert, Uhlig, Steve, and Ramamohanarao, Kotagiri
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Emerging Technologies, Quantum Physics
Abstract: The recent development of quantum computing, which uses entanglement, superposition, and other quantum fundamental concepts, can provide substantial processing advantages over traditional computing. These quantum features help solve many complex problems that cannot be solved otherwise with conventional computing methods. These problems include modeling quantum mechanics, logistics, chemical-based advances, drug design, statistical science, sustainable energy, banking, reliable communication, and quantum chemical engineering. The last few years have witnessed remarkable progress in quantum software and algorithm creation and quantum hardware research, which has significantly advanced the prospect of realizing quantum computers. It would be helpful to have comprehensive literature research on this area to grasp the current status and find outstanding problems that require considerable attention from the research community working in the quantum computing industry. To better understand quantum computing, this paper examines the foundations and vision based on current research in this area. We discuss cutting-edge developments in quantum computer hardware advancement and subsequent advances in quantum cryptography, quantum software, and high-scalability quantum computers. Many potential challenges and exciting new trends for quantum technology research and development are highlighted in this paper for a broader debate., Comment: Technical Report: 15 pages, 2 figures
Published: 2024

21. Greater moisture impacts on radial growth of Larix sibirica in the eastern Altay Mountains since the 1990s

Author: Gou, Xiaoxia, Zhang, Tongwen, Yu, Shulong, Shang, Huaming, Zhang, Ruibo, Qin, Li, Liu, Kexiang, Jiang, Shengxia, Guo, Dong, Fan, Yuting, Abudureheman, Ruxianguli, and Zhang, Heli
Published: 2024
Full Text: View/download PDF

22. Ssd-kdgan: a lightweight SSD target detection method based on knowledge distillation and generative adversarial networks

Author: Wang, Huilin, Qian, Huaming, and Feng, Shuai
Published: 2024
Full Text: View/download PDF

23. Both real-valued and binary multi-feature fusion histograms for 3D local shape representation

Author: Hao, Linbo, Wang, Xincheng, Shen, Ying, Xu, Ke, and Wang, Huaming
Published: 2024
Full Text: View/download PDF

24. Longitudinal trajectory of technological growth in Sub-Sahara Africa: new insights for achieving carbon dioxide emissions reduction and environmental sustainability

Author: Jamatutu, Seidu Abdulai, Abbass, Kashif, Song, Huaming, Gawusu, Sidique, and Yeboah, Kyei Emmanuel
Published: 2024
Full Text: View/download PDF

25. Practical prognostic tools to predict the risk of postoperative delirium in older patients undergoing cardiac surgery: visual and dynamic nomograms

Author: Bah, Chernor Sulaiman, Mbambara, Bongani, Xie, Xianhai, Li, Junlin, Iddi, Asha Khatib, Chen, Chen, Jiang, Hui, Feng, Yue, Zhong, Yi, Zhang, Xinlong, Xia, Huaming, Yan, Libo, Si, Yanna, Zhang, Juan, and Zou, Jianjun
Published: 2024
Full Text: View/download PDF

26. Contributions to Rock Fracture Induced by High Ground Stress in Deep Mining: A Review

Author: An, Huaming and Mu, Xinghai
Published: 2024
Full Text: View/download PDF

27. Adaptive event-triggered exponential convergence tolerant control for fuzzy communication delay network systems with actuator faults, deception attacks and disturbances

Author: Yan, Shuya, Qian, Huaming, and Hui, Chen
Published: 2024
Full Text: View/download PDF

28. Isolation and Identification of Postharvest Rot Pathogens in Citrus × tangelo and Their Potential Inhibition with Acidic Electrolyzed Water

Author: Ji, Ying, Wang, Jieqiong, Liu, Ye, Liu, Shaoyan, Jiang, Xuanjing, and Huang, Huaming
Published: 2024
Full Text: View/download PDF

29. A bandwidth-fair migration-enabled task offloading for vehicular edge computing: a deep reinforcement learning approach

Author: Tang, Chaogang, Li, Zhao, Xiao, Shuo, Wu, Huaming, and Chen, Wei
Published: 2024
Full Text: View/download PDF

30. FDNet: Frequency Domain Denoising Network For Cell Segmentation in Astrocytes Derived From Induced Pluripotent Stem Cells

Author: Li, Haoran, Shi, Jiahua, Chen, Huaming, Du, Bo, Maksour, Simon, Phillips, Gabrielle, Dottori, Mirella, and Shen, Jun
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Artificially generated induced pluripotent stem cells (iPSCs) from somatic cells play an important role for disease modeling and drug screening of neurodegenerative diseases. Astrocytes differentiated from iPSCs are important targets to investigate neuronal metabolism. The astrocyte differentiation progress can be monitored through the variations of morphology observed from microscopy images at different differentiation stages, then determined by molecular biology techniques upon maturation. However, the astrocytes usually ``perfectly'' blend into the background and some of them are covered by interference information (i.e., dead cells, media sediments, and cell debris), which makes astrocytes difficult to observe. Due to the lack of annotated datasets, the existing state-of-the-art deep learning approaches cannot be used to address this issue. In this paper, we introduce a new task named astrocyte segmentation with a novel dataset, called IAI704, which contains 704 images and their corresponding pixel-level annotation masks. Moreover, a novel frequency domain denoising network, named FDNet, is proposed for astrocyte segmentation. In detail, our FDNet consists of a contextual information fusion module (CIF), an attention block (AB), and a Fourier transform block (FTB). CIF and AB fuse multi-scale feature embeddings to localize the astrocytes. FTB transforms feature embeddings into the frequency domain and conducts a high-pass filter to eliminate interference information. Experimental results demonstrate the superiority of our proposed FDNet over the state-of-the-art substitutes in astrocyte segmentation, shedding insights for iPSC differentiation progress prediction., Comment: Accepted by The IEEE International Symposium on Biomedical Imaging (ISBI) 2024
Published: 2024

31. Benchmarking Transferable Adversarial Attacks

Author: Jin, Zhibo, Zhang, Jiayu, Zhu, Zhiyu, and Chen, Huaming
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: The robustness of deep learning models against adversarial attacks remains a pivotal concern. This study presents, for the first time, an exhaustive review of the transferability aspect of adversarial attacks. It systematically categorizes and critically evaluates various methodologies developed to augment the transferability of adversarial attacks. This study encompasses a spectrum of techniques, including Generative Structure, Semantic Similarity, Gradient Editing, Target Modification, and Ensemble Approach. Concurrently, this paper introduces a benchmark framework \textit{TAA-Bench}, integrating ten leading methodologies for adversarial attack transferability, thereby providing a standardized and systematic platform for comparative analysis across diverse model architectures. Through comprehensive scrutiny, we delineate the efficacy and constraints of each method, shedding light on their underlying operational principles and practical utility. This review endeavors to be a quintessential resource for both scholars and practitioners in the field, charting the complex terrain of adversarial transferability and setting a foundation for future explorations in this vital sector. The associated codebase is accessible at: https://github.com/KxPlaug/TAA-Bench, Comment: Accepted by NDSS 2024 Workshop
Published: 2024

32. Large Language Models Based Fuzzing Techniques: A Survey

Author: Huang, Linghan, Zhao, Peizhou, Chen, Huaming, and Ma, Lei
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence
Abstract: In the modern era where software plays a pivotal role, software security and vulnerability analysis have become essential for software development. Fuzzing test, as an efficient software testing method, are widely used in various domains. Moreover, the rapid development of Large Language Models (LLMs) has facilitated their application in the field of software testing, demonstrating remarkable performance. Considering that existing fuzzing test techniques are not entirely automated and software vulnerabilities continue to evolve, there is a growing trend towards employing fuzzing test generated based on large language models. This survey provides a systematic overview of the approaches that fuse LLMs and fuzzing tests for software testing. In this paper, a statistical analysis and discussion of the literature in three areas, namely LLMs, fuzzing test, and fuzzing test generated based on LLMs, are conducted by summarising the state-of-the-art methods up until 2024. Our survey also investigates the potential for widespread deployment and application of fuzzing test techniques generated by LLMs in the future., Comment: 9 pages submission under review
Published: 2024

33. PAM: Prompting Audio-Language Models for Audio Quality Assessment

Author: Deshmukh, Soham, Alharthi, Dareen, Elizalde, Benjamin, Gamper, Hannes, Ismail, Mahmoud Al, Singh, Rita, Raj, Bhiksha, and Wang, Huaming
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
Abstract: While audio quality is a key performance metric for various audio processing tasks, including generative modeling, its objective measurement remains a challenge. Audio-Language Models (ALMs) are pre-trained on audio-text pairs that may contain information about audio quality, the presence of artifacts, or noise. Given an audio input and a text prompt related to quality, an ALM can be used to calculate a similarity score between the two. Here, we exploit this capability and introduce PAM, a no-reference metric for assessing audio quality for different audio processing tasks. Contrary to other "reference-free" metrics, PAM does not require computing embeddings on a reference dataset nor training a task-specific model on a costly set of human listening scores. We extensively evaluate the reliability of PAM against established metrics and human listening scores on four tasks: text-to-audio (TTA), text-to-music generation (TTM), text-to-speech (TTS), and deep noise suppression (DNS). We perform multiple ablation studies with controlled distortions, in-the-wild setups, and prompt choices. Our evaluation shows that PAM correlates well with existing metrics and human listening scores. These results demonstrate the potential of ALMs for computing a general-purpose audio quality metric.
Published: 2024

34. GE-AdvGAN: Improving the transferability of adversarial samples by gradient editing-based adversarial generative model

Author: Zhu, Zhiyu, Chen, Huaming, Wang, Xinyi, Zhang, Jiayu, Jin, Zhibo, Choo, Kim-Kwang Raymond, Shen, Jun, and Yuan, Dong
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Adversarial generative models, such as Generative Adversarial Networks (GANs), are widely applied for generating various types of data, i.e., images, text, and audio. Accordingly, its promising performance has led to the GAN-based adversarial attack methods in the white-box and black-box attack scenarios. The importance of transferable black-box attacks lies in their ability to be effective across different models and settings, more closely aligning with real-world applications. However, it remains challenging to retain the performance in terms of transferable adversarial examples for such methods. Meanwhile, we observe that some enhanced gradient-based transferable adversarial attack algorithms require prolonged time for adversarial sample generation. Thus, in this work, we propose a novel algorithm named GE-AdvGAN to enhance the transferability of adversarial samples whilst improving the algorithm's efficiency. The main approach is via optimising the training process of the generator parameters. With the functional and characteristic similarity analysis, we introduce a novel gradient editing (GE) mechanism and verify its feasibility in generating transferable samples on various models. Moreover, by exploring the frequency domain information to determine the gradient editing direction, GE-AdvGAN can generate highly transferable adversarial samples while minimizing the execution time in comparison to the state-of-the-art transferable adversarial attack algorithms. The performance of GE-AdvGAN is comprehensively evaluated by large-scale experiments on different datasets, which results demonstrate the superiority of our algorithm. The code for our algorithm is available at: https://github.com/LMBTough/GE-advGAN, Comment: Accepted by SIAM International Conference on Data Mining (SDM24)
Published: 2024

35. Modern Computing: Vision and Challenges

Author: Gill, Sukhpal Singh, Wu, Huaming, Patros, Panos, Ottaviani, Carlo, Arora, Priyansh, Pujol, Victor Casamayor, Haunschild, David, Parlikad, Ajith Kumar, Cetinkaya, Oktay, Lutfiyya, Hanan, Stankovski, Vlado, Li, Ruidong, Ding, Yuemin, Qadir, Junaid, Abraham, Ajith, Ghosh, Soumya K., Song, Houbing Herbert, Sakellariou, Rizos, Rana, Omer, Rodrigues, Joel J. P. C., Kanhere, Salil S., Dustdar, Schahram, Uhlig, Steve, Ramamohanarao, Kotagiri, and Buyya, Rajkumar
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Over the past six decades, the computing systems field has experienced significant transformations, profoundly impacting society with transformational developments, such as the Internet and the commodification of computing. Underpinned by technological advancements, computer systems, far from being static, have been continuously evolving and adapting to cover multifaceted societal niches. This has led to new paradigms such as cloud, fog, edge computing, and the Internet of Things (IoT), which offer fresh economic and creative opportunities. Nevertheless, this rapid change poses complex research challenges, especially in maximizing potential and enhancing functionality. As such, to maintain an economical level of performance that meets ever-tighter requirements, one must understand the drivers of new model emergence and expansion, and how contemporary challenges differ from past ones. To that end, this article investigates and assesses the factors influencing the evolution of computing systems, covering established systems and architectures as well as newer developments, such as serverless computing, quantum computing, and on-device AI on edge devices. Trends emerge when one traces technological trajectory, which includes the rapid obsolescence of frameworks due to business and technical constraints, a move towards specialized systems and models, and varying approaches to centralized and decentralized control. This comprehensive review of modern computing systems looks ahead to the future of research in the field, highlighting key challenges and emerging trends, and underscoring their importance in cost-effectively driving technological progress., Comment: Preprint submitted to Telematics and Informatics Reports, Elsevier (2024)
Published: 2024
Full Text: View/download PDF

36. FairCompass: Operationalising Fairness in Machine Learning

Author: Liu, Jessica, Chen, Huaming, Shen, Jun, and Choo, Kim-Kwang Raymond
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computers and Society, Computer Science - Software Engineering
Abstract: As artificial intelligence (AI) increasingly becomes an integral part of our societal and individual activities, there is a growing imperative to develop responsible AI solutions. Despite a diverse assortment of machine learning fairness solutions is proposed in the literature, there is reportedly a lack of practical implementation of these tools in real-world applications. Industry experts have participated in thorough discussions on the challenges associated with operationalising fairness in the development of machine learning-empowered solutions, in which a shift toward human-centred approaches is promptly advocated to mitigate the limitations of existing techniques. In this work, we propose a human-in-the-loop approach for fairness auditing, presenting a mixed visual analytical system (hereafter referred to as 'FairCompass'), which integrates both subgroup discovery technique and the decision tree-based schema for end users. Moreover, we innovatively integrate an Exploration, Guidance and Informed Analysis loop, to facilitate the use of the Knowledge Generation Model for Visual Analytics in FairCompass. We evaluate the effectiveness of FairCompass for fairness auditing in a real-world scenario, and the findings demonstrate the system's potential for real-world deployability. We anticipate this work will address the current gaps in research for fairness and facilitate the operationalisation of fairness in machine learning systems., Comment: Accepted in IEEE Transactions on Artificial Intelligence
Published: 2023

37. MFABA: A More Faithful and Accelerated Boundary-based Attribution Method for Deep Neural Networks

Author: Zhu, Zhiyu, Chen, Huaming, Zhang, Jiayu, Wang, Xinyi, Jin, Zhibo, Xue, Minhui, Zhu, Dongxiao, and Choo, Kim-Kwang Raymond
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: To better understand the output of deep neural networks (DNN), attribution based methods have been an important approach for model interpretability, which assign a score for each input dimension to indicate its importance towards the model outcome. Notably, the attribution methods use the axioms of sensitivity and implementation invariance to ensure the validity and reliability of attribution results. Yet, the existing attribution methods present challenges for effective interpretation and efficient computation. In this work, we introduce MFABA, an attribution algorithm that adheres to axioms, as a novel method for interpreting DNN. Additionally, we provide the theoretical proof and in-depth analysis for MFABA algorithm, and conduct a large scale experiment. The results demonstrate its superiority by achieving over 101.5142 times faster speed than the state-of-the-art attribution algorithms. The effectiveness of MFABA is thoroughly evaluated through the statistical analysis in comparison to other methods, and the full implementation package is open-source at: https://github.com/LMBTough/MFABA, Comment: Accepted by The 38th Annual AAAI Conference on Artificial Intelligence (AAAI-24)
Published: 2023

38. Code Ownership in Open-Source AI Software Security

Author: Wen, Jiawen, Yuan, Dong, Ma, Lei, and Chen, Huaming
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence
Abstract: As open-source AI software projects become an integral component in the AI software development, it is critical to develop a novel methods to ensure and measure the security of the open-source projects for developers. Code ownership, pivotal in the evolution of such projects, offers insights into developer engagement and potential vulnerabilities. In this paper, we leverage the code ownership metrics to empirically investigate the correlation with the latent vulnerabilities across five prominent open-source AI software projects. The findings from the large-scale empirical study suggest a positive relationship between high-level ownership (characterised by a limited number of minor contributors) and a decrease in vulnerabilities. Furthermore, we innovatively introduce the time metrics, anchored on the project's duration, individual source code file timelines, and the count of impacted releases. These metrics adeptly categorise distinct phases of open-source AI software projects and their respective vulnerability intensities. With these novel code ownership metrics, we have implemented a Python-based command-line application to aid project curators and quality assurance professionals in evaluating and benchmarking their on-site projects. We anticipate this work will embark a continuous research development for securing and measuring open-source AI project security., Comment: 8 pages, in submission for review
Published: 2023

39. Numerical Investigation of Land Reclamation Effects on Hydrodynamics and Mangroves in Shacheng Bay for the Last 36 Years

Author: Wu, Zetao, Yu, Huaming, Shola, Ayinde Akeem, Chang, Xiaofeng, and Jiang, Wanjun
Published: 2024
Full Text: View/download PDF

40. Research progress in the digitization of additive manufacturing model processing

Author: Liu, Huaming, Peng, Xitai, and Zhou, Runmin
Published: 2024
Full Text: View/download PDF

41. CloudAIBus: a testbed for AI based cloud computing environments

Author: Velu, Sasidharan, Gill, Sukhpal Singh, Murugesan, Subramaniam Subramanian, Wu, Huaming, and Li, Xingwang
Published: 2024
Full Text: View/download PDF

42. Leveraging Information Consistency in Frequency and Spatial Domain for Adversarial Attacks

Author: Jin, Zhibo, Zhang, Jiayu, Zhu, Zhiyu, Wang, Xinyi, Huang, Yiyun, Chen, Huaming, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Hadfi, Rafik, editor, Anthony, Patricia, editor, Sharma, Alok, editor, Ito, Takayuki, editor, and Bai, Quan, editor
Published: 2025
Full Text: View/download PDF

43. Study on impedance spectroscopy based on dynamic equivalent circuit of solar cell

Author: Xiao, Wenbo, Li, Ao, Wu, Huaming, Li, Yongbo, and Xiao, Bangzhi
Published: 2024
Full Text: View/download PDF

44. Engineering of a Substrate Affinity Reduced S-Adenosyl-methionine Synthetase as a Novel Biosensor for Growth-Coupling Selection of L-Methionine Overproducers

Author: Huang, Jianfeng, Liu, Jinhui, Dong, Huaming, Shi, Jingjing, You, Xiaoyan, and Zhang, Yanfei
Published: 2024
Full Text: View/download PDF

45. ADAM12 Silencing Mediated by FOXC2 Represses Meningioma Progression Through Inactivating the JAK1/STAT3/VEGFA Pathway

Author: Zhang, Huaming and Yang, Bing
Published: 2024
Full Text: View/download PDF

46. Research on the influence of image motion blur on the effectiveness of machine vision-based metal scraps separation system

Author: Li, Yifeng, Zhou, Yan, and Liu, Huaming
Published: 2024
Full Text: View/download PDF

47. The impact of thyroid function on total spine bone mineral density in postmenopausal women

Author: Ji, Jiazhong, Li, Zhaoyang, Xue, Long, Xue, Huaming, Wen, Tao, Yang, Tao, Ma, Tong, and Tu, Yihui
Published: 2024
Full Text: View/download PDF

48. Regioselective hydroformylation with subnanometre Rh clusters in MFI zeolite

Author: Dou, Xiaomeng, Yan, Tao, Qian, Lixiang, Hou, Huaming, Lopez-Haro, Miguel, Marini, Carlo, Agostini, Giovanni, Meira, Debora M., Zhang, Xiangjie, Zhang, Liang, Cao, Zhi, and Liu, Lichen
Published: 2024
Full Text: View/download PDF

49. Classification of Sailboat Tell Tail Based on Deep Learning

Author: Chang, Xiaofeng, Yu, Jintao, Gao, Ying, Ding, Hongchen, Liu, Yulong, and Yu, Huaming
Published: 2024
Full Text: View/download PDF

50. Honest Score Client Selection Scheme: Preventing Federated Learning Label Flipping Attacks in Non-IID Scenarios

Author: Li, Yanli, Chen, Huaming, Bao, Wei, Xu, Zhengmeng, and Yuan, Dong
Subjects: Computer Science - Cryptography and Security
Abstract: Federated Learning (FL) is a promising technology that enables multiple actors to build a joint model without sharing their raw data. The distributed nature makes FL vulnerable to various poisoning attacks, including model poisoning attacks and data poisoning attacks. Today, many byzantine-resilient FL methods have been introduced to mitigate the model poisoning attack, while the effectiveness when defending against data poisoning attacks still remains unclear. In this paper, we focus on the most representative data poisoning attack - "label flipping attack" and monitor its effectiveness when attacking the existing FL methods. The results show that the existing FL methods perform similarly in Independent and identically distributed (IID) settings but fail to maintain the model robustness in Non-IID settings. To mitigate the weaknesses of existing FL methods in Non-IID scenarios, we introduce the Honest Score Client Selection (HSCS) scheme and the corresponding HSCSFL framework. In the HSCSFL, The server collects a clean dataset for evaluation. Under each iteration, the server collects the gradients from clients and then perform HSCS to select aggregation candidates. The server first evaluates the performance of each class of the global model and generates the corresponding risk vector to indicate which class could be potentially attacked. Similarly, the server evaluates the client's model and records the performance of each class as the accuracy vector. The dot product of each client's accuracy vector and global risk vector is generated as the client's host score; only the top p\% host score clients are included in the following aggregation. Finally, server aggregates the gradients and uses the outcome to update the global model. The comprehensive experimental results show our HSCSFL effectively enhances the FL robustness and defends against the "label flipping attack."
Published: 2023

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

15,981 results on '"An, Huaming"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources