Author: "Xu, Weiyu" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Xu, Weiyu"' showing total 576 results

Start Over Author "Xu, Weiyu"

576 results on '"Xu, Weiyu"'

1. Towards unlocking the mystery of adversarial fragility of neural networks

Author: Gao, Jingchao, Mudumbai, Raghu, Wu, Xiaodong, Yi, Jirong, Xu, Catherine, Xie, Hui, and Xu, Weiyu
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security, Computer Science - Information Theory, Electrical Engineering and Systems Science - Signal Processing
Abstract: In this paper, we study the adversarial robustness of deep neural networks for classification tasks. We look at the smallest magnitude of possible additive perturbations that can change the output of a classification algorithm. We provide a matrix-theoretic explanation of the adversarial fragility of deep neural network for classification. In particular, our theoretical results show that neural network's adversarial robustness can degrade as the input dimension $d$ increases. Analytically we show that neural networks' adversarial robustness can be only $1/\sqrt{d}$ of the best possible adversarial robustness. Our matrix-theoretic explanation is consistent with an earlier information-theoretic feature-compression-based explanation for the adversarial fragility of neural networks., Comment: 21 pages
Published: 2024

2. Camouflage Adversarial Attacks on Multiple Agent Systems

Author: Lu, Ziqing, Liu, Guanlin, Lai, Lifeng, and Xu, Weiyu
Subjects: Computer Science - Multiagent Systems
Abstract: The multi-agent reinforcement learning systems (MARL) based on the Markov decision process (MDP) have emerged in many critical applications. To improve the robustness/defense of MARL systems against adversarial attacks, the study of various adversarial attacks on reinforcement learning systems is very important. Previous works on adversarial attacks considered some possible features to attack in MDP, such as the action poisoning attacks, the reward poisoning attacks, and the state perception attacks. In this paper, we propose a brand-new form of attack called the camouflage attack in the MARL systems. In the camouflage attack, the attackers change the appearances of some objects without changing the actual objects themselves; and the camouflaged appearances may look the same to all the targeted recipient (victim) agents. The camouflaged appearances can mislead the recipient agents to misguided actions. We design algorithms that give the optimal camouflage attacks minimizing the rewards of recipient agents. Our numerical and theoretical results show that camouflage attacks can rival the more conventional, but likely more difficult state perception attacks. We also investigate cost-constrained camouflage attacks and showed numerically how cost budgets affect the attack performance., Comment: arXiv admin note: text overlap with arXiv:2311.00859
Published: 2024

3. Characteristics Investigation and Optimization of the Long Stator Section on High-Speed EMS Maglev Trains

Author: Duan, Jiaheng, Shi, Liming, Xu, Weiyu, Li, Zixin, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Tan, Kay Chen, Series Editor, Yang, Qingxin, editor, and Li, Jian, editor
Published: 2025
Full Text: View/download PDF

4. gcDLSeg: Integrating Graph-cut into Deep Learning for Binary Semantic Segmentation

Author: Xie, Hui, Xu, Weiyu, Wang, Ya Xing, Buatti, John, and Wu, Xiaodong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Signal Processing
Abstract: Binary semantic segmentation in computer vision is a fundamental problem. As a model-based segmentation method, the graph-cut approach was one of the most successful binary segmentation methods thanks to its global optimality guarantee of the solutions and its practical polynomial-time complexity. Recently, many deep learning (DL) based methods have been developed for this task and yielded remarkable performance, resulting in a paradigm shift in this field. To combine the strengths of both approaches, we propose in this study to integrate the graph-cut approach into a deep learning network for end-to-end learning. Unfortunately, backward propagation through the graph-cut module in the DL network is challenging due to the combinatorial nature of the graph-cut algorithm. To tackle this challenge, we propose a novel residual graph-cut loss and a quasi-residual connection, enabling the backward propagation of the gradients of the residual graph-cut loss for effective feature learning guided by the graph-cut segmentation model. In the inference phase, globally optimal segmentation is achieved with respect to the graph-cut energy defined on the optimized image features learned from DL networks. Experiments on the public AZH chronic wound data set and the pancreas cancer data set from the medical segmentation decathlon (MSD) demonstrated promising segmentation accuracy, and improved robustness against adversarial attacks., Comment: 12 pages
Published: 2023

5. Optimal Cost Constrained Adversarial Attacks For Multiple Agent Systems

Author: Lu, Ziqing, Liu, Guanlin, Lai, Lifeng, and Xu, Weiyu
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Cryptography and Security, Computer Science - Multiagent Systems
Abstract: Finding optimal adversarial attack strategies is an important topic in reinforcement learning and the Markov decision process. Previous studies usually assume one all-knowing coordinator (attacker) for whom attacking different recipient (victim) agents incurs uniform costs. However, in reality, instead of using one limitless central attacker, the attacks often need to be performed by distributed attack agents. We formulate the problem of performing optimal adversarial agent-to-agent attacks using distributed attack agents, in which we impose distinct cost constraints on each different attacker-victim pair. We propose an optimal method integrating within-step static constrained attack-resource allocation optimization and between-step dynamic programming to achieve the optimal adversarial attack in a multi-agent system. Our numerical results show that the proposed attacks can significantly reduce the rewards received by the attacked agents., Comment: Submitted to ICCASP2024
Published: 2023

6. Trust, but Verify: Robust Image Segmentation using Deep Learning

Author: Zaman, Fahim Ahmed, Wu, Xiaodong, Xu, Weiyu, Sonka, Milan, and Mudumbai, Raghuraman
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: We describe a method for verifying the output of a deep neural network for medical image segmentation that is robust to several classes of random as well as worst-case perturbations i.e. adversarial attacks. This method is based on a general approach recently developed by the authors called "Trust, but Verify" wherein an auxiliary verification network produces predictions about certain masked features in the input image using the segmentation as an input. A well-designed auxiliary network will produce high-quality predictions when the input segmentations are accurate, but will produce low-quality predictions when the segmentations are incorrect. Checking the predictions of such a network with the original image allows us to detect bad segmentations. However, to ensure the verification method is truly robust, we need a method for checking the quality of the predictions that does not itself rely on a black-box neural network. Indeed, we show that previous methods for segmentation evaluation that do use deep neural regression networks are vulnerable to false negatives i.e. can inaccurately label bad segmentations as good. We describe the design of a verification network that avoids such vulnerability and present results to demonstrate its robustness compared to previous methods., Comment: 5 Pages, 8 Figures, conference
Published: 2023

7. Outlier Detection Using Generative Models with Theoretical Performance Guarantees

Author: Yi, Jirong, Gao, Jingchao, Wang, Tianming, Wu, Xiaodong, and Xu, Weiyu
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Signal Processing
Abstract: This paper considers the problem of recovering signals modeled by generative models from linear measurements contaminated with sparse outliers. We propose an outlier detection approach for reconstructing the ground-truth signals modeled by generative models under sparse outliers. We establish theoretical recovery guarantees for reconstruction of signals using generative models in the presence of outliers, giving lower bounds on the number of correctable outliers. Our results are applicable to both linear generator neural networks and the nonlinear generator neural networks with an arbitrary number of layers. We propose an iterative alternating direction method of multipliers (ADMM) algorithm for solving the outlier detection problem via $\ell_1$ norm minimization, and a gradient descent algorithm for solving the outlier detection problem via squared $\ell_1$ norm minimization. We conduct extensive experiments using variational auto-encoder and deep convolutional generative adversarial networks, and the experimental results show that the signals can be successfully reconstructed under outliers using our approach. Our approach outperforms the traditional Lasso and $\ell_2$ minimization approach., Comment: arXiv admin note: substantial text overlap with arXiv:1810.11335
Published: 2023

8. Linear Progressive Coding for Semantic Communication using Deep Neural Networks

Author: Riherd, Eva, Mudumbai, Raghu, and Xu, Weiyu
Subjects: Electrical Engineering and Systems Science - Signal Processing
Abstract: We propose a general method for semantic representation of images and other data using progressive coding. Semantic coding allows for specific pieces of information to be selectively encoded into a set of measurements that can be highly compressed compared to the size of the original raw data. We consider a hierarchical method of coding where a partial amount of semantic information is first encoded a into a coarse representation of the data, which is then refined by additional encodings that add additional semantic information. Such hierarchical coding is especially well-suited for semantic communication i.e. transferring semantic information over noisy channels. Our proposed method can be considered as a generalization of both progressive image compression and source coding for semantic communication. We present results from experiments on the MNIST and CIFAR-10 datasets that show that progressive semantic coding can provide timely previews of semantic information with a small number of initial measurements while achieving overall accuracy and efficiency comparable to non-progressive methods.
Published: 2023

9. Distributed Dual Coordinate Ascent with Imbalanced Data on a General Tree Network

Author: Cho, Myung, Lai, Lifeng, and Xu, Weiyu
Subjects: Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Information Theory
Abstract: In this paper, we investigate the impact of imbalanced data on the convergence of distributed dual coordinate ascent in a tree network for solving an empirical loss minimization problem in distributed machine learning. To address this issue, we propose a method called delayed generalized distributed dual coordinate ascent that takes into account the information of the imbalanced data, and provide the analysis of the proposed algorithm. Numerical experiments confirm the effectiveness of our proposed method in improving the convergence speed of distributed dual coordinate ascent in a tree network., Comment: To be published in IEEE 2023 Workshop on Machine Learning for Signal Processing (MLSP)
Published: 2023

10. To AI or not to AI, to Buy Local or not to Buy Local: A Mathematical Theory of Real Price

Author: Cai, Huan, Xu, Catherine, and Xu, Weiyu
Subjects: Economics - Theoretical Economics, Computer Science - Artificial Intelligence, Mathematics - Optimization and Control, 65
Abstract: In the past several decades, the world's economy has become increasingly globalized. On the other hand, there are also ideas advocating the practice of ``buy local'', by which people buy locally produced goods and services rather than those produced farther away. In this paper, we establish a mathematical theory of real price that determines the optimal global versus local spending of an agent which achieves the agent's optimal tradeoff between spending and obtained utility. Our theory of real price depends on the asymptotic analysis of a Markov chain transition probability matrix related to the network of producers and consumers. We show that the real price of a product or service can be determined from the involved Markov chain matrix, and can be dramatically different from the product's label price. In particular, we show that the label prices of products and services are often not ``real'' or directly ``useful'': given two products offering the same myopic utility, the one with lower label price may not necessarily offer better asymptotic utility. This theory shows that the globality or locality of the products and services does have different impacts on the spending-utility tradeoff of a customer. The established mathematical theory of real price can be used to determine whether to adopt or not to adopt certain artificial intelligence (AI) technologies from an economic perspective., Comment: 16 pages, 3 figures
Published: 2023

11. Experimental Study on the Influence of Superplasticizer on the Performance of Ecotype Ultra-High Performance Concrete (E-UHPC)

Author: Xu, Gao, Deng, Chungang, Xu, Weiyu, di Prisco, Marco, Series Editor, Chen, Sheng-Hong, Series Editor, Vayas, Ioannis, Series Editor, Kumar Shukla, Sanjay, Series Editor, Sharma, Anuj, Series Editor, Kumar, Nagesh, Series Editor, Wang, Chien Ming, Series Editor, Cui, Zhen-Dong, Series Editor, Lu, Xinzheng, Series Editor, and Feng, Guangliang, editor
Published: 2024
Full Text: View/download PDF

12. Frequency Characteristics Analysis of Wireless Power Transfer System in Seawater

Author: Xu, Weiyu, Shi, Liming, Yin, Zhenggang, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Tan, Kay Chen, Series Editor, Cai, Chunwei, editor, Qu, Xiaohui, editor, Mai, Ruikun, editor, Zhang, Pengcheng, editor, Chai, Wenping, editor, and Wu, Shuai, editor
Published: 2024
Full Text: View/download PDF

13. Optimal Compression for Minimizing Classification Error Probability: an Information-Theoretic Approach

Author: Gao, Jingchao, Tang, Ao, and Xu, Weiyu
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Information Theory
Abstract: We formulate the problem of performing optimal data compression under the constraints that compressed data can be used for accurate classification in machine learning. We show that this translates to a problem of minimizing the mutual information between data and its compressed version under the constraint on error probability of classification is small when using the compressed data for machine learning. We then provide analytical and computational methods to characterize the optimal trade-off between data compression and classification error probability. First, we provide an analytical characterization for the optimal compression strategy for data with binary labels. Second, for data with multiple labels, we formulate a set of convex optimization problems to characterize the optimal tradeoff, from which the optimal trade-off between the classification error and compression efficiency can be obtained by numerically solving the formulated optimization problems. We further show the improvements of our formulations over the information-bottleneck methods in classification performance., Comment: This work was done in Summer 2021
Published: 2022

14. A deep learning network with differentiable dynamic programming for retina OCT surface segmentation

Author: Xie, Hui, Xu, Weiyu, and Wu, Xiaodong
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Multiple-surface segmentation in Optical Coherence Tomography (OCT) images is a challenge problem, further complicated by the frequent presence of weak image boundaries. Recently, many deep learning (DL) based methods have been developed for this task and yield remarkable performance. Unfortunately, due to the scarcity of training data in medical imaging, it is challenging for DL networks to learn the global structure of the target surfaces, including surface smoothness. To bridge this gap, this study proposes to seamlessly unify a U-Net for feature learning with a constrained differentiable dynamic programming module to achieve an end-to-end learning for retina OCT surface segmentation to explicitly enforce surface smoothness. It effectively utilizes the feedback from the downstream model optimization module to guide feature learning, yielding a better enforcement of global structures of the target surfaces. Experiments on Duke AMD (age-related macular degeneration) and JHU MS (multiple sclerosis) OCT datasets for retinal layer segmentation demonstrated very promising segmentation accuracy.
Published: 2022

15. Optimal Needle Placement for Prostate Rotating-Shield Brachytherapy (RSBT)

Author: Yi, Jirong, Adams, Quentin E., Hopfensperger, Karolyn M., Flynn, Ryan T., Kim, Yusung, Buatti, John M., Xu, Weiyu, and Wu, Xiaodong
Subjects: Physics - Medical Physics
Abstract: Purpose: To present an efficient NEEdle Position Optimization (NEEPO) algorithm for prostate rotating shield brachytherapy (RSBT). With RSBT, the increased flexibility beyond conventional high-dose-rate brachytherapy (HDR-BT) due to the partially shielded radiation source has been shown by Adams et al. in 2020 to enable improved urethra sparing (23.1%), enhanced dose escalation (29.9%), or both, with 20 needles without NEEPO-optimized positions. Within this regime of improved dosimetry, we propose in this work that the benefits of RSBT can be maintained while also reducing the number of needles needed for the delivery. The goal of NEEPO is to provide the capability to further increase the dosimetric benefit of RSBT and to minimize the number of needles needed to satisfy a dosimetric goal. Methods: The NEEPO algorithm generates a needle pool for a given patient and then iteratively constructs a subset of needles from the pool based on relative needle importance as determined by total dwell times within needles. The NEEPO algorithm is based on a convex optimization formulation using a quadratic dosimetric penalty function, dwell time regularization by total variation, and a block sparsity regularization term to enable iterative removal of low-importance needles. RSBT treatment plans for 26 patients were generated using single fraction prescriptions with both dose escalation and urethra sparing goals, and compared to baseline HDR-BT treatment plans., Comment: 12 pages
Published: 2021

16. Optimal Pooling Matrix Design for Group Testing with Dilution (Row Degree) Constraints

Author: Yi, Jirong, Cho, Myung, Wu, Xiaodong, Mudumbai, Raghu, and Xu, Weiyu
Subjects: Quantitative Biology - Quantitative Methods, Computer Science - Information Theory, Electrical Engineering and Systems Science - Signal Processing, Statistics - Applications
Abstract: In this paper, we consider the problem of designing optimal pooling matrix for group testing (for example, for COVID-19 virus testing) with the constraint that no more than $r>0$ samples can be pooled together, which we call "dilution constraint". This problem translates to designing a matrix with elements being either 0 or 1 that has no more than $r$ '1's in each row and has a certain performance guarantee of identifying anomalous elements. We explicitly give pooling matrix designs that satisfy the dilution constraint and have performance guarantees of identifying anomalous elements, and prove their optimality in saving the largest number of tests, namely showing that the designed matrices have the largest width-to-height ratio among all constraint-satisfying 0-1 matrices., Comment: group testing design, COVID-19
Published: 2020

17. Error Correction Codes for COVID-19 Virus and Antibody Testing: Using Pooled Testing to Increase Test Reliability

Author: Yi, Jirong, Cho, Myung, Wu, Xiaodong, Xu, Weiyu, and Mudumbai, Raghu
Subjects: Quantitative Biology - Quantitative Methods, Statistics - Methodology
Abstract: We consider a novel method to increase the reliability of COVID-19 virus or antibody tests by using specially designed pooled testings. Instead of testing nasal swab or blood samples from individual persons, we propose to test mixtures of samples from many individuals. The pooled sample testing method proposed in this paper also serves a different purpose: for increasing test reliability and providing accurate diagnoses even if the tests themselves are not very accurate. Our method uses ideas from compressed sensing and error-correction coding to correct for a certain number of errors in the test results. The intuition is that when each individual's sample is part of many pooled sample mixtures, the test results from all of the sample mixtures contain redundant information about each individual's diagnosis, which can be exploited to automatically correct for wrong test results in exactly the same way that error correction codes correct errors introduced in noisy communication channels. While such redundancy can also be achieved by simply testing each individual's sample multiple times, we present simulations and theoretical arguments that show that our method is significantly more efficient in increasing diagnostic accuracy. In contrast to group testing and compressed sensing which aim to reduce the number of required tests, this proposed error correction code idea purposefully uses pooled testing to increase test accuracy, and works not only in the "undersampling" regime, but also in the "oversampling" regime, where the number of tests is bigger than the number of subjects. The results in this paper run against traditional beliefs that, "even though pooled testing increased test capacity, pooled testings were less reliable than testing individuals separately.", Comment: 14 pages, 15 figures
Published: 2020

18. Derivation of Information-Theoretically Optimal Adversarial Attacks with Applications to Robust Machine Learning

Author: Yi, Jirong, Mudumbai, Raghu, and Xu, Weiyu
Subjects: Computer Science - Machine Learning, Computer Science - Information Theory, Statistics - Machine Learning
Abstract: We consider the theoretical problem of designing an optimal adversarial attack on a decision system that maximally degrades the achievable performance of the system as measured by the mutual information between the degraded signal and the label of interest. This problem is motivated by the existence of adversarial examples for machine learning classifiers. By adopting an information theoretic perspective, we seek to identify conditions under which adversarial vulnerability is unavoidable i.e. even optimally designed classifiers will be vulnerable to small adversarial perturbations. We present derivations of the optimal adversarial attacks for discrete and continuous signals of interest, i.e., finding the optimal perturbation distributions to minimize the mutual information between the degraded signal and a signal following a continuous or discrete distribution. In addition, we show that it is much harder to achieve adversarial attacks for minimizing mutual information when multiple redundant copies of the input signal are available. This provides additional support to the recently proposed ``feature compression" hypothesis as an explanation for the adversarial vulnerability of deep learning classifiers. We also report on results from computational experiments to illustrate our theoretical results., Comment: 16 pages, 5 theorems, 6 figures
Published: 2020

19. Low-Cost and High-Throughput Testing of COVID-19 Viruses and Antibodies via Compressed Sensing: System Concepts and Computational Experiments

Author: Yi, Jirong, Mudumbai, Raghu, and Xu, Weiyu
Subjects: Quantitative Biology - Quantitative Methods, Computer Science - Information Theory, Electrical Engineering and Systems Science - Signal Processing, Quantitative Biology - Biomolecules
Abstract: Coronavirus disease 2019 (COVID-19) is an ongoing pandemic infectious disease outbreak that has significantly harmed and threatened the health and lives of millions or even billions of people. COVID-19 has also negatively impacted the social and economic activities of many countries significantly. With no approved vaccine available at this moment, extensive testing of COVID-19 viruses in people are essential for disease diagnosis, virus spread confinement, contact tracing, and determining right conditions for people to return to normal economic activities. Identifying people who have antibodies for COVID-19 can also help select persons who are suitable for undertaking certain essential activities or returning to workforce. However, the throughputs of current testing technologies for COVID-19 viruses and antibodies are often quite limited, which are not sufficient for dealing with COVID-19 viruses' anticipated fast oscillating waves of spread affecting a significant portion of the earth's population. In this paper, we propose to use compressed sensing (group testing can be seen as a special case of compressed sensing when it is applied to COVID-19 detection) to achieve high-throughput rapid testing of COVID-19 viruses and antibodies, which can potentially provide tens or even more folds of speedup compared with current testing technologies. The proposed compressed sensing system for high-throughput testing can utilize expander graph based compressed sensing matrices developed by us \cite{Weiyuexpander2007}., Comment: 11 pages
Published: 2020

20. Do Deep Minds Think Alike? Selective Adversarial Attacks for Fine-Grained Manipulation of Multiple Deep Neural Networks

Author: Khan, Zain, Yi, Jirong, Mudumbai, Raghu, Wu, Xiaodong, and Xu, Weiyu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: Recent works have demonstrated the existence of {\it adversarial examples} targeting a single machine learning system. In this paper we ask a simple but fundamental question of "selective fooling": given {\it multiple} machine learning systems assigned to solve the same classification problem and taking the same input signal, is it possible to construct a perturbation to the input signal that manipulates the outputs of these {\it multiple} machine learning systems {\it simultaneously} in arbitrary pre-defined ways? For example, is it possible to selectively fool a set of "enemy" machine learning systems but does not fool the other "friend" machine learning systems? The answer to this question depends on the extent to which these different machine learning systems "think alike". We formulate the problem of "selective fooling" as a novel optimization problem, and report on a series of experiments on the MNIST dataset. Our preliminary findings from these experiments show that it is in fact very easy to selectively manipulate multiple MNIST classifiers simultaneously, even when the classifiers are identical in their architectures, training algorithms and training datasets except for random initialization during training. This suggests that two nominally equivalent machine learning systems do not in fact "think alike" at all, and opens the possibility for many novel applications and deeper understandings of the working principles of deep neural networks., Comment: 9 pages, submitted to ICML 2020
Published: 2020

21. Research on a class-incremental learning method based on sonar images

Author: CHEN Xinzhe, LIANG Hong, and XU Weiyu
Subjects: 声呐图像识别, 生成重放, 类别增量学习, Motor vehicles. Aeronautics. Astronautics, TL1-4050
Abstract: Due to the low resolution and the small number of samples of sonar images, the existing class incremental learning networks have a serious problem of catastrophic forgetting of historical task targets, resulting in a low average recognition rate of all task targets. Based on the framework model of generated replay, an improved class incremental learning network is proposed in this paper, and a new deep convolution generative adversarial network is designed and built to replace the variational autoencoder as the reconstruction model of generated replay incremental network to improve the effect of image reconstruction; a new convolution neural network is constructed to replace the multi-layer perception as the recognition network of generated replay incremental network to improve the performance of image classification and recognition. The results show that the improved generated replay incremental network alleviates the problem of catastrophic forgetting of historical task targets, and the average recognition rate for all task targets is significantly improved.
Published: 2023
Full Text: View/download PDF

22. Trust but Verify: An Information-Theoretic Explanation for the Adversarial Fragility of Machine Learning Systems, and a General Defense against Adversarial Attacks

Author: Yi, Jirong, Xie, Hui, Zhou, Leixin, Wu, Xiaodong, Xu, Weiyu, and Mudumbai, Raghuraman
Subjects: Computer Science - Cryptography and Security, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Deep-learning based classification algorithms have been shown to be susceptible to adversarial attacks: minor changes to the input of classifiers can dramatically change their outputs, while being imperceptible to humans. In this paper, we present a simple hypothesis about a feature compression property of artificial intelligence (AI) classifiers and present theoretical arguments to show that this hypothesis successfully accounts for the observed fragility of AI classifiers to small adversarial perturbations. Drawing on ideas from information and coding theory, we propose a general class of defenses for detecting classifier errors caused by abnormally small input perturbations. We further show theoretical guarantees for the performance of this detection method. We present experimental results with (a) a voice recognition system, and (b) a digit recognition system using the MNIST database, to demonstrate the effectiveness of the proposed defense methods. The ideas in this paper are motivated by a simple analogy between AI classifiers and the standard Shannon model of a communication system., Comment: 44 Pages, 2 Theorems, 35 Figures, 29 Tables. arXiv admin note: substantial text overlap with arXiv:1901.09413
Published: 2019

23. Fast Single Image Reflection Suppression via Convex Optimization

Author: Yang, Yang, Ma, Wenye, Zheng, Yin, Cai, Jian-Feng, and Xu, Weiyu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Removing undesired reflections from images taken through the glass is of great importance in computer vision. It serves as a means to enhance the image quality for aesthetic purposes as well as to preprocess images in machine learning and pattern recognition applications. We propose a convex model to suppress the reflection from a single input image. Our model implies a partial differential equation with gradient thresholding, which is solved efficiently using Discrete Cosine Transform. Extensive experiments on synthetic and real-world images demonstrate that our approach achieves desirable reflection suppression results and dramatically reduces the execution time., Comment: 9 pages, 8 figures, IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019
Published: 2019

24. Large Scale 2D Spectral Compressed Sensing in Continuous Domain

Author: Cai, Jian-Feng, Xu, Weiyu, and Yang, Yang
Subjects: Electrical Engineering and Systems Science - Signal Processing
Abstract: We consider the problem of spectral compressed sensing in continuous domain, which aims to recover a 2-dimensional spectrally sparse signal from partially observed time samples. The signal is assumed to be a superposition of s complex sinusoids. We propose a semidefinite program for the 2D signal recovery problem. Our model is able to handle large scale 2D signals of size 500*500, whereas traditional approaches only handle signals of size around 20*20., Comment: 5 pages, 2 figures, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Published: 2019
Full Text: View/download PDF

25. An Information-Theoretic Explanation for the Adversarial Fragility of AI Classifiers

Author: Xie, Hui, Yi, Jirong, Xu, Weiyu, and Mudumbai, Raghu
Subjects: Computer Science - Information Theory, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Signal Processing
Abstract: We present a simple hypothesis about a compression property of artificial intelligence (AI) classifiers and present theoretical arguments to show that this hypothesis successfully accounts for the observed fragility of AI classifiers to small adversarial perturbations. We also propose a new method for detecting when small input perturbations cause classifier errors, and show theoretical guarantees for the performance of this detection method. We present experimental results with a voice recognition system to demonstrate this method. The ideas in this paper are motivated by a simple analogy between AI classifiers and the standard Shannon model of a communication system., Comment: 5 pages
Published: 2019

26. How did Donald Trump Surprisingly Win the 2016 United States Presidential Election? an Information-Theoretic Perspective (Clean Sensing for Big Data Analytics:Optimal Strategies,Estimation Error Bounds Tighter than the Cram\'{e}r-Rao Bound)

Author: Xu, Weiyu, Lai, Lifeng, and Khajehnejad, Amin
Subjects: Computer Science - Information Theory, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Signal Processing, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: Donald Trump was lagging behind in nearly all opinion polls leading up to the 2016 US presidential election, but he surprisingly won the election. This raises the following important questions: 1) why most opinion polls were not accurate in 2016? and 2) how to improve the accuracies of opinion polls? In this paper, we study the inaccuracies of opinion polls in the 2016 election through the lens of information theory. We first propose a general framework of parameter estimation, called clean sensing (polling), which performs optimal parameter estimation with sensing cost constraints, from heterogeneous and potentially distorted data sources. We then cast the opinion polling as a problem of parameter estimation from potentially distorted heterogeneous data sources, and derive the optimal polling strategy using heterogenous and possibly distorted data under cost constraints. Our results show that a larger number of data samples do not necessarily lead to better polling accuracy, which give a possible explanation of the inaccuracies of opinion polls in 2016. The optimal sensing strategy should instead optimally allocate sensing resources over heterogenous data sources according to several factors including data quality, and, moreover, for a particular data source, it should strike an optimal balance between the quality of data samples, and the quantity of data samples. As a byproduct of this research, in a general setting, we derive a group of new lower bounds on the mean-squared errors of general unbiased and biased parameter estimators. These new lower bounds can be tighter than the classical Cram\'{e}r-Rao bound (CRB) and Chapman-Robbins bound. Our derivations are via studying the Lagrange dual problems of certain convex programs. The classical Cram\'{e}r-Rao bound and Chapman-Robbins bound follow naturally from our results for special cases of these convex programs., Comment: 45 pages
Published: 2018

27. Outlier Detection using Generative Models with Theoretical Performance Guarantees

Author: Yi, Jirong, Le, Anh Duc, Wang, Tianming, Wu, Xiaodong, and Xu, Weiyu
Subjects: Computer Science - Information Theory, Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: This paper considers the problem of recovering signals from compressed measurements contaminated with sparse outliers, which has arisen in many applications. In this paper, we propose a generative model neural network approach for reconstructing the ground truth signals under sparse outliers. We propose an iterative alternating direction method of multipliers (ADMM) algorithm for solving the outlier detection problem via $\ell_1$ norm minimization, and a gradient descent algorithm for solving the outlier detection problem via squared $\ell_1$ norm minimization. We establish the recovery guarantees for reconstruction of signals using generative models in the presence of outliers, and give an upper bound on the number of outliers allowed for recovery. Our results are applicable to both the linear generator neural network and the nonlinear generator neural network with an arbitrary number of layers. We conduct extensive experiments using variational auto-encoder and deep convolutional generative adversarial networks, and the experimental results show that the signals can be successfully reconstructed under outliers using our approach. Our approach outperforms the traditional Lasso and $\ell_2$ minimization approach., Comment: 38 Pages, 15 Figures, 10 Lemmas or Theorems with Proofs
Published: 2018

28. MSE-optimal 1-bit Precoding for Multiuser MIMO via Branch and Bound

Author: Jacobsson, Sven, Xu, Weiyu, Durisi, Giuseppe, and Studer, Christoph
Subjects: Computer Science - Information Theory
Abstract: In this paper, we solve the sum mean-squared error (MSE)-optimal 1-bit quantized precoding problem exactly for small-to-moderate sized multiuser multiple-input multiple-output (MU-MIMO) systems via branch and bound. To this end, we reformulate the original NP-hard precoding problem as a tree search and deploy a number of strategies that improve the pruning efficiency without sacrificing optimality. We evaluate the error-rate performance and the complexity of the resulting 1-bit branch-and-bound (BB-1) precoder, and compare its efficacy to that of existing, suboptimal algorithms for 1-bit precoding in MU-MIMO systems.
Published: 2018

29. Necessary and Sufficient Null Space Condition for Nuclear Norm Minimization in Low-Rank Matrix Recovery

Author: Yi, Jirong and Xu, Weiyu
Subjects: Mathematics - Optimization and Control, Computer Science - Information Theory, Computer Science - Learning, Electrical Engineering and Systems Science - Signal Processing, Statistics - Machine Learning
Abstract: Low-rank matrix recovery has found many applications in science and engineering such as machine learning, signal processing, collaborative filtering, system identification, and Euclidean embedding. But the low-rank matrix recovery problem is an NP hard problem and thus challenging. A commonly used heuristic approach is the nuclear norm minimization. In [12,14,15], the authors established the necessary and sufficient null space conditions for nuclear norm minimization to recover every possible low-rank matrix with rank at most r (the strong null space condition). In addition, in [12], Oymak et al. established a null space condition for successful recovery of a given low-rank matrix (the weak null space condition) using nuclear norm minimization, and derived the phase transition for the nuclear norm minimization. In this paper, we show that the weak null space condition in [12] is only a sufficient condition for successful matrix recovery using nuclear norm minimization, and is not a necessary condition as claimed in [12]. In this paper, we further give a weak null space condition for low-rank matrix recovery, which is both necessary and sufficient for the success of nuclear norm minimization. At the core of our derivation are an inequality for characterizing the nuclear norms of block matrices, and the conditions for equality to hold in that inequality., Comment: 17 pages, 0 figures
Published: 2018

30. Symbol Error Rate Performance of Box-relaxation Decoders in Massive MIMO

Author: Thrampoulidis, Christos, Xu, Weiyu, and Hassibi, Babak
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Information Theory
Abstract: The maximum-likelihood (ML) decoder for symbol detection in large multiple-input multiple-output wireless communication systems is typically computationally prohibitive. In this paper, we study a popular and practical alternative, namely the Box-relaxation optimization (BRO) decoder, which is a natural convex relaxation of the ML. For iid real Gaussian channels with additive Gaussian noise, we obtain exact asymptotic expressions for the symbol error rate (SER) of the BRO. The formulas are particularly simple, they yield useful insights, and they allow accurate comparisons to the matched-filter bound (MFB) and to the zero-forcing decoder. For BPSK signals the SER performance of the BRO is within 3dB of the MFB for square systems, and it approaches the MFB as the number of receive antennas grows large compared to the number of transmit antennas. Our analysis further characterizes the empirical density function of the solution of the BRO, and shows that error events for any fixed number of symbols are asymptotically independent. The fundamental tool behind the analysis is the convex Gaussian min-max theorem.
Published: 2017
Full Text: View/download PDF

31. Separation-Free Super-Resolution from Compressed Measurements is Possible: an Orthonormal Atomic Norm Minimization Approach

Author: Xu, Weiyu, Yi, Jirong, Dasgupta, Soura, Cai, Jian-Feng, Jacob, Mathews, and Cho, Myung
Subjects: Computer Science - Information Theory, Computer Science - Learning, Mathematics - Optimization and Control
Abstract: We consider the problem of recovering the superposition of $R$ distinct complex exponential functions from compressed non-uniform time-domain samples. Total Variation (TV) minimization or atomic norm minimization was proposed in the literature to recover the $R$ frequencies or the missing data. However, it is known that in order for TV minimization and atomic norm minimization to recover the missing data or the frequencies, the underlying $R$ frequencies are required to be well-separated, even when the measurements are noiseless. This paper shows that the Hankel matrix recovery approach can super-resolve the $R$ complex exponentials and their frequencies from compressed non-uniform measurements, regardless of how close their frequencies are to each other. We propose a new concept of orthonormal atomic norm minimization (OANM), and demonstrate that the success of Hankel matrix recovery in separation-free super-resolution comes from the fact that the nuclear norm of a Hankel matrix is an orthonormal atomic norm. More specifically, we show that, in traditional atomic norm minimization, the underlying parameter values $\textbf{must}$ be well separated to achieve successful signal recovery, if the atoms are changing continuously with respect to the continuously-valued parameter. In contrast, for the OANM, it is possible the OANM is successful even though the original atoms can be arbitrarily close. As a byproduct of this research, we provide one matrix-theoretic inequality of nuclear norm, and give its proof from the theory of compressed sensing., Comment: 39 pages
Published: 2017

32. Fast dose optimization for rotating shield brachytherapy

Author: Cho, Myung, Wu, Xiaodong, Dakhah, Hossein, Yi, Jirong, Flynn, Ryan T., Kim, Yusung, and Xu, Weiyu
Subjects: Physics - Medical Physics, Mathematics - Optimization and Control
Abstract: Purpose: To provide a fast computational method, based on the proximal graph solver (POGS) - a convex optimization solver using the alternating direction method of multipliers (ADMM), for calculating an optimal treatment plan in rotating shield brachytherapy (RSBT). RSBT treatment planning has more degrees of freedom than conventional high-dose-rate brachytherapy (HDR-BT) due to the addition of emission direction, and this necessitates a fast optimization technique to enable clinical usage. // Methods: The multi-helix RSBT (H-RSBT) delivery technique was considered with five representative cervical cancer patients. Treatment plans were generated for all patients using the POGS method and the previously considered commercial solver IBM CPLEX. The rectum, bladder, sigmoid, high-risk clinical target volume (HR-CTV), and HR-CTV boundary were the structures considered in our optimization problem, called the asymmetric dose-volume optimization with smoothness control. Dose calculation resolution was 1x1x3 mm^3 for all cases. The H-RSBT applicator has 6 helices, with 33.3 mm of translation along the applicator per helical rotation and 1.7 mm spacing between dwell positions, yielding 17.5 degree emission angle spacing per 5 mm along the applicator.// Results: For each patient, HR-CTV D90, HR-CTV D100, rectum D2cc, sigmoid D2cc, and bladder D2cc matched within 1% for CPLEX and POGS. Also, we obtained similar EQD2 figures between CPLEX and POGS. POGS was around 18 times faster than CPLEX. Over all patients, total optimization times were 32.1-65.4 seconds for CPLEX and 2.1-3.9 seconds for POGS. // Conclusions: POGS substantially reduced treatment plan optimization time around 18 times for RSBT with similar HR-CTV D90, OAR D2cc values, and EQD2 figure relative to CPLEX, which is significant progress toward clinical translation of RSBT. POGS is also applicable to conventional HDR-BT., Comment: 9 pages, 3 figures
Published: 2017
Full Text: View/download PDF

33. Distributed Dual Coordinate Ascent in General Tree Networks and Communication Network Effect on Synchronous Machine Learning

Author: Cho, Myung, Lai, Lifeng, and Xu, Weiyu
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Information Theory, Computer Science - Machine Learning
Abstract: Due to the big size of data and limited data storage volume of a single computer or a single server, data are often stored in a distributed manner. Thus, performing large-scale machine learning operations with the distributed datasets through communication networks is often required. In this paper, we study the convergence rate of the distributed dual coordinate ascent for distributed machine learning problems in a general tree-structured network. Since a tree network model can be understood as the generalization of a star network model, our algorithm can be thought of as the generalization of the distributed dual coordinate ascent in a star network model. We provide the convergence rate of the distributed dual coordinate ascent over a general tree network in a recursive manner and analyze the network effect on the convergence rate. Secondly, by considering network communication delays, we optimize the distributed dual coordinate ascent algorithm to maximize its convergence speed. From our analytical result, we can choose the optimal number of local iterations depending on the communication delay severity to achieve the fastest convergence speed. In numerical experiments, we consider machine learning scenarios over communication networks, where local workers cannot directly reach to a central node due to constraints in communication, and demonstrate that the usability of our distributed dual coordinate ascent algorithm in tree networks. Additionally, we show that adapting number of local and global iterations to network communication delays in the distributed dual coordinated ascent algorithm can improve its convergence speed., Comment: 34 pages, 18 figures
Published: 2017

34. Tree Network Design for Faster Distributed Machine Learning Process with Distributed Dual Coordinate Ascent

Author: Cho, Myung, primary, Chikkam, Meghana, additional, Xu, Weiyu, additional, and Lai, Lifeng, additional
Published: 2024
Full Text: View/download PDF

35. Phaseless super-resolution in the continuous domain

Author: Cho, Myung, Thrampoulidis, Christos, Xu, Weiyu, and Hassibi, Babak
Subjects: Computer Science - Information Theory
Abstract: Phaseless super-resolution refers to the problem of superresolving a signal from only its low-frequency Fourier magnitude measurements. In this paper, we consider the phaseless super-resolution problem of recovering a sum of sparse Dirac delta functions which can be located anywhere in the continuous time-domain. For such signals in the continuous domain, we propose a novel Semidefinite Programming (SDP) based signal recovery method to achieve the phaseless superresolution. This work extends the recent work of Jaganathan et al. [1], which considered phaseless super-resolution for discrete signals on the grid.
Published: 2016

36. Compressed Hypothesis Testing: To Mix or Not to Mix?

Author: Cho, Myung, Xu, Weiyu, and Lai, Lifeng
Subjects: Computer Science - Information Theory
Abstract: In this paper, we study the problem of determining $k$ anomalous random variables that have different probability distributions from the rest $(n-k)$ random variables. Instead of sampling each individual random variable separately as in the conventional hypothesis testing, we propose to perform hypothesis testing using mixed observations that are functions of multiple random variables. We characterize the error exponents for correctly identifying the $k$ anomalous random variables under fixed time-invariant mixed observations, random time-varying mixed observations, and deterministic time-varying mixed observations. For our error exponent characterization, we introduce the notions of inner conditional Chernoff information and outer conditional Chernoff information. It is demonstrated that mixed observations can strictly improve the error exponents of hypothesis testing, over separate observations of individual random variables. We further characterize the optimal sensing vector maximizing the error exponents, which leads to explicit constructions of the optimal mixed observations in special cases of hypothesis testing for Gaussian random variables. These results show that mixed observations of random variables can reduce the number of required samples in hypothesis testing applications. In order to solve large-scale hypothesis testing problems, we also propose efficient algorithms - LASSO based and message passing based hypothesis testing algorithms., Comment: compressed sensing, hypothesis testing, Chernoff information, anomaly detection, anomalous random variable, quickest detection. arXiv admin note: substantial text overlap with arXiv:1208.2311
Published: 2016

37. Computable performance guarantees for compressed sensing matrices

Author: Cho, Myung, Mishra, Kumar Vijay, and Xu, Weiyu
Subjects: Computer Science - Information Theory
Abstract: The null space condition for $\ell_1$ minimization in compressed sensing is a necessary and sufficient condition on the sensing matrices under which a sparse signal can be uniquely recovered from the observation data via $\ell_1$ minimization. However, verifying the null space condition is known to be computationally challenging. Most of the existing methods can provide only upper and lower bounds on the proportion parameter that characterizes the null space condition. In this paper, we propose new polynomial-time algorithms to establish upper bounds of the proportion parameter. We leverage on these techniques to find upper bounds and further develop a new procedure - tree search algorithm - that is able to precisely and quickly verify the null space condition. Numerical experiments show that the execution speed and accuracy of the results obtained from our methods far exceed those of the previous methods which rely on Linear Programming (LP) relaxation and Semidefinite Programming (SDP).
Published: 2016

38. Efficient Optimal Joint Channel Estimation and Data Detection for Massive MIMO Systems

Author: Alshamary, Haider Ali Jasim and Xu, Weiyu
Subjects: Computer Science - Information Theory, Mathematics - Optimization and Control
Abstract: In this paper, we propose an efficient optimal joint channel estimation and data detection algorithm for massive MIMO wireless systems. Our algorithm is optimal in terms of the generalized likelihood ratio test (GLRT). For massive MIMO systems, we show that the expected complexity of our algorithm grows polynomially in the channel coherence time. Simulation results demonstrate significant performance gains of our algorithm compared with suboptimal non-coherent detection algorithms. To the best of our knowledge, this is the first algorithm which efficiently achieves GLRT-optimal non-coherent detections for massive MIMO systems with general constellations., Comment: 5 pages, 4 figures, Conference
Published: 2016

39. BER Analysis of the box relaxation for BPSK Signal Recovery

Author: Thrampoulidis, Christos, Abbasi, Ehsan, Xu, Weiyu, and Hassibi, Babak
Subjects: Computer Science - Information Theory
Abstract: We study the problem of recovering an $n$-dimensional vector of $\{\pm1\}^n$ (BPSK) signals from $m$ noise corrupted measurements $\mathbf{y}=\mathbf{A}\mathbf{x}_0+\mathbf{z}$. In particular, we consider the box relaxation method which relaxes the discrete set $\{\pm1\}^n$ to the convex set $[-1,1]^n$ to obtain a convex optimization algorithm followed by hard thresholding. When the noise $\mathbf{z}$ and measurement matrix $\mathbf{A}$ have iid standard normal entries, we obtain an exact expression for the bit-wise probability of error $P_e$ in the limit of $n$ and $m$ growing and $\frac{m}{n}$ fixed. At high SNR our result shows that the $P_e$ of box relaxation is within 3dB of the matched filter bound MFB for square systems, and that it approaches MFB as $m $ grows large compared to $n$. Our results also indicates that as $m,n\rightarrow\infty$, for any fixed set of size $k$, the error events of the corresponding $k$ bits in the box relaxation method are independent., Comment: 5 pages, 2 figures
Published: 2015

40. Precise Phase Transition of Total Variation Minimization

Author: Zhang, Bingwen, Xu, Weiyu, Cai, Jian-Feng, and Lai, Lifeng
Subjects: Computer Science - Information Theory, Computer Science - Learning, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: Characterizing the phase transitions of convex optimizations in recovering structured signals or data is of central importance in compressed sensing, machine learning and statistics. The phase transitions of many convex optimization signal recovery methods such as $\ell_1$ minimization and nuclear norm minimization are well understood through recent years' research. However, rigorously characterizing the phase transition of total variation (TV) minimization in recovering sparse-gradient signal is still open. In this paper, we fully characterize the phase transition curve of the TV minimization. Our proof builds on Donoho, Johnstone and Montanari's conjectured phase transition curve for the TV approximate message passing algorithm (AMP), together with the linkage between the minmax Mean Square Error of a denoising problem and the high-dimensional convex geometry for TV minimization., Comment: 6 pages
Published: 2015

41. Block Iterative Reweighted Algorithms for Super-Resolution of Spectrally Sparse Signals

Author: Cho, Myung, Mishra, Kumar Vijay, Cai, Jian-Feng, and Xu, Weiyu
Subjects: Computer Science - Information Theory
Abstract: We propose novel algorithms that enhance the performance of recovering unknown continuous-valued frequencies from undersampled signals. Our iterative reweighted frequency recovery algorithms employ the support knowledge gained from earlier steps of our algorithms as block prior information to enhance frequency recovery. Our methods improve the performance of the atomic norm minimization which is a useful heuristic in recovering continuous-valued frequency contents. Numerical results demonstrate that our block iterative reweighted methods provide both better recovery performance and faster speed than other known methods.
Published: 2015
Full Text: View/download PDF

42. Distributed Channel Estimation and Pilot Contamination Analysis for Massive MIMO-OFDM Systems

Author: Zaib, Alam, Masood, Mudassir, Ali, Anum, Xu, Weiyu, and Al-Naffouri, Tareq Y.
Subjects: Computer Science - Information Theory
Abstract: Massive MIMO communication systems, by virtue of utilizing very large number of antennas, have a potential to yield higher spectral and energy efficiency in comparison with the conventional MIMO systems. In this paper, we consider uplink channel estimation in massive MIMO-OFDM systems with frequency selective channels. With increased number of antennas, the channel estimation problem becomes very challenging as exceptionally large number of channel parameters have to be estimated. We propose an efficient distributed linear minimum mean square error (LMMSE) algorithm that can achieve near optimal channel estimates at very low complexity by exploiting the strong spatial correlations and symmetry of large antenna array elements. The proposed method involves solving a (fixed) reduced dimensional LMMSE problem at each antenna followed by a repetitive sharing of information through collaboration among neighboring antenna elements. To further enhance the channel estimates and/or reduce the number of reserved pilot tones, we propose a data-aided estimation technique that relies on finding a set of most reliable data carriers. We also analyse the effect of pilot contamination on the mean square error (MSE) performance of different channel estimation techniques. Unlike the conventional approaches, we use stochastic geometry to obtain analytical expression for interference variance (or power) across OFDM frequency tones and use it to derive the MSE expressions for different algorithms under both noise and pilot contaminated regimes. Simulation results validate our analysis and the near optimal MSE performance of proposed estimation algorithms., Comment: 16 pages, 15 figures
Published: 2015

43. Projected Wirtinger Gradient Descent for Low-Rank Hankel Matrix Completion in Spectral Compressed Sensing

Author: Cai, Jian-Feng, Liu, Suhui, and Xu, Weiyu
Subjects: Computer Science - Information Theory, Computer Science - Learning, Mathematics - Optimization and Control
Abstract: This paper considers reconstructing a spectrally sparse signal from a small number of randomly observed time-domain samples. The signal of interest is a linear combination of complex sinusoids at $R$ distinct frequencies. The frequencies can assume any continuous values in the normalized frequency domain $[0,1)$. After converting the spectrally sparse signal recovery into a low rank structured matrix completion problem, we propose an efficient feasible point approach, named projected Wirtinger gradient descent (PWGD) algorithm, to efficiently solve this structured matrix completion problem. We further accelerate our proposed algorithm by a scheme inspired by FISTA. We give the convergence analysis of our proposed algorithms. Extensive numerical experiments are provided to illustrate the efficiency of our proposed algorithm. Different from earlier approaches, our algorithm can solve problems of very large dimensions very efficiently., Comment: 12 pages
Published: 2015

44. Optimal Non-coherent Data Detection for Massive SIMO Wireless Systems with General Constellations: A Polynomial Complexity Solution

Author: Alshamary, Haider Ali Jasim, Anjum, Md Fahim, Al-Naffouri, Tareq, Zaib, Alam, and Xu, Weiyu
Subjects: Computer Science - Information Theory
Abstract: Massive MIMO systems can greatly increase spectral and energy efficiency over traditional MIMO systems by exploiting large antenna arrays. However, increasing the number of antennas at the base station (BS) makes the uplink non-coherent data detection very challenging in massive MIMO systems. In this paper we consider the joint maximum likelihood (ML) channel estimation and data detection problem for massive SIMO (single input multiple output) wireless systems, which is a special case of wireless systems with large antenna arrays. We propose exact ML non-coherent data detection algorithms for both constant-modulus and nonconstant-modulus constellations, with a low expected complexity. Despite the large number of unknown channel coefficients for massive SIMO systems, we show that the expected computational complexity of these algorithms is linear in the number of receive antennas and polynomial in channel coherence time. Simulation results show the performance gains (up to 5 dB improvement) of the optimal non-coherent data detection with a low computational complexity., Comment: Journal version. Conference version accepted to IEEE Signal Processing Workshop. arXiv admin note: substantial text overlap with arXiv:1411.6739
Published: 2015

45. Robust recovery of complex exponential signals from random Gaussian projections via low rank Hankel matrix reconstruction

Author: Cai, Jian-Feng, Qu, Xiaobo, Xu, Weiyu, and Ye, Gui-Bo
Subjects: Computer Science - Information Theory, Mathematics - Numerical Analysis, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: This paper explores robust recovery of a superposition of $R$ distinct complex exponential functions from a few random Gaussian projections. We assume that the signal of interest is of $2N-1$ dimensional and $R<<2N-1$. This framework covers a large class of signals arising from real applications in biology, automation, imaging science, etc. To reconstruct such a signal, our algorithm is to seek a low-rank Hankel matrix of the signal by minimizing its nuclear norm subject to the consistency on the sampled data. Our theoretical results show that a robust recovery is possible as long as the number of projections exceeds $O(R\ln^2N)$. No incoherence or separation condition is required in our proof. Our method can be applied to spectral compressed sensing where the signal of interest is a superposition of $R$ complex sinusoids. Compared to existing results, our result here does not need any separation condition on the frequencies, while achieving better or comparable bounds on the number of measurements. Furthermore, our method provides theoretical guidance on how many samples are required in the state-of-the-art non-uniform sampling in NMR spectroscopy. The performance of our algorithm is further demonstrated by numerical experiments., Comment: 17 pages
Published: 2015

46. Structures and magnetic properties of two hexanuclear [Co2Ln4]complexes

Author: Wang, Yingying, Yuan, Zhuangdong, Ren, Hong, Xu, Weiyu, Xu, Jiali, Zhang, Huanzhen, Sha, Jingquan, and Zhang, Haifeng
Published: 2020
Full Text: View/download PDF

47. Linear Progressive Coding for Semantic Communication using Deep Neural Networks

Author: Riherd, Eva, primary, Mudumbai, Raghu, additional, and Xu, Weiyu, additional
Published: 2024
Full Text: View/download PDF

48. Optimal Cost Constrained Adversarial Attacks for Multiple Agent Systems

Author: Lu, Ziqing, primary, Liu, Guanlin, additional, Lai, Lifeng, additional, and Xu, Weiyu, additional
Published: 2024
Full Text: View/download PDF

49. Optimal non-coherent data detection for massive SIMO wireless systems: A polynomial complexity solution

Author: Alshamary, Haider Ali Jasim, Al-Naffouri, Tareq, Zaib, Alam, and Xu, Weiyu
Subjects: Computer Science - Information Theory
Abstract: Massive MIMO systems have made significant progress in increasing spectral and energy efficiency over traditional MIMO systems by exploiting large antenna arrays. In this paper we consider the joint maximum likelihood (ML) channel estimation and data detection problem for massive SIMO (single input multiple output) wireless systems. Despite the large number of unknown channel coefficients for massive SIMO systems, we improve an algorithm to achieve the exact ML non-coherent data detection with a low expected complexity. We show that the expected computational complexity of this algorithm is linear in the number of receive antennas and polynomial in channel coherence time. Simulation results show the performance gain of the optimal non-coherent data detection with a low computational complexity., Comment: 7pages, 5 figures
Published: 2014

50. Spectral Super-resolution With Prior Knowledge

Author: Mishra, Kumar Vijay, Cho, Myung, Kruger, Anton, and Xu, Weiyu
Subjects: Computer Science - Information Theory
Abstract: We address the problem of super-resolution frequency recovery using prior knowledge of the structure of a spectrally sparse, undersampled signal. In many applications of interest, some structure information about the signal spectrum is often known. The prior information might be simply knowing precisely some signal frequencies or the likelihood of a particular frequency component in the signal. We devise a general semidefinite program to recover these frequencies using theories of positive trigonometric polynomials. Our theoretical analysis shows that, given sufficient prior information, perfect signal reconstruction is possible using signal samples no more than thrice the number of signal frequencies. Numerical experiments demonstrate great performance enhancements using our method. We show that the nominal resolution necessary for the grid-free results can be improved if prior information is suitably employed., Comment: 13 pages, 8 figures. arXiv admin note: text overlap with arXiv:1404.7041, arXiv:1311.0950
Published: 2014

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

576 results on '"Xu, Weiyu"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources