Author: "Miao, Fei" / Search Limiters: Full Text - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Miao, Fei"' showing total 256 results

Start Over Author "Miao, Fei" Search Limiters Full Text

256 results on '"Miao, Fei"'

1. YOLO-MARL: You Only LLM Once for Multi-agent Reinforcement Learning

Author: Zhuang, Yuan, Shen, Yi, Zhang, Zhili, Chen, Yuxiao, and Miao, Fei
Subjects: Computer Science - Multiagent Systems
Abstract: Advancements in deep multi-agent reinforcement learning (MARL) have positioned it as a promising approach for decision-making in cooperative games. However, it still remains challenging for MARL agents to learn cooperative strategies for some game environments. Recently, large language models (LLMs) have demonstrated emergent reasoning capabilities, making them promising candidates for enhancing coordination among the agents. However, due to the model size of LLMs, it can be expensive to frequently infer LLMs for actions that agents can take. In this work, we propose You Only LLM Once for MARL (YOLO-MARL), a novel framework that leverages the high-level task planning capabilities of LLMs to improve the policy learning process of multi-agents in cooperative games. Notably, for each game environment, YOLO-MARL only requires one time interaction with LLMs in the proposed strategy generation, state interpretation and planning function generation modules, before the MARL policy training process. This avoids the ongoing costs and computational time associated with frequent LLMs API calls during training. Moreover, the trained decentralized normal-sized neural network-based policies operate independently of the LLM. We evaluate our method across three different environments and demonstrate that YOLO-MARL outperforms traditional MARL algorithms.
Published: 2024

2. CUQDS: Conformal Uncertainty Quantification under Distribution Shift for Trajectory Prediction

Author: Huang, Huiqun, He, Sihong, and Miao, Fei
Subjects: Computer Science - Machine Learning, Computer Science - Robotics
Abstract: Trajectory prediction models that can infer both finite future trajectories and their associated uncertainties of the target vehicles in an online setting (e.g., real-world application scenarios) is crucial for ensuring the safe and robust navigation and path planning of autonomous vehicle motion. However, the majority of existing trajectory prediction models have neither considered reducing the uncertainty as one objective during the training stage nor provided reliable uncertainty quantification during inference stage under potential distribution shift. Therefore, in this paper, we propose the Conformal Uncertainty Quantification under Distribution Shift framework, CUQDS, to quantify the uncertainty of the predicted trajectories of existing trajectory prediction models under potential data distribution shift, while considering improving the prediction accuracy of the models and reducing the estimated uncertainty during the training stage. Specifically, CUQDS includes 1) a learning-based Gaussian process regression module that models the output distribution of the base model (any existing trajectory prediction or time series forecasting neural networks) and reduces the estimated uncertainty by additional loss term, and 2) a statistical-based Conformal P control module to calibrate the estimated uncertainty from the Gaussian process regression module in an online setting under potential distribution shift between training and testing data., Comment: 9 pages, 2 figures
Published: 2024

3. $\alpha$-OCC: Uncertainty-Aware Camera-based 3D Semantic Occupancy Prediction

Author: Su, Sanbao, Chen, Nuo, Juefei-Xu, Felix, Feng, Chen, and Miao, Fei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In the realm of autonomous vehicle (AV) perception, comprehending 3D scenes is paramount for tasks such as planning and mapping. Camera-based 3D Semantic Occupancy Prediction (OCC) aims to infer scene geometry and semantics from limited observations. While it has gained popularity due to affordability and rich visual cues, existing methods often neglect the inherent uncertainty in models. To address this, we propose an uncertainty-aware camera-based 3D semantic occupancy prediction method ($\alpha$-OCC). Our approach includes an uncertainty propagation framework (Depth-UP) from depth models to enhance geometry completion (up to 11.58\% improvement) and semantic segmentation (up to 12.95\% improvement) for a variety of OCC models. Additionally, we propose a hierarchical conformal prediction (HCP) method to quantify OCC uncertainty, effectively addressing the high-level class imbalance in OCC datasets. On the geometry level, we present a novel KL-based score function that significantly improves the occupied recall of safety-critical classes (45\% improvement) with minimal performance overhead (3.4\% reduction). For uncertainty quantification, we demonstrate the ability to achieve smaller prediction set sizes while maintaining a defined coverage guarantee. Compared with baselines, it reduces up to 92\% set size. Our contributions represent significant advancements in OCC accuracy and robustness, marking a noteworthy step forward in autonomous perception systems.
Published: 2024

4. Momentum for the Win: Collaborative Federated Reinforcement Learning across Heterogeneous Environments

Author: Wang, Han, He, Sihong, Zhang, Zhili, Miao, Fei, and Anderson, James
Subjects: Computer Science - Machine Learning, Computer Science - Multiagent Systems, Mathematics - Optimization and Control
Abstract: We explore a Federated Reinforcement Learning (FRL) problem where $N$ agents collaboratively learn a common policy without sharing their trajectory data. To date, existing FRL work has primarily focused on agents operating in the same or ``similar" environments. In contrast, our problem setup allows for arbitrarily large levels of environment heterogeneity. To obtain the optimal policy which maximizes the average performance across all potentially completely different environments, we propose two algorithms: FedSVRPG-M and FedHAPG-M. In contrast to existing results, we demonstrate that both FedSVRPG-M and FedHAPG-M, both of which leverage momentum mechanisms, can exactly converge to a stationary point of the average performance function, regardless of the magnitude of environment heterogeneity. Furthermore, by incorporating the benefits of variance-reduction techniques or Hessian approximation, both algorithms achieve state-of-the-art convergence results, characterized by a sample complexity of $\mathcal{O}\left(\epsilon^{-\frac{3}{2}}/N\right)$. Notably, our algorithms enjoy linear convergence speedups with respect to the number of agents, highlighting the benefit of collaboration among agents in finding a common policy.
Published: 2024

5. Constrained Reinforcement Learning Under Model Mismatch

Author: Sun, Zhongchang, He, Sihong, Miao, Fei, and Zou, Shaofeng
Subjects: Computer Science - Machine Learning
Abstract: Existing studies on constrained reinforcement learning (RL) may obtain a well-performing policy in the training environment. However, when deployed in a real environment, it may easily violate constraints that were originally satisfied during training because there might be model mismatch between the training and real environments. To address the above challenge, we formulate the problem as constrained RL under model uncertainty, where the goal is to learn a good policy that optimizes the reward and at the same time satisfy the constraint under model mismatch. We develop a Robust Constrained Policy Optimization (RCPO) algorithm, which is the first algorithm that applies to large/continuous state space and has theoretical guarantees on worst-case reward improvement and constraint violation at each iteration during the training. We demonstrate the effectiveness of our algorithm on a set of RL tasks with constraints., Comment: ICML 2024
Published: 2024

6. Safety Guaranteed Robust Multi-Agent Reinforcement Learning with Hierarchical Control for Connected and Automated Vehicles

Author: Zhang, Zhili, Ahmad, H M Sabbir, Sabouni, Ehsan, Sun, Yanchao, Huang, Furong, Li, Wenchao, and Miao, Fei
Subjects: Computer Science - Robotics, Computer Science - Multiagent Systems
Abstract: We address the problem of coordination and control of Connected and Automated Vehicles (CAVs) in the presence of imperfect observations in mixed traffic environment. A commonly used approach is learning-based decision-making, such as reinforcement learning (RL). However, most existing safe RL methods suffer from two limitations: (i) they assume accurate state information, and (ii) safety is generally defined over the expectation of the trajectories. It remains challenging to design optimal coordination between multi-agents while ensuring hard safety constraints under system state uncertainties (e.g., those that arise from noisy sensor measurements, communication, or state estimation methods) at every time step. We propose a safety guaranteed hierarchical coordination and control scheme called Safe-RMM to address the challenge. Specifically, the high-level coordination policy of CAVs in mixed traffic environment is trained by the Robust Multi-Agent Proximal Policy Optimization (RMAPPO) method. Though trained without uncertainty, our method leverages a worst-case Q network to ensure the model's robust performances when state uncertainties are present during testing. The low-level controller is implemented using model predictive control (MPC) with robust Control Barrier Functions (CBFs) to guarantee safety through their forward invariance property. We compare our method with baselines in different road networks in the CARLA simulator. Results show that our method provides best evaluated safety and efficiency in challenging mixed traffic environments with uncertainties., Comment: 6 pages, 6 figures
Published: 2023

7. Towards Safe Autonomy in Hybrid Traffic: Detecting Unpredictable Abnormal Behaviors of Human Drivers via Information Sharing

Author: Wang, Jiangwei, Su, Lili, Han, Songyang, Song, Dongjin, and Miao, Fei
Subjects: Computer Science - Robotics, Computer Science - Artificial Intelligence
Abstract: Hybrid traffic which involves both autonomous and human-driven vehicles would be the norm of the autonomous vehicles practice for a while. On the one hand, unlike autonomous vehicles, human-driven vehicles could exhibit sudden abnormal behaviors such as unpredictably switching to dangerous driving modes, putting its neighboring vehicles under risks; such undesired mode switching could arise from numbers of human driver factors, including fatigue, drunkenness, distraction, aggressiveness, etc. On the other hand, modern vehicle-to-vehicle communication technologies enable the autonomous vehicles to efficiently and reliably share the scarce run-time information with each other. In this paper, we propose, to the best of our knowledge, the first efficient algorithm that can (1) significantly improve trajectory prediction by effectively fusing the run-time information shared by surrounding autonomous vehicles, and can (2) accurately and quickly detect abnormal human driving mode switches or abnormal driving behavior with formal assurance without hurting human drivers privacy. To validate our proposed algorithm, we first evaluate our proposed trajectory predictor on NGSIM and Argoverse datasets and show that our proposed predictor outperforms the baseline methods. Then through extensive experiments on SUMO simulator, we show that our proposed algorithm has great detection performance in both highway and urban traffic. The best performance achieves detection rate of 97.3%, average detection delay of 1.2s, and 0 false alarm., Comment: accepted to ACM Transactions on Cyber-Physical Systems
Published: 2023

8. Robust Electric Vehicle Balancing of Autonomous Mobility-On-Demand System: A Multi-Agent Reinforcement Learning Approach

Author: He, Sihong, Han, Shuo, and Miao, Fei
Subjects: Computer Science - Multiagent Systems, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Systems and Control
Abstract: Electric autonomous vehicles (EAVs) are getting attention in future autonomous mobility-on-demand (AMoD) systems due to their economic and societal benefits. However, EAVs' unique charging patterns (long charging time, high charging frequency, unpredictable charging behaviors, etc.) make it challenging to accurately predict the EAVs supply in E-AMoD systems. Furthermore, the mobility demand's prediction uncertainty makes it an urgent and challenging task to design an integrated vehicle balancing solution under supply and demand uncertainties. Despite the success of reinforcement learning-based E-AMoD balancing algorithms, state uncertainties under the EV supply or mobility demand remain unexplored. In this work, we design a multi-agent reinforcement learning (MARL)-based framework for EAVs balancing in E-AMoD systems, with adversarial agents to model both the EAVs supply and mobility demand uncertainties that may undermine the vehicle balancing solutions. We then propose a robust E-AMoD Balancing MARL (REBAMA) algorithm to train a robust EAVs balancing policy to balance both the supply-demand ratio and charging utilization rate across the whole city. Experiments show that our proposed robust method performs better compared with a non-robust MARL method that does not consider state uncertainties; it improves the reward, charging utilization fairness, and supply-demand fairness by 19.28%, 28.18%, and 3.97%, respectively. Compared with a robust optimization-based method, the proposed MARL algorithm can improve the reward, charging utilization fairness, and supply-demand fairness by 8.21%, 8.29%, and 9.42%, respectively., Comment: accepted to International Conference on Intelligent Robots and Systems (IROS2023)
Published: 2023

9. Robust Multi-Agent Reinforcement Learning with State Uncertainty

Author: He, Sihong, Han, Songyang, Su, Sanbao, Han, Shuo, Zou, Shaofeng, and Miao, Fei
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Science and Game Theory, Computer Science - Multiagent Systems, Electrical Engineering and Systems Science - Systems and Control
Abstract: In real-world multi-agent reinforcement learning (MARL) applications, agents may not have perfect state information (e.g., due to inaccurate measurement or malicious attacks), which challenges the robustness of agents' policies. Though robustness is getting important in MARL deployment, little prior work has studied state uncertainties in MARL, neither in problem formulation nor algorithm design. Motivated by this robustness issue and the lack of corresponding studies, we study the problem of MARL with state uncertainty in this work. We provide the first attempt to the theoretical and empirical analysis of this challenging problem. We first model the problem as a Markov Game with state perturbation adversaries (MG-SPA) by introducing a set of state perturbation adversaries into a Markov Game. We then introduce robust equilibrium (RE) as the solution concept of an MG-SPA. We conduct a fundamental analysis regarding MG-SPA such as giving conditions under which such a robust equilibrium exists. Then we propose a robust multi-agent Q-learning (RMAQ) algorithm to find such an equilibrium, with convergence guarantees. To handle high-dimensional state-action space, we design a robust multi-agent actor-critic (RMAAC) algorithm based on an analytical expression of the policy gradient derived in the paper. Our experiments show that the proposed RMAQ algorithm converges to the optimal value function; our RMAAC algorithm outperforms several MARL and robust MARL methods in multiple multi-agent environments when state uncertainty is present. The source code is public on \url{https://github.com/sihongho/robust_marl_with_state_uncertainty}., Comment: 50 pages, Published in TMLR, Transactions on Machine Learning Research (06/2023)
Published: 2023

10. Multi-Agent Reinforcement Learning Guided by Signal Temporal Logic Specifications

Author: Wang, Jiangwei, Yang, Shuo, An, Ziyan, Han, Songyang, Zhang, Zhili, Mangharam, Rahul, Ma, Meiyi, and Miao, Fei
Subjects: Computer Science - Artificial Intelligence
Abstract: Reward design is a key component of deep reinforcement learning, yet some tasks and designer's objectives may be unnatural to define as a scalar cost function. Among the various techniques, formal methods integrated with DRL have garnered considerable attention due to their expressiveness and flexibility to define the reward and requirements for different states and actions of the agent. However, how to leverage Signal Temporal Logic (STL) to guide multi-agent reinforcement learning reward design remains unexplored. Complex interactions, heterogeneous goals and critical safety requirements in multi-agent systems make this problem even more challenging. In this paper, we propose a novel STL-guided multi-agent reinforcement learning framework. The STL requirements are designed to include both task specifications according to the objective of each agent and safety specifications, and the robustness values of the STL specifications are leveraged to generate rewards. We validate the advantages of our method through empirical studies. The experimental results demonstrate significant reward performance improvements compared to MARL without STL guidance, along with a remarkable increase in the overall safety rate of the multi-agent systems.
Published: 2023

11. Surrogate Lagrangian Relaxation: A Path To Retrain-free Deep Neural Network Pruning

Author: Zhou, Shanglin, Bragin, Mikhail A., Pepin, Lynn, Gurevin, Deniz, Miao, Fei, and Ding, Caiwen
Subjects: Computer Science - Neural and Evolutionary Computing, Computer Science - Artificial Intelligence, I.2
Abstract: Network pruning is a widely used technique to reduce computation cost and model size for deep neural networks. However, the typical three-stage pipeline significantly increases the overall training time. In this paper, we develop a systematic weight-pruning optimization approach based on Surrogate Lagrangian relaxation, which is tailored to overcome difficulties caused by the discrete nature of the weight-pruning problem. We prove that our method ensures fast convergence of the model compression problem, and the convergence of the SLR is accelerated by using quadratic penalties. Model parameters obtained by SLR during the training phase are much closer to their optimal values as compared to those obtained by other state-of-the-art methods. We evaluate our method on image classification tasks using CIFAR-10 and ImageNet with state-of-the-art MLP-Mixer, Swin Transformer, and VGG-16, ResNet-18, ResNet-50 and ResNet-110, MobileNetV2. We also evaluate object detection and segmentation tasks on COCO, KITTI benchmark, and TuSimple lane detection dataset using a variety of models. Experimental results demonstrate that our SLR-based weight-pruning optimization approach achieves a higher compression rate than state-of-the-art methods under the same accuracy requirement and also can achieve higher accuracy under the same compression rate requirement. Under classification tasks, our SLR approach converges to the desired accuracy $3\times$ faster on both of the datasets. Under object detection and segmentation tasks, SLR also converges $2\times$ faster to the desired accuracy. Further, our SLR achieves high model accuracy even at the hard-pruning stage without retraining, which reduces the traditional three-stage pruning into a two-stage process. Given a limited budget of retraining epochs, our approach quickly recovers the model's accuracy., Comment: arXiv admin note: text overlap with arXiv:2012.10079
Published: 2023

12. Collaborative Multi-Object Tracking with Conformal Uncertainty Propagation

Author: Su, Sanbao, Han, Songyang, Li, Yiming, Zhang, Zhili, Feng, Chen, Ding, Caiwen, and Miao, Fei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Object detection and multiple object tracking (MOT) are essential components of self-driving systems. Accurate detection and uncertainty quantification are both critical for onboard modules, such as perception, prediction, and planning, to improve the safety and robustness of autonomous vehicles. Collaborative object detection (COD) has been proposed to improve detection accuracy and reduce uncertainty by leveraging the viewpoints of multiple agents. However, little attention has been paid to how to leverage the uncertainty quantification from COD to enhance MOT performance. In this paper, as the first attempt to address this challenge, we design an uncertainty propagation framework called MOT-CUP. Our framework first quantifies the uncertainty of COD through direct modeling and conformal prediction, and propagates this uncertainty information into the motion prediction and association steps. MOT-CUP is designed to work with different collaborative object detectors and baseline MOT algorithms. We evaluate MOT-CUP on V2X-Sim, a comprehensive collaborative perception dataset, and demonstrate a 2% improvement in accuracy and a 2.67X reduction in uncertainty compared to the baselines, e.g. SORT and ByteTrack. In scenarios characterized by high occlusion levels, our MOT-CUP demonstrates a noteworthy $4.01\%$ improvement in accuracy. MOT-CUP demonstrates the importance of uncertainty quantification in both COD and MOT, and provides the first attempt to improve the accuracy and reduce the uncertainty in MOT based on COD through uncertainty propagation. Our code is public on https://coperception.github.io/MOT-CUP/., Comment: This paper has been accepted by IEEE Robotics and Automation Letters
Published: 2023

13. Privacy-preserving and Uncertainty-aware Federated Trajectory Prediction for Connected Autonomous Vehicles

Author: Peng, Muzi, Wang, Jiangwei, Song, Dongjin, Miao, Fei, and Su, Lili
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Robotics
Abstract: Deep learning is the method of choice for trajectory prediction for autonomous vehicles. Unfortunately, its data-hungry nature implicitly requires the availability of sufficiently rich and high-quality centralized datasets, which easily leads to privacy leakage. Besides, uncertainty-awareness becomes increasingly important for safety-crucial cyber physical systems whose prediction module heavily relies on machine learning tools. In this paper, we relax the data collection requirement and enhance uncertainty-awareness by using Federated Learning on Connected Autonomous Vehicles with an uncertainty-aware global objective. We name our algorithm as FLTP. We further introduce ALFLTP which boosts FLTP via using active learning techniques in adaptatively selecting participating clients. We consider both negative log-likelihood (NLL) and aleatoric uncertainty (AU) as client selection metrics. Experiments on Argoverse dataset show that FLTP significantly outperforms the model trained on local data. In addition, ALFLTP-AU converges faster in training regression loss and performs better in terms of NLL, minADE and MR than FLTP in most rounds, and has more stable round-wise performance than ALFLTP-NLL.
Published: 2023

14. Shared Information-Based Safe And Efficient Behavior Planning For Connected Autonomous Vehicles

Author: Han, Songyang, Zhou, Shanglin, Pepin, Lynn, Wang, Jiangwei, Ding, Caiwen, and Miao, Fei
Subjects: Computer Science - Robotics, Computer Science - Artificial Intelligence
Abstract: The recent advancements in wireless technology enable connected autonomous vehicles (CAVs) to gather data via vehicle-to-vehicle (V2V) communication, such as processed LIDAR and camera data from other vehicles. In this work, we design an integrated information sharing and safe multi-agent reinforcement learning (MARL) framework for CAVs, to take advantage of the extra information when making decisions to improve traffic efficiency and safety. We first use weight pruned convolutional neural networks (CNN) to process the raw image and point cloud LIDAR data locally at each autonomous vehicle, and share CNN-output data with neighboring CAVs. We then design a safe actor-critic algorithm that utilizes both a vehicle's local observation and the information received via V2V communication to explore an efficient behavior planning policy with safety guarantees. Using the CARLA simulator for experiments, we show that our approach improves the CAV system's efficiency in terms of average velocity and comfort under different CAV ratios and different traffic densities. We also show that our approach avoids the execution of unsafe actions and always maintains a safe distance from other vehicles. We construct an obstacle-at-corner scenario to show that the shared vision can help CAVs to observe obstacles earlier and take action to avoid traffic jams., Comment: This paper gets the Best Paper Award in the DCAA workshop of AAAI 2023
Published: 2023

15. What is the Solution for State-Adversarial Multi-Agent Reinforcement Learning?

Author: Han, Songyang, Su, Sanbao, He, Sihong, Han, Shuo, Yang, Haizhao, Zou, Shaofeng, and Miao, Fei
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computer Science and Game Theory, Computer Science - Multiagent Systems
Abstract: Various methods for Multi-Agent Reinforcement Learning (MARL) have been developed with the assumption that agents' policies are based on accurate state information. However, policies learned through Deep Reinforcement Learning (DRL) are susceptible to adversarial state perturbation attacks. In this work, we propose a State-Adversarial Markov Game (SAMG) and make the first attempt to investigate different solution concepts of MARL under state uncertainties. Our analysis shows that the commonly used solution concepts of optimal agent policy and robust Nash equilibrium do not always exist in SAMGs. To circumvent this difficulty, we consider a new solution concept called robust agent policy, where agents aim to maximize the worst-case expected state value. We prove the existence of robust agent policy for finite state and finite action SAMGs. Additionally, we propose a Robust Multi-Agent Adversarial Actor-Critic (RMA3C) algorithm to learn robust policies for MARL agents under state uncertainties. Our experiments demonstrate that our algorithm outperforms existing methods when faced with state perturbations and greatly improves the robustness of MARL policies. Our code is public on https://songyanghan.github.io/what_is_solution/., Comment: Accepted by Transactions on Machine Learning Research (TMLR)
Published: 2022

16. Data-Driven Distributionally Robust Electric Vehicle Balancing for Autonomous Mobility-on-Demand Systems under Demand and Supply Uncertainties

Author: He, Sihong, Zhang, Zhili, Han, Shuo, Pepin, Lynn, Wang, Guang, Zhang, Desheng, Stankovic, John, and Miao, Fei
Subjects: Mathematics - Optimization and Control, Computer Science - Robotics, Electrical Engineering and Systems Science - Systems and Control
Abstract: Electric vehicles (EVs) are being rapidly adopted due to their economic and societal benefits. Autonomous mobility-on-demand (AMoD) systems also embrace this trend. However, the long charging time and high recharging frequency of EVs pose challenges to efficiently managing EV AMoD systems. The complicated dynamic charging and mobility process of EV AMoD systems makes the demand and supply uncertainties significant when designing vehicle balancing algorithms. In this work, we design a data-driven distributionally robust optimization (DRO) approach to balance EVs for both the mobility service and the charging process. The optimization goal is to minimize the worst-case expected cost under both passenger mobility demand uncertainties and EV supply uncertainties. We then propose a novel distributional uncertainty sets construction algorithm that guarantees the produced parameters are contained in desired confidence regions with a given probability. To solve the proposed DRO AMoD EV balancing problem, we derive an equivalent computationally tractable convex optimization problem. Based on real-world EV data of a taxi system, we show that with our solution the average total balancing cost is reduced by 14.49%, and the average mobility fairness and charging fairness are improved by 15.78% and 34.51%, respectively, compared to solutions that do not consider uncertainties., Comment: 16 pages
Published: 2022

17. Data-Driven Distributionally Robust Electric Vehicle Balancing for Mobility-on-Demand Systems under Demand and Supply Uncertainties

Author: He, Sihong, Pepin, Lynn, Wang, Guang, Zhang, Desheng, and Miao, Fei
Subjects: Mathematics - Optimization and Control, Computer Science - Robotics, Statistics - Applications
Abstract: As electric vehicle (EV) technologies become mature, EV has been rapidly adopted in modern transportation systems, and is expected to provide future autonomous mobility-on-demand (AMoD) service with economic and societal benefits. However, EVs require frequent recharges due to their limited and unpredictable cruising ranges, and they have to be managed efficiently given the dynamic charging process. It is urgent and challenging to investigate a computationally efficient algorithm that provide EV AMoD system performance guarantees under model uncertainties, instead of using heuristic demand or charging models. To accomplish this goal, this work designs a data-driven distributionally robust optimization approach for vehicle supply-demand ratio and charging station utilization balancing, while minimizing the worst-case expected cost considering both passenger mobility demand uncertainties and EV supply uncertainties. We then derive an equivalent computationally tractable form for solving the distributionally robust problem in a computationally efficient way under ellipsoid uncertainty sets constructed from data. Based on E-taxi system data of Shenzhen city, we show that the average total balancing cost is reduced by 14.49%, the average unfairness of supply-demand ratio and utilization is reduced by 15.78% and 34.51% respectively with the distributionally robust vehicle balancing method, compared with solutions which do not consider model uncertainties., Comment: This paper has been published in IROS2020
Published: 2022
Full Text: View/download PDF

18. Spatial-Temporal-Aware Safe Multi-Agent Reinforcement Learning of Connected Autonomous Vehicles in Challenging Scenarios

Author: Zhang, Zhili, Han, Songyang, Wang, Jiangwei, and Miao, Fei
Subjects: Computer Science - Robotics, Computer Science - Artificial Intelligence, Computer Science - Multiagent Systems
Abstract: Communication technologies enable coordination among connected and autonomous vehicles (CAVs). However, it remains unclear how to utilize shared information to improve the safety and efficiency of the CAV system in dynamic and complicated driving scenarios. In this work, we propose a framework of constrained multi-agent reinforcement learning (MARL) with a parallel Safety Shield for CAVs in challenging driving scenarios that includes unconnected hazard vehicles. The coordination mechanisms of the proposed MARL include information sharing and cooperative policy learning, with Graph Convolutional Network (GCN)-Transformer as a spatial-temporal encoder that enhances the agent's environment awareness. The Safety Shield module with Control Barrier Functions (CBF)-based safety checking protects the agents from taking unsafe actions. We design a constrained multi-agent advantage actor-critic (CMAA2C) algorithm to train safe and cooperative policies for CAVs. With the experiment deployed in the CARLA simulator, we verify the performance of the safety checking, spatial-temporal encoder, and coordination mechanisms designed in our method by comparative experiments in several challenging scenarios with unconnected hazard vehicles. Results show that our proposed methodology significantly increases system safety and efficiency in challenging scenarios., Comment: This paper has been accepted by the 2023 IEEE International Conference on Robotics and Automation (ICRA 2023). 6 pages, 5 figures
Published: 2022

19. A Robust and Constrained Multi-Agent Reinforcement Learning Electric Vehicle Rebalancing Method in AMoD Systems

Author: He, Sihong, Wang, Yue, Han, Shuo, Zou, Shaofeng, and Miao, Fei
Subjects: Computer Science - Multiagent Systems, Computer Science - Machine Learning, Computer Science - Robotics, Electrical Engineering and Systems Science - Systems and Control
Abstract: Electric vehicles (EVs) play critical roles in autonomous mobility-on-demand (AMoD) systems, but their unique charging patterns increase the model uncertainties in AMoD systems (e.g. state transition probability). Since there usually exists a mismatch between the training and test/true environments, incorporating model uncertainty into system design is of critical importance in real-world applications. However, model uncertainties have not been considered explicitly in EV AMoD system rebalancing by existing literature yet, and the coexistence of model uncertainties and constraints that the decision should satisfy makes the problem even more challenging. In this work, we design a robust and constrained multi-agent reinforcement learning (MARL) framework with state transition kernel uncertainty for EV AMoD systems. We then propose a robust and constrained MARL algorithm (ROCOMA) with robust natural policy gradients (RNPG) that trains a robust EV rebalancing policy to balance the supply-demand ratio and the charging utilization rate across the city under model uncertainty. Experiments show that the ROCOMA can learn an effective and robust rebalancing policy. It outperforms non-robust MARL methods in the presence of model uncertainties. It increases the system fairness by 19.6% and decreases the rebalancing costs by 75.8%., Comment: 8 pages, accepted to IROS2023
Published: 2022

20. Uncertainty Quantification of Collaborative Detection for Self-Driving

Author: Su, Sanbao, Li, Yiming, He, Sihong, Han, Songyang, Feng, Chen, Ding, Caiwen, and Miao, Fei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Sharing information between connected and autonomous vehicles (CAVs) fundamentally improves the performance of collaborative object detection for self-driving. However, CAVs still have uncertainties on object detection due to practical challenges, which will affect the later modules in self-driving such as planning and control. Hence, uncertainty quantification is crucial for safety-critical systems such as CAVs. Our work is the first to estimate the uncertainty of collaborative object detection. We propose a novel uncertainty quantification method, called Double-M Quantification, which tailors a moving block bootstrap (MBB) algorithm with direct modeling of the multivariant Gaussian distribution of each corner of the bounding box. Our method captures both the epistemic uncertainty and aleatoric uncertainty with one inference pass based on the offline Double-M training process. And it can be used with different collaborative object detectors. Through experiments on the comprehensive collaborative perception dataset, we show that our Double-M method achieves more than 4X improvement on uncertainty score and more than 3% accuracy improvement, compared with the state-of-the-art uncertainty quantification methods. Our code is public on https://coperception.github.io/double-m-quantification., Comment: This paper has been accepted by the 2023 IEEE International Conference on Robotics and Automation (ICRA 2023)
Published: 2022

21. Robust Constrained Reinforcement Learning

Author: Wang, Yue, Miao, Fei, and Zou, Shaofeng
Subjects: Computer Science - Machine Learning
Abstract: Constrained reinforcement learning is to maximize the expected reward subject to constraints on utilities/costs. However, the training environment may not be the same as the test one, due to, e.g., modeling error, adversarial attack, non-stationarity, resulting in severe performance degradation and more importantly constraint violation. We propose a framework of robust constrained reinforcement learning under model uncertainty, where the MDP is not fixed but lies in some uncertainty set, the goal is to guarantee that constraints on utilities/costs are satisfied for all MDPs in the uncertainty set, and to maximize the worst-case reward performance over the uncertainty set. We design a robust primal-dual approach, and further theoretically develop guarantee on its convergence, complexity and robust feasibility. We then investigate a concrete example of $\delta$-contamination uncertainty set, design an online and model-free algorithm and theoretically characterize its sample complexity.
Published: 2022

22. Botnets Breaking Transformers: Localization of Power Botnet Attacks Against the Distribution Grid

Author: Pepin, Lynn, Wang, Lizhi, Wang, Jiangwei, Han, Songyang, Pishawikar, Pranav, Herzberg, Amir, Zhang, Peng, and Miao, Fei
Subjects: Computer Science - Cryptography and Security
Abstract: Traditional botnet attacks leverage large and distributed numbers of compromised internet-connected devices to target and overwhelm other devices with internet packets. With increasing consumer adoption of high-wattage internet-facing "smart devices", a new "power botnet" attack emerges, where such devices are used to target and overwhelm power grid devices with unusual load demand. We introduce a variant of this attack, the power-botnet weardown-attack, which does not intend to cause blackouts or short-term acute instability, but instead forces expensive mechanical components to activate more frequently, necessitating costly replacements / repairs. Specifically, we target the on-load tap-changer (OLTC) transformer, which uses a mechanical switch that responds to change in load demand. In our analysis and simulations, these attacks can halve the lifespan of an OLTC, or in the most extreme cases, reduce it to $2.5\%$ of its original lifespan. Notably, these power botnets are composed of devices not connected to the internal SCADA systems used to control power grids. This represents a new internet-based cyberattack that targets the power grid from the outside. To help the power system to mitigate these types of botnet attacks, we develop attack-localization strategies. We formulate the problem as a supervised machine learning task to locate the source of power botnet attacks. Within a simulated environment, we generate the training and testing dataset to evaluate several machine learning algorithm based localization methods, including SVM, neural network and decision tree. We show that decision-tree based classification successfully identifies power botnet attacks and locates compromised devices with at least $94\%$ improvement of accuracy over a baseline "most-frequent" classifier., Comment: 18 pages, 10 figures
Published: 2022

23. Stable and Efficient Shapley Value-Based Reward Reallocation for Multi-Agent Reinforcement Learning of Autonomous Vehicles

Author: Han, Songyang, Wang, He, Su, Sanbao, Shi, Yuanyuan, and Miao, Fei
Subjects: Computer Science - Computer Science and Game Theory
Abstract: With the development of sensing and communication technologies in networked cyber-physical systems (CPSs), multi-agent reinforcement learning (MARL)-based methodologies are integrated into the control process of physical systems and demonstrate prominent performance in a wide array of CPS domains, such as connected autonomous vehicles (CAVs). However, it remains challenging to mathematically characterize the improvement of the performance of CAVs with communication and cooperation capability. When each individual autonomous vehicle is originally self-interest, we can not assume that all agents would cooperate naturally during the training process. In this work, we propose to reallocate the system's total reward efficiently to motivate stable cooperation among autonomous vehicles. We formally define and quantify how to reallocate the system's total reward to each agent under the proposed transferable utility game, such that communication-based cooperation among multi-agents increases the system's total reward. We prove that Shapley value-based reward reallocation of MARL locates in the core if the transferable utility game is a convex game. Hence, the cooperation is stable and efficient and the agents should stay in the coalition or the cooperating group. We then propose a cooperative policy learning algorithm with Shapley value reward reallocation. In experiments, compared with several literature algorithms, we show the improvement of the mean episode system reward of CAV systems using our proposed algorithm., Comment: This paper has been accepted by the 2022 IEEE International Conference on Robotics and Automation (ICRA 2022)
Published: 2022

24. A Secure and Efficient Federated Learning Framework for NLP

Author: Deng, Jieren, Wang, Chenghong, Meng, Xianrui, Wang, Yijue, Li, Ji, Lin, Sheng, Han, Shuo, Miao, Fei, Rajasekaran, Sanguthevar, and Ding, Caiwen
Subjects: Computer Science - Cryptography and Security, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: In this work, we consider the problem of designing secure and efficient federated learning (FL) frameworks. Existing solutions either involve a trusted aggregator or require heavyweight cryptographic primitives, which degrades performance significantly. Moreover, many existing secure FL designs work only under the restrictive assumption that none of the clients can be dropped out from the training protocol. To tackle these problems, we propose SEFL, a secure and efficient FL framework that (1) eliminates the need for the trusted entities; (2) achieves similar and even better model accuracy compared with existing FL designs; (3) is resilient to client dropouts. Through extensive experimental studies on natural language processing (NLP) tasks, we demonstrate that the SEFL achieves comparable accuracy compared to existing FL solutions, and the proposed pruning technique can improve runtime performance up to 13.7x., Comment: Accepted by EMNLP 2021
Published: 2022

25. Hydroa vacciniforme lymphoproliferative disorder：a case report

Author: ZHU Xia, JIN Jingjing, WANG Xin, MIAO Fei, MA Jiexian, ZHANG Jiechen, XIAO Li
Subjects: hydroa vacciniforme lymphoproliferative disorder, skin papules, epstein-barr virus, Medicine
Abstract: In this study, we report a case of hydroa vacciniforme lymphoproliferative disorder (HVLPD). The patient, who was 21 years old at the time of initial consultation，suffering from recurrent papules, vesicular rashes, bleeding and black scabs on the neck, face and trunk. Serum EBV-DNA was significantly increased (2.88×107 copies/mL). The patient underwent skin biopsies twice within 2 years. The pathology of the first skin biopsy showed partial degeneration and loosening of the epidermal stratum spinosum, intraepidermal blister formation, partial epidermal detachment, and multifocal small abscesses seen in the blisters and stratum spinosum. Patchy infiltration of small lymphocytes, plasma cells, histiocytes, and eosinophils in the dermis, with no significant atypia of lymphocytes, EBER in situ hybridization was negative, which made it difficult to make a definitive diagnosis on pathology. The pathology of the second skin biopsy showed blisters visible within the patient's epidermis， and atypical lymphoid cells infiltrate around the hair follicles, sweat glands and blood vessels in the dermis. The immunohistochemical analysis indicated that lymphoid cells were positive for CD3, CD5, CD4, CD8, granzyme B and TIA-1, while CD56 and Perforin were negative, and the proliferation rate of Ki-67 was approximately 10%. EBER was positive by in situ hybridization consistent with clinicopathologic features of HVLPD. More than 1 year after receiving symptomatic treatment, the patient's rash worsened, with sometimes fever and left eyelid edema. The third skin biopsy performed in the other hospital showed that atypical lymphoid cells infiltrated the subcutaneous adipose tissue, and the proliferation rate of Ki-67 was 60%. The disease progressed to EBER-positive T-cell lymphoma. After 2 courses of chemotherapy with gemcitabine, cisplatin, dexamethasone and pegaspargase, the patient's edema subsided and the rash healed. This report demonstrates the clinical and pathologic features of the disease during its evolution and progression, with a view to enriching its diagnostic and therapeutic experience.
Published: 2023
Full Text: View/download PDF

26. Carbon-based peracetic acid activation towards advanced water purification

Author: Miao, Fei, Ren, Wei, Zhou, Hongyu, Ma, Tianyi, Zhang, Hui, Wang, Shaobin, and Duan, Xiaoguang
Published: 2025
Full Text: View/download PDF

27. Enabling Retrain-free Deep Neural Network Pruning using Surrogate Lagrangian Relaxation

Author: Gurevin, Deniz, Zhou, Shanglin, Pepin, Lynn, Li, Bingbing, Bragin, Mikhail, Ding, Caiwen, and Miao, Fei
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Network pruning is a widely used technique to reduce computation cost and model size for deep neural networks. However, the typical three-stage pipeline, i.e., training, pruning and retraining (fine-tuning) significantly increases the overall training trails. In this paper, we develop a systematic weight-pruning optimization approach based on Surrogate Lagrangian relaxation (SLR), which is tailored to overcome difficulties caused by the discrete nature of the weight-pruning problem while ensuring fast convergence. We further accelerate the convergence of the SLR by using quadratic penalties. Model parameters obtained by SLR during the training phase are much closer to their optimal values as compared to those obtained by other state-of-the-art methods. We evaluate the proposed method on image classification tasks, i.e., ResNet-18 and ResNet-50 using ImageNet, and ResNet-18, ResNet-50 and VGG-16 using CIFAR-10, as well as object detection tasks, i.e., YOLOv3 and YOLOv3-tiny using COCO 2014 and Ultra-Fast-Lane-Detection using TuSimple lane detection dataset. Experimental results demonstrate that our SLR-based weight-pruning optimization approach achieves higher compression rate than state-of-the-arts under the same accuracy requirement. It also achieves a high model accuracy even at the hard-pruning stage without retraining (reduces the traditional three-stage pruning to two-stage). Given a limited budget of retraining epochs, our approach quickly recovers the model accuracy.
Published: 2020

28. A Multi-Agent Reinforcement Learning Approach For Safe and Efficient Behavior Planning Of Connected Autonomous Vehicles

Author: Han, Songyang, Zhou, Shanglin, Wang, Jiangwei, Pepin, Lynn, Ding, Caiwen, Fu, Jie, and Miao, Fei
Subjects: Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Systems and Control
Abstract: The recent advancements in wireless technology enable connected autonomous vehicles (CAVs) to gather information about their environment by vehicle-to-vehicle (V2V) communication. In this work, we design an information-sharing-based multi-agent reinforcement learning (MARL) framework for CAVs, to take advantage of the extra information when making decisions to improve traffic efficiency and safety. The safe actor-critic algorithm we propose has two new techniques: the truncated Q-function and safe action mapping. The truncated Q-function utilizes the shared information from neighboring CAVs such that the joint state and action spaces of the Q-function do not grow in our algorithm for a large-scale CAV system. We prove the bound of the approximation error between the truncated-Q and global Q-functions. The safe action mapping provides a provable safety guarantee for both the training and execution based on control barrier functions. Using the CARLA simulator for experiments, we show that our approach can improve the CAV system's efficiency in terms of average velocity and comfort under different CAV ratios and different traffic densities. We also show that our approach avoids the execution of unsafe actions and always maintains a safe distance from other vehicles. We construct an obstacle-at-corner scenario to show that the shared vision can help CAVs to observe obstacles earlier and take action to avoid traffic jams., Comment: This paper is submitted to IEEE Transactions on Intelligent Transportation Systems
Published: 2020

29. CD38 as a pan-hematologic target for chimeric antigen receptor T cells

Author: Glisovic-Aplenc, Tina, Diorio, Caroline, Chukinas, John A., Veliz, Kimberly, Shestova, Olga, Shen, Feng, Nunez-Cruz, Selene, Vincent, Tiffaney L., Miao, Fei, Milone, Michael C., June, Carl H., Teachey, David T., Tasian, Sarah K., Aplenc, Richard, and Gill, Saar
Published: 2023
Full Text: View/download PDF

30. Clinical evaluation of deep learning–based clinical target volume three-channel auto-segmentation algorithm for adaptive radiotherapy in cervical cancer

Author: Chen-ying Ma, Ju-ying Zhou, Xiao-ting Xu, Song-bing Qin, Miao-fei Han, Xiao-huan Cao, Yao-zong Gao, Lu Xu, Jing-jie Zhou, Wei Zhang, and Le-cheng Jia
Subjects: Cervical cancer CTV, Deep learning, Auto-segmentation, Registration, Medical technology, R855-855.5
Abstract: Abstract Objectives Accurate contouring of the clinical target volume (CTV) is a key element of radiotherapy in cervical cancer. We validated a novel deep learning (DL)-based auto-segmentation algorithm for CTVs in cervical cancer called the three-channel adaptive auto-segmentation network (TCAS). Methods A total of 107 cases were collected and contoured by senior radiation oncologists (ROs). Each case consisted of the following: (1) contrast-enhanced CT scan for positioning, (2) the related CTV, (3) multiple plain CT scans during treatment and (4) the related CTV. After registration between (1) and (3) for the same patient, the aligned image and CTV were generated. Method 1 is rigid registration, method 2 is deformable registration, and the aligned CTV is seen as the result. Method 3 is rigid registration and TCAS, method 4 is deformable registration and TCAS, and the result is generated by a DL-based method. Results From the 107 cases, 15 pairs were selected as the test set. The dice similarity coefficient (DSC) of method 1 was 0.8155 ± 0.0368; the DSC of method 2 was 0.8277 ± 0.0315; the DSCs of method 3 and 4 were 0.8914 ± 0.0294 and 0.8921 ± 0.0231, respectively. The mean surface distance and Hausdorff distance of methods 3 and 4 were markedly better than those of method 1 and 2. Conclusions The TCAS achieved comparable accuracy to the manual delineation performed by senior ROs and was significantly better than direct registration.
Published: 2022
Full Text: View/download PDF

31. A Moving-Horizon Hybrid Stochastic Game for Secure Control of Cyber-Physical Systems

Author: Miao, Fei, Zhu, Quanyan, Pajic, Miroslav, and Pappas, George J.
Subjects: Computer Science - Computer Science and Game Theory
Abstract: In this paper, we establish a zero-sum, hybrid state stochastic game model for designing defense policies for cyber-physical systems against different types of attacks. With the increasingly integrated properties of cyber-physical systems (CPS) today, security is a challenge for critical infrastructures. Though resilient control and detecting techniques for a specific model of attack have been proposed, to analyze and design detection and defense mechanisms against multiple types of attacks for CPSs requires new system frameworks. Besides security, other requirements such as optimal control cost also need to be considered. The hybrid game model we propose in this work contains physical states that are described by the system dynamics, and a cyber state that represents the detection mode of the system composed by a set of subsystems. A strategy means selecting a subsystem by combining one controller, one estimator and one detector among a finite set of candidate components at each state. Based on the game model, we propose a suboptimal value iteration algorithm for a finite horizon game, and prove that the algorithm results an upper bound for the value of the finite horizon game. A moving-horizon approach is also developed in order to provide a scalable and real-time computation of the switching strategies. Both algorithms aims at obtaining a saddle-point equilibrium policy for balancing the system's security overhead and control cost. The paper illustrates these concepts using numerical examples, and we compare the results with previously system designs that only equipped with one type of controller., Comment: Provionally accepted as a regular paper, Automatica, 11 pages
Published: 2017

32. Physical simulation of residual oil displacement production in offshore strong bottom water reservoir

Author: Tan, Jie, Cai, Hui, Li, Yan-lai, Liu, Chun-yan, Miao, Fei-fei, and Liu, Chun-zhi
Published: 2022
Full Text: View/download PDF

33. Clinical evaluation of deep learning–based clinical target volume three-channel auto-segmentation algorithm for adaptive radiotherapy in cervical cancer

Author: Ma, Chen-ying, Zhou, Ju-ying, Xu, Xiao-ting, Qin, Song-bing, Han, Miao-fei, Cao, Xiao-huan, Gao, Yao-zong, Xu, Lu, Zhou, Jing-jie, Zhang, Wei, and Jia, Le-cheng
Published: 2022
Full Text: View/download PDF

34. Fortilin interacts with TGF-β1 and prevents TGF-β receptor activation

Author: Pinkaew, Decha, Martinez-Hackert, Erik, Jia, Wei, King, Matthew D., Miao, Fei, Enger, Nicole R., Silakit, Runglawan, Ramana, Kota, Chen, Shi-You, and Fujise, Ken
Published: 2022
Full Text: View/download PDF

35. Coding Schemes for Securing Cyber-Physical Systems Against Stealthy Data Injection Attacks

Author: Miao, Fei, Zhu, Quanyan, Pajic, Miroslav, and Pappas, George J.
Subjects: Computer Science - Cryptography and Security, Computer Science - Systems and Control
Abstract: This paper considers a method of coding the sensor outputs in order to detect stealthy false data injection attacks. An intelligent attacker can design a sequence of data injection to sensors and actuators that pass the state estimator and statistical fault detector, based on knowledge of the system parameters. To stay undetected, the injected data should increase the state estimation errors while keep the estimation residues small. We employ a coding matrix to change the original sensor outputs to increase the estimation residues under intelligent data injection attacks. This is a low cost method compared with encryption schemes over all sensor measurements in communication networks. We show the conditions of a feasible coding matrix under the assumption that the attacker does not have knowledge of the exact coding matrix. An algorithm is developed to compute a feasible coding matrix, and, we show that in general, multiple feasible coding matrices exist. To defend against attackers who estimates the coding matrix via sensor and actuator measurements, time-varying coding matrices are designed according to the detection requirements. A heuristic algorithm to decide the time length of updating a coding matrix is then proposed., Comment: 12 pages, accepted, IEEE Transactions on Control of Network Systems
Published: 2016
Full Text: View/download PDF

36. Data-Driven Robust Taxi Dispatch under Demand Uncertainties

Author: Miao, Fei, Han, Shuo, Lin, Shan, Wang, Qian, Stankovic, John, Hendawi, Abdeltawab, Zhang, Desheng, He, Tian, and Pappas, George J.
Subjects: Computer Science - Systems and Control
Abstract: In modern taxi networks, large amounts of taxi occupancy status and location data are collected from networked in-vehicle sensors in real-time. They provide knowledge of system models on passenger demand and mobility patterns for efficient taxi dispatch and coordination strategies. Such approaches face new challenges: how to deal with uncertainties of predicted customer demand while fulfilling the system's performance requirements, including minimizing taxis' total idle mileage and maintaining service fairness across the whole city; how to formulate a computationally tractable problem. To address this problem, we develop a data-driven robust taxi dispatch framework to consider spatial-temporally correlated demand uncertainties. The robust vehicle dispatch problem we formulate is concave in the uncertain demand and convex in the decision variables. Uncertainty sets of random demand vectors are constructed from data based on theories in hypothesis testing, and provide a desired probabilistic guarantee level for the performance of robust taxi dispatch solutions. We prove equivalent computationally tractable forms of the robust dispatch problem using the minimax theorem and strong duality. Evaluations on four years of taxi trip data for New York City show that by selecting a probabilistic guarantee level at 75%, the average demand-supply ratio error is reduced by 31.7%, and the average total idle driving distance is reduced by 10.13% or about 20 million miles annually, compared with non-robust dispatch solutions., Comment: Accepted as a regular paper, IEEE Transactions on Control Systems Technology; 15 pages. This version updated as of Oct 2017
Published: 2016

37. Taxi Dispatch with Real-Time Sensing Data in Metropolitan Areas: A Receding Horizon Control Approach

Author: Miao, Fei, Han, Shuo, Lin, Shan, Stankovic, John A., Huang, Hua, Zhang, Desheng, Munir, Sirajum, He, Tian, and Pappas, George J.
Subjects: Computer Science - Systems and Control
Abstract: Traditional taxi systems in metropolitan areas often suffer from inefficiencies due to uncoordinated actions as system capacity and customer demand change. With the pervasive deployment of networked sensors in modern vehicles, large amounts of information regarding customer demand and system status can be collected in real time. This information provides opportunities to perform various types of control and coordination for large-scale intelligent transportation systems. In this paper, we present a receding horizon control (RHC) framework to dispatch taxis, which incorporates highly spatiotemporally correlated demand/supply models and real-time GPS location and occupancy information. The objectives include matching spatiotemporal ratio between demand and supply for service quality with minimum current and anticipated future taxi idle driving distance. Extensive trace-driven analysis with a data set containing taxi operational records in San Francisco shows that our solution reduces the average total idle distance by 52%, and reduces the supply demand ratio error across the city during one experimental time slot by 45%. Moreover, our RHC framework is compatible with a wide variety of predictive models and optimization problem formulations. This compatibility property allows us to solve robust optimization problems with corresponding demand uncertainty models that provide disruptive event information., Comment: Accepted. Key words--Intelligent Transportation System, Real-Time Taxi Dispatch, Receding Horizon Control, Mobility Pattern
Published: 2016
Full Text: View/download PDF

38. Molecular characteristics of Staphylococcus aureus isolates colonizing human nares and skin

Author: Zhao, Na, Cheng, Danhong, Jian, Ying, Liu, Yao, Liu, Junlan, Huang, Qian, He, Lei, Wang, Hua, Miao, Fei, Li, Min, and Liu, Qian
Published: 2021
Full Text: View/download PDF

39. Pancreatic Neuroendocrine Neoplasms: CT Spectral Imaging in Grading

Author: Li, Wei-Xia, Miao, Fei, Xu, Xue-Qin, Zhang, Jing, Wu, Zhi-Yuan, Chen, Ke-Min, Yan, Fu-Hua, and Lin, Xiao-Zhu
Published: 2021
Full Text: View/download PDF

40. Application value of a deep learning method based on a 3D V-Net convolutional neural network in the recognition and segmentation of the auditory ossicles

Author: Xing-Rui Wang, Xi Ma, Liu-Xu Jin, Yan-Jun Gao, Yong-Jie Xue, Jing-Long Li, Wei-Xian Bai, Miao-Fei Han, Qing Zhou, Feng Shi, and Jing Wang
Subjects: auditory ossicles, automatic segmentation, computed tomography, convolutional neural network, deep learning, Neurosciences. Biological psychiatry. Neuropsychiatry, RC321-571
Abstract: ObjectiveTo explore the feasibility of a deep learning three-dimensional (3D) V-Net convolutional neural network to construct high-resolution computed tomography (HRCT)-based auditory ossicle structure recognition and segmentation models.MethodsThe temporal bone HRCT images of 158 patients were collected retrospectively, and the malleus, incus, and stapes were manually segmented. The 3D V-Net and U-Net convolutional neural networks were selected as the deep learning methods for segmenting the auditory ossicles. The temporal bone images were randomized into a training set (126 cases), a test set (16 cases), and a validation set (16 cases). Taking the results of manual segmentation as a control, the segmentation results of each model were compared.ResultsThe Dice similarity coefficients (DSCs) of the malleus, incus, and stapes, which were automatically segmented with a 3D V-Net convolutional neural network and manually segmented from the HRCT images, were 0.920 ± 0.014, 0.925 ± 0.014, and 0.835 ± 0.035, respectively. The average surface distance (ASD) was 0.257 ± 0.054, 0.236 ± 0.047, and 0.258 ± 0.077, respectively. The Hausdorff distance (HD) 95 was 1.016 ± 0.080, 1.000 ± 0.000, and 1.027 ± 0.102, respectively. The DSCs of the malleus, incus, and stapes, which were automatically segmented using the 3D U-Net convolutional neural network and manually segmented from the HRCT images, were 0.876 ± 0.025, 0.889 ± 0.023, and 0.758 ± 0.044, respectively. The ASD was 0.439 ± 0.208, 0.361 ± 0.077, and 0.433 ± 0.108, respectively. The HD 95 was 1.361 ± 0.872, 1.174 ± 0.350, and 1.455 ± 0.618, respectively. As these results demonstrated, there was a statistically significant difference between the two groups (P < 0.001).ConclusionThe 3D V-Net convolutional neural network yielded automatic recognition and segmentation of the auditory ossicles and produced similar accuracy to manual segmentation results.
Published: 2022
Full Text: View/download PDF

41. Accuracy-Improved Fault Diagnosis Method for Rolling Bearing Based on Enhanced ESGMD-CC and BA-ELM Model

Author: Yuan, Wei, primary, Liu, Fuzheng, additional, Gu, Hongbin, additional, Miao, Fei, additional, Zhang, Faye, additional, and Jiang, Mingshun, additional
Published: 2024
Full Text: View/download PDF

42. Full-face ALA-PDT for facial actinic keratosis: Two case reports

Author: Zha, Wenjing, primary, Huang, Jianhua, additional, Lyu, Ting, additional, Miao, Fei, additional, Wu, Minfeng, additional, Shen, Jie, additional, Zhu, Rongyi, additional, Wang, Hongwei, additional, and Shi, Lei, additional
Published: 2024
Full Text: View/download PDF

43. An Intestinal Tumors Detection Model Based on Feature Distillation with Self-correction Mechanism and PathGAN

Author: Zhu, Lingfeng, primary, Liu, Jindong, additional, Zheng, Dongmei, additional, Cao, Ziran, additional, Miao, Fei, additional, Li, Cheng, additional, He, Jian, additional, and Guo, Jing, additional
Published: 2024
Full Text: View/download PDF

44. Engine remaining useful life prediction based on PSO optimized multi-layer long short-term memory and multi-source information fusion.

Author: Yuan, Wei, Li, Xinlong, Gu, Hongbin, Zhang, Faye, and Miao, Fei
Subjects: REMAINING useful life, DEEP learning, PARTICLE swarm optimization, MACHINE learning, ENGINES
Abstract: Engine as the core component of mechanical equipment, its operating state directly affects whether the equipment can operate normally. Predicting the engine remaining useful life (RUL) can monitor the health of the engine in real time and formulate a timely and reasonable maintenance plan. Aiming at the engine monitoring data with various and long time span, we propose a direct prediction method of engine RUL based on particle swarm optimization (PSO) optimized multi-layer Long Short-Term Memory (LSTM) in this paper. Firstly, the monitoring data that can well reflect the engine degradation trend is screened out, and the samples are constructed through a sliding time window. Then, a multi-layer LSTM model is constructed to mine the deep-seated features of the samples for predicting the engine RUL. Finally, the hyperparameters of the multi-layer LSTM model are optimized automatically by the PSO algorithm to optimize the performance of the model. The effectiveness of this method is verified by NASA data set. RMSE, MAE and the scoring function are used as evaluation indexes. RMSE and score of the prediction results are 12.35 and 284.1, respectively. It has higher prediction accuracy compared with traditional deep learning and machine learning methods. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

45. Engine remaining useful life prediction based on PSO optimized multi-layer long short-term memory and multi-source information fusion

Author: Yuan, Wei, primary, Li, Xinlong, additional, Gu, Hongbin, additional, Zhang, Faye, additional, and Miao, Fei, additional
Published: 2023
Full Text: View/download PDF

46. Resectable pancreatic ductal adenocarcinoma: association between preoperative CT texture features and metastatic nodal involvement

Author: Fang, Wei Huan, Li, Xu Dong, Zhu, Hui, Miao, Fei, Qian, Xiao Hua, Pan, Zi Lai, and Lin, Xiao Zhu
Published: 2020
Full Text: View/download PDF

47. A 15-year follow-up report of an elderly diabetic foot with multiple recurrences leading to toe amputation and thoughts on the model of care for diabetic foot ulcer

Author: Jia, Qing, primary, Ming, Yue, additional, Bai, Jiaojiao, additional, Miao, Fei, additional, and Qin, Wen, additional
Published: 2023
Full Text: View/download PDF

48. Safe and Robust Multi-Agent Reinforcement Learning for Connected Autonomous Vehicles under State Perturbations

Author: Zhang, Zhili, Sun, Yanchao, Huang, Furong, Miao, Fei, Zhang, Zhili, Sun, Yanchao, Huang, Furong, and Miao, Fei
Abstract: Sensing and communication technologies have enhanced learning-based decision making methodologies for multi-agent systems such as connected autonomous vehicles (CAV). However, most existing safe reinforcement learning based methods assume accurate state information. It remains challenging to achieve safety requirement under state uncertainties for CAVs, considering the noisy sensor measurements and the vulnerability of communication channels. In this work, we propose a Robust Multi-Agent Proximal Policy Optimization with robust Safety Shield (SR-MAPPO) for CAVs in various driving scenarios. Both robust MARL algorithm and control barrier function (CBF)-based safety shield are used in our approach to cope with the perturbed or uncertain state inputs. The robust policy is trained with a worst-case Q function regularization module that pursues higher lower-bounded reward in the former, whereas the latter, i.e., the robust CBF safety shield accounts for CAVs' collision-free constraints in complicated driving scenarios with even perturbed vehicle state information. We validate the advantages of SR-MAPPO in robustness and safety and compare it with baselines under different driving and state perturbation scenarios in CARLA simulator. The SR-MAPPO policy is verified to maintain higher safety rates and efficiency (reward) when threatened by both state perturbations and unconnected vehicles' dangerous behaviors., Comment: 6 pages, 5 figures
Published: 2023

49. A 15-year follow-up report of an elderly diabetic foot with multiple recurrences leading to toe amputation and thoughts on the model of care for diabetic foot ulcer

Author: Jia, Qing, Ming, Yue, Bai, Jiaojiao, Miao, Fei, and Qin, Wen
Subjects: Communication
Abstract: Diabetic foot ulcer (DFU) is one of the most serious complications of diabetes. Elderly diabetic patients are a high prevalence of diabetic foot ulcers, and their high recurrence, disability, and mortality rates impose a heavy economic burden on families and society. This paper reports a case of an elderly patient with a diabetic foot ulcer who was admitted in April 2007 and discharged after recovery from comprehensive diabetic foot treatment. Due to intermittent foot care and lack of home care, the patient's foot ulcers recurred after repeated healing during home rehabilitation, eventually resulting in the amputation of the right bunion. After the patient was discharged from the hospital with an amputated toe, the whole-process seamless management model of "hospital - community - family" was implemented. The hospital provides specialized foot support and guidance, and the community is responsible for daily disease management and referrals. The family is responsible for the implementation of home rehabilitation programs, and family caregivers need to identify and provide feedback on foot abnormalities promptly. As of May 2022, the patient had not experienced ulcer recurrence. This paper reports the whole process of "ulcer development → ulcer healing → ulcer recurrence healing → toe amputation → continuous care management" experienced by the patient in 15 years, aiming to reflect on the significance of the whole-process seamless foot care management model of "hospital-community-family" for diabetic foot ulcer rehabilitation through the case.
Published: 2023

50. Clinicopathologic characterization and abnormal autophagy of CSF1R-related leukoencephalopathy

Author: Tian, Wo-Tu, Zhan, Fei-Xia, Liu, Qing, Luan, Xing-Hua, Zhang, Chao, Shang, Liang, Zhang, Ben-Yan, Pan, Si-Jian, Miao, Fei, Hu, Jiong, Zhong, Ping, Liu, Shi-Hua, Zhu, Ze-Yu, Zhou, Hai-Yan, Sun, Suya, Liu, Xiao-Li, Huang, Xiao-Jun, Jiang, Jing-Wen, Ma, Jian-Fang, Wang, Ying, Chen, Shu-Fen, Tang, Hui-Dong, Chen, Sheng-Di, and Cao, Li
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

256 results on '"Miao, Fei"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources