Author: "YUAN, Lei" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"YUAN, Lei"' showing total 4,899 results

Start Over Author "YUAN, Lei"

4,899 results on '"YUAN, Lei"'

1. Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation

Author: Li, Yi-Chen, Zhang, Fuxiang, Qiu, Wenjie, Yuan, Lei, Jia, Chengxing, Zhang, Zongzhang, Yu, Yang, and An, Bo
Subjects: Computer Science - Machine Learning
Abstract: Large Language Models (LLMs), trained on a large amount of corpus, have demonstrated remarkable abilities. However, it may not be sufficient to directly apply open-source LLMs like Llama to certain real-world scenarios, since most of them are trained for \emph{general} purposes. Thus, the demands for customizing publicly available LLMs emerge, but are currently under-studied. In this work, we consider customizing pre-trained LLMs with new human preferences. Specifically, the LLM should not only meet the new preference but also preserve its original capabilities after customization. Drawing inspiration from the observation that human preference can be expressed as a reward model, we propose to cast LLM customization as optimizing the sum of two reward functions, one of which (denoted as $r_1$) was used to pre-train the LLM while the other (denoted as $r_2$) characterizes the new human preference. The obstacle here is that both reward functions are unknown, making the application of modern reinforcement learning methods infeasible. Thanks to the residual Q-learning framework, we can restore the customized LLM with the pre-trained LLM and the \emph{residual Q-function} without the reward function $r_1$. Moreover, we find that for a fixed pre-trained LLM, the reward function $r_2$ can be derived from the residual Q-function, enabling us to directly learn the residual Q-function from the new human preference data upon the Bradley-Terry model. We name our method Q-Adapter as it introduces an adapter module to approximate the residual Q-function for customizing the pre-trained LLM towards the new preference. Experiments based on the Llama-3.1 model on the DSP dataset and HH-RLHF dataset illustrate the superior effectiveness of Q-Adapter on both retaining existing knowledge and learning new preferences. Code is available at \url{https://github.com/mansicer/Q-Adapter}.
Published: 2024

2. Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation

Author: Jia, Chengxing, Zhang, Fuxiang, Li, Yi-Chen, Gao, Chen-Xiao, Liu, Xu-Hui, Yuan, Lei, Zhang, Zongzhang, and Yu, Yang
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Offline meta-reinforcement learning (OMRL) proficiently allows an agent to tackle novel tasks while solely relying on a static dataset. For precise and efficient task identification, existing OMRL research suggests learning separate task representations that be incorporated with policy input, thus forming a context-based meta-policy. A major approach to train task representations is to adopt contrastive learning using multi-task offline data. The dataset typically encompasses interactions from various policies (i.e., the behavior policies), thus providing a plethora of contextual information regarding different tasks. Nonetheless, amassing data from a substantial number of policies is not only impractical but also often unattainable in realistic settings. Instead, we resort to a more constrained yet practical scenario, where multi-task data collection occurs with a limited number of policies. We observed that learned task representations from previous OMRL methods tend to correlate spuriously with the behavior policy instead of reflecting the essential characteristics of the task, resulting in unfavorable out-of-distribution generalization. To alleviate this issue, we introduce a novel algorithm to disentangle the impact of behavior policy from task representation learning through a process called adversarial data augmentation. Specifically, the objective of adversarial data augmentation is not merely to generate data analogous to offline data distribution; instead, it aims to create adversarial examples designed to confound learned task representations and lead to incorrect task identification. Our experiments show that learning from such adversarial samples significantly enhances the robustness and effectiveness of the task identification process and realizes satisfactory out-of-distribution generalization.
Published: 2024

3. A Simplified single-phase neutral-clamped H-bridge cascade converter FCS-MPC control

Author: Yuan, Lei, Mei, Jia-wei, Xu, An-fei, Wang, Pan, and Cai, Yu-hua
Published: 2024
Full Text: View/download PDF

4. EZH2 Promotes Glioma Cell Proliferation, Invasion, and Migration via Mir-142-3p/KCNQ1OT1/HMGB3 Axis: Running Title: EZH2 Promotes Glioma cell Malignant Behaviors

Author: Zhang, Yiming, Yu, Yong, Yuan, Lei, and Zhang, Baozhong
Published: 2024
Full Text: View/download PDF

5. Harmonic Suppression Strategy of Permanent Magnet Synchronous Motor Based on PCI and ADRC Methods

Author: Xu, An-fei, Zhu, Xue-song, Yuan, Lei, Han, Kun, and Wang, Pan
Published: 2024
Full Text: View/download PDF

6. Reduction Behavior of Lump Ore and Its Applicability During Hydrogen-Based Shaft Furnace Process

Author: Zhao, Zichuan, Tang, Jue, Chu, Mansheng, Feng, Jinge, Li, Sinan, Qin, Jile, Li, Feng, and Yuan, Lei
Published: 2024
Full Text: View/download PDF

7. HCT-Unet: multi-target medical image segmentation via a hybrid CNN-transformer Unet incorporating multi-axis gated multi-layer perceptron

Author: Fan, Yazhuo, Song, Jianhua, Yuan, Lei, and Jia, Yunlin
Published: 2024
Full Text: View/download PDF

8. Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics

Author: Zhang, Xinyu, Qiu, Wenjie, Li, Yi-Chen, Yuan, Lei, Jia, Chengxing, Zhang, Zongzhang, and Yu, Yang
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Developing policies that can adjust to non-stationary environments is essential for real-world reinforcement learning applications. However, learning such adaptable policies in offline settings, with only a limited set of pre-collected trajectories, presents significant challenges. A key difficulty arises because the limited offline data makes it hard for the context encoder to differentiate between changes in the environment dynamics and shifts in the behavior policy, often leading to context misassociations. To address this issue, we introduce a novel approach called Debiased Offline Representation for fast online Adaptation (DORA). DORA incorporates an information bottleneck principle that maximizes mutual information between the dynamics encoding and the environmental data, while minimizing mutual information between the dynamics encoding and the actions of the behavior policy. We present a practical implementation of DORA, leveraging tractable bounds of the information bottleneck principle. Our experimental evaluation across six benchmark MuJoCo tasks with variable parameters demonstrates that DORA not only achieves a more precise dynamics encoding but also significantly outperforms existing baselines in terms of performance.
Published: 2024

9. A Survey of Progress on Cooperative Multi-agent Reinforcement Learning in Open Environment

Author: Yuan, Lei, Zhang, Ziqian, Li, Lihe, Guan, Cong, and Yu, Yang
Subjects: Computer Science - Multiagent Systems
Abstract: Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and has made progress in various fields. Specifically, cooperative MARL focuses on training a team of agents to cooperatively achieve tasks that are difficult for a single agent to handle. It has shown great potential in applications such as path planning, autonomous driving, active voltage control, and dynamic algorithm configuration. One of the research focuses in the field of cooperative MARL is how to improve the coordination efficiency of the system, while research work has mainly been conducted in simple, static, and closed environment settings. To promote the application of artificial intelligence in real-world, some research has begun to explore multi-agent coordination in open environments. These works have made progress in exploring and researching the environments where important factors might change. However, the mainstream work still lacks a comprehensive review of the research direction. In this paper, starting from the concept of reinforcement learning, we subsequently introduce multi-agent systems (MAS), cooperative MARL, typical methods, and test environments. Then, we summarize the research work of cooperative MARL from closed to open environments, extract multiple research directions, and introduce typical works. Finally, we summarize the strengths and weaknesses of the current research, and look forward to the future development direction and research problems in cooperative MARL in open environments.
Published: 2023

10. Study of thermal shock resistance of HVAF spraying thickness gradient WC-Cr3C2-Ni coating on crystallizer surface

Author: Zhang, Diyao, Hu, Shuming, Peng, Zijun, Liu, Zhenli, Yu, Jingkun, and Yuan, Lei
Published: 2024
Full Text: View/download PDF

11. Relationship between microbial protein and amino acid metabolism in fermented grains of long fermentation period strong-flavor Baijiu

Author: Liu, Xiaogang, Yuan, Lei, Ma, Dongna, Liu, Shuangping, Ji, Zhongwei, Han, Xiao, Shen, Caihong, and Mao, Jian
Published: 2024
Full Text: View/download PDF

12. A risk prediction model for delayed bleeding after ESD for gastric precancerous lesions

Author: Zhu, Yiying, Ji, Mengyao, Yuan, Lei, Yuan, Jingping, and Shen, Lei
Published: 2024
Full Text: View/download PDF

13. Tribological characteristics of WC-Cr3C2-Ni cermet coatings under different wear parameters

Author: Zhang, Diyao, Peng, Zijun, Liu, Zhenli, Yu, Jingkun, and Yuan, Lei
Published: 2024
Full Text: View/download PDF

14. Straw retention and inhibitor application reduce the leaching risk of mineral N in no-tillage systems of Northeast China

Author: Yuan, Lei, Hu, Yanyu, Yang, Miaoyin, Lei, Ningbo, Chen, Huaihai, Ma, Jian, Chen, Xin, Xie, Hongtu, He, Hongbo, Zhang, Xudong, and Lu, Caiyan
Published: 2024
Full Text: View/download PDF

15. Effect of Electric Current Pulse on the Interfacial Reaction Between Molten Steel and SEN During Continuous Casting of Ultra-Low Carbon Steel

Author: Chen, Kaiwang, Yuan, Lei, Gu, Qiang, Liu, Guoqi, Zhi, Jianjun, Yu, Jingkun, and Li, Hongxia
Published: 2024
Full Text: View/download PDF

16. Efficient Human-AI Coordination via Preparatory Language-based Convention

Author: Guan, Cong, Zhang, Lichao, Fan, Chunpeng, Li, Yichen, Chen, Feng, Li, Lihe, Tian, Yunjia, Yuan, Lei, and Yu, Yang
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language
Abstract: Developing intelligent agents capable of seamless coordination with humans is a critical step towards achieving artificial general intelligence. Existing methods for human-AI coordination typically train an agent to coordinate with a diverse set of policies or with human models fitted from real human data. However, the massively diverse styles of human behavior present obstacles for AI systems with constrained capacity, while high quality human data may not be readily available in real-world scenarios. In this study, we observe that prior to coordination, humans engage in communication to establish conventions that specify individual roles and actions, making their coordination proceed in an orderly manner. Building upon this observation, we propose employing the large language model (LLM) to develop an action plan (or equivalently, a convention) that effectively guides both human and AI. By inputting task requirements, human preferences, the number of agents, and other pertinent information into the LLM, it can generate a comprehensive convention that facilitates a clear understanding of tasks and responsibilities for all parties involved. Furthermore, we demonstrate that decomposing the convention formulation problem into sub-problems with multiple new sessions being sequentially employed and human feedback, will yield a more efficient coordination convention. Experimental evaluations conducted in the Overcooked-AI environment, utilizing a human proxy model, highlight the superior performance of our proposed method compared to existing learning-based approaches. When coordinating with real humans, our method achieves better alignment with human preferences and an average performance improvement of 15% compared to the state-of-the-art.
Published: 2023

17. Learning to Coordinate with Anyone

Author: Yuan, Lei, Li, Lihe, Zhang, Ziqian, Chen, Feng, Zhang, Tianyi, Guan, Cong, Yu, Yang, and Zhou, Zhi-Hua
Subjects: Computer Science - Multiagent Systems
Abstract: In open multi-agent environments, the agents may encounter unexpected teammates. Classical multi-agent learning approaches train agents that can only coordinate with seen teammates. Recent studies attempted to generate diverse teammates to enhance the generalizable coordination ability, but were restricted by pre-defined teammates. In this work, our aim is to train agents with strong coordination ability by generating teammates that fully cover the teammate policy space, so that agents can coordinate with any teammates. Since the teammate policy space is too huge to be enumerated, we find only dissimilar teammates that are incompatible with controllable agents, which highly reduces the number of teammates that need to be trained with. However, it is hard to determine the number of such incompatible teammates beforehand. We therefore introduce a continual multi-agent learning process, in which the agent learns to coordinate with different teammates until no more incompatible teammates can be found. The above idea is implemented in the proposed Macop (Multi-agent compatible policy learning) algorithm. We conduct experiments in 8 scenarios from 4 environments that have distinct coordination patterns. Experiments show that Macop generates training teammates with much lower compatibility than previous methods. As a result, in all scenarios Macop achieves the best overall coordination ability while never significantly worse than the baselines, showing strong generalization ability.
Published: 2023

18. Finding Structure in Real Time: An Eye Tracking Study on the Statistical Learning of Multiple Linguistic Structures Simultaneously

Author: Vleugels, Lucile and Yuan, Lei
Subjects: Attention, Language learning, Perception, Statistical learning, Eye tracking
Abstract: Many human-invented compositional systems (e.g., language, mathematics) embody hierarchical relational structures. How exactly these structures are acquired during learning remains an open question. Here, we examine how the structure of a system engages learners' attention and learning. Participants (N=88) learned an artificial language that describes novel combinations of unknown visual symbols while their eye movements were recorded. Participants were randomly assigned to one of two conditions. The ‚ÄòMore' condition had three latent rules that connected components in verbal input to visual input. In contrast, the ‚ÄòLess' condition had only one latent rule. Despite having more regularities to learn, the ‚ÄòMore' condition performed as well as the ‚ÄòLess' condition. Eye movement data further revealed that participants in the ‚ÄòMore' condition selectively attended to target symbols more than those in the ‚ÄòLess' condition. These results suggest a counterintuitive ‚ÄòMore is More' principle: the presence of multiple regularities organizes attention and potentiates learning.
Published: 2024

19. Numerical analysis of multi-scale mechanical theory of microfine magnesite powder molding

Author: Zhang, Ruinan, Liu, Zhaoyang, Pan, Songyang, Yuan, Lei, Wen, Tianpeng, and Yu, Jingkun
Published: 2024
Full Text: View/download PDF

20. The role of male hormones in bacterial infections: enhancing Staphylococcus aureus virulence through testosterone-induced Agr activation

Author: Luo, Zhaoxia, Xi, Huimin, Huang, Wei, Liu, Mei-fang, Yuan, Lei, Chen, Qiang, Xiao, Yanghua, Zhu, Qing, Zhao, Rui, and Sheng, Yi-yun
Published: 2024
Full Text: View/download PDF

21. Author Correction: Clinical application and evaluation of metagenomic next-generation sequencing in pathogen detection for suspected central nervous system infections

Author: Yuan, Lei, Zhu, Xin Yu, Lai, Lan Min, Chen, Qiang, Liu, Yang, and Zhao, Rui
Published: 2024
Full Text: View/download PDF

22. Relationship between paraspinal muscle morphology and function in different directions in a healthy Chinese population at different ages: a cross-sectional study

Author: Liu, Yinhao, Yuan, Lei, Zeng, Yan, and Ni, Jiajun
Published: 2024
Full Text: View/download PDF

23. USP36 promotes tumorigenesis and tamoxifen resistance in breast cancer by deubiquitinating and stabilizing ERα

Author: Zhuang, Ting, Zhang, Shuqing, Liu, Dongyi, Li, Zhongbo, Li, Xin, Li, Jiaoyan, Yang, Penghe, Zhang, Chenmiao, Cui, Jiayao, Fu, Mingxi, Shen, Fangyu, Yuan, Lei, Zhang, Zhao, Su, Peng, Zhu, Jian, and Yang, Huijie
Published: 2024
Full Text: View/download PDF

24. Correction: Evaluation of deep learning-based reconstruction late gadolinium enhancement images for identifying patients with clinically unrecognized myocardial infarction

Author: Lu, Xuefang, Liu, Weiyin Vivian, Yan, Yuchen, Yang, Wenbing, Liu, Changsheng, Gong, Wei, Quan, Guangnan, Jiang, Jiawei, Yuan, Lei, and Zha, Yunfei
Published: 2024
Full Text: View/download PDF

25. Clinical application and evaluation of metagenomic next-generation sequencing in pathogen detection for suspected central nervous system infections

Author: Yuan, Lei, Zhu, Xin Yu, Lai, Lan Min, Chen, Qiang, Liu, Yang, and Zhao, Rui
Published: 2024
Full Text: View/download PDF

26. Correction: Evaluation of clinical characteristics and risk factors associated with Chlamydia psittaci infection based on metagenomic next-generation sequencing

Author: Yuan, Lei, Chen, Qiang, Zhu, Xin Yu, Lai, Lan Min, Zhao, Rui, and Liu, Yang
Published: 2024
Full Text: View/download PDF

27. The mutation of Japanese encephalitis virus envelope protein residue 389 attenuates viral neuroinvasiveness

Author: Huang, Rong, He, Yajing, Zhang, Chenghua, Luo, Yue, Chen, Chen, Tan, Ning, Ren, Yang, Xu, Kui, Yuan, Lei, and Yang, Jian
Published: 2024
Full Text: View/download PDF

28. Evaluation of deep learning-based reconstruction late gadolinium enhancement images for identifying patients with clinically unrecognized myocardial infarction

Author: Lu, Xuefang, Liu, Weiyin Vivian, Yan, Yuchen, Yang, Wenbing, Liu, Changsheng, Gong, Wei, Quan, Guangnan, Jiang, Jiawei, Yuan, Lei, and Zha, Yunfei
Published: 2024
Full Text: View/download PDF

29. A randomized controlled trial to compare short-term outcomes following infragastric and infracolic omentectomy at the time of primary debulking surgery for epithelial ovarian cancer with normal-appearing omentum

Author: Dong, Xuhui, Yuan, Lei, Zou, Ruoyao, and Yao, Liangqing
Published: 2024
Full Text: View/download PDF

30. Evaluation of clinical characteristics and risk factors associated with Chlamydia psittaci infection based on metagenomic next-generation sequencing

Author: Yuan, Lei, Chen, Qiang, Zhu, Xin Yu, Lai, Lan Min, Zhao, Rui, and Liu, Yang
Published: 2024
Full Text: View/download PDF

31. CD47—a novel prognostic predicator in epithelial ovarian cancer and correlations with clinicopathological and gene mutation features

Author: Luo, Xukai, Mo, Jiahang, Zhang, Min, Huang, Wu, Bao, Yiting, Zou, Ruoyao, Yao, Liangqing, and Yuan, Lei
Published: 2024
Full Text: View/download PDF

32. Structural characterization of a low-molecular weight linear O-acetyl-glucomannan in Lilium lancifolium from Tibet and its protected H2O2-induced oxidative stress in HUVEC cells

Author: Yuan, Lei, Zhong, ZhengChang, Liu, Yu, Quan, Hong, and Lan, XiaoZhong
Published: 2024
Full Text: View/download PDF

33. Autophagy dysfunction contributes to NLRP1 inflammasome-linked depressive-like behaviors in mice

Author: Zhu, Ya-Jing, Huang, Jing, Chen, Ru, Zhang, Yu, He, Xin, Duan, Wen-Xin, Zou, Yuan-Lei, Sun, Meng-Mei, Sun, Hui-Li, Cheng, Si-Min, Wang, Hao-Chuan, Zhang, Hao, and Wu, Wen-Ning
Published: 2024
Full Text: View/download PDF

34. Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation

Author: Yuan, Lei, Chen, Feng, Zhang, Zongzhang, and Yu, Yang
Published: 2024
Full Text: View/download PDF

35. Novel predictors for identifying cervical minimal deviation adenocarcinoma patients with poor prognosis: a long-term observational study in a tertiary centre

Author: Bao, Yiting, Zhang, Hao, Huang, Wu, Luo, Xukai, Yao, Liangqing, Feng, Guannan, and Yuan, Lei
Published: 2024
Full Text: View/download PDF

36. Structure–property relations of recrystallized products from magnesium chloride ethanol solution: aiming at recycling magnesium resources from nature

Author: Sun, Qiaoyang, Jiang, Lixin, Wen, Tianpeng, Yuan, Lei, and Yu, Jingkun
Published: 2024
Full Text: View/download PDF

37. Fast Teammate Adaptation in the Presence of Sudden Policy Change

Author: Zhang, Ziqian, Yuan, Lei, Li, Lihe, Xue, Ke, Jia, Chengxing, Guan, Cong, Qian, Chao, and Yu, Yang
Subjects: Computer Science - Multiagent Systems
Abstract: In cooperative multi-agent reinforcement learning (MARL), where an agent coordinates with teammate(s) for a shared goal, it may sustain non-stationary caused by the policy change of teammates. Prior works mainly concentrate on the policy change during the training phase or teammates altering cross episodes, ignoring the fact that teammates may suffer from policy change suddenly within an episode, which might lead to miscoordination and poor performance as a result. We formulate the problem as an open Dec-POMDP, where we control some agents to coordinate with uncontrolled teammates, whose policies could be changed within one episode. Then we develop a new framework, fast teammates adaptation (Fastap), to address the problem. Concretely, we first train versatile teammates' policies and assign them to different clusters via the Chinese Restaurant Process (CRP). Then, we train the controlled agent(s) to coordinate with the sampled uncontrolled teammates by capturing their identifications as context for fast adaptation. Finally, each agent applies its local information to anticipate the teammates' context for decision-making accordingly. This process proceeds alternately, leading to a robust policy that can adapt to any teammates during the decentralized execution phase. We show in multiple multi-agent benchmarks that Fastap can achieve superior performance than multiple baselines in stationary and non-stationary scenarios., Comment: In: Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence (UAI'23), Pittsburgh, PA, 2023
Published: 2023

38. Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers

Author: Yuan, Lei, Zhang, Zi-Qian, Xue, Ke, Yin, Hao, Chen, Feng, Guan, Cong, Li, Li-He, Qian, Chao, and Yu, Yang
Subjects: Computer Science - Multiagent Systems, Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing
Abstract: Cooperative multi-agent reinforcement learning (CMARL) has shown to be promising for many real-world applications. Previous works mainly focus on improving coordination ability via solving MARL-specific challenges (e.g., non-stationarity, credit assignment, scalability), but ignore the policy perturbation issue when testing in a different environment. This issue hasn't been considered in problem formulation or efficient algorithm design. To address this issue, we firstly model the problem as a limited policy adversary Dec-POMDP (LPA-Dec-POMDP), where some coordinators from a team might accidentally and unpredictably encounter a limited number of malicious action attacks, but the regular coordinators still strive for the intended goal. Then, we propose Robust Multi-Agent Coordination via Evolutionary Generation of Auxiliary Adversarial Attackers (ROMANCE), which enables the trained policy to encounter diversified and strong auxiliary adversarial attacks during training, thus achieving high robustness under various policy perturbations. Concretely, to avoid the ego-system overfitting to a specific attacker, we maintain a set of attackers, which is optimized to guarantee the attackers high attacking quality and behavior diversity. The goal of quality is to minimize the ego-system coordination effect, and a novel diversity regularizer based on sparse action is applied to diversify the behaviors among attackers. The ego-system is then paired with a population of attackers selected from the maintained attacker set, and alternately trained against the constantly evolving attackers. Extensive experiments on multiple scenarios from SMAC indicate our ROMANCE provides comparable or better robustness and generalization ability than other baselines., Comment: In: Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI'23), 2023
Published: 2023

39. Communication-Robust Multi-Agent Learning by Adaptable Auxiliary Multi-Agent Adversary Generation

Author: Yuan, Lei, Chen, Feng, Zhang, Zhongzhang, and Yu, Yang
Subjects: Computer Science - Machine Learning
Abstract: Communication can promote coordination in cooperative Multi-Agent Reinforcement Learning (MARL). Nowadays, existing works mainly focus on improving the communication efficiency of agents, neglecting that real-world communication is much more challenging as there may exist noise or potential attackers. Thus the robustness of the communication-based policies becomes an emergent and severe issue that needs more exploration. In this paper, we posit that the ego system trained with auxiliary adversaries may handle this limitation and propose an adaptable method of Multi-Agent Auxiliary Adversaries Generation for robust Communication, dubbed MA3C, to obtain a robust communication-based policy. In specific, we introduce a novel message-attacking approach that models the learning of the auxiliary attacker as a cooperative problem under a shared goal to minimize the coordination ability of the ego system, with which every information channel may suffer from distinct message attacks. Furthermore, as naive adversarial training may impede the generalization ability of the ego system, we design an attacker population generation approach based on evolutionary learning. Finally, the ego system is paired with an attacker population and then alternatively trained against the continuously evolving attackers to improve its robustness, meaning that both the ego system and the attackers are adaptable. Extensive experiments on multiple benchmarks indicate that our proposed MA3C provides comparable or better robustness and generalization ability than other baselines.
Published: 2023

40. Multi-agent Continual Coordination via Progressive Task Contextualization

Author: Yuan, Lei, Li, Lihe, Zhang, Ziqian, Zhang, Fuxiang, Guan, Cong, and Yu, Yang
Subjects: Computer Science - Multiagent Systems, Computer Science - Machine Learning
Abstract: Cooperative Multi-agent Reinforcement Learning (MARL) has attracted significant attention and played the potential for many real-world applications. Previous arts mainly focus on facilitating the coordination ability from different aspects (e.g., non-stationarity, credit assignment) in single-task or multi-task scenarios, ignoring the stream of tasks that appear in a continual manner. This ignorance makes the continual coordination an unexplored territory, neither in problem formulation nor efficient algorithms designed. Towards tackling the mentioned issue, this paper proposes an approach Multi-Agent Continual Coordination via Progressive Task Contextualization, dubbed MACPro. The key point lies in obtaining a factorized policy, using shared feature extraction layers but separated independent task heads, each specializing in a specific class of tasks. The task heads can be progressively expanded based on the learned task contextualization. Moreover, to cater to the popular CTDE paradigm in MARL, each agent learns to predict and adopt the most relevant policy head based on local information in a decentralized manner. We show in multiple multi-agent benchmarks that existing continual learning methods fail, while MACPro is able to achieve close-to-optimal performance. More results also disclose the effectiveness of MACPro from multiple aspects like high generalization ability.
Published: 2023

41. Robust Multi-agent Communication via Multi-view Message Certification

Author: Yuan, Lei, Jiang, Tao, Li, Lihe, Chen, Feng, Zhang, Zongzhang, and Yu, Yang
Subjects: Computer Science - Multiagent Systems, Computer Science - Machine Learning
Abstract: Many multi-agent scenarios require message sharing among agents to promote coordination, hastening the robustness of multi-agent communication when policies are deployed in a message perturbation environment. Major relevant works tackle this issue under specific assumptions, like a limited number of message channels would sustain perturbations, limiting the efficiency in complex scenarios. In this paper, we take a further step addressing this issue by learning a robust multi-agent communication policy via multi-view message certification, dubbed CroMAC. Agents trained under CroMAC can obtain guaranteed lower bounds on state-action values to identify and choose the optimal action under a worst-case deviation when the received messages are perturbed. Concretely, we first model multi-agent communication as a multi-view problem, where every message stands for a view of the state. Then we extract a certificated joint message representation by a multi-view variational autoencoder (MVAE) that uses a product-of-experts inference network. For the optimization phase, we do perturbations in the latent space of the state for a certificate guarantee. Then the learned joint message representation is used to approximate the certificated state representation during training. Extensive experiments in several cooperative multi-agent benchmarks validate the effectiveness of the proposed CroMAC.
Published: 2023

42. HMGA1 promotes the progression of esophageal squamous cell carcinoma by elevating TKT-mediated upregulation of pentose phosphate pathway

Author: Meng-Jie Liu, Yuan Zhao, Qiu-Tong Li, Xin-Yuan Lei, Kai-Yue He, Jin-Rong Guo, Jing-Yu Yang, Zhen-Hua Yan, Dan-Hui Wu, Lei Zhang, Yong-Ping Jian, and Zhi-Xiang Xu
Subjects: Cytology, QH573-671
Abstract: Abstract Esophageal squamous cell carcinoma (ESCC) possesses a poor prognosis and treatment outcome. Dysregulated metabolism contributes to unrestricted growth of multiple cancers. However, abnormal metabolism, such as highly activated pentose phosphate pathway (PPP) in the progression of ESCC remains largely unknown. Herein, we report that high-mobility group AT-hook 1 (HMGA1), a structural transcriptional factor involved in chromatin remodeling, promoted the development of ESCC by upregulating the PPP. We found that HMGA1 was highly expressed in ESCC. Elevated HMGA1 promoted the malignant phenotype of ESCC cells. Conditional knockout of HMGA1 markedly reduced 4-nitroquinoline-1-oxide (4NQO)-induced esophageal tumorigenesis in mice. Through the metabolomic analysis and the validation assay, we found that HMGA1 upregulated the non-oxidative PPP. With the transcriptome sequencing, we identified that HMGA1 upregulated the expression of transketolase (TKT), which catalyzes the reversible reaction in non-oxidative PPP to exchange metabolites with glycolytic pathway. HMGA1 knockdown suppressed the PPP by downregulating TKT, resulting in the reduction of nucleotides in ESCC cells. Overexpression of HMGA1 upregulated PPP and promoted the survival of ESCC cells by activating TKT. We further characterized that HMGA1 promoted the transcription of TKT by interacting with and enhancing the binding of transcription factor SP1 to the promoter of TKT. Therapeutics targeting TKT with an inhibitor, oxythiamine, reduced HMGA1-induced ESCC cell proliferation and tumor growth. Together, in this study, we identified a new role of HMGA1 in ESCCs by upregulating TKT-mediated activation of PPP. Our results provided a new insight into the role of HMGA1/TKT/PPP in ESCC tumorigenesis and targeted therapy.
Published: 2024
Full Text: View/download PDF

43. Multi-view Heterogeneous Graph Neural Networks for Node Classification

Author: Xi Zeng, Fang-Yuan Lei, Chang-Dong Wang, and Qing-Yun Dai
Subjects: Heterogeneous graphs, Node classification, Multi-view representation, Graph diffusion, Information technology, T58.5-58.64, Electronic computers. Computer science, QA75.5-76.95
Abstract: Abstract Recently, with graph neural networks (GNNs) becoming a powerful technique for graph representation, many excellent GNN-based models have been proposed for processing heterogeneous graphs, which are termed Heterogeneous graph neural networks (HGNNs). However, existing HGNNs tend to aggregate information from either direct neighbors or those connected by short metapaths, thereby neglecting the higher-order information and global feature similarity information in heterogeneous graphs. In this paper, we propose a Multi-View Heterogeneous graph neural network (MV-HGNN) to aggregate these information. Firstly, two auxiliary views, specifically a global feature similarity view and a graph diffusion view, are generated from the original heterogeneous graph. Secondly, MV-HGNN performs two message-passing strategies to get the representation of different views. Subsequently, a transformer-based aggregator is used to get the semantic information. Subsequently, the representations of the three views are fused into a final composite representation. We evaluate our method on the node classification task over three commonly used heterogeneous graph datasets, and the results demonstrate that our proposed MV-HGNN significantly outperforms state-of-the-art baselines.
Published: 2024
Full Text: View/download PDF

44. Multi-agent policy transfer via task relationship modeling

Author: Qin, Rongjun, Chen, Feng, Wang, Tonghan, Yuan, Lei, Wu, Xiaoran, Kang, Yipeng, Zhang, Zongzhang, Zhang, Chongjie, and Yu, Yang
Published: 2024
Full Text: View/download PDF

45. Diclofenac sodium effectively inhibits the biofilm formation of Staphylococcus epidermidis

Author: Xi, Huimin, Luo, Zhaoxia, Liu, Mei-fang, Chen, Qiang, Zhu, Qing, yuan, Lei, Sheng, Yi-yun, and Zhao, Rui
Published: 2024
Full Text: View/download PDF

46. Efficient Communication via Self-supervised Information Aggregation for Online and Offline Multi-agent Reinforcement Learning

Author: Guan, Cong, Chen, Feng, Yuan, Lei, Zhang, Zongzhang, and Yu, Yang
Subjects: Computer Science - Machine Learning, Computer Science - Multiagent Systems
Abstract: Utilizing messages from teammates can improve coordination in cooperative Multi-agent Reinforcement Learning (MARL). Previous works typically combine raw messages of teammates with local information as inputs for policy. However, neglecting message aggregation poses significant inefficiency for policy learning. Motivated by recent advances in representation learning, we argue that efficient message aggregation is essential for good coordination in cooperative MARL. In this paper, we propose Multi-Agent communication via Self-supervised Information Aggregation (MASIA), where agents can aggregate the received messages into compact representations with high relevance to augment the local policy. Specifically, we design a permutation invariant message encoder to generate common information-aggregated representation from messages and optimize it via reconstructing and shooting future information in a self-supervised manner. Hence, each agent would utilize the most relevant parts of the aggregated representation for decision-making by a novel message extraction mechanism. Furthermore, considering the potential of offline learning for real-world applications, we build offline benchmarks for multi-agent communication, which is the first as we know. Empirical results demonstrate the superiority of our method in both online and offline settings. We also release the built offline benchmarks in this paper as a testbed for communication ability validation to facilitate further future research.
Published: 2023

47. Self-Motivated Multi-Agent Exploration

Author: Zhang, Shaowei, Cao, Jiahan, Yuan, Lei, Yu, Yang, and Zhan, De-Chuan
Subjects: Computer Science - Machine Learning, Computer Science - Multiagent Systems
Abstract: In cooperative multi-agent reinforcement learning (CMARL), it is critical for agents to achieve a balance between self-exploration and team collaboration. However, agents can hardly accomplish the team task without coordination and they would be trapped in a local optimum where easy cooperation is accessed without enough individual exploration. Recent works mainly concentrate on agents' coordinated exploration, which brings about the exponentially grown exploration of the state space. To address this issue, we propose Self-Motivated Multi-Agent Exploration (SMMAE), which aims to achieve success in team tasks by adaptively finding a trade-off between self-exploration and team cooperation. In SMMAE, we train an independent exploration policy for each agent to maximize their own visited state space. Each agent learns an adjustable exploration probability based on the stability of the joint team policy. The experiments on highly cooperative tasks in StarCraft II micromanagement benchmark (SMAC) demonstrate that SMMAE can explore task-related states more efficiently, accomplish coordinated behaviours and boost the learning performance.
Published: 2023

48. Exposure to particulate matter (PM2.5) weakens corneal defense by downregulating thrombospondin-1 and tight junction proteins

Author: Liangliang Niu, Jiamin Liu, Huan Xu, Binghui Liu, Maomao Song, Chunchun Hu, Rui Jiang, Xinghuai Sun, and Yuan Lei
Subjects: Fine particulate matter, Corneal epithelial barrier integrity, Tight junction- associated proteins, Thrombospondin-1, Environmental pollution, TD172-193.5, Environmental sciences, GE1-350
Abstract: Background: Fine particulate matter (PM2.5) induces ocular surface toxicity through pyroptosis, oxidative stress, autophagy, and inflammatory responses. However, the precise molecular pathways through which PM2.5 causes corneal damage remain unclear. This study aims to investigate the underlying mechanisms by exposing human corneal epithelial cells (HCECs) to PM2.5. Methods: After the morphology and chemical composition analysis of the PM samples, we conducted both in vivo and in vitro experiments to investigate PM2.5-induced corneal epithelial damage. We assessed corneal barrier function in HCECs using transepithelial electrical resistance (TEER) assays. To explore the molecular mechanisms of PM2.5-induced corneal epithelial damage, we performed whole-transcriptome resequencing, quantitative RT-PCR, and western blotting in vitro. In addition, we analyzed mouse corneas exposed to concentrated ambient PM2.5 through immunofluorescence staining to observe the resulting changes in corneal epithelial protein expression in vivo. Results: Our results showed significant impairment of corneal epithelial barrier function in PM2.5-treated HCECs, as indicated by decreased TEER values. The expression of thrombospondin-1 (THBS1) and claudin-1, both key factors for maintaining corneal epithelial barrier integrity, was markedly reduced at the gene and protein levels in both in vitro and in vivo PM2.5 exposure models. Moreover, the levels of tight junction-associated proteins, including occludin, zonula occludens-1 (ZO-1) and ZO-2, essential components of the corneal epithelial barrier, were significantly diminished in PM2.5-treated HCECs. Conclusion: PM2.5 exposure leads to corneal epithelium damage by disrupting tight junction proteins and THBS1 expression. These findings provide insight into potential pathways for PM2.5-induced ocular toxicity and underscore the need for protective strategies against such environmental pollutants.
Published: 2024
Full Text: View/download PDF

49. The association between airborne particulate matter (PM2.5) exposure level and primary open-angle glaucoma

Author: Yi Ma, Mingxi Shao, Shengjie Li, Yuan Lei, Wenjun Cao, and Xinghuai Sun
Subjects: Particulate matter, Air pollution, Primary open-angle glaucoma, Intraocular pressure, Visual field, Environmental pollution, TD172-193.5, Environmental sciences, GE1-350
Abstract: The eye is vulnerable to the adverse effects of air pollution. Previous experimental study found that fine particulate matter (PM2.5) had a direct toxic effect on intraocular tissues. However, clinical evidence for the impact of air pollutants exposure on functional and structural changes in glaucoma remains scarce. A total of 120 patients with primary open-angle glaucoma (POAG) who met the inclusion criteria were included in this retrospective study. The standardized ophthalmic examination, such as intraocular pressure (IOP), visual field, optical coherence tomography, and comprehensive physical examination, were performed. The air pollution data, including PM2.5 concentration and air quality index (AQI), were collected. PM2.5 and AQI for the day of the medical examination, as well as one month, and three months before the medical examination date, were investigated. In our results, higher average exposure levels for one-month and three-month, were associated with increased IOP (r=0.229, P=0.013; r=0.204, P=0.028, respectively) and decreased visual field mean sensitivity (MS) (r=-0.212, P=0.037; r=-0.305, P=0.002, respectively). PM2.5 concentrations for the day of the medical examination was not significantly associated with ocular parameters. In multiple linear regression analysis adjusted for demographic and clinical factors, higher PM2.5 exposure for one month was associated with elevated IOP (P=0.040, β=0.173, 95 %CI=0.008–0.337). We also found an association between PM2.5 and MS (one-month exposure: β=-0.160, P=0.029; three-month exposure: β=-0.238, P=0.002). The logistic regression analysis found that three-month average PM2.5 exposure level was significantly associated with the disease severity (β=0.043, P=0.025, 95 %CI=1.005–1.084). In conclusion, this study is the first to investigate the relationship between air pollution and detailed ocular parameters of POAG patients in Shanghai over a three-year period, and to explore the effects of different exposure times of PM2.5 on glaucoma. This study found that PM2.5 exposure was correlated with elevated IOP and decreased MS. The one-month PM2.5 exposure level had the most significant effects on IOP. The three-month PM2.5 exposure level was an independent risk factor for POAG severity. Current evidence suggests there may be an association between PM2.5 exposure and POAG.
Published: 2024
Full Text: View/download PDF

50. Variation in intensive care unit beds capacity in China from 2007 to 2021

Author: Yuan, Lei, Xu, Siyu, Xu, Jingmin, Cao, Jing, and Qian, Zhaoxin
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

4,899 results on '"YUAN, Lei"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources