Author: "Yang, Zhuoran" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Yang, Zhuoran"' showing total 647 results

Start Over Author "Yang, Zhuoran"

647 results on '"Yang, Zhuoran"'

301. Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret

Author: Fei, Yingjie, Yang, Zhuoran, Chen, Yudong, Wang, Zhaoran, Xie, Qiaomin, Fei, Yingjie, Yang, Zhuoran, Chen, Yudong, Wang, Zhaoran, and Xie, Qiaomin
Abstract: We study risk-sensitive reinforcement learning in episodic Markov decision processes with unknown transition kernels, where the goal is to optimize the total reward under the risk measure of exponential utility. We propose two provably efficient model-free algorithms, Risk-Sensitive Value Iteration (RSVI) and Risk-Sensitive Q-learning (RSQ). These algorithms implement a form of risk-sensitive optimism in the face of uncertainty, which adapts to both risk-seeking and risk-averse modes of exploration. We prove that RSVI attains an $\tilde{O}\big(\lambda(|\beta| H^2) \cdot \sqrt{H^{3} S^{2}AT} \big)$ regret, while RSQ attains an $\tilde{O}\big(\lambda(|\beta| H^2) \cdot \sqrt{H^{4} SAT} \big)$ regret, where $\lambda(u) = (e^{3u}-1)/u$ for $u>0$. In the above, $\beta$ is the risk parameter of the exponential utility function, $S$ the number of states, $A$ the number of actions, $T$ the total number of timesteps, and $H$ the episode length. On the flip side, we establish a regret lower bound showing that the exponential dependence on $|\beta|$ and $H$ is unavoidable for any algorithm with an $\tilde{O}(\sqrt{T})$ regret (even when the risk objective is on the same scale as the original reward), thus certifying the near-optimality of the proposed algorithms. Our results demonstrate that incorporating risk awareness into reinforcement learning necessitates an exponential cost in $|\beta|$ and $H$, which quantifies the fundamental tradeoff between risk sensitivity (related to aleatoric uncertainty) and sample efficiency (related to epistemic uncertainty). To the best of our knowledge, this is the first regret analysis of risk-sensitive reinforcement learning with the exponential utility.
Published: 2020

302. Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory

Author: Zhang, Yufeng, Cai, Qi, Yang, Zhuoran, Chen, Yongxin, Wang, Zhaoran, Zhang, Yufeng, Cai, Qi, Yang, Zhuoran, Chen, Yongxin, and Wang, Zhaoran
Abstract: Temporal-difference and Q-learning play a key role in deep reinforcement learning, where they are empowered by expressive nonlinear function approximators such as neural networks. At the core of their empirical successes is the learned feature representation, which embeds rich observations, e.g., images and texts, into the latent space that encodes semantic structures. Meanwhile, the evolution of such a feature representation is crucial to the convergence of temporal-difference and Q-learning. In particular, temporal-difference learning converges when the function approximator is linear in a feature representation, which is fixed throughout learning, and possibly diverges otherwise. We aim to answer the following questions: When the function approximator is a neural network, how does the associated feature representation evolve? If it converges, does it converge to the optimal one? We prove that, utilizing an overparameterized two-layer neural network, temporal-difference and Q-learning globally minimize the mean-squared projected Bellman error at a sublinear rate. Moreover, the associated feature representation converges to the optimal one, generalizing the previous analysis of Cai et al. (2019) in the neural tangent kernel regime, where the associated feature representation stabilizes at the initial one. The key to our analysis is a mean-field perspective, which connects the evolution of a finite-dimensional parameter to its limiting counterpart over an infinite-dimensional Wasserstein space. Our analysis generalizes to soft Q-learning, which is further connected to policy gradient., Comment: add acknowledgement
Published: 2020

303. Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium

Author: Xie, Qiaomin, Chen, Yudong, Wang, Zhaoran, Yang, Zhuoran, Xie, Qiaomin, Chen, Yudong, Wang, Zhaoran, and Yang, Zhuoran
Abstract: We develop provably efficient reinforcement learning algorithms for two-player zero-sum finite-horizon Markov games with simultaneous moves. To incorporate function approximation, we consider a family of Markov games where the reward function and transition kernel possess a linear structure. Both the offline and online settings of the problems are considered. In the offline setting, we control both players and aim to find the Nash Equilibrium by minimizing the duality gap. In the online setting, we control a single player playing against an arbitrary opponent and aim to minimize the regret. For both settings, we propose an optimistic variant of the least-squares minimax value iteration algorithm. We show that our algorithm is computationally efficient and provably achieves an $\tilde O(\sqrt{d^3 H^3 T} )$ upper bound on the duality gap and regret, where $d$ is the linear dimension, $H$ the horizon and $T$ the total number of timesteps. Our results do not require additional assumptions on the sampling model. Our setting requires overcoming several new challenges that are absent in Markov decision processes or turn-based Markov games. In particular, to achieve optimism with simultaneous moves, we construct both upper and lower confidence bounds of the value function, and then compute the optimistic policy by solving a general-sum matrix game with these bounds as the payoff matrices. As finding the Nash Equilibrium of a general-sum game is computationally hard, our algorithm instead solves for a Coarse Correlated Equilibrium (CCE), which can be obtained efficiently. To our best knowledge, such a CCE-based scheme for optimism has not appeared in the literature and might be of interest in its own right., Comment: Accepted for presentation at COLT 2020
Published: 2020

304. Effect of polarity‐reversal voltage on charge accumulation and carrier mobility in silicone rubber/silicon carbide composites

Author: Du, Boxue, primary, Han, Chong, additional, Li, Zhonglei, additional, Han, Chenlei, additional, Li, Jin, additional, Xiao, Meng, additional, and Yang, Zhuoran, additional
Published: 2021
Full Text: View/download PDF

305. Group VIII (Fe, Co, and Ni) based water evaporators for solar driven interfacial water evaporation: Performance optimization, applications and recent advances.

Author: Yang, Zhuoran, Li, Wenting, Liu, Menghan, Meng, Yao, Yang, Chunyu, Qu, Fengyu, and Guo, Wei
Subjects: *SUSTAINABILITY, *EVAPORATORS, *WASTEWATER treatment, *LIGHT absorption, *GREEN business, *SALINE water conversion, *SOLAR water heaters
Abstract: Solar-driven interfacial evaporation (SDIE) is a promising technology for sustainable and efficient production of clean water, addressing the pressing issue of freshwater scarcity facing humanity. Given the high level of concern regarding this matter, it is crucial to continuously summarize the research achievements of predecessors to further advance SDIE technology. This review first discusses the key role of solar absorbers containing Group VIII elements, including Fe, Co, and Ni, in SDIE. Second, it explores strategies to improve SDIE performance, such as increasing light absorption, regulating water transport channels, implementing effective thermal management, and reducing the enthalpy of water evaporation. These strategies can effectively optimize the performance of the evaporator and improve energy utilization efficiency. Further, this review details the specific applications of SDIE technology in areas such as seawater desalination, wastewater treatment, and energy reuse. We aim to provide insights for developing stable and efficient SDIE systems, promoting their practical applications, and developing other potential applications. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

306. Fractional cyclic cohesive zone model for time-dependent fatigue behavior of soft adhesives under mode-II loading.

Author: Yang, Zhuoran, Xia, Yan, Yao, Chengbin, Zhu, Zhongmeng, Liu, Zhanli, Jiang, Han, and Wang, Kaijie
Subjects: *FATIGUE cracks, *DAMAGE models, *ADHESIVES, *VISCOELASTICITY, *CREEP (Materials), *CYCLIC loads, *IMPACT loads, *FATIGUE life
Abstract: • Fractional cyclic CZM is developed to analyze time-dependent fatigue of soft adhesive. • Nonlinear viscoelasticity and in-situ response per cycle is crucial to fatigue damage. • CZM is validated through both creep and fatigue tests of single-lap shear specimen. • Effects of cyclic period and stress amplitude on fatigue life are investigated. • As loading period grows, failure shifts from fatigue- to creep-dominating mechanism. Soft adhesives exhibit nonlinear viscoelastic behavior during cyclic loading, suggesting that their cycle-dependent fatigue behavior can be significantly affected by the time-dependent deformation behavior. Soft adhesives are frequently subject to various cyclic conditions in practical applications, presenting potential safety risks. This paper has proposed a fractional cyclic cohesive zone model (CZM) to characterize the time-dependent fatigue behavior of soft adhesives. To capture the nonlinear evolution of fatigue damage under asymmetric stress-controlled cyclic loading, a fatigue damage model that considers the contribution of in-situ stress/strain was incorporated into fractional-order viscoelastic model. The proposed CZM was validated through single-lap shear creep tests at various stress levels and fatigue tests at various stress amplitudes and loading periods. Furthermore, the impacts of stress ratio and peak stress level on fatigue failure under different loading periods were investigated using the proposed CZM. The results indicate that under a short loading period, the failure of the soft adhesives is primarily influenced by cycle-dependent fatigue damage, while under a long loading period, it is primarily affected by time-dependent creep deformation. Overall, this work offers a numerical strategy to describe the fatigue failure of nonlinear viscoelastic soft adhesives and to understand their time-dependent fatigue failure mechanism, with implications for product design and optimization. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

307. Study on Surface Discharge Characteristics of GO-Doped Epoxy Resin–LN 2 Composite Insulation.

Author: Xing, Yunqi, Chen, Yuanyuan, Yuan, Ruiyi, Yang, Zhuoran, Yao, Tianyi, Li, Jiehua, Zhu, Wenbo, and Wang, Xiaoxue
Subjects: EPOXY resins, ELECTRIC insulators & insulation, NATURAL gas pipelines, SUPERCONDUCTING cables, HIGH temperature superconductors, MATERIALS testing, ELECTRIC fields
Abstract: Superconducting power lead equipment for epoxy insulation, such as high-temperature superconducting DC power or liquefied natural gas energy pipelines, as well as high-temperature superconducting cables, has long been used in extreme environments, from liquid nitrogen temperatures to normal temperatures. It is easy to induce surface discharge and flashover under the action of strong electric field, which accelerates the insulation failure of current leads. In this paper, two-dimensional nano-material GO was used to control the electrical properties of epoxy resins. The DC surface discharge and flashover characteristics of the prepared epoxy resin–GO composite insulation materials were tested at room temperature with liquid nitrogen. The surface discharge mechanism of the epoxy resin–GO composite insulation materials was analyzed. The experimental results show that the insulation properties of epoxy composites doped with GO changed. Among them, the surface flashover voltage of 0.05 wt% material is the best, which can inhibit the discharge phenomenon and improve its insulation properties in extreme environments, from room temperature to liquid nitrogen temperature. It is found that the development process of surface discharge of composite insulating materials under liquid nitrogen is quite different from that under room temperature. Before critical flashover, the repetition rate and amplitude of surface discharge remain at a low level until critical flashover. Furthermore, the voltage of the first flashover is significantly higher than that of the subsequent flashover under the action of the desorption gas on the surface of the composite insulating material and the gasification layer produced by the discharge. Given that the surface flashover voltage of 0.05 wt% epoxy composite is the best, the research and analysis of 0.05 wt% composite is emphasized. In the future design of superconducting power lead insulation, the modification method of adding GO to epoxy resin can be considered in order to improve its insulation performance. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

308. Prediction of Resumed Production Trajectories in the Post-Epidemic Area Based on Big Power Data

Author: Li, Haijiang, primary, Wang, Xiran, additional, Zhu, Xiaojun, additional, Xu, Zihan, additional, and Yang, Zhuoran, additional
Published: 2020
Full Text: View/download PDF

309. Efficient and doubly-robust methods for variable selection and parameter estimation in longitudinal data analysis

Author: Fu, Liya, primary, Yang, Zhuoran, additional, Cai, Fengjing, additional, and Wang, You-Gan, additional
Published: 2020
Full Text: View/download PDF

310. UV-mediated synthesis of carboxymethyl cellulose/poly-N-isopropylacrylamide composite hydrogels with triple stimuli-responsive swelling performances

Author: Su, Chen, primary, Liu, Jia, additional, Yang, Zhuoran, additional, Jiang, Lei, additional, Liu, Xiufeng, additional, and Shao, Wei, additional
Published: 2020
Full Text: View/download PDF

311. Synthesis of Antibacterial Gelatin/Sodium Alginate Sponges and Their Antibacterial Activity

Author: Wen, Yanyi, primary, Yu, Bing, additional, Zhu, Zhongjie, additional, Yang, Zhuoran, additional, and Shao, Wei, additional
Published: 2020
Full Text: View/download PDF

312. Diffusion Behavior of Waterproof Glue into Silicone Rubber Insulation of Submarine Cable Joints Based on Molecular Dynamics Simulation

Author: Wang, Heyu, primary, Li, Zhonglei, additional, Du, Boxue, additional, Fan, Mingsheng, additional, Zhou, Shuofan, additional, and Yang, Zhuoran, additional
Published: 2020
Full Text: View/download PDF

313. Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator

Author: Luo, Yuwei, Yang, Zhuoran, Wang, Zhaoran, and Kolar, Mladen
Subjects: FOS: Computer and information sciences, Computer Science::Multiagent Systems, Computer Science - Machine Learning, Optimization and Control (math.OC), Statistics - Machine Learning, FOS: Mathematics, Machine Learning (stat.ML), Mathematics - Optimization and Control, Machine Learning (cs.LG)
Abstract: Multi-agent reinforcement learning has been successfully applied to a number of challenging problems. Despite these empirical successes, theoretical understanding of different algorithms is lacking, primarily due to the curse of dimensionality caused by the exponential growth of the state-action space with the number of agents. We study a fundamental problem of multi-agent linear quadratic regulator (LQR) in a setting where the agents are partially exchangeable. In this setting, we develop a hierarchical actor-critic algorithm, whose computational complexity is independent of the total number of agents, and prove its global linear convergence to the optimal policy. As LQRs are often used to approximate general dynamic systems, this paper provides an important step towards a better understanding of general hierarchical mean-field multi-agent reinforcement learning.
Published: 2019

314. Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Author: Zhang, Kaiqing, Yang, Zhuoran, and Ba��ar, Tamer
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Statistics - Machine Learning, Machine Learning (stat.ML), Computer Science - Multiagent Systems, Machine Learning (cs.LG), Multiagent Systems (cs.MA)
Abstract: Recent years have witnessed significant advances in reinforcement learning (RL), which has registered great success in solving various sequential decision-making problems in machine learning. Most of the successful RL applications, e.g., the games of Go and Poker, robotics, and autonomous driving, involve the participation of more than one single agent, which naturally fall into the realm of multi-agent RL (MARL), a domain with a relatively long history, and has recently re-emerged due to advances in single-agent RL techniques. Though empirically successful, theoretical foundations for MARL are relatively lacking in the literature. In this chapter, we provide a selective overview of MARL, with focus on algorithms backed by theoretical analysis. More specifically, we review the theoretical results of MARL algorithms mainly within two representative frameworks, Markov/stochastic games and extensive-form games, in accordance with the types of tasks they address, i.e., fully cooperative, fully competitive, and a mix of the two. We also introduce several significant but challenging applications of these algorithms. Orthogonal to the existing reviews on MARL, we highlight several new angles and taxonomies of MARL theory, including learning in extensive-form games, decentralized MARL with networked agents, MARL in the mean-field regime, (non-)convergence of policy-based methods for learning in games, etc. Some of the new angles extrapolate from our own research endeavors and interests. Our overall goal with this chapter is, beyond providing an assessment of the current state of the field on the mark, to identify fruitful future research directions on theoretical studies of MARL. We expect this chapter to serve as continuing stimulus for researchers interested in working on this exciting while challenging topic., Invited Chapter in Handbook on RL and Control (Springer Studies in Systems, Decision and Control); Proofread version from the Publisher
Published: 2019

315. Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games

Author: Fu, Zuyue, Yang, Zhuoran, Chen, Yongxin, and Wang, Zhaoran
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Optimization and Control (math.OC), Computer Science - Computer Science and Game Theory, Statistics - Machine Learning, FOS: Mathematics, Machine Learning (stat.ML), Computer Science - Multiagent Systems, Mathematics - Optimization and Control, Computer Science and Game Theory (cs.GT), Machine Learning (cs.LG), Multiagent Systems (cs.MA)
Abstract: We study discrete-time mean-field Markov games with infinite numbers of agents where each agent aims to minimize its ergodic cost. We consider the setting where the agents have identical linear state transitions and quadratic cost functions, while the aggregated effect of the agents is captured by the population mean of their states, namely, the mean-field state. For such a game, based on the Nash certainty equivalence principle, we provide sufficient conditions for the existence and uniqueness of its Nash equilibrium. Moreover, to find the Nash equilibrium, we propose a mean-field actor-critic algorithm with linear function approximation, which does not require knowing the model of dynamics. Specifically, at each iteration of our algorithm, we use the single-agent actor-critic algorithm to approximately obtain the optimal policy of the each agent given the current mean-field state, and then update the mean-field state. In particular, we prove that our algorithm converges to the Nash equilibrium at a linear rate. To the best of our knowledge, this is the first success of applying model-free reinforcement learning with function approximation to discrete-time mean-field Markov games with provable non-asymptotic global convergence guarantees.
Published: 2019

316. Sample Elicitation

Author: Wei, Jiaheng, Fu, Zuyue, Liu, Yang, Li, Xingyu, Yang, Zhuoran, and Wang, Zhaoran
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computer Science and Game Theory, Statistics - Machine Learning, Machine Learning (stat.ML), Machine Learning (cs.LG), Computer Science and Game Theory (cs.GT)
Abstract: It is important to collect credible training samples $(x,y)$ for building data-intensive learning systems (e.g., a deep learning system). Asking people to report complex distribution $p(x)$, though theoretically viable, is challenging in practice. This is primarily due to the cognitive loads required for human agents to form the report of this highly complicated information. While classical elicitation mechanisms apply to eliciting a complex and generative (and continuous) distribution $p(x)$, we are interested in eliciting samples $x_i \sim p(x)$ from agents directly. We coin the above problem "sample elicitation". This paper introduces a deep learning aided method to incentivize credible sample contributions from self-interested and rational agents. We show that with an accurate estimation of a certain $f$-divergence function we can achieve approximate incentive compatibility in eliciting truthful samples. We then present an efficient estimator with theoretical guarantees via studying the variational forms of the $f$-divergence function. We also show a connection between this sample elicitation problem and $f$-GAN, and how this connection can help reconstruct an estimator of the distribution based on collected samples. Experiments on synthetic data, MNIST, and CIFAR-10 datasets demonstrate that our mechanism elicits truthful samples. Our implementation is available at https://github.com/weijiaheng/Credible-sample-elicitation.git., To appear at AISTATS 2021
Published: 2019

317. Fast Multi-Agent Temporal-Difference Learning via Homotopy Stochastic Primal-Dual Optimization

Author: Ding, Dongsheng, Wei, Xiaohan, Yang, Zhuoran, Wang, Zhaoran, and Jovanovi��, Mihailo R.
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Optimization and Control (math.OC), FOS: Mathematics, Computer Science - Multiagent Systems, Mathematics - Optimization and Control, Machine Learning (cs.LG), Multiagent Systems (cs.MA)
Abstract: We study the policy evaluation problem in multi-agent reinforcement learning where a group of agents, with jointly observed states and private local actions and rewards, collaborate to learn the value function of a given policy via local computation and communication over a connected undirected network. This problem arises in various large-scale multi-agent systems, including power grids, intelligent transportation systems, wireless sensor networks, and multi-agent robotics. When the dimension of state-action space is large, the temporal-difference learning with linear function approximation is widely used. In this paper, we develop a new distributed temporal-difference learning algorithm and quantify its finite-time performance. Our algorithm combines a distributed stochastic primal-dual method with a homotopy-based approach to adaptively adjust the learning rate in order to minimize the mean-square projected Bellman error by taking fresh online samples from a causal on-policy trajectory. We explicitly take into account the Markovian nature of sampling and improve the best-known finite-time error bound from $O(1/\sqrt{T})$ to~$O(1/T)$, where $T$ is the total number of iterations., 29 pages, 4 figures
Published: 2019

318. Robust One-Bit Recovery via ReLU Generative Networks: Near-Optimal Statistical Rate and Global Landscape Analysis

Author: Qiu, Shuang, Wei, Xiaohan, and Yang, Zhuoran
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, FOS: Mathematics, Mathematics - Statistics Theory, Machine Learning (stat.ML), Statistics Theory (math.ST), Machine Learning (cs.LG)
Abstract: We study the robust one-bit compressed sensing problem whose goal is to design an algorithm that faithfully recovers any sparse target vector $\theta_0\in\mathbb{R}^d$ \textit{uniformly} via $m$ quantized noisy measurements. Specifically, we consider a new framework for this problem where the sparsity is implicitly enforced via mapping a low dimensional representation $x_0 \in \mathbb{R}^k$ through a known $n$-layer ReLU generative network $G:\mathbb{R}^k\rightarrow\mathbb{R}^d$ such that $\theta_0 = G(x_0)$. Such a framework poses low-dimensional priors on $\theta_0$ without a known sparsity basis. We propose to recover the target $G(x_0)$ solving an unconstrained empirical risk minimization (ERM). Under a weak \textit{sub-exponential measurement assumption}, we establish a joint statistical and computational analysis. In particular, we prove that the ERM estimator in this new framework achieves a statistical rate of $m=\widetilde{\mathcal{O}}(kn \log d /\varepsilon^2)$ recovering any $G(x_0)$ uniformly up to an error $\varepsilon$. When the network is shallow (i.e., $n$ is small), we show this rate matches the information-theoretic lower bound up to logarithm factors of $\varepsilon^{-1}$. From the lens of computation, we prove that under proper conditions on the network weights, our proposed empirical risk, despite non-convexity, has no stationary point outside of small neighborhoods around the true representation $x_0$ and its negative multiple; furthermore, we show that the global minimizer of the empirical risk stays within the neighborhood around $x_0$ rather than its negative multiple under further assumptions on the network weights.
Published: 2019
Full Text: View/download PDF

319. Neural Policy Gradient Methods: Global Optimality and Rates of Convergence

Author: Wang, Lingxiao, Cai, Qi, Yang, Zhuoran, and Wang, Zhaoran
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Optimization and Control (math.OC), FOS: Mathematics, Machine Learning (stat.ML), Mathematics - Optimization and Control, Machine Learning (cs.LG)
Abstract: Policy gradient methods with actor-critic schemes demonstrate tremendous empirical successes, especially when the actors and critics are parameterized by neural networks. However, it remains less clear whether such "neural" policy gradient methods converge to globally optimal policies and whether they even converge at all. We answer both the questions affirmatively in the overparameterized regime. In detail, we prove that neural natural policy gradient converges to a globally optimal policy at a sublinear rate. Also, we show that neural vanilla policy gradient converges sublinearly to a stationary point. Meanwhile, by relating the suboptimality of the stationary points to the representation power of neural actor and critic classes, we prove the global optimality of all stationary points under mild regularity conditions. Particularly, we show that a key to the global optimality and convergence is the "compatibility" between the actor and critic, which is ensured by sharing neural architectures and random initializations across the actor and critic. To the best of our knowledge, our analysis establishes the first global optimality and convergence guarantees for neural policy gradient methods., Comment: 71 pages. The first two authors contribute equally
Published: 2019
Full Text: View/download PDF

320. Convergent Policy Optimization for Safe Reinforcement Learning

Author: Yu, Ming, Yang, Zhuoran, Kolar, Mladen, and Wang, Zhaoran
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, MathematicsofComputing_NUMERICALANALYSIS, Machine Learning (stat.ML), Machine Learning (cs.LG)
Abstract: We study the safe reinforcement learning problem with nonlinear function approximation, where policy optimization is formulated as a constrained optimization problem with both the objective and the constraint being nonconvex functions. For such a problem, we construct a sequence of surrogate convex constrained optimization problems by replacing the nonconvex functions locally with convex quadratic functions obtained from policy gradient estimators. We prove that the solutions to these surrogate problems converge to a stationary point of the original nonconvex problem. Furthermore, to extend our theoretical results, we apply our algorithm to examples of optimal control and multi-agent reinforcement learning with safety constraints.
Published: 2019
Full Text: View/download PDF

321. Provably Efficient Exploration in Policy Optimization

Author: Cai, Qi, Yang, Zhuoran, Jin, Chi, and Wang, Zhaoran
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Optimization and Control (math.OC), FOS: Mathematics, Machine Learning (stat.ML), Mathematics - Optimization and Control, Machine Learning (cs.LG)
Abstract: While policy-based reinforcement learning (RL) achieves tremendous successes in practice, it is significantly less understood in theory, especially compared with value-based RL. In particular, it remains elusive how to design a provably efficient policy optimization algorithm that incorporates exploration. To bridge such a gap, this paper proposes an Optimistic variant of the Proximal Policy Optimization algorithm (OPPO), which follows an ``optimistic version'' of the policy gradient direction. This paper proves that, in the problem of episodic Markov decision process with linear function approximation, unknown transition, and adversarial reward with full-information feedback, OPPO achieves $\tilde{O}(\sqrt{d^2 H^3 T} )$ regret. Here $d$ is the feature dimension, $H$ is the episode horizon, and $T$ is the total number of steps. To the best of our knowledge, OPPO is the first provably efficient policy optimization algorithm that explores., Comment: We have fixed a technical issue in the first version of this paper. We remark the technical assumption of the linear MDP in this version of the paper is different from that in the first version
Published: 2019
Full Text: View/download PDF

322. Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework

Author: Jin, Wanxin, Wang, Zhaoran, Yang, Zhuoran, Mou, Shaoshuai, Jin, Wanxin, Wang, Zhaoran, Yang, Zhuoran, and Mou, Shaoshuai
Abstract: This paper develops a Pontryagin Differentiable Programming (PDP) methodology, which establishes a unified framework to solve a broad class of learning and control tasks. The PDP distinguishes from existing methods by two novel techniques: first, we differentiate through Pontryagin's Maximum Principle, and this allows to obtain the analytical derivative of a trajectory with respect to tunable parameters within an optimal control system, enabling end-to-end learning of dynamics, policies, or/and control objective functions; and second, we propose an auxiliary control system in the backward pass of the PDP framework, and the output of this auxiliary control system is the analytical derivative of the original system's trajectory with respect to the parameters, which can be iteratively solved using standard control tools. We investigate three learning modes of the PDP: inverse reinforcement learning, system identification, and control/planning. We demonstrate the capability of the PDP in each learning mode on different high-dimensional systems, including multi-link robot arm, 6-DoF maneuvering quadrotor, and 6-DoF rocket powered landing., Comment: Published in NeurIPS 2020, Codes are at https://github.com/wanxinjin/Pontryagin-Differentiable-Programming
Published: 2019

323. Decentralized Multi-Agent Reinforcement Learning with Networked Agents: Recent Advances

Author: Zhang, Kaiqing, Yang, Zhuoran, Başar, Tamer, Zhang, Kaiqing, Yang, Zhuoran, and Başar, Tamer
Abstract: Multi-agent reinforcement learning (MARL) has long been a significant and everlasting research topic in both machine learning and control. With the recent development of (single-agent) deep RL, there is a resurgence of interests in developing new MARL algorithms, especially those that are backed by theoretical analysis. In this paper, we review some recent advances a sub-area of this topic: decentralized MARL with networked agents. Specifically, multiple agents perform sequential decision-making in a common environment, without the coordination of any central controller. Instead, the agents are allowed to exchange information with their neighbors over a communication network. Such a setting finds broad applications in the control and operation of robots, unmanned vehicles, mobile sensor networks, and smart grid. This review is built upon several our research endeavors in this direction, together with some progresses made by other researchers along the line. We hope this review to inspire the devotion of more research efforts to this exciting yet challenging area., Comment: This is a invited submission to a Special Issue of the Journal of Frontiers of Information Technology & Electronic Engineering (FITEE). Most of the contents are based on the Sec. 4 in our recent overview arXiv:1911.10635, with focus on the setting of decentralized MARL with networked agents
Published: 2019

324. Generalized estimating equations for analyzing multivariate survival data.

Author: Liya, Fu, Yang, Zhuoran, Zhang, Jun, Long, Anle, and Zhou, Yan
Subjects: *GENERALIZED estimating equations, *SURVIVAL analysis (Biometry), *CENSORING (Statistics)
Abstract: Generalized estimating equations (GEE) approach has been used to estimate the parameters in semiparametric accelerated failure time (AFT) models with clustered and censored data. However, the working correlation model has a substantial impact on estimator efficiency when using the GEE method. This article proposes a general correlation model to incorporate the correlations among the clustered and censored data and protect against avoidable loss of efficiency associated with misspecified correlation structure. The proposed estimator is consistent and asymptotically normal. Simulation studies are carried out to demonstrate the effectiveness of the proposed method. Finally, a real dataset from a toxicology study is analyzed for illustration. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

325. A Communication-Efficient Multi-Agent Actor-Critic Algorithm for Distributed Reinforcement Learning

Author: Lin, Yixuan, primary, Zhang, Kaiqing, additional, Yang, Zhuoran, additional, Wang, Zhaoran, additional, Basar, Tamer, additional, Sandhu, Romeil, additional, and Liu, Ji, additional
Published: 2019
Full Text: View/download PDF

326. Efficient parameter estimation for multivariate accelerated failure time model via the quadratic inference functions method

Author: Fu, Liya, primary, Yang, Zhuoran, additional, Zhao, Mingtao, additional, and Zhou, Yan, additional
Published: 2019
Full Text: View/download PDF

327. Learning Partially Observable Markov Decision Processes Using Coupled Canonical Polyadic Decomposition

Author: Huang, Kejun, primary, Yang, Zhuoran, additional, Wang, Zhaoran, additional, and Hong, Mingyi, additional
Published: 2019
Full Text: View/download PDF

328. Design of Single Channel Speech Separation System Based on Deep Clustering Model

Author: Zhao, Wei, primary, Sun, Yuanyuan, additional, Yang, Zhuoran, additional, and Li, Haozhen, additional
Published: 2019
Full Text: View/download PDF

329. Generalized estimating equations for analyzing multivariate survival data

Author: Liya, Fu, primary, Yang, Zhuoran, additional, Zhang, Jun, additional, Long, Anle, additional, and Zhou, Yan, additional
Published: 2019
Full Text: View/download PDF

330. Quantitatively analyzing effects of meteorology and PM2.5 sources on low visual distance

Author: Guan, Liao, primary, Liang, Yongli, additional, Tian, Yingze, additional, Yang, Zhuoran, additional, Sun, Yueming, additional, and Feng, Yinchang, additional
Published: 2019
Full Text: View/download PDF

331. Conductivity and Surface Charge Properties of HTV SiR/SiC Composites under Impulse Superimposed on DC Voltage at Different Temperatures

Author: Han, Chong, primary, Li, Zhonglei, additional, Yang, Zhuoran, additional, Xiao, Meng, additional, and Du, Boxue, additional
Published: 2019
Full Text: View/download PDF

332. Improving the Electric Field Distribution in Stress Cone of HTS DC Cable Terminals by Nonlinear Conductive Epoxy/ZnO Composites

Author: Li, Zhonglei, primary, Yang, Zhuoran, additional, Xing, Yunqi, additional, Zhu, Wenbo, additional, Su, Jingang, additional, Kong, Xiaoxiao, additional, Jiang, Jinpeng, additional, and Du, Boxue, additional
Published: 2019
Full Text: View/download PDF

333. Surface Charge Transport Characteristics of ZnO/Silicone Rubber Composites Under Impulse Superimposed on DC Voltage

Author: Li, Zhonglei, primary, Yang, Zhuoran, additional, and Du, Boxue, additional
Published: 2019
Full Text: View/download PDF

334. Contrastive Learning from Pairwise Measurements

Author: Chen, Yi, Yang, Zhuoran, Xie, Yuchen, Wang, Zhaoran, Chen, Yi, Yang, Zhuoran, Xie, Yuchen, and Wang, Zhaoran
Abstract: Learning from pairwise measurements naturally arises from many applications, such as rank aggregation, ordinal embedding, and crowdsourcing. However, most existing models and algorithms are susceptible to potential model misspecification. In this paper, we study a semiparametric model where the pairwise measurements follow a natural exponential family distribution with an unknown base measure. Such a semiparametric model includes various popular parametric models, such as the Bradley-Terry-Luce model and the paired cardinal model, as special cases. To estimate this semiparametric model without specifying the base measure, we propose a data augmentation technique to create virtual examples, which enables us to define a contrastive estimator. In particular, we prove that such a contrastive estimator is invariant to model misspecification within the natural exponential family, and moreover, attains the optimal statistical rate of convergence up to a logarithmic factor. We provide numerical experiments to corroborate our theory. © 2018 Curran Associates Inc.All rights reserved.
Published: 2018

335. Nanocomposite for Space Charge Suppression in HVDC Cable Accessory

Author: Du, Boxue, Li, Jin, Yang, Zhuoran, Du, Boxue, Li, Jin, and Yang, Zhuoran
Abstract: HVDC cable accessories made of ethylene-vinyl acetate copolymer (EVA) by incorporation of specific fillers have to face the problem of space charge accumulation. The effects of doping contents on the space charge behaviors of EVA/ZnO composite are not completely clear. EVA composites are prepared with the fraction of 0, 1, 5 and 10 wt%, respectively, with which 5 wt% nano-sized plus 5 wt% micro-sized ZnO-doped samples are chosen for comparison. Obtained results show that the particles in EVA composite are in homodisperse. The permittivity is increased by ZnO doping and the dissipation factor of EVA composites with 1 and 5 wt% nanoparticles is lower at the lower frequencies. The homocharge injection occurs in cathode instead of anode when ZnO nanoparticles are introduced and 5 wt% nanoparticle doping performs well in suppressing space charge injection. The electric field in the 5 wt% nanoparticle-doped EVA distributes more uniformly under the high electric stress than that of others. During the depolarization procedure, the total remnant charges of 10 wt% doped samples are the least in the final. The above results are well explained by the DC conduction, apparent mobility and trap distribution characteristics.
Published: 2018
Full Text: View/download PDF

336. Networked Multi-Agent Reinforcement Learning in Continuous Spaces

Author: Zhang, Kaiqing, primary, Yang, Zhuoran, additional, and Basar, Tamer, additional
Published: 2018
Full Text: View/download PDF

337. A Finite Sample Analysis of the Actor-Critic Algorithm

Author: Yang, Zhuoran, primary, Zhang, Kaiqing, additional, Hong, Mingyi, additional, and Basar, Tamer, additional
Published: 2018
Full Text: View/download PDF

338. Promotion Effect of Sound Image Location by Unity Audio-Visual Interaction

Author: Zhao, Wei, primary, He, Qifei, additional, Yang, Zhuoran, additional, and Chen, Zihan, additional
Published: 2018
Full Text: View/download PDF

339. Effect of Field-dependent Conductivity on Surface Charge Decay in ZnO/Silicone Rubber Composites

Author: Li, Zhonglei, primary, Yang, Zhuoran, additional, Du, Boxue, additional, Xiao, Mi, additional, Su, Jingang, additional, Zhu, Wenbo, additional, Zhu, Lewei, additional, and Kong, Xiaoxiao, additional
Published: 2018
Full Text: View/download PDF

340. Improving Electric Field Distribution in HTS DC Cable Terminals by Nonlinear Conductive Epoxy/ZnO Composites

Author: Li, Zhonglei, primary, Zhu, Wenbo, additional, Yang, Zhuoran, additional, Su, Jingang, additional, Kong, Xiaoxiao, additional, and Du, Boxue, additional
Published: 2018
Full Text: View/download PDF

341. Improving surface quality in microcutting of 10B/Al composite.

Author: Wang, Xingjun, Yang, Zhuoran, Xu, Feifei, and Wang, Liping
Subjects: *BORON, *GAUSSIAN distribution, *CUTTING tools
Abstract: Purpose: The microcutting performance of the 10B/Al composite is significantly poor because of the existence of hard boron particles. The effects of cutting parameters, including uncut chip thickness and cutting speed, on the material removal mechanism and surface generation are investigated to improve the surface quality. Design/methodology/approach: The 2D finite element model, which includes a rigid cutting tool, a reinforced phase, a matrix and a dense layer, is established. The effects of uncut chip thickness on material removal mechanism and surface generation are analyzed from a probabilistic perspective. The relationship between the uncut chip thickness and the probability in which the machined surface will have a better surface quality is constructed. A Gaussian distribution formula is applied to describe the machined surface quality. Findings: Two representative particle-removal modes, namely, cutting-through and pulling-out modes, are observed. For cutting-through mode, when the relative cutting location is small, better surface quality is obtained. For pulling-out mode, the quality of the machined surface gradually improves because the further increase of the relative cutting location reduces the height of the generated pit and scratches. The microcutting at high cutting speed tends to suppress the scratch phenomenon. The best surface quality will be obtained at small uncut chip thickness and high cutting speed. Originality/value: The surface quality generated in microcutting of the 10B/Al composite can be improved by optimizing the cutting parameters and controlling the particle-removal modes based on the proposed Gaussian distribution formula. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

342. Sparse nonlinear regression: Parameter estimation under nonconvexity

Author: Yang, Zhuoran, Wang, Zhaoran, Liu, Han, Eldar, Yonina C., Zhang, Tong, Yang, Zhuoran, Wang, Zhaoran, Liu, Han, Eldar, Yonina C., and Zhang, Tong
Abstract: We study parameter estimation for sparse nonlinear regression. More specifically, we assume the data are given by y = f(xTβ) + e, where / is nonlinear. To recover β, we propose an ℓ1- regularized least-squares estimator. Unlike classical linear regression, the corresponding optimization problem is nonconvex because of the nonlin- earityof ℓ. In spite of the nonconvexity, we prove that under mild conditions, every stationary point of the objective enjoys an optimal statistical rate of convergence. Detailed numerical results are provided to back up our theory.copyright © 2016 by the author(s).
Published: 2016

343. Enhancing interfacial shear debonding resistance by mechanical mismatch.

Author: Zhu, Zhongmeng, Yang, Zhuoran, Yang, Fan, Yao, Chengbin, and Jiang, Han
Subjects: *DEBONDING, *SHEAR (Mechanics), *INTERFACIAL resistance, *INTERFACIAL bonding, *SOFT robotics, *STRAIN energy
Abstract: • Mechanical matching is not the optimal solution for high shear debonding resistance. • Shear debonding resistance improves monotonically with increasing loading layer stiffness. • Increased strain energy storage of loading layer and shear lag length attribute to the improvement. • Local non-uniform deformation causes insignificant influence on shear debonding resistance. • Mechanical mismatch based strategies for enhancing shear debonding resistance are proposed. Layered systems of similar/dissimilar soft materials are widely used in advanced device applications, including bioelectronics, soft robotics, and energy generators. Device designers strive to minimize the mechanical mismatch between the soft material layers, claiming that the mismatch could be associated with low interfacial debonding resistance and then cause device malfunction. Although strong interfacial bonding has been repeatedly achieved based on the principle of minimizing-mechanical-mismatch, the specific question of whether mechanical matching is the optimal solution for high debonding resistance has never been solidly answered, leaving ambiguity in the further optimization of the layered soft material systems. In this paper, the influence of the stiffness ratio between the soft material layers on the interfacial shear debonding resistance is systematically analyzed through theoretical modeling and finite element simulation approaches. Monotonic improvement of the shear debonding resistance with increasing the loading layer stiffness, even when exceeding that of the substrate layer, is analytically derived by establishing a shear debonding theory of a hyperelastic bilayer. This mechanical-mismatch induced improvement is due to the effective consumption of the external work by the strain energy storage of the loading layer and the large increase in the shear lag length. The theoretical predictions of the debonding resistance agree well with the finite element simulation results, which implies the local non-uniform deformation in the loading and substrate layers near the interfacial crack front causes insignificant influence on the shear debonding. Increasing the modulus or thickness of the loading layer or adding a stiff backing can equivalently improve the shear debonding resistance, which can be quantitatively predicted by the established shear debonding theory. Reasonable applications of the proposed debonding resistance enhancement strategy are also discussed. The findings provide good guidance for the safe service and further optimization of the layered soft material systems to improve the device reliability. [Display omitted] [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

344. Counterintuitive delayed failure of soft adhesive system under shear holding.

Author: Yang, Zhuoran, Zhu, Zhongmeng, Yao, Chengbin, Xia, Yan, and Jiang, Han
Subjects: *SHEAR (Mechanics), *CREEP (Materials), *DEAD loads (Mechanics), *RELAXATION phenomena, *MOLECULAR force constants, *ADHESIVES, *ADHESIVE joints
Abstract: • Viscoelastic CZM used to describe soft adhesive behavior under displacement holding. • Identification of local creep of soft adhesive and macroscopic relaxation phenomena. • Observation of shear lag length expansion during displacement holding. • Mechanism of delayed failure attributed to creep driven by stored energy in adherend. • Parameters discussed to mitigate delayed failure in soft adhesive structures. Soft adhesive structures are widely employed in cutting-edge applications due to their unique mechanical properties. However, the phenomenon of delayed failure resulting from static loading is often overlooked in these scenarios. In particular, the counterintuitive delayed failure of the soft adhesive structure under shear displacement holding, where no additional energy is input into the soft adhesive structure during the macroscopic relaxation process, poses a challenge for comprehension. To address this, this work employs a time-dependent cohesive zone model (CZM) to characterize the viscoelastic behavior of the soft adhesive within the soft adhesive structure during displacement holding. Two adhesive systems, namely the hard adherend bonded to the hard substrate with soft adhesive (HSH) and the soft adherend bonded to the hard substrate with soft adhesive (SSH), were investigated both experimentally and numerically. The adopted CZM was validated through single-lap shear tests on HSH, encompassing single loading, constant force, and displacement holding experiments. Subsequently, the delayed failure of the local soft adhesive in SSH was investigated, and the deformation field of the soft adherend during the holding process was observed experimentally using the particle tracking method. The results reveal that the delayed failure of SSH is attributed to the occurrence of local creep in the soft adhesive at the crack front, despite the evident macroscopic relaxation phenomenon observed during displacement holding. Aiming to eliminate the delayed failure of the soft adhesive structure, the importance of the adherend to adhesive modulus ratio, viscoelastic properties, and loading conditions were discussed to address the delayed failure of the soft adhesive structure. By shedding light on the overlooked aspect of delayed failure in soft adhesive structures, this study presents a novel perspective that extends beyond conventional static loading analyses. The understanding of delayed failure mechanisms provides valuable guidance for enhancing the performance of soft adhesive structures. [Display omitted] [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

345. Androgen-induced miR-135a acts as a tumor suppressor through downregulating RBAK and MMP11, and mediates resistance to androgen deprivation therapy

Author: Wan, Xuechao, primary, Pu, Honglei, additional, Huang, Wenhua, additional, Yang, Shu, additional, Zhang, Yalong, additional, Kong, Zhe, additional, Yang, Zhuoran, additional, Zhao, Peiqing, additional, Li, Ao, additional, Li, Tao, additional, and Li, Yao, additional
Published: 2016
Full Text: View/download PDF

346. Design of AP Network Subsystem Based on WLAN System

Author: Yang, Zhuoran, primary and Zhang, Yanmei, primary
Published: 2016
Full Text: View/download PDF

347. The Sample Complexity of Online Contract Design.

Author: Zhu, Banghua, Bates, Stephen, Yang, Zhuoran, Wang, Yixin, Jiao, Jiantao, and Jordan, Michael I.
Subjects: CONTRACT theory, TRANSACTION costs, CONTRACTS, LEARNING contracts, LIBERTY of contract
Abstract: Contract theory studies the interactions between a principal and an agent when the two parties transact in the presence of private information [Bolton and Dewatripont, 2004, Faure-Grimaud et al., 2001, Salanié, 2005]. The principal would like to achieve her desired outcomes by hiring agents to work for her. The agent wishes to make money by working for the principal. They develop agreements in the form of a contract, which specifies how much the principal would pay under the different possible outcomes of the agent's work. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

348. The Ultrasonic In-line Inspection System of Pipelines Based on Spectrum Estimation on-line

Author: Tian Xiaoping, Yang Zhuoran, Sheng Sha, Dai Bo, and Xie Zurong
Subjects: Pipeline transport, Engineering, Automated X-ray inspection, Channel (digital image), business.industry, Acoustics, Pipeline (computing), Ultrasonic testing, Electronic engineering, Spectral density estimation, Ultrasonic sensor, business, Power (physics)
Abstract: The ultrasonic detection is one of the important ways to inspect the wall-loss defects on-line for oil pipeline. The on-line power spectral estimation what characterized short time serial, higher resolution and anti-noise was a key technology to in-line inspect the pipeline. Based on power spectral estimation on-line, the digital multi-channels ultrasonic inspection system with 20 probes was developed. The system consists of main and sub-structure. Inspection board communicated with embedded computer with PCI bus and the numbers of channel was expended by means of high-speed switch. The ultrasonic echo signal was processed with Burg maximum entropy spectrum estimation on-line. The test shows that the inspection system could inspect the inner and outer defects and is suitable for on-line detection of pipeline corrosion. It will be used widely in practice.
Published: 2006
Full Text: View/download PDF

349. l‐Arginine‐Modified CoWO4/FeWO4S‐Scheme Heterojunction Enhances Ferroptosis against Solid Tumor

Author: Yang, Zhuoran, Yang, Chunyu, Yang, Dan, Zhang, Ye, Yang, Qingzhu, Qu, Fengyu, and Guo, Wei
Abstract: Ferroptosis has recently attracted much attention as an anti‐tumor therapy. Evidence suggests that ferroptosis can induce oxidative stress and accumulation of lethal lipid peroxides in cancer cells, leading to cell damage. However, unsuitable pH, H2O2levels, and high glutathione (GSH) expression in the tumor microenvironment hinder the development of ferroptosis‐mediated therapy. In this study, an l‐arginine (l‐arg)‐modified CoWO4/FeWO4(CFW) S‐scheme heterojunction is strategically designed and constructed for ultrasound (US)‐triggered sonodynamic‐ and gas therapy‐induced ferroptosis. CFW not only has excellent Fenton‐catalytic activity, outstanding GSH consumption capacity, and excellent ability to overcome tumor hypoxia, but its S‐scheme heterostructure can also avoid the rapid combination of electron (e) and hole (h+) pairs, thereby enhancing the sonodynamic effects. As a precursor of nitric oxide (NO), l‐arg is modified on the surface of CFW (CFW@l‐arg) to achieve controlled NO release under US irradiation, thereby enhancing ferroptosis. In addition, poly(allylamine hydrochloride) is further modified on the surface of CFW@l‐arg to stabilize l‐arg and achieve controllable NO release. Both in vitro and in vivo results demonstrate that such a multifunctional therapeutic nanoplatform can achieve high therapeutic efficacy through sonodynamic and gas therapy‐enhanced ferroptosis. This designed oncotherapy nanoplatform provides new inspiration for ferroptosis‐mediated therapy. A CoWO4/FeWO4S‐scheme heterojunction that exhibits excellent Fenton‐catalytic activity, outstanding glutathione consumption capacity, and admirable ability to effectively separate electron and hole pairs is reported. After modification with l‐arginine and polyelectrolyte, such a multifunctional therapeutic nanoplatform can achieve high therapeutic efficacy through sonodynamic and gas therapy‐enhanced ferroptosis.
Published: 2023
Full Text: View/download PDF

350. Friction Reduction and Motion Stability of Setal Array Sliding on Rough Surface.

Author: Cheng, Qian, Yang, Fan, Yang, Zhuoran, Jiang, Han, Zhu, Zhongmeng, Liu, Tianyuan, and Cai, Hongbin
Abstract: Setal arrays provide insects with the good capability to walk fast and stably on rough surfaces, making them important research objects in the field of low-friction bionic design. The low-friction mechanism of the setal array is commonly attributed to the reduction of the actual contact area. However, the effect of the significant structural large deflection deformation of the seta on its friction behavior on rough surfaces has not been comprehensively studied. In this study, an analytical model that can describe the true deformation process and friction behavior of the seta on rough surfaces was developed. Accordingly, the friction behavior of the setal array sliding on rough surfaces was systematically analyzed. It's been found that the large deflection deformation of a single seta reduces friction by decreasing the apparent tangential and normal stiffness. The superposition of the setae asynchronous deformation at different positions in the array reduces the vibration of the array's apparent friction force and improves motion stability. These friction properties can be further optimized by modulating the seta density. The applicability of high-density setal arrays in bionic transportation is also experimentally demonstrated. This paper provides an important guide to revealing the biological friction mechanism and design of the bionic structures. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

647 results on '"Yang, Zhuoran"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources