647 results on '"Yang, Zhuoran"'
Search Results
202. Cohesive zone model to investigate complex soft adhesive failure: state-of-the-art review
203. Being Trustworthy is Not Enough: How Untrustworthy Artificial Intelligence (AI) Can Deceive the End-Users and Gain Their Trust
204. Optimistic Exploration with Learned Features Provably Solves Markov Decision Processes with Neural Dynamics
205. Effect of thermal aging on the scratch behavior of poly (methyl methacrylate)
206. One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration
207. False Correlation Reduction for Offline Reinforcement Learning
208. Understanding Implicit Regularization in Over-Parameterized Single Index Model.
209. Online Bootstrap Inference For Policy Evaluation In Reinforcement Learning.
210. In vivo evaluation of intravascular lithotripsy in a healthy porcine coronary model
211. L‐Arginine‐Modified CoWO 4 /FeWO 4 S‐Scheme Heterojunction Enhances Ferroptosis Against Solid Tumor
212. COMPARISON OF THREE-YEAR OUTCOMES OF DRUG-COATED BALLOON ANGIOPLASTY IN TOTALLY OCCLUSIVE VS. NON-OCCLUSIVE IN-STENT RESTENOSIS OF DRUG-ELUTING STENTS
213. UTILIZATION AND IN-HOSPITAL OUTCOMES OF PERCUTANEOUS LEFT ATRIAL APPENDAGE OCCLUSION IN PATIENTS WITH CANCER
214. Nanomaterials as well their applications and effects in batteries
215. Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium
216. A Two-Timescale Stochastic Algorithm Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic
217. False Correlation Reduction for Offline Reinforcement Learning
218. Enhanced Interfacial Shear Debonding Resistance of Soft Material Bilayers Based on Mechanical Mismatch
219. Hollow Nanooxidase Enhanced Phototherapy Against Solid Tumors
220. Study of temperature fields and losses in high voltage cables under different layings
221. Accelerate online reinforcement learning for building HVAC control with heterogeneous expert guidances
222. l‐Arginine‐Modified CoWO4/FeWO4 S‐Scheme Heterojunction Enhances Ferroptosis against Solid Tumor.
223. Computing Independent Variable Sets for Polynomial Ideals
224. Time-temperature superposition principle for the shear fracture behaviour of soft adhesive layers: From bulk to interface
225. Online Bootstrap Inference For Policy Evaluation In Reinforcement Learning
226. Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning
227. Relationship Between Electrical Treeing Degradation and DCIC-Q(t) Characteristics of XLPE Insulation
228. Effect of Polycyclic Aromatic Compounds Content on Electrical Tree and Partial Discharge of XLPE
229. Decentralized Optimistic Hyperpolicy Mirror Descent: Provably No-Regret Learning in Markov Games
230. Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
231. Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
232. Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning
233. Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation
234. The Best of Both Worlds: Reinforcement Learning with Logarithmic Regret and Policy Switches
235. Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions
236. Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach
237. Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information
238. Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency
239. Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets
240. Offline Policy Optimization in RL with Variance Regularizaton
241. Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning
242. Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
243. Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes
244. Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
245. Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets
246. Study on Surface Discharge Characteristics of GO-Doped Epoxy Resin–LN2 Composite Insulation
247. Electrostatically Controlled ex Situ and in Situ Polymerization of Diacetylene-Containing Peptide Amphiphiles in Living Cells
248. Dynamic multifunctional devices enabled by ultrathin metal nanocoatings with optical/photothermal and morphological versatility
249. Investigating Inter/Intralayer Interface-Triggered Toughening Mechanisms of Three-Dimensional Printed Polylactic Acid Using Double-Notch Four-Point-Bending Method
250. Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.