Search

Your search keyword '"Bedi, P"' showing total 52 results

Search Constraints

Start Over You searched for: Author "Bedi, P" Remove constraint Author: "Bedi, P" Topic computer science - machine learning Remove constraint Topic: computer science - machine learning
52 results on '"Bedi, P"'

Search Results

1. Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction

2. ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment

3. On The Global Convergence Of Online RLHF With Neural Parametrization

4. rECGnition_v1.0: Arrhythmia detection using cardiologist-inspired multi-modal architecture incorporating demographic attributes in ECG

5. On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning

6. AIME: AI System Optimization via Multiple LLM Evaluators

7. meds_reader: A fast and efficient EHR processing library

8. CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk

9. SAIL: Self-Improving Efficient Online Alignment of Large Language Models

10. Multi-LLM QA with Embodied Exploration

11. DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning

12. Transfer Q Star: Principled Decoding for LLM Alignment

13. FACT or Fiction: Can Truthful Mechanisms Eliminate Federated Free Riding?

14. Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization

15. PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling

16. Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles

17. MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences

18. REBEL: A Regularization-Based Solution for Reward Overoptimization in Robotic Reinforcement Learning from Human Feedback

19. Towards Realistic Mechanisms That Incentivize Federated Participation and Contribution

20. PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback

21. On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization

22. iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning

23. CCE: Sample Efficient Sparse Reward Policy Learning for Robotic Navigation via Confidence-Controlled Exploration

24. On the Possibilities of AI-Generated Text Detection

25. RE-MOVE: An Adaptive Policy Design for Robotic Navigation Tasks in Dynamic Environments via Language-Based Feedback

26. Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic

27. STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning

28. SWIFT: Rapid Decentralized Federated Learning via Wait-Free Model Communication

29. DC-MRTA: Decentralized Multi-Robot Task Allocation and Navigation in Complex Environments

30. Predicting Future Mosquito Larval Habitats Using Time Series Climate Forecasting and Deep Learning

31. FedBC: Calibrating Global and Local Models via Federated Learning Beyond Consensus

32. FedNew: A Communication-Efficient and Privacy-Preserving Newton-Type Method for Federated Learning

33. Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm

34. Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies

35. Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning

36. On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces

37. Projection-Free Algorithm for Stochastic Bi-level Optimization

38. Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach

39. Wasserstein-Splitting Gaussian Process Regression for Heterogeneous Online Bayesian Inference

40. On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control

41. Energy-Efficient and Federated Meta-Learning via Projected Stochastic Gradient Ascent

42. MARL with General Utilities via Decentralized Shadow Reward Actor-Critic

43. Conservative Stochastic Optimization with Expectation Constraints

44. Variational Policy Gradient Method for Reinforcement Learning with General Utilities

45. Regret and Belief Complexity Trade-off in Gaussian Process Bandits via Information Thresholding

46. Cautious Reinforcement Learning via Distributional Risk in the Dual Domain

47. Q-GADMM: Quantized Group ADMM for Communication Efficient Decentralized Machine Learning

48. Optimally Compressed Nonparametric Online Learning

49. Nonstationary Nonparametric Online Learning: Balancing Dynamic Regret and Model Parsimony

50. GADMM: Fast and Communication Efficient Framework for Distributed Machine Learning

Catalog

Books, media, physical & digital resources