Search

Your search keyword '"Bedi, P"' showing total 115 results

Search Constraints

Start Over You searched for: Author "Bedi, P" Remove constraint Author: "Bedi, P" Database arXiv Remove constraint Database: arXiv
115 results on '"Bedi, P"'

Search Results

1. Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction

2. EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering

3. ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment

4. On The Global Convergence Of Online RLHF With Neural Parametrization

5. rECGnition_v1.0: Arrhythmia detection using cardiologist-inspired multi-modal architecture incorporating demographic attributes in ECG

6. On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning

7. AIME: AI System Optimization via Multiple LLM Evaluators

8. Auction-Based Regulation for Artificial Intelligence

9. meds_reader: A fast and efficient EHR processing library

10. Solving the strong CP problem with massless grand-color quarks

11. CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk

12. TrustNavGPT: Modeling Uncertainty to Improve Trustworthiness of Audio-Guided LLM-Based Robot Navigation

13. SAIL: Self-Improving Efficient Online Alignment of Large Language Models

14. Multi-LLM QA with Embodied Exploration

15. DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning

16. Transfer Q Star: Principled Decoding for LLM Alignment

17. FACT or Fiction: Can Truthful Mechanisms Eliminate Federated Free Riding?

18. Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization

19. PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling

20. Bayesian modeling of co-occurrence microbial interaction networks

21. Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles

22. Right Place, Right Time! Towards ObjectNav for Non-Stationary Goals

23. Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning

24. Highlighting the Safety Concerns of Deploying LLMs/VLMs in Robotics

25. Small instanton-induced flavor invariants and the axion potential

26. MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences

27. Beyond Text: Utilizing Vocal Cues to Improve Decision Making in LLMs for Robot Navigation Tasks

28. REBEL: A Regularization-Based Solution for Reward Overoptimization in Robotic Reinforcement Learning from Human Feedback

29. Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey

30. Towards Realistic Mechanisms That Incentivize Federated Participation and Contribution

31. Minimal surfaces over harmonic shears

32. LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments

33. A generalized Bayesian stochastic block model for microbiome community detection

34. PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback

35. On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization

36. iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning

37. CCE: Sample Efficient Sparse Reward Policy Learning for Robotic Navigation via Confidence-Controlled Exploration

38. Bayesian Segmentation Modeling of Epidemic Growth

39. Image-based Indian Sign Language Recognition: A Practical Review using Deep Neural Networks

40. On the Possibilities of AI-Generated Text Detection

41. RE-MOVE: An Adaptive Policy Design for Robotic Navigation Tasks in Dynamic Environments via Language-Based Feedback

42. Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic

43. STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning

44. SWIFT: Rapid Decentralized Federated Learning via Wait-Free Model Communication

45. DMCA: Dense Multi-agent Navigation using Attention and Communication

46. RTAW: An Attention Inspired Reinforcement Learning Method for Multi-Robot Task Allocation in Warehouse Environments

47. DC-MRTA: Decentralized Multi-Robot Task Allocation and Navigation in Complex Environments

48. Predicting Future Mosquito Larval Habitats Using Time Series Climate Forecasting and Deep Learning

49. HTRON:Efficient Outdoor Navigation with Sparse Rewards via Heavy Tailed Adaptive Reinforce Algorithm

50. FedBC: Calibrating Global and Local Models via Federated Learning Beyond Consensus

Catalog

Books, media, physical & digital resources