Search

Your search keyword '"Bedi, P"' showing total 119 results

Search Constraints

Start Over You searched for: Author "Bedi, P" Remove constraint Author: "Bedi, P" Publication Type Reports Remove constraint Publication Type: Reports
119 results on '"Bedi, P"'

Search Results

1. International Students: Poorly Suited Immigration Pathways Stymie Formation of High Growth Businesses. White Paper No. 273

2. Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction

3. EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering

4. ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment

5. On The Global Convergence Of Online RLHF With Neural Parametrization

6. rECGnition_v1.0: Arrhythmia detection using cardiologist-inspired multi-modal architecture incorporating demographic attributes in ECG

7. On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning

8. AIME: AI System Optimization via Multiple LLM Evaluators

9. Auction-Based Regulation for Artificial Intelligence

10. meds_reader: A fast and efficient EHR processing library

11. Solving the strong CP problem with massless grand-color quarks

12. CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk

13. TrustNavGPT: Modeling Uncertainty to Improve Trustworthiness of Audio-Guided LLM-Based Robot Navigation

14. SAIL: Self-Improving Efficient Online Alignment of Large Language Models

15. Multi-LLM QA with Embodied Exploration

16. DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning

17. Transfer Q Star: Principled Decoding for LLM Alignment

18. FACT or Fiction: Can Truthful Mechanisms Eliminate Federated Free Riding?

19. Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization

20. PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling

21. Bayesian modeling of co-occurrence microbial interaction networks

22. Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles

23. Right Place, Right Time! Towards ObjectNav for Non-Stationary Goals

24. Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning

25. Highlighting the Safety Concerns of Deploying LLMs/VLMs in Robotics

26. Small instanton-induced flavor invariants and the axion potential

27. MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences

28. Beyond Text: Utilizing Vocal Cues to Improve Decision Making in LLMs for Robot Navigation Tasks

29. REBEL: A Regularization-Based Solution for Reward Overoptimization in Robotic Reinforcement Learning from Human Feedback

30. Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey

31. Towards Realistic Mechanisms That Incentivize Federated Participation and Contribution

32. Minimal surfaces over harmonic shears

33. LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments

34. A generalized Bayesian stochastic block model for microbiome community detection

35. PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback

36. On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization

37. iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning

38. CCE: Sample Efficient Sparse Reward Policy Learning for Robotic Navigation via Confidence-Controlled Exploration

39. Bayesian Segmentation Modeling of Epidemic Growth

40. Image-based Indian Sign Language Recognition: A Practical Review using Deep Neural Networks

41. On the Possibilities of AI-Generated Text Detection

42. RE-MOVE: An Adaptive Policy Design for Robotic Navigation Tasks in Dynamic Environments via Language-Based Feedback

43. Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic

44. STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning

45. SWIFT: Rapid Decentralized Federated Learning via Wait-Free Model Communication

46. DMCA: Dense Multi-agent Navigation using Attention and Communication

47. RTAW: An Attention Inspired Reinforcement Learning Method for Multi-Robot Task Allocation in Warehouse Environments

48. DC-MRTA: Decentralized Multi-Robot Task Allocation and Navigation in Complex Environments

49. Predicting Future Mosquito Larval Habitats Using Time Series Climate Forecasting and Deep Learning

50. HTRON:Efficient Outdoor Navigation with Sparse Rewards via Heavy Tailed Adaptive Reinforce Algorithm

Catalog

Books, media, physical & digital resources