Search

Your search keyword '"Bedi, P"' showing total 4,045 results

Search Constraints

Start Over You searched for: Author "Bedi, P" Remove constraint Author: "Bedi, P"
4,045 results on '"Bedi, P"'

Search Results

1. International Students: Poorly Suited Immigration Pathways Stymie Formation of High Growth Businesses. White Paper No. 273

2. Developing Quality Schools: A Content Analysis of Principals' Practices, Stressors, and Support Factors

3. Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction

4. EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering

5. ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment

6. On The Global Convergence Of Online RLHF With Neural Parametrization

7. rECGnition_v1.0: Arrhythmia detection using cardiologist-inspired multi-modal architecture incorporating demographic attributes in ECG

8. On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning

9. AIME: AI System Optimization via Multiple LLM Evaluators

10. Auction-Based Regulation for Artificial Intelligence

11. meds_reader: A fast and efficient EHR processing library

12. Solving the strong CP problem with massless grand-color quarks

13. CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk

14. TrustNavGPT: Modeling Uncertainty to Improve Trustworthiness of Audio-Guided LLM-Based Robot Navigation

15. SAIL: Self-Improving Efficient Online Alignment of Large Language Models

16. Multi-LLM QA with Embodied Exploration

17. DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning

19. Transfer Q Star: Principled Decoding for LLM Alignment

20. FACT or Fiction: Can Truthful Mechanisms Eliminate Federated Free Riding?

21. Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization

22. PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling

23. Bayesian modeling of co-occurrence microbial interaction networks

24. Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles

25. Right Place, Right Time! Towards ObjectNav for Non-Stationary Goals

26. Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning

27. Keep Learning: Student Engagement in an Online Environment

32. The Clinical Utility of a 7-Gene Biosignature on Radiation Therapy Decision Making in Patients with Ductal Carcinoma In Situ Following Breast-Conserving Surgery: An Updated Analysis of the DCISionRT® PREDICT Study

33. Highlighting the Safety Concerns of Deploying LLMs/VLMs in Robotics

34. Small instanton-induced flavor invariants and the axion potential

35. MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences

36. Beyond Text: Utilizing Vocal Cues to Improve Decision Making in LLMs for Robot Navigation Tasks

37. REBEL: A Regularization-Based Solution for Reward Overoptimization in Robotic Reinforcement Learning from Human Feedback

Catalog

Books, media, physical & digital resources