597 results on '"Pineau, Joelle"'
Search Results
152. A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature Predictions
153. Biomedical Research and Informatics Living Laboratory for Innovative Advances of New Technologies in Community Mobility Rehabilitation: Protocol for Evaluation and Rehabilitation of Mobility Across Continuums of Care
154. Adaptive control of epileptiform excitability in an in vitro model of limbic seizures
155. ML Reproducibility Challenge 2021
156. Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs
157. Chapter 11: Imputing missing data from sequential multiple assignment randomized trials
158. Chapter 16: Practical reinforcement learning in dynamic treatment regimes
159. A bistable computational model of recurring epileptiform activity as observed in rodent slice preparations
160. Biomedical Research & Informatics Living Laboratory for Innovative Advances of New Technologies in Community Mobility Rehabilitation: Protocol for a longitudinal evaluation of mobility outcomes (Preprint)
161. Improving Passage Retrieval with Zero-Shot Question Generation
162. The Curious Case of Absolute Position Embeddings
163. Compressed Least-Squares Regression on Sparse Spaces
164. Transparency and reproducibility in artificial intelligence
165. ML Reproducibility Challenge 2020
166. A survey of point-based POMDP solvers
167. Automated Data-Driven Generation of Personalized Pedagogical Interventions in Intelligent Tutoring Systems
168. Improving reproducibility in machine learning research : a report from the NeurIPS 2019 reproducibility program
169. Informing sequential clinical decision-making through reinforcement learning: an empirical study
170. Development and Validation of a Robust Speech Interface for Improved Human-Robot Interaction
171. Improving Sample Efficiency in Model-Free Reinforcement Learning from Images
172. NeurIPS 2019 Reproducibility Challenge
173. The Duality of State and Observation in Probabilistic Transition Systems
174. Constructing evidence-based treatment strategies using methods from computer science
175. Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little
176. Exploring the Limits of Few-Shot Link Prediction in Knowledge Graphs
177. UnNatural Language Inference
178. A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic loss
179. Sometimes We Want Ungrammatical Translations
180. Do Encoder Representations of Generative Dialogue Models have sufficient summary of the Information about the task ?
181. The Bottleneck Simulator: A Model-Based Deep Reinforcement Learning Approach
182. Building reproducible, reusable, and robust machine learning software
183. Development of a polygenic risk score to improve screening for fracture risk: A genetic risk prediction study
184. On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability (Extended Abstract)
185. Handling Black Swan Events in Deep Learning with Diversely Extrapolated Neural Networks
186. Literature Mining for Incorporating Inductive Bias in Biomedical Prediction Tasks (Student Abstract)
187. Exploiting Spatial Invariance for Scalable Unsupervised Object Tracking
188. Learning an Unreferenced Metric for Online Dialogue Evaluation
189. Recurrent Boosting for Classification of Natural and Synthetic Time-Series Data
190. Active Learning in Partially Observable Markov Decision Processes
191. AAAI 2008 workshop reports
192. MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions
193. Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning
194. No Press Diplomacy: Modeling Multi-Agent Gameplay
195. Learning Causal State Representations of Partially Observable Environments
196. On the Pitfalls of Measuring Emergent Communication
197. Separating value functions across time-scales
198. The Second Conversational Intelligence Challenge (ConvAI2)
199. Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
200. Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.