Source Themes

Boosted Curriculum Reinforcement Learning

Long-Term Visitation Value for Deep Exploration in Sparse-Reward Reinforcement Learning

A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement Learning

Convex Regularization in Monte-Carlo Tree Search

Self-Paced Deep Reinforcement Learning

Generalized Mean Estimation in Monte-Carlo Tree Search

Multi-agent active information gathering in discrete and continuous-state decentralized POMDPs by policy graph improvement

Probabilistic approach to physical object disentangling

Compatible natural gradient policy search

Projections for Approximate Policy Iteration Algorithms