Selected Key References

For a more complete list, please see Google Scholar.

(2024). Optimistic Multi-Agent Policy Gradient. In Proceedings of the International Conference on Machine Learning (ICML).

PDF Cite

(2023). Simplified Temporal Consistency Reinforcement Learning. In Proceedings of the International Conference on Machine Learning (ICML).

PDF Cite

(2023). Hierarchical Imitation Learning with Vector Quantized Models. In Proceedings of the International Conference on Machine Learning (ICML).

PDF Cite

(2022). Curriculum reinforcement learning via constrained optimal transport. In Proceedings of the International Conference on Machine Learning (ICML).

PDF Cite

(2022). Boosted Curriculum Reinforcement Learning. International Conference on Learning Representations (ICLR).

PDF Cite Code

(2021). Convex Regularization in Monte-Carlo Tree Search. In Proceedings of the International Conference on Machine Learning (ICML).

PDF Cite

(2020). Self-Paced Deep Reinforcement Learning. In Proceedings of the Annual Conference on Neural Information Processing Systems (NeurIPS).

PDF Cite

(2020). Generalized Mean Estimation in Monte-Carlo Tree Search. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI).

PDF Cite

(2019). Projections for Approximate Policy Iteration Algorithms. In Proceedings of the International Conference on Machine Learning (ICML).

PDF Cite

(2011). Periodic Finite State Controllers for Efficient POMDP and DEC-POMDP Planning. In Annual Conference on Neural Information Processing Systems (NeurIPS).

PDF Cite