Ilija Bogunovic
Home
News
Publications
Team
Reading Group
Contact
V. Mehta
Latest
Sample Efficient Reinforcement Learning from Human Feedback via Active Exploration
Near-optimal Policy Identification in Active Reinforcement Learning
Cite
×