BDPI

Tag

Denis Steckelmacher, Ph.D. student at the Artificial Intelligence Lab, presented his research on Bootstrapped Dual Policy Iteration at our weekly research meeting last week: Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration In reinforcement learning, we not only want the agent to learn how to perform well in a given environment, we also want it...
Read More