Playing Atari games with Hierarchical Reinforcement Learning
We experimentally demonstrate that by decomposing the learning task into sub-systems and constructing options for the temporally extended planning, the learning process can be dramatically accelerated. Download paper here
Agent(orange one in the video) trained with hierarchical reinforcement learning playing Tennis