ProMP — Proximal MetaPolicy Search

We address the credit assignment problem of two forms of MAML with an RL objective and discuss an efficient and stable meta reinforcement learning algorithm.

5 min read

Hierarchical Guidance

Discussion on an algorithmic framework called hierarchical guidance, which leverages hierarchical structure in imitation learning.

3 min read