Page 11 of 16 for Zero | This blog no longer updates but I’m still in my quest of RL. For anyone interested in discussion of recent advance of AI/RL, please contact me via my emails: 122134545@qq.com/o.xlnwel@gmail.com

MB-MPO — Model-Based Meta-Policy Optimization

Discussion on an algorithm that efficiently learns a robust policy by applying MAML to multiple dynamics model.

ProMP — Proximal MetaPolicy Search

We address the credit assignment problem of two forms of MAML with an RL objective and discuss an efficient and stable meta reinforcement learning algorithm.

Adaptive MAML — Applying MAML-RL to nonstationary environments

Discussion on a variant of MAML-RL for solving tasks that change dynamically due to non-stationary of the environment.

MAML++: Improvements on MAML

Discussion on a series of improvements on MAML

MAML — Model-Agnostic Meta-Learning

Discussion on an optimization algorithm for meta-learning named Model-Agnostic Meta-Learning(MAML)

SNAIL — Simple Neural AttentIve meta-Learner

Discussion on a meta-learning architecture named Simple Neural AttentIve meta-Learner(SNAIL).

HAC — Learning Multi-Level Hierarchies with Hindsight

A norvel hierarchical reinforcement learning frame work that can efficiently learn multiple levels of policies in parallel.

NORL-HRL — Near-Optimal Representation Learning for Hierarchical Reinforcement Learning

Near-Optimal Representation Learning for Hierarchical Reinforcement Learning: An improvement to HIRO

HIRO — HIerarchical Reinforcement learning with Off-policy correction

Discussion on a hierarchical reinforcement learning algorithm for goal-directed tasks.

Hierarchical Guidance

Discussion on an algorithmic framework called hierarchical guidance, which leverages hierarchical structure in imitation learning.