SIL - Self-Imitation Learning

Discussion on self-imitation learning, in which the agent exploits the previous transitions that receives better returnas than it expects

1 min read

AdaNorm

We analyze layer normalization and discuss its improvement AdaNorm.

1 min read

Ape-X DQfD

Discussion on several enhancements on Ape-X DQN.

6 min read