The Deadly Triad

We analyze how different components of DQN play a role in emergence of the deadly triad

2 min read

TPPO ā€” Truly PPO

We investigate the behavior of PPO and introduce new methods that forces the trust region constraint.

3 min read

Anti-Aliasing

Discussion on aliasing in modern convolutional neural networks and address it with low-pass filters.

2 min read

SENet: Squeeze-and Excitation Network

Discussion on Squeeze-and Excitation Network, an architecture that adaptively recalibrates channel-wise feature responses by explicitly modelling interdependencies between channels.

2 min read

EvoNorm

Discussion on EvoNorm, a set of uniform normalization-activation layers found by AutoML.

4 min read

MobileNet

Discussion on MobileNet families

5 min read

Math

We summarize some mathematical concepts used in deep reinforcement learning

5 min read

Combining EAs with RL

We summarize summarize several recent works that combine evolutionary algorithms with reinforcement learning.

1 min read