Agent57
Discussion on an agent, called Agent57, that outperforms the standard human benchmark on all Atari games.
NGU — Never Give Up
Discussion on the Never-Give-Up(NGV) agent that achieves the state-of-the-art performance in hard exploration games in Atari without any prior knowledge while maintraining a very high score across the remaining games.
From 1st Wasserstein to Kantorovich-Rubinstein Duality
An introduction to the dual of the 1st Wasserstein distance.
Duality in Linear Programm
An introduction to dual linear programs
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
Discussion on some useful results about how to learn a good representation
GCN, GLU — Gated Convolutional Network
Discussion on Gated Convolutional Network that applies 1D convolution to sequential data.
EC — Episodic Curiosity
Discussion on an exploration method based on episodic memory.
FiLM — Feature-wise Linear Modulation
Discussion on Feature-wise Linear Modulation
DreamerV2
Discussion on DreamerV2, a model-based algorithm reaching promising results on Atari games
Dreamer
Discussion on a model-based reinforcement learning agent called Dreamer