Page 7 of 16 for Zero | This blog no longer updates but I’m still in my quest of RL. For anyone interested in discussion of recent advance of AI/RL, please contact me via my emails: 122134545@qq.com/o.xlnwel@gmail.com

Agent57

Discussion on an agent, called Agent57, that outperforms the standard human benchmark on all Atari games.

Discussion on the Never-Give-Up(NGV) agent that achieves the state-of-the-art performance in hard exploration games in Atari without any prior knowledge while maintraining a very high score across the remaining games.

From 1st Wasserstein to Kantorovich-Rubinstein Duality

An introduction to the dual of the 1st Wasserstein distance.

Duality in Linear Programm

An introduction to dual linear programs

SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

Discussion on some useful results about how to learn a good representation

GCN, GLU — Gated Convolutional Network

Discussion on Gated Convolutional Network that applies 1D convolution to sequential data.

EC — Episodic Curiosity

Discussion on an exploration method based on episodic memory.

FiLM — Feature-wise Linear Modulation

Discussion on Feature-wise Linear Modulation

DreamerV2

Discussion on DreamerV2, a model-based algorithm reaching promising results on Atari games

Dreamer

Discussion on a model-based reinforcement learning agent called Dreamer