Page 10 of 16 for Zero | This blog no longer updates but I’m still in my quest of RL. For anyone interested in discussion of recent advance of AI/RL, please contact me via my emails: 122134545@qq.com/o.xlnwel@gmail.com

Discussion on a method that can learn values across many orders of magnitudes.

Discussion on a multi-agent reinforcement learning algorithm that schedules communication between cooperative agents.

Discussion on a multi-agent reinforcement learning algorithm that recursively reason the opponents’ behavior.

Discussion on a multi-agent reinforcement learning algorithm that follows the framework of centralized training with decentralized execution.

Discussion on a novel exploration method based on representation learning

Discussion on how to solve the web navigation problem using DQN.

Discussion on a new regularization mechanism that leverage an optimal prior to explicitly penalize the mutual information between states and f.

Discussion on several techniques involved in SAGAN, including self-attention, spectral normalization, conditional batch normalization, etc

Discussion on a model-based meta reinforcement learning algorithm that enables the agent to fast adapt to changes of environment.

Discussion on an off-policy meta reinforcement learning algorithm that achieves state-of-the-art performance and sample efficiency.