Page 9 of 16 for Zero | This blog no longer updates but I’m still in my quest of RL. For anyone interested in discussion of recent advance of AI/RL, please contact me via my emails: 122134545@qq.com/o.xlnwel@gmail.com

Discussion on a RL algorithm that exploit off-policy data.

Discussion on several concerns in deep (Q) learning.

Discussion on a scalable reinforcement learning architecture that speeds up both data collection and learning process.

Discussion on a distributed reinforcement learning architecture that incoporates a recurrent network into Ape-X.

Discussion on a distributed reinforcement learning architecture for policy gradient methods.

Discussion on a distributed reinforcement learning architecture for Q-learning methods.

Discussion on several improvements on differentiable neural computer.

Discussion on Differentiable Neural Computer.

Discussion on Neural Turing Machines, an architecture able to utilize an external memory.

Discussion on a policy-gradient method with hindsight experience