Skip links

  • Skip to primary navigation
  • Skip to content
  • Skip to footer
Menu
  • Posts
  • Categories
  • Tags
  • Search
Zero
Zero

This blog no longer updates but I'm still in my quest of RL. For anyone interested in discussion of recent advance of AI/RL, please contact me via my emails: 122134545@qq.com/o.xlnwel@gmail.com

RODE — Learning Roles to Decompose Multi-Agent Tasks

Discussion on RODE, a hierarchical MARL method that decompose the action space into role action subspaces according to their effects on the environment.

3 min read February 27, 2021

PWIL — Primal Wasserstein Imitation Learning

Discussion on Primal Wasserstein Imitation Learning.

3 min read February 14, 2021

Network Regularization in Policy Optimization

Discussion on the effect of network regularization in policy optimization.

3 min read February 7, 2021

HIDIO — Hierarchical RL by Discovering Intrinsic Options

Discussion on HIDIO, which identifies and addresses the problem of using a shared representation for learning the policy and the value function.

2 min read February 1, 2021

IDAAC — Invariant Decoupled Advantage Actor-Critic

Discussion on IDAAC, which identifies and addresses the problem of using a shared representation for learning the policy and the value function.

4 min read January 27, 2021

DTSIL — Diverse Trajectory-conditioned Self-Imitation Learning

Discussion on Diverse Trajectory-conditioned Self-Imitation Learning,

3 min read January 14, 2021

TAC — Tsallis Actor Critic

Discussion on Tsallis Actor Critic

3 min read January 7, 2021

MARL — A Survey and Critique

We present an overview of multi-agent reinforcement learning

12 min read January 1, 2021

C++ Concurrency in Action — Chapter 9

Notes from Williams’ C++ Concurrency in Action

~1 min read January 1, 2021

C++ Concurrency in Action — Chapter 8

Notes from Williams’ C++ Concurrency in Action

4 min read January 1, 2021
  • Previous
  • 1
  • 2
  • 3
  • 4
  • …
  • 16
  • Next

© 2022 Zero. Powered by Jekyll & So Simple.