MuZero: Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

First published at 13:28 UTC on November 21st, 2019.
subscribers

MuZero harnesses the power of AlphaZero, but without relying on an accurate environment model. This opens up planning-based reinforcement learning to entirely new domains, where such environment models aren't available. The difference to previo…

MORE
CategoryScience & Technology
SensitivityNormal - Content that is suitable for ages 16 and over
DISCUSS THIS VIDEO