AlphaZero

Name: AlphaZero
Start: 2020-02-25T12:00:00Z
Location: Cape Town, South Africa

Abstract

AlphaZero is the successor to the famous AlphaGo algorithm that beat a champion Go player. Unlike AlphaGo, it requires no human games to initialize its training, and can learn to play other games, such as Chess. This talk will break down how AlphaZero works without assuming any prior reinforcement learning knowledge. It will also look at a recent extension, MuZero, that lifts one of the main limitations of AlphaZero: the need to have an explicit model of the environment. MuZero has achieved stunning results on various reinforcement learning domains, such as learning to play Atari games.

Date

Feb 25, 2020

Event

Explore-AI CPT Meetup

Location

Cape Town, South Africa

AlphaZero

Abstract

Sebastian Bodenstein

Machine Learning Research Engineer