Player of Games: All the games, one algorithm! (w/ author Martin Schmid)
#playerofgames #deepmind #alphazero
Special Guest: First author Martin Schmid ()
Games have been used throughout research as testbeds for AI algorithms, such as reinforcement learning agents. However, different types of games usually require different solution approaches, such as AlphaZero for Go or Chess, and Counterfactual Regret Minimization (CFR) for Poker. Player of Games bridges this gap between perfect and imperfect information games and delivers a single algorithm that uses tree search over public information states, and is trained via self-play. The resulting algorithm can play Go, Chess, Poker, Scotland Yard, and many more games, as well as non-game environments.
OUTLINE:
0:00 - Introduction
2:50 - What games can Player of Games be trained on?
4:00 - Tree search algorithms (AlphaZero)
8:00 - What is different in imperfect information games?
15:40 - Counterfactual Value- and Policy-Networks
18:50 - The Player of Games search procedure
28:30 - How to train the network?
34
17 views
12
6
1 week ago 00:31:21 0
Warhammer 40k: Dawn of War Soulstorm Unification mod No. 19 Eldar vs Ordo Hereticus Stronghold FINAL
2 weeks ago 00:00:39 0
The Blood of Dawnwalker — Gameplay Reveal Event Announcement
2 weeks ago 00:01:58 2
The Cycle - GDC Gameplay Trailer
4 weeks ago 00:02:03 10
Dying Light: The Beast - Gameplay Premiere Trailer | Summer Game Fest 2025
4 weeks ago 00:16:12 6
War Robots NEW Giveaway 100x Prime Sword Unit-191 | WR Sword Unit Giveaway + Gameplay
4 weeks ago 00:03:56 0
I Played Black Myth Wukong on the Skytech King 95 and Ascended to the Cloud - YouTube
1 month ago 00:04:54 5
Karlo Matković 2024-25 NBA Season Highlights | New Orleans Pelicans