Decision Transformer: Reinforcement Learning via Sequence Modeling (Research Paper Explained)
#decisiontransformer #reinforcementlearning #transformer
Proper credit assignment over long timespans is a fundamental problem in reinforcement learning. Even methods designed to combat this problem, such as TD-learning, quickly reach their limits when rewards are sparse or noisy. This paper reframes offline reinforcement learning as a pure sequence modeling problem, with the actions being sampled conditioned on the given history and desired future rewards. This allows the authors to use recent advances in sequence modeling using Transformers and achieve competitive results in Offline RL benchmarks.
OUTLINE:
0:00 - Intro & Overview
4:15 - Offline Reinforcement Learning
10:10 - Transformers in RL
14:25 - Value Functions and Temporal Difference Learning
20:25 - Sequence Modeling and Reward-to-go
27:20 - Why this is ideal for offline RL
31:30 - The context length problem
34:35 - Toy example: Shortest path from random walks
41:00 - Discount factors
45:50 - Experimental Results
49:25 - Do you need to know the be
1 view
52
18
1 month ago 00:00:00 4
BILAN GÉOPOLITIQUE DE L’ANNÉE 2024 ET CE QUI NOUS ATTEND POUR 2025 | GPTV LA MATINALE
1 month ago 00:31:55 1
55 Celebrity Plastic Surgery Disasters Full Album | Then and Now 2025
1 month ago 00:08:59 1
Telegram Signal Copier : Trade Smarter, Not Harder!
1 month ago 00:04:42 1
Copy Trading : Telegram Signal Copier to Skyrocket Profits!
1 month ago 00:07:44 1
How to Use AI Tools to Make Money Online – Easy and Effective!
1 month ago 00:54:33 1
(FULL) Beethoven Symphony No.6 “Pastorale“ And Egmont Overture - London Philarmonic Orchestra
1 month ago 00:06:54 1
How to Profit from Metaverse Crypto – Best Projects and Strategies!
1 month ago 00:08:44 1
The Future of Cinema: How AI Filmmaking Is Changing Filmmaking Forever!
2 months ago 00:32:07 1
Are Christian Artists Pushing Boundaries in Hip-Hop? | SWAY’S UNIVERSE
2 months ago 00:03:02 1
SPX Options Trading : Strategies for Big Gains!
2 months ago 00:08:10 1
AI Agents Will Create MILLIONAIRES in 2025 – Are You Ready