Machine Learning Street Talk (MLST)
#114 - Secrets of Deep Reinforcement Learning (Minqi Jiang)
Patreon: https://www.patreon.com/mlst
Discord: https...
more
Apr 16 2023 2h 47m
Chapter 1 3 mins
Tech & Startup BackgroundChapter 2 7 mins
Pursuing PhD in Deep RLChapter 3 56 sec
Startup LessonsChapter 4 6 mins
Serendipity vs PlanningChapter 5 3 mins
Objectives & Decision MakingChapter 6 3 mins
Minimax Regret & UncertaintyChapter 7 7 mins
Robustness in RL & Zero-Sum GamesChapter 8 7 mins
RL vs Supervised LearningChapter 9 5 mins
Exploration & IntelligenceChapter 10 7 mins
Environment, Emergence, AbstractionChapter 11 10 mins
Open-endedness & Intelligence ExplosionChapter 12 11 mins
Language Models & Training DataChapter 13 10 mins
RLHF & Language ModelsChapter 14 13 mins
Creativity in Language ModelsChapter 15 4 mins
Limitations of RLChapter 16 3 mins
Software 2.0 & InterpretabilityChapter 17 3 mins
Language Models & Code ReliabilityChapter 18 4 mins
Robust Prioritized Level ReplayChapter 19 12 mins
Open-ended LearningChapter 20 22 mins
Auto-curriculum & Deep RLChapter 21 5 mins
Robotics & Open-ended LearningChapter 22 5 mins
Learning Potential & MDPsChapter 23 45 sec
Universal Function SpaceChapter 24 1 min
Goal-Directed Learning & Auto-CurriculaChapter 25 2 mins
Advice & Closing Thoughts