Here are some suggested background readings on Reinforcement Learning and Multiagent Reinforcement Learning.
Themes |
Suggested |
|
1. Bayesian RL / Bayesian exploration / Bandit problems |
|
|
2. Bayesian networks |
|
|
3. Factored MDPs and related solution methods |
|
|
4. DecMDPs / DecPOMDPs |
|
|
5. First-order MDPs |
|
|
6. Gaussian Processes (GPs) / GPs +RL / GPs+ Machine Learning |
|
|
7. Inverse RL |
|
|
8. Coordination Games / Learning in Coordination Games |
|
|
9. Hierarchical RL and Abstraction methods |
|
|
10.Hidden Markov Models |
|
|
11.Uncertainty and Learning in Games |
|
|
Date |
Title |
Discussion Leaders
(you can find links to slides here, if available; check links at leaders’
names) |
|
24 Feb. 2009 |
Sutton and |
|
|
4 Mar. 2009 |
Sutton and |
Simon, Enrique, Georgios |
|
25 Mar. 2009 |
1) Bayesian Q-Learning (http://www.cs.huji.ac.il/~nir/Abstracts/DFR1.html) 2) Model-Based Bayesian Exploration (http://robotics.stanford.edu/users/nir/Papers/DFA1.pdf) 3) A Bayesian Framework for Reinforcement Learning (http://citeseer.ist.psu.edu/old/522507.html) People should find the following videotalk on “Model-Based Bayesian RL” by Pascal Poupart very informative (slides can also be downloaded from the site): http://videolectures.net/icml07_poupart_mbbrl/ |
1) Long 2) Archie 3) Kate |
|
7 Apr. 2009 |
Model-based Bayesian Reinforcement Learning in Partially Observable Domains by Pascal Poupart and Nikos Vlassis. AI-Math 2008 |
- Simon (George will also be speaking briefly about the following paper: Poupart, Vlassis, Hoey and Regan: An analytic solution to Discrete Bayesian Reinforcement Learning, 23rd ICML, 2006 ) |
|
28 Apr. 2009 |
Introduction
to Bayesian Networks (D.Heckerman: A tutorial on Bayesian Networks: Sections
1-5) |
|
|
27 May 2009 |
More on
Bayesian Networks: Sections
6-12 |
-
Long, Simon |
|
1 Jul 2009 |
“Optimizing Information
Exchange in Cooperative Multiagent Systems” by
Claudia Goldman and Shlomo Zilberstein |
-
George, Zinovi |
|
29 Jul 2009 |
On the
Value of Communication during Online Multiagent
Planning under Uncertainty |
-
Simon [POSTPONED] |
|
21 Oct 2009 |
DEBATE: Bayesian
vs. Frequentist - http://www-stat.stanford.edu/~ckirby/brad/papers/2005NEWModernScience.pdf - http://nb.vse.cz/kfil/elogos/science/vallverdu08.pdf |
-
Zinovi, Enrique, George et al. |
Page maintained by Georgios Chalkiadakis.