強化学習/機械学習国際会議 ICML 2009
をテンプレートにして作成
開始行:
*概要 [#m9fefdc1]
-機械学習国際会議(International Conference on Machine Le...
-2009年6月14日-18日
-モントリオール
-http://www.cs.mcgill.ca/~icml2009/
強化学習に最も関係が強い国際会議です.
今年は強化学習に関するセッションが3つあります.
(昨年は6つもありました.)
*1C - Exploration in Reinforcement Learning [#f8839b2b]
-''[[The Adaptive k-Meteorologists Problem and Its Applic...
Carlos Diuk, Lihong Li and Bethany Leffler.
-''[[Near-Bayesian Exploration in Polynomial Time.:http:/...
J. Zico Kolter and Andrew Ng.
-''[[Optimistic Initialization and Greediness Lead to Pol...
Istvan Szita and Andras Lorincz.
-''[[Dynamic Analysis of Multiagent Q-learning with e-gre...
Eduardo Rodrigues Gomes and Ryszard Kowalczyk.
-''[[Hoeffding and Bernstein Races for Selecting Policies...
Verena Heidrich-Meisner and Christian Igel.
*3C - Reinforcement Learning with Temporal Differences [#...
-''[[Proto-Predictive Representation of States with Simpl...
Takaki Makino.
-''[[Regularization and Feature Selection in Least Square...
J. Zico Kolter and Andrew Ng.
-''[[Fast gradient-descent methods for temporal-differenc...
Richard S. Sutton, Hamid Reza Maei, Doina Precup, Shalabh...
-''[[Kernelized Value Function Approximation for Reinforc...
Gavin Taylor and Ronald Parr.
-''[[Constraint Relaxation in Approximate Linear Programs...
Marek Petrik and Shlomo Zilberstein.
*5C - Reinforcement Learning in High Order Environments [...
-''[[Binary Action Search for Learning Continuous-Action ...
Jason Pazis and Michail Lagoudakis.
-''[[Predictive Representations for Policy Gradient in PO...
Stochastic Search using the Natural Gradient.
-''[[Stochastic Search using the Natural Gradient.:http:/...
Sun Yi, Daan Wierstra, Tom Schaul, and Juergen Schmidhuber.
-''[[Approximate Inference for Planning in Stochastic Rel...
Tobias Lang and Marc Toussaint.
-''[[Discovering Options from Example Trajectories.:http:...
Peng Zang, Peng Zhou, David Minnen and Charles Isbell.
終了行:
*概要 [#m9fefdc1]
-機械学習国際会議(International Conference on Machine Le...
-2009年6月14日-18日
-モントリオール
-http://www.cs.mcgill.ca/~icml2009/
強化学習に最も関係が強い国際会議です.
今年は強化学習に関するセッションが3つあります.
(昨年は6つもありました.)
*1C - Exploration in Reinforcement Learning [#f8839b2b]
-''[[The Adaptive k-Meteorologists Problem and Its Applic...
Carlos Diuk, Lihong Li and Bethany Leffler.
-''[[Near-Bayesian Exploration in Polynomial Time.:http:/...
J. Zico Kolter and Andrew Ng.
-''[[Optimistic Initialization and Greediness Lead to Pol...
Istvan Szita and Andras Lorincz.
-''[[Dynamic Analysis of Multiagent Q-learning with e-gre...
Eduardo Rodrigues Gomes and Ryszard Kowalczyk.
-''[[Hoeffding and Bernstein Races for Selecting Policies...
Verena Heidrich-Meisner and Christian Igel.
*3C - Reinforcement Learning with Temporal Differences [#...
-''[[Proto-Predictive Representation of States with Simpl...
Takaki Makino.
-''[[Regularization and Feature Selection in Least Square...
J. Zico Kolter and Andrew Ng.
-''[[Fast gradient-descent methods for temporal-differenc...
Richard S. Sutton, Hamid Reza Maei, Doina Precup, Shalabh...
-''[[Kernelized Value Function Approximation for Reinforc...
Gavin Taylor and Ronald Parr.
-''[[Constraint Relaxation in Approximate Linear Programs...
Marek Petrik and Shlomo Zilberstein.
*5C - Reinforcement Learning in High Order Environments [...
-''[[Binary Action Search for Learning Continuous-Action ...
Jason Pazis and Michail Lagoudakis.
-''[[Predictive Representations for Policy Gradient in PO...
Stochastic Search using the Natural Gradient.
-''[[Stochastic Search using the Natural Gradient.:http:/...
Sun Yi, Daan Wierstra, Tom Schaul, and Juergen Schmidhuber.
-''[[Approximate Inference for Planning in Stochastic Rel...
Tobias Lang and Marc Toussaint.
-''[[Discovering Options from Example Trajectories.:http:...
Peng Zang, Peng Zhou, David Minnen and Charles Isbell.
ページ名: