強化学習/神経情報処理システム国際会議 NIPS
をテンプレートにして作成
開始行:
Conference on Advances in Neural Information Procession S...
(最近のものから順次追加しており,完全なリストではありま...
''採択率''
-NIPS 2009: ?
-NIPS 2008: 250/1022=24.5%
-NIPS 2007: 217/975=22.3%
*ロボット [#cc3c22b6]
-[[''Policy Search for Motor Primitives in Robotics'':htt...
Jens Kober, Jan Peters~
NIPS 2008, pp. 849-856 (2009).
-[[''An Application of Reinforcement Learning to Aerobati...
Pieter Abbeel, Adam Coates, Andrew Ng, Morgan Quigley~
NIPS 2006, pp. 1-8 (2007).
*交通制御 [#v33be840]
-[[''Natural Actor-Critic for Road Traffic Optimisation''...
Silvia Richter, Douglas Aberdeen, Jin Yu~
NIPS 2006, pp. 1169-1176 (2007).
*電力制御 [#a7985bf3]
-[[''Managing Power Consumption and Performance of Comput...
Gerry Tesauro, Rajarshi Das, Hoi Chan, Jeffrey Kephart, D...
NIPS 2007, pp. 1497-1504 (2008).
*見倣い学習 [#ec21e061]
-[[''Hierarchical Apprenticeship Learning with Applicatio...
J. Zico Kolter, Pieter Abbeel, Andrew Ng~
NIPS 2007, pp. 769-776 (2008).
-[[''A Game-Theoretic Approach to Apprenticeship Learning...
Umar Syed, Robert Schapire~
NIPS 2007, pp. 1449-1456 (2008).
*メタ学習 [#dc360314]
-[[''Stress, noradrenaline, and realistic prediction of m...
Gediminas Lukšys, Carmen Sandi, Wulfram Gerstner~
NIPS 2008, pp. 1001-1008 (2009).
-''Effects of Stress and Genotype on Meta-parameter Dynam...
Gediminas Lukšys, Jérémie Knüsel, Denis Sheynikhovich, Ca...
NIPS 2006, pp. 937-944 (2007).
*連続的行動空間 [#m17e8758]
-[[''Fitted Q-iteration by Advantage Weighted Regression'...
Gerhard Neumann, Jan Peters~
NIPS 2008, pp. 1177-1184 (2009).
-[[''Reinforcement Learning in Continuous Action Spaces t...
Alessandro Lazaric, Marcello Restelli, Andrea Bonarini~
NIPS 2007, pp. 833-840 (2008).
-[[''Fitted Q-iteration in continuous action-space MDPs''...
András Antos, Remi Munos, Csaba Szepesvari~
NIPS 2007, pp. 9-16 (2008).
*探査と知識利用のジレンマ [#c75af9c5]
-[[''Learning to Explore and Exploit in POMDPs'':http://n...
Chenghui Cai, Xuejun Liao, Lawrence Carin~
NIPS 2009.
*探査 [#o1739031]
-[[''Multi-resolution Exploration in Continuous Spaces'':...
Ali Nouri, Michael Littman~
NIPS 2008, pp. 1209-1216 (2009).
*学習分析 [#d02b9143]
-[[''Temporal Difference Updating without a Learning Rate...
Marcus Hutter, Shane Legg~
NIPS 2007, pp. 705-712 (2008).
*勾配法 [#s665fbf9]
-[[''Signal-to-Noise Ratio Analysis of Policy Gradient Al...
John Roberts, Russ Tedrake~
NIPS 2008, pp. 1361-1368 (2009).
-[[''Bayesian Policy Gradient Algorithms'':http://nips.cc...
Mohammad Ghavamzadeh, Yaakov Engel~
NIPS 2006, pp. 457-464 (2007).
*TD学習 [#mc6d8702]
-[[''A Convergent '''O'''('''n''') Temporal-difference Al...
Rich Sutton, Csaba Szepesvari, Hamid Maei~
NIPS 2008, pp. 1609-1616 (2009).
*アクター・クリティック [#h88d2bf4]
-[[''Temporal Difference Based Actor Critic Learning - Co...
Dotan Di Castro, Dima Volkinshtein, Ron Meir~
NIPS 2008, pp. 385-392 (2009).
-[[''Incremental Natural Actor-Critic Algorithms'':http:/...
Shalabh Bhatnagar, Rich Sutton, Mohammad Ghavamzadeh, Mar...
NIPS 2007, pp. 105-112 (2008).
*モデル・ベースド [#s3f15c68]
-[[''Manifold Embeddings for Model-Based Reinforcement Le...
Keith Bush, Joelle Pineau~
NIPS 2009.
-[[''Learning to Use Working Memory in Partially Observab...
Michael Todd, Yael Niv, Jonathan Cohen~
NIPS 2008, pp. 1689-1696 (2009).
*その他・未分類 [#a92fb145]
-[[''Training Factor Graphs with Reinforcement Learning f...
Michael Wick, Khashayar Rohanimanesh, Sameer Singh, Andre...
NIPS 2009.
-[[''Optimization on a Budget: A Reinforcement Learning A...
Paul Ruvolo, Ian Fasel, javier movellan~
NIPS 2008, pp. 1385-1392 (2009).
-[[''Near-optimal Regret Bounds for Reinforcement Learnin...
Peter Auer, Thomas Jaksch, Ronald Ortner~
NIPS 2008, pp. 89-96 (2009).
-[[''Psychiatry: Insights into depression through normati...
Quentin Huys, joshua vogelstein, Peter Dayan~
NIPS 2008, pp. 729-736 (2009).
-''Logarithmic Online Regret Bounds for Undiscounted Rein...
Peter Auer, Ronald Ortner~
NIPS 2006, pp. 49-56 (2007).
終了行:
Conference on Advances in Neural Information Procession S...
(最近のものから順次追加しており,完全なリストではありま...
''採択率''
-NIPS 2009: ?
-NIPS 2008: 250/1022=24.5%
-NIPS 2007: 217/975=22.3%
*ロボット [#cc3c22b6]
-[[''Policy Search for Motor Primitives in Robotics'':htt...
Jens Kober, Jan Peters~
NIPS 2008, pp. 849-856 (2009).
-[[''An Application of Reinforcement Learning to Aerobati...
Pieter Abbeel, Adam Coates, Andrew Ng, Morgan Quigley~
NIPS 2006, pp. 1-8 (2007).
*交通制御 [#v33be840]
-[[''Natural Actor-Critic for Road Traffic Optimisation''...
Silvia Richter, Douglas Aberdeen, Jin Yu~
NIPS 2006, pp. 1169-1176 (2007).
*電力制御 [#a7985bf3]
-[[''Managing Power Consumption and Performance of Comput...
Gerry Tesauro, Rajarshi Das, Hoi Chan, Jeffrey Kephart, D...
NIPS 2007, pp. 1497-1504 (2008).
*見倣い学習 [#ec21e061]
-[[''Hierarchical Apprenticeship Learning with Applicatio...
J. Zico Kolter, Pieter Abbeel, Andrew Ng~
NIPS 2007, pp. 769-776 (2008).
-[[''A Game-Theoretic Approach to Apprenticeship Learning...
Umar Syed, Robert Schapire~
NIPS 2007, pp. 1449-1456 (2008).
*メタ学習 [#dc360314]
-[[''Stress, noradrenaline, and realistic prediction of m...
Gediminas Lukšys, Carmen Sandi, Wulfram Gerstner~
NIPS 2008, pp. 1001-1008 (2009).
-''Effects of Stress and Genotype on Meta-parameter Dynam...
Gediminas Lukšys, Jérémie Knüsel, Denis Sheynikhovich, Ca...
NIPS 2006, pp. 937-944 (2007).
*連続的行動空間 [#m17e8758]
-[[''Fitted Q-iteration by Advantage Weighted Regression'...
Gerhard Neumann, Jan Peters~
NIPS 2008, pp. 1177-1184 (2009).
-[[''Reinforcement Learning in Continuous Action Spaces t...
Alessandro Lazaric, Marcello Restelli, Andrea Bonarini~
NIPS 2007, pp. 833-840 (2008).
-[[''Fitted Q-iteration in continuous action-space MDPs''...
András Antos, Remi Munos, Csaba Szepesvari~
NIPS 2007, pp. 9-16 (2008).
*探査と知識利用のジレンマ [#c75af9c5]
-[[''Learning to Explore and Exploit in POMDPs'':http://n...
Chenghui Cai, Xuejun Liao, Lawrence Carin~
NIPS 2009.
*探査 [#o1739031]
-[[''Multi-resolution Exploration in Continuous Spaces'':...
Ali Nouri, Michael Littman~
NIPS 2008, pp. 1209-1216 (2009).
*学習分析 [#d02b9143]
-[[''Temporal Difference Updating without a Learning Rate...
Marcus Hutter, Shane Legg~
NIPS 2007, pp. 705-712 (2008).
*勾配法 [#s665fbf9]
-[[''Signal-to-Noise Ratio Analysis of Policy Gradient Al...
John Roberts, Russ Tedrake~
NIPS 2008, pp. 1361-1368 (2009).
-[[''Bayesian Policy Gradient Algorithms'':http://nips.cc...
Mohammad Ghavamzadeh, Yaakov Engel~
NIPS 2006, pp. 457-464 (2007).
*TD学習 [#mc6d8702]
-[[''A Convergent '''O'''('''n''') Temporal-difference Al...
Rich Sutton, Csaba Szepesvari, Hamid Maei~
NIPS 2008, pp. 1609-1616 (2009).
*アクター・クリティック [#h88d2bf4]
-[[''Temporal Difference Based Actor Critic Learning - Co...
Dotan Di Castro, Dima Volkinshtein, Ron Meir~
NIPS 2008, pp. 385-392 (2009).
-[[''Incremental Natural Actor-Critic Algorithms'':http:/...
Shalabh Bhatnagar, Rich Sutton, Mohammad Ghavamzadeh, Mar...
NIPS 2007, pp. 105-112 (2008).
*モデル・ベースド [#s3f15c68]
-[[''Manifold Embeddings for Model-Based Reinforcement Le...
Keith Bush, Joelle Pineau~
NIPS 2009.
-[[''Learning to Use Working Memory in Partially Observab...
Michael Todd, Yael Niv, Jonathan Cohen~
NIPS 2008, pp. 1689-1696 (2009).
*その他・未分類 [#a92fb145]
-[[''Training Factor Graphs with Reinforcement Learning f...
Michael Wick, Khashayar Rohanimanesh, Sameer Singh, Andre...
NIPS 2009.
-[[''Optimization on a Budget: A Reinforcement Learning A...
Paul Ruvolo, Ian Fasel, javier movellan~
NIPS 2008, pp. 1385-1392 (2009).
-[[''Near-optimal Regret Bounds for Reinforcement Learnin...
Peter Auer, Thomas Jaksch, Ronald Ortner~
NIPS 2008, pp. 89-96 (2009).
-[[''Psychiatry: Insights into depression through normati...
Quentin Huys, joshua vogelstein, Peter Dayan~
NIPS 2008, pp. 729-736 (2009).
-''Logarithmic Online Regret Bounds for Undiscounted Rein...
Peter Auer, Ronald Ortner~
NIPS 2006, pp. 49-56 (2007).
ページ名: