強化学習/機械学習国際会議 ICML 2010
をテンプレートにして作成
開始行:
*概要 [#tb4acdf8]
-第27回機械学習国際会議(The 27th International Conferenc...
-2010年6月21日-24日
-イスラエル
-http://www.icml2010.org/
*Reinforcement Learning 1 [#y26c66c6]
-[[''Least-Squares Λ Policy Iteration: Bias-Variance Trad...
Christophe Thiery (Loria); Bruno Scherrer (Loria)
-[[''Finite-Sample Analysis of LSTD'':http://www.icml2010...
Alessandro Lazaric (Inria); Mohammad Ghavamzadeh (Inria);...
-[[''Convergence of Least Squares Temporal Difference Met...
Huizhen Yu (Univ. of Helsinki)
-[[''Should one compute the Temporal Difference fix point...
Bruno Scherrer (Loria)
*Reinforcement Learning 2 [#zd62c780]
-[[''Approximate Predictive Representations of Partially ...
Doina Precup (Mcgill University); Monica Dinculescu (McGi...
-[[''Constructing States for Reinforcement Learning'':htt...
M. M. Mahmud (Australian National University)
-[[''Temporal Difference Bayesian Model Averaging: A Baye...
Carlton Downey (Victoria University of Wellington); Scott...
-[[''Bayesian Multi-Task Reinforcement Learning'':http://...
Alessandro Lazaric (Inria); Mohammad Ghavamzadeh (Inria)
*Reinforcement Learning 3 [#fe04a101]
-[[''Generalizing Apprenticeship Learning across Hypothes...
Thomas Walsh (Rutgers University); Kaushik Subramanian (R...
-[[''Toward Off-Policy Learning Control with Function App...
Hamid Maei (University of Alberta); Csaba Szepesvari (Uni...
-[[''Efficient Reinforcement Learning with Multiple Rewar...
Daniel Lizotte (University of Michigan); Michael Bowling ...
-[[''Internal Rewards Mitigate Agent Boundedness'':http:/...
Jonathan Sorg (University of Michigan); Satinder Singh (U...
*Reinforcement Learning 4 [#b301e15c]
-[[''Analysis of a Classification-based Policy Iteration ...
Alessandro Lazaric (Inria); Mohammad Ghavamzadeh (Inria);...
-[[''Nonparametric Return Distribution Approximation for ...
Tetsuro Morimura (IBM Research - Tokyo); Masashi Sugiyama...
-[[''Inverse Optimal Control with Linearly Solvable MDPs'...
Krishnamurthy Dvijotham (University of Washington); Emanu...
-[[''Feature Selection Using Regularization in Approximat...
Marek Petrik (University of Massachusetts ); Gavin Taylor...
終了行:
*概要 [#tb4acdf8]
-第27回機械学習国際会議(The 27th International Conferenc...
-2010年6月21日-24日
-イスラエル
-http://www.icml2010.org/
*Reinforcement Learning 1 [#y26c66c6]
-[[''Least-Squares Λ Policy Iteration: Bias-Variance Trad...
Christophe Thiery (Loria); Bruno Scherrer (Loria)
-[[''Finite-Sample Analysis of LSTD'':http://www.icml2010...
Alessandro Lazaric (Inria); Mohammad Ghavamzadeh (Inria);...
-[[''Convergence of Least Squares Temporal Difference Met...
Huizhen Yu (Univ. of Helsinki)
-[[''Should one compute the Temporal Difference fix point...
Bruno Scherrer (Loria)
*Reinforcement Learning 2 [#zd62c780]
-[[''Approximate Predictive Representations of Partially ...
Doina Precup (Mcgill University); Monica Dinculescu (McGi...
-[[''Constructing States for Reinforcement Learning'':htt...
M. M. Mahmud (Australian National University)
-[[''Temporal Difference Bayesian Model Averaging: A Baye...
Carlton Downey (Victoria University of Wellington); Scott...
-[[''Bayesian Multi-Task Reinforcement Learning'':http://...
Alessandro Lazaric (Inria); Mohammad Ghavamzadeh (Inria)
*Reinforcement Learning 3 [#fe04a101]
-[[''Generalizing Apprenticeship Learning across Hypothes...
Thomas Walsh (Rutgers University); Kaushik Subramanian (R...
-[[''Toward Off-Policy Learning Control with Function App...
Hamid Maei (University of Alberta); Csaba Szepesvari (Uni...
-[[''Efficient Reinforcement Learning with Multiple Rewar...
Daniel Lizotte (University of Michigan); Michael Bowling ...
-[[''Internal Rewards Mitigate Agent Boundedness'':http:/...
Jonathan Sorg (University of Michigan); Satinder Singh (U...
*Reinforcement Learning 4 [#b301e15c]
-[[''Analysis of a Classification-based Policy Iteration ...
Alessandro Lazaric (Inria); Mohammad Ghavamzadeh (Inria);...
-[[''Nonparametric Return Distribution Approximation for ...
Tetsuro Morimura (IBM Research - Tokyo); Masashi Sugiyama...
-[[''Inverse Optimal Control with Linearly Solvable MDPs'...
Krishnamurthy Dvijotham (University of Washington); Emanu...
-[[''Feature Selection Using Regularization in Approximat...
Marek Petrik (University of Massachusetts ); Gavin Taylor...
ページ名: