強化学習/安全な強化学習のバックアップ差分(No.1)

バックアップ一覧
現在との差分を表示
ソースを表示
バックアップを表示
強化学習/安全な強化学習へ行く。
- 1 (2023-12-19 (火) 11:56:36)
- 2 (2023-12-19 (火) 13:02:51)
- 3 (2023-12-20 (水) 09:34:06)
- 4 (2023-12-21 (木) 18:07:57)

追加された行はこの色です。
削除された行はこの色です。

安全な強化学習に関するメモ

*サーベイ論文 [#s1eda9e8]

-[[ García J and Fernández F (2015). A comprehensive survey on safe reinforcement learning. JMLR 16(42):1437–1480.>https://jmlr.org/papers/v16/garcia15a.html]]
--安全な強化学習に関する最初のサーベイ論文


*論文 [#ke1f2355]
-[[Ray A, Achiam J, and Amodei D (2019). Benchmarking safe exploration in deep reinforcement learning.　Available from https://openai.com/research/benchmarking-safe-exploration-in-deep-reinforcement-learning>https://openai.com/research/benchmarking-safe-exploration-in-deep-reinforcement-learning]]
--OpenAI Safety Gymに関する論文


*Web [#ncb28416]
-[[chauncygu (2023). Safe reinforcement learning baselines. GitHub [cited at 2023 Dec 19]. Available from https://github.com/chauncygu/Safe-Reinforcement-Learning-Baselines>https://github.com/chauncygu/Safe-Reinforcement-Learning-Baselines]]
--安全な強化学習に関する情報をまとめたGitHubページ



*環境 [#h1b5e37c]
-[[OpenAI (2019). Safety Gym. [cited at 2023 Dec 19]. Available from  https://openai.com/research/safety-gym>https://openai.com/research/safety-gym]]
--OpenAIが開発した安全な強化学習に関するベンチマーク環境
-[[OpenAI (2019). Safety starter agents. GitHub. [cited at 2023 Dec 19]. Available from https://github.com/openai/safety-starter-agents>https://github.com/openai/safety-starter-agents]]
--OpenAIが用意した安全な強化学習エージェント

強化学習/安全な強化学習 のバックアップ差分(No.1)

強化学習/安全な強化学習のバックアップ差分(No.1)