安全な強化学習

2023-12-22 (金) 19:55:03 (556d) | Topic path: Top / 強化学習 / 安全な強化学習

安全な強化学習に関するメモ

サーベイ論文 †

García J and Fernández F (2015). A comprehensive survey on safe reinforcement learning. JMLR 16(42):1437–1480.
- 安全な強化学習に関する最初のサーベイ論文
Brunke L, Greeff M, Yuan Z, et al. (2022). Safe learning in robotics: From learning-based control to safe reinforcement learning. Annual Review of Control, Robotics and Autonomous Systems 5:411–444. doi: 10.1146/annurev-control-042920-020211
- 安全な強化学習に関するサーベイ論文
Zhao W, He T, Chen R, et al. (2023). State-wise safe reinforcement learning: A survey. IJCAI 2023 Survey Track:6814–6822. doi: 10.24963/ijcai.2023/763
- 安全な強化学習に関するサーベイ論文。状態ごとに制約付きのMDP (State-wise Constrained MDP)

論文 †

Ray A, Achiam J, and Amodei D (2019). Benchmarking safe exploration in deep reinforcement learning. Available from https://openai.com/research/benchmarking-safe-exploration-in-deep-reinforcement-learning
- OpenAI Safety Gymに関する論文

Workshop †

Safe RL 2022
- The 1st International Workshop on Safe Reinforcement Learning Theory and its Applications, IEEE MFI 2022

Web †

chauncygu (2023). Safe reinforcement learning baselines. GitHub [cited at 2023 Dec 19]. Available from https://github.com/chauncygu/Safe-Reinforcement-Learning-Baselines
- 安全な強化学習に関する情報をまとめたGitHubページ

コード †

OpenAI (2019). Safety Gym. [cited at 2023 Dec 19]. Available from https://openai.com/research/safety-gym
- OpenAIが開発した安全な強化学習に関するベンチマーク環境
OpenAI (2019). Safety starter agents. GitHub. [cited at 2023 Dec 19]. Available from https://github.com/openai/safety-starter-agents
- OpenAIが用意した安全な強化学習エージェント

とうごろう.jp

とうごろぐ（ブログ）

Twitter

Facebook

授業

最新の20件

2025-06-28

Tips For Online Dating Website No Cost

2025-05-12

機械学習/Rで機械学習する

2025-01-11

ColabでCUDAとPyTorchとPythonをダウングレードする

2024-10-02

バイオ・データ・マイニング/ClustalWでペアワイズ・アラインメントを行う

2024-08-06

2023-12-26

金融データ・マイニング/動的クラスタリングとクラスター変化検出

2023-12-22

強化学習/安全な強化学習

2023-12-21

2023-12-19

授業/情報数学

2023-01-11

バイオ・データ・マイニング/Rでロジスティック回帰を使う

2022-11-09

2022-10-14

バイオ・データ・マイニング/HMMERで相同性検索を行う

2020-12-23

バイオ・データ・マイニング/Rで回帰分析する

2020-12-09

バイオ・データ・マイニング/Rで階層クラスタリングを使う

2020-10-21

バイオ・データ・マイニング/BLASTで相同性検索を行う