From 51c059263644cabf1d6d14824fce9c0f86a84d2a Mon Sep 17 00:00:00 2001 From: apo77yon <126520850+apo77yon@users.noreply.github.com> Date: Thu, 30 Mar 2023 13:08:19 -0700 Subject: [PATCH] Create README.md --- rl/README.md | 28 ++++++++++++++++++++++++++++ 1 file changed, 28 insertions(+) create mode 100644 rl/README.md diff --git a/rl/README.md b/rl/README.md new file mode 100644 index 0000000..6b04734 --- /dev/null +++ b/rl/README.md @@ -0,0 +1,28 @@ +## reinforcement learning + +
+ +##### 🤖 *reinforcement learning is learning what to do (how to map situations to actions) so as to maximize a numerical reward signal.* + +##### 🤖 *an autonomous agent is a software program or system that can operate independently and make decisions on its own, without direct intervention from a human.* + +
+ + +--- + +### tools + +* [gymnasium api](https://gymnasium.farama.org/) + +
+ +--- + +### resources + +
+ +* [reinforcement learning with unsupervised auxiliary tasks, by jaderberg et al](https://arxiv.org/abs/1611.05397) +* [advances in financial machine learning, by m. prado](https://github.com/midsumm3rDAO/library/blob/main/books/Advances+in+Financial+Machine+Learning.pdf) +* [reinforcement learning, by r. suton](https://github.com/midsumm3rDAO/library/blob/main/books/SuttonBartoIPRLBook2ndEd.pdf)