RL-Adventure

Reinforcement Learning Adventure

Intro

In this little project, I aim at summarizing the Reinforcement Learning knowledge, step by step. The motivation is to organize what I have learned from Udacity Deep Reinforcement Learning Nanodegree and goes beyond. Here I will write blogs and codes for the basic knowledges and also for the important papers in this area, starting with the recommandation in the Nanodegree.

The main 'Adventure' play ground will be in OpenAI's gym environment.

Refences

Notice that many of my codes will be referencing the Nanodegree contents and other excellent implementations on Github.

Udacity DRLND course
'SuttonBartoIPRLBook2ndEd'
UCL Deep Reinforcement Learning course
the excellent algorithm implementations like ShangtongZhang and rlcode on Github.
Pinard's blog (in Chinese)
Andrej Karpathy blog
icml deep_rl_tutorial

Recent Plan and Progress (up to 2019-05-19)

Blogs writing (20% completed)
Reinforcement Learning Intro (blogs 30% completed)
Dynamic Progrmming (0% completed)
Classic
- Monte Carlo method (workable codes implemented, Blogs 0% completed)
- Temporal-Difference methods (workable codes implemented, Blogs 0% completed)
- RL in Continous Space (Blogs 0% completed)
Value-based methods
- Classic DQN with image inputs for learning(workable codes implemented, still tuning for request performance...)
- Classic DQN (workable codes implemented, blogs 0% completed)
- Double-DQN (workable codes implemented, blogs 0% completed)
- Prioritized Replay DDQN (workable codes implemented, blogs completed)
- Dueling-DQN (workable codes implemented, blogs completed)
Policy-based methods
- Policy-based and Hill climbing algorthm(blogs and workable codes completed) -Policy Gradients method(Blogs completed)
-Proximal Policy Optimization(working on blogs)

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
Policy_Gradient_REINFORCE		Policy_Gradient_REINFORCE
RL_learning		RL_learning
cross-entropy		cross-entropy
dqn-Prioritized Experience Replay ddqn		dqn-Prioritized Experience Replay ddqn
dqn-double_dqn		dqn-double_dqn
dqn-duelingdqn		dqn-duelingdqn
dqn		dqn
hill-climbing		hill-climbing
monte-carlo		monte-carlo
p1_navigation		p1_navigation
p1_navigation_learning_from_pixels		p1_navigation_learning_from_pixels
p2_continuous-control		p2_continuous-control
p3_collab-compet		p3_collab-compet
papers		papers
pong_with_RENFORCE_and_PPO		pong_with_RENFORCE_and_PPO
temporal-difference		temporal-difference
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL-Adventure

Intro

Refences

Recent Plan and Progress (up to 2019-05-19)

About

Uh oh!

Releases

Packages

Languages

quboanthony/RL-Adventure

Folders and files

Latest commit

History

Repository files navigation

RL-Adventure

Intro

Refences

Recent Plan and Progress (up to 2019-05-19)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages