| Star | Last Update | Name | Backend |
|---|---|---|---|
| ray-rllib | pytorch, tensorflow-2.x | ||
| baselines | tesorflow-1.x | ||
| dopamine | tensorflow-2.x, tesorflow-1.x | ||
| spinningup | pytorch, tesorflow-1.x | ||
| TensorLayer | tensorflow-2.x | ||
| tianshou | pytorch | ||
| keras-rl | keras | ||
| stable-baselines3 | pytorch | ||
| Deep-Reinforcement-Learning-Algorithms-with-PyTorch | pytorch | ||
| open_spiel | pytorch, tensorflow-2.x | ||
| ReAgent | pytorch | ||
| DouZero | pytorch | ||
| tensorforce | tensorflow-2.x | ||
| acme | jax, tensorflow-2.x | ||
| pytorch-a2c-ppo-acktr-gail | pytorch | ||
| trfl | tensorflow-2.x, tesorflow-1.x | ||
| PARL | paddle, pytorch | ||
| ElegantRL | pytorch | ||
| agents | tensorflow-2.x, tesorflow-1.x | ||
| DI-engine | pytorch | ||
| cleanrl | pytorch | ||
| coach | tesorflow-1.x | ||
| rlcard | pytorch | ||
| rlkit | pytorch | ||
| rlpyt | pytorch | ||
| garage | tensorflow-2.x | ||
| SLM-Lab | pytorch | ||
| chainerrl | chainer | ||
| rl | pytorch | ||
| pfrl | pytorch | ||
| rlax | jax | ||
| batch-ppo | tesorflow-1.x | ||
| scalable_agent | tesorflow-1.x | ||
| d3rlpy | pytorch | ||
| seed_rl | tensorflow-2.x | ||
| mbrl-lib | pytorch | ||
| torchbeast | pytorch | ||
| mushroom-rl | pytorch | ||
| reverb | jax, tensorflow-2.x | ||
| GA3C | tesorflow-1.x | ||
| autonomous-learning-library | pytorch | ||
| CORL | pytorch | ||
| sample-factory | pytorch | ||
| rl-starter-files | pytorch | ||
| deer | tensorflow-2.x | ||
| surreal | pytorch | ||
| rl_algorithms | pytorch | ||
| deep_rl | pytorch | ||
| jaxrl | jax | ||
| Deep-Reinforcement-Learning-Algorithms | pytorch | ||
| rl-agents | pytorch | ||
| batch_rl | tensorflow-2.x | ||
| RLs | pytorch | ||
| salina | pytorch | ||
| rl_games | pytorch | ||
| godot_rl_agents | pytorch | ||
| genrl | pytorch | ||
| tonic | pytorch, tensorflow-2.x | ||
| lagom | pytorch | ||
| malib | pytorch | ||
| machin | pytorch | ||
| JORLDY | pytorch | ||
| rlgraph | pytorch, tesorflow-1.x | ||
| rlmeta | pytorch | ||
| url_benchmark | pytorch | ||
| epymarl | pytorch | ||
| xingtian | tesorflow-1.x | ||
| HandyRL | pytorch | ||
| rlstructures | pytorch | ||
| DeepRL_Algorithms | pytorch, tensorflow-2.x | ||
| pymdp | numpy | ||
| stable-baselines | tesorflow-1.x | ||
| simple_rl | numpy | ||
| alf | pytorch, Tensorflow 2.1 | ||
| tmrl | pytorch | ||
| paac | tesorflow-1.x | ||
| adeptRL | pytorch | ||
| pomdp-baselines | pytorch | ||
| skrl | pytorch | ||
| ape-x | tesorflow-1.x | ||
| mtrl | pytorch | ||
| EasyReinforcementLearning | tesorflow-1.x | ||
| torchrl | pytorch | ||
| TimeChamber | pytorch | ||
| rlds | tensorflow-2.x | ||
| coax | jax | ||
| tleague_projpage | tesorflow-1.x | ||
| rlberry | jax, pytorch | ||
| ILSwiss | pytorch | ||
| deluca | jax | ||
| nnabla-rl | nnabla | ||
| d4pg-pytorch | pytorch | ||
| magi | jax | ||
| mrl | pytorch | ||
| rsl_rl | pytorch | ||
| distributedRL | pytorch | ||
| sbx | jax | ||
| rela | pytorch | ||
| RLHive | torch | ||
| deep_ope | tensorflow-2.x | ||
| rljax | jax | ||
| Explorer | pytorch | ||
| unstable_baselines | tensorflow-2.x | ||
| jax-rl | jax | ||
| deep_reinforcement_learning_gallery | tensorflow-2.x | ||
| cpprb | |||
| simple-reinforcement-learning | tesorflow-1.x | ||
| safeRL | pytorch | ||
| YARR | pytorch | ||
| COBS | pytorch, tensorflow-2.x | ||
| DB-Football | pytorch | ||
| raylab | pytorch | ||
| fastpbrl | jax, pytorch | ||
| QuaRL | tensorflow-2.x | ||
| accel_rl | theano | ||
| apex | pytorch | ||
| embodied | tensorflow | ||
| Rainy | pytorch | ||
| dapo | tesorflow-1.x | ||
| abcdrl | pytorch | ||
| gymnax-blines | jax | ||
| MARS | pytorch | ||
| nxdo | pytorch | ||
| gala | tesorflow-1.x | ||
| coltra-rl | pytorch | ||
| HTS-RL | pytorch | ||
| memoire | |||
| xpag | jax | ||
| fast-marl | pytorch | ||
| haiku-baseline | jax | ||
| reinforcement | mindspore | ||
| sb3_jax | jax | ||
| exarl | tf-2.x | ||
| reinforced-lib | jax | ||
| reproduceRL | tensorflow-1.x | ||
| cause-life-is-a-game | pytorch | ||
| mbrl-jax | jax | ||
| XuanJing | pytorch | ||
| causal-mbrl | pytorch |
| Star | arXiv | Last Update | Name | Accelerate Type | Property |
|---|---|---|---|---|---|
| / | / | / | vec_env | subproc [1] [2] | all |
| EnvPool | cpp | Atari, Mujoco, Compilable environment | |||
| ELF | cpp | Game in cpp, MiniRTS | |||
| Cule | gpu | Atari | |||
| Brax | gpu | robot | |||
| Isaac-gym | gpu | robot | |||
| WarpDrive | gpu | multiagent | |||
| / | griddly | cpp | grid-world game | ||
| / | powderworld | gpu | physics lightweight simulation environment | ||
| / | jumanji | jit+xla | Game / Combinatorial |