Reinforce

This is a Reinforcement Learning (RL) library built on top of Ruby at the University of Ferrara, Italy. The library is in its early stages of development and is not yet ready for production use.

At the moment, it is simply a playground that we set up to learn some technical and/or implementation details of RL algorithms. We hope that in time it could grow and become a mature product.

Reinforce requires the torch.rb gem, which provides Ruby bindings for the PyTorch library.

Prerequisites

The installation of torch.rb might require some tweaks depending on your system. On MacOs you can install libtorch using Homebrew:

$ brew install pytorch

On Linux instead you need to first download libtorch from here, and then extract the archive in a folder of your choice. Then you need add the following configuration to your project:

$ bundle config build.torch-rb --with-torch-dir=/path/to/libtorch

This will configure the bundler to build torch.rb using the libtorch installation in the specified directory.

For more info, please visit the torch.rb's github repo

Installation

Install the gem and add to the application's Gemfile (or gems.rb) by executing:

$ bundle add reinforce

If bundler is not being used to manage dependencies, install the gem by executing:

$ gem install reinforce

Usage

Train a DQN agent to solve the GridWorld environment:

$ bundle exec examples/dqn_gridworld.rb

By default the DQN policy is saved. You can test the trained policy by executing:

$ bundle exec examples/dqn_gridworld_test.rb

Define a new environment

Defining a new environment is fairly simple. Use the examples environment as guide in defining your own. All you need is to wrap your environment in a class that defines the following methods:

initialize - Initialize the environment.
reset - Reset the environment to its initial state.
state_size - Return the size of the state space.
actions - Return the action space; you can retrieve the number of actions by calling actions.size.
step - Execute an action in the environment and return the next state, reward, done, and info.
render - Render the environment on specified output, e.g, $stdout (optional).

Contributing

Many thanks to Prof. Mauro Tortonesi and Filippo Poltronieri who are currently developing the library. All contributions are welcome.

We welcome contributions to this project.

Fork it.
Create your feature branch (git checkout -b my-new-feature).
Commit your changes (git commit -am 'Add some feature').
Push to the branch (git push origin my-new-feature).
Create new Pull Request.

License

This software is available as open source under the terms of the MIT License.

Developer Certificate of Origin

This project uses the Developer Certificate of Origin. All contributors to this project must agree to this document to have their contributions accepted.

Contributor Covenant

This project is governed by Contributor Covenant. All contributors and participants agree to abide by its terms.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
bin		bin
examples		examples
lib		lib
sig		sig
test/reinforce		test/reinforce
.gitignore		.gitignore
Rakefile		Rakefile
changelog.md		changelog.md
code_of_conduct.md		code_of_conduct.md
gems.rb		gems.rb
gridworld_ppo.pth_backup		gridworld_ppo.pth_backup
license.md		license.md
logs		logs
ppo_results_1709558009.csv		ppo_results_1709558009.csv
readme.md		readme.md
reinforce.gemspec		reinforce.gemspec
test-yjit.sh		test-yjit.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforce

Prerequisites

Installation

Usage

Define a new environment

Contributing

License

Developer Certificate of Origin

Contributor Covenant

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

DSG-UniFE/reinforce

Folders and files

Latest commit

History

Repository files navigation

Reinforce

Prerequisites

Installation

Usage

Define a new environment

Contributing

License

Developer Certificate of Origin

Contributor Covenant

About

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages