Research Scientist, Microsoft Research
- New York
- https://riashat.github.io/
Stars
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…
tensorboard for pytorch (and chainer, mxnet, numpy, ...)
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
A platform for developing AI systems as described in A Roadmap towards Machine Intelligence - http://arxiv.org/abs/1511.08130
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.




