-
Beijing University of Posts and Telecommunications (BUPT)
- Beijing haidian district west TuCheng Road 10, Beijing university of posts and telecommunications.
-
23:18
(UTC +08:00)
Stars
Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
A parallel framework for population-based multi-agent reinforcement learning.
โจโจLatest Advances on Multimodal Large Language Models
Train transformer language models with reinforcement learning.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
A modular RL library to fine-tune language models to human preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
Reference implementation for DPO (Direct Preference Optimization)
A collection of recent resources on End-to-End Autonomous Driving [survey accepted in IEEE TIV]
This repository is used to collect NeRF papers on autonomous driving
HuaHuoLabel is a multifunctional AI data label tool, which supports data label of five computer vision tasks, including single-category classification, multi-category classification, semantic segmeโฆ
Driving in CARLA using model-free deep reinforcement learning
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
Gather AIGC most useful tools, materials, publications and reports
This repository contains the official implementation of the following paper: Lazy and Fast Greedy MAP Inference for Determinantal Point Process
Efficient and Diverse Batch Acquisition for Deep Bayesian Active Learning.
A Library for Active Preference-based Reward Learning Algorithms
Tensorflow2.0 ๐๐ is delicious, just eat it! ๐๐
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
PyTorch Implementation of Distributed Prioritized Experience Replay(Ape-X)
Reinforcement learning algorithms for MuJoCo tasks
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Massively Parallel Deep Reinforcement Learning. ๐ฅ