Skip to content
View ArbiterGe's full-sized avatar
  • Beijing University of Posts and Telecommunications (BUPT)
  • Beijing haidian district west TuCheng Road 10, Beijing university of posts and telecommunications.
  • 23:18 (UTC +08:00)

Block or report ArbiterGe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research

C++ 17,967 4,851 Updated May 15, 2025

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,598 429 Updated Dec 7, 2025

One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

Python 1,269 192 Updated Nov 28, 2024

A parallel framework for population-based multi-agent reinforcement learning.

Python 548 65 Updated Dec 14, 2023

โœจโœจLatest Advances on Multimodal Large Language Models

17,369 1,113 Updated Feb 23, 2026

Train transformer language models with reinforcement learning.

Python 17,468 2,513 Updated Feb 27, 2026

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,740 483 Updated Jan 8, 2024

A modular RL library to fine-tune language models to human preferences

Python 2,379 202 Updated Mar 1, 2024

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,377 170 Updated Jul 25, 2023

Reference implementation for DPO (Direct Preference Optimization)

Python 2,858 233 Updated Aug 11, 2024

A collection of recent resources on End-to-End Autonomous Driving [survey accepted in IEEE TIV]

241 21 Updated Feb 16, 2025

This repository is used to collect NeRF papers on autonomous driving

31 2 Updated Apr 12, 2024

HuaHuoLabel is a multifunctional AI data label tool, which supports data label of five computer vision tasks, including single-category classification, multi-category classification, semantic segmeโ€ฆ

Python 10 2 Updated Jan 19, 2024

Driving in CARLA using model-free deep reinforcement learning

Python 60 17 Updated Feb 2, 2021

[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving

Python 4,499 515 Updated Oct 29, 2025

Gather AIGC most useful tools, materials, publications and reports

153 21 Updated Apr 23, 2025

This repository contains the official implementation of the following paper: Lazy and Fast Greedy MAP Inference for Determinantal Point Process

C++ 5 Updated Jan 30, 2023

Fast Greedy MAP Inference for DPP

Python 130 31 Updated May 11, 2020

Efficient and Diverse Batch Acquisition for Deep Bayesian Active Learning.

Python 247 54 Updated Jun 5, 2024

A Library for Active Preference-based Reward Learning Algorithms

Python 54 12 Updated Dec 16, 2023

Tensorflow2.0 ๐ŸŽ๐ŸŠ is delicious, just eat it! ๐Ÿ˜‹๐Ÿ˜‹

Python 9,972 2,455 Updated Sep 22, 2022
Jupyter Notebook 3 1 Updated Jul 10, 2020

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Python 837 145 Updated Nov 29, 2022

PyTorch Implementation of Distributed Prioritized Experience Replay(Ape-X)

Python 155 17 Updated Apr 28, 2019

Reinforcement learning algorithms for MuJoCo tasks

Python 448 110 Updated Mar 13, 2025

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Python 4,465 737 Updated Feb 13, 2026

Massively Parallel Deep Reinforcement Learning. ๐Ÿ”ฅ

Python 4,295 969 Updated Feb 20, 2026

An open autonomous driving platform

C++ 26,446 9,958 Updated Feb 27, 2026
Next