multi-agent-rl

Here are 9 public repositories matching this topic...

hsvgbkhgbv / SQDDPG

This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.

framework reinforcement-learning openai-gym pytorch policy-gradient multiagent-reinforcement-learning multi-agent-reinforcement-learning marl sqddpg shapley-q-value multi-agent-rl

Updated Nov 4, 2024
Python

Nikelroid / adversarial-coevolution

Star

Adversarial Co-Evolution of RL and LLM Agents: A framework for training high-performance PPO agents against Large Language Models in Gin Rummy, utilizing curriculum learning and knowledge distillation.

reinforcement-learning pytorch knowledge-distillation gin-rummy curriculum-learning ppo multi-agent-rl pettingzoo stable-baselines3 llm ollama

Updated Dec 13, 2025
Python

tk-yasuno / dql-bridge-maintenance

Star

A deep reinforcement learning system for optimizing bridge maintenance decisions across municipal infrastructure fleets, implementing cross-subsidy budget sharing and cooperative multi-agent learning.

reinforcement-learning deep-q-learning decision-support-system predictive-maintenance cbm resource-sharing large-scale-optimization markov-decision-process multi-agent-rl prognostics-health-management budget-allocation bridge-maintenance disaster-resilience infrastructure-maintenance-management infrastructure-resilience municipal-infrastructure hadr-ai cooperative-rl cross-subsidy

Updated Dec 5, 2025
Python

Vaioskn / adversarial-soccer-rl

Star

Deterministic hex-grid soccer environment with two adversarial agents. Implements Q-Learning, Minimax-Q (via LP), and Belief-Q with online belief updates; trains in SE2G/SE6G to reduce state space and evaluates behaviors in the full environment with comprehensive visualizations.

reinforcement-learning linear-programming q-learning game-theory markov-games markov-decision-process multi-agent-rl adversarial-rl minimax-q belief-q state-space-reduction

Updated Sep 28, 2024
Python

julesser / DeepRL-P3-Collaboration-Competition

Star

Project 3 of Udacity's Deep Reinforcement Learning Nanodegree Program

unity-ml-agents multi-agent-rl reinforcment-learning

Updated Oct 25, 2021
Python

alizangeneh / multiagent-warehouse-navigation-dqn

Star

Research-grade Reinforcement Learning framework for single-agent and multi-agent warehouse navigation using Deep Q-Networks (DQN), PyTorch, replay buffer, target networks, logging, and full test suite. Built for PhD-level RL and autonomous systems research.

machine-learning reinforcement-learning robotics decision-making deep-reinforcement-learning path-planning pytorch dqn multiagent-systems gridworld deep-q-network ai-research target-network autonomous-navigation experience-replay multi-agent-rl cooperative-agents multi-agent-navigation warehouse-robotics

Updated Dec 11, 2025
Python

tk-yasuno / dql-multi-equipments-cbm

Star

Multi-Equipment CBM system using QR-DQN with advanced probability distribution analysis. Coordinated maintenance decision-making for 4 industrial equipment units with realistic anomaly rates (1.9-2.2%), comprehensive risk analysis (VaR/CVaR), and 51-quantile distribution visualization.

reinforcement-learning risk-analysis uncertainty-estimation value-at-risk deep-q-learning predictive-maintenance qr-dqn distributional-rl condition-based-maintenance multi-agent-rl cvar prognostics-health-management disaster-resilience equipment-maintenance infrastructure-prognostics

Updated Dec 21, 2025
Python

mwasifanwar / multi_agent_rl

Star

Coordinated multi-agent systems that learn to solve complex collaborative and competitive tasks.

game-theory emergent-behavior game-theory-model game-theory-algorithms game-theory-framework multi-agent-rl cooperative-ai distributed-ai

Updated Nov 4, 2025
Python

tk-yasuno / dql-aged-multi-equipment-cbm

Star

Multi-Equipment CBM (Condition-Based Maintenance) optimization using Deep Q-Learning with cost leveling and scenario comparison. Advanced RL system with QR-DQN, N-step learning, and parallel environments for HVAC equipment predictive maintenance.

reinforcement-learning deep-q-learning cost-optimization predictive-maintenance qr-dqn scenario-analysis distributional-rl condition-based-maintenance multi-agent-rl prognostics-health-management disaster-resilience n-step-learning infrastructure-prognostics hvac-maintenance

Updated Dec 25, 2025
Python

Improve this page

Add a description, image, and links to the multi-agent-rl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-agent-rl topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-agent-rl

Here are 9 public repositories matching this topic...

hsvgbkhgbv / SQDDPG

Nikelroid / adversarial-coevolution

tk-yasuno / dql-bridge-maintenance

Vaioskn / adversarial-soccer-rl

julesser / DeepRL-P3-Collaboration-Competition

alizangeneh / multiagent-warehouse-navigation-dqn

tk-yasuno / dql-multi-equipments-cbm

mwasifanwar / multi_agent_rl

tk-yasuno / dql-aged-multi-equipment-cbm

Improve this page

Add this topic to your repo