🎯
Focusing
Stars
Model Parallel
4 repositories
Minimalistic 4D-parallelism distributed training framework for education purpose
Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs
Bridge Megatron-Core to Hugging Face/Reinforcement Learning
Toolchain built around the Megatron-LM for Distributed Training

