Skip to content
View devjwsong's full-sized avatar

Highlights

  • Pro

Block or report devjwsong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Why is this running?

Go 9,242 190 Updated Jan 2, 2026

Learn how to design large-scale systems. Prep for the system design interview.

Python 31 4 Updated Mar 18, 2018

The best ChatGPT that $100 can buy.

Python 39,630 5,057 Updated Jan 1, 2026

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…

Python 181 19 Updated Mar 18, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,382 2,148 Updated Dec 18, 2025

RLHF implementation details of OAI's 2019 codebase

Python 197 12 Updated Jan 14, 2024

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,378 173 Updated Jul 25, 2023

RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks

Jupyter Notebook 226 20 Updated Jun 20, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 50,567 3,610 Updated Dec 20, 2025

Official inference framework for 1-bit LLMs

Python 24,497 1,921 Updated Jun 3, 2025

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,654 126 Updated Apr 17, 2024

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,950 288 Updated May 15, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,131 4,676 Updated Jan 2, 2026

Official Github repository for the SIGCOMM '24 paper "Accelerating Model Training in Multi-cluster Environments with Consumer-grade GPUs"

C++ 73 18 Updated Jul 13, 2024

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …

C# 18,997 4,406 Updated Dec 19, 2025

Create characters in Unity with LLMs!

C# 1,425 155 Updated Dec 23, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,583 7,060 Updated Jan 2, 2026

Repository for the paper "Will GPT-4 Run DOOM?"

Python 24 4 Updated Nov 27, 2024

🐍🎮 pygame (the library) is a Free and Open Source python programming language library for making multimedia applications like games built on top of the excellent SDL library. C, Python, Native, Ope…

C 8,542 3,956 Updated Nov 1, 2025

Godot Engine – Multi-platform 2D and 3D game engine

C++ 104,701 23,935 Updated Jan 2, 2026

A Data Streaming Library for Efficient Neural Network Training

Python 1,436 181 Updated Oct 27, 2025
C 4,500 392 Updated Dec 27, 2023

Decompilation of The Legend of Zelda: Twilight Princess

C++ 1,489 151 Updated Jan 2, 2026

A list of open source games.

11,406 895 Updated Jan 2, 2026

A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems", which is `dmls-book`

HTML 9,718 1,510 Updated Apr 15, 2023

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Jupyter Notebook 45,415 7,105 Updated Aug 18, 2024

Fully open reproduction of DeepSeek-R1

Python 25,781 2,406 Updated Nov 24, 2025

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,339 2,125 Updated Oct 27, 2025
Next