devjwsong

Jaewoo (Kyle) Song devjwsong

Applied Scientist @ Amazon

71 followers · 68 following

Achievements

x2 x2

Achievements

x2 x2

Highlights

Stars

pranshuparmar / witr

Why is this running?

Go 9,242 190 Updated Jan 2, 2026

IuryAlves / system-design-primer

Forked from donnemartin/system-design-primer

Learn how to design large-scale systems. Prep for the system design interview.

Python 31 4 Updated Mar 18, 2018

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 39,630 5,057 Updated Jan 1, 2026

SamsungSAILMontreal / TinyRecursiveModels

Python 6,137 938 Updated Dec 2, 2025

seannyD / VideoGameDialogueCorpusPublic

Python 63 8 Updated Oct 29, 2025

raghavc / LLM-RLHF-Tuning-with-PPO-and-DPO

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…

Python 181 19 Updated Mar 18, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,382 2,148 Updated Dec 18, 2025

vwxyzjn / lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase

Python 197 12 Updated Jan 14, 2024

openai / lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,378 173 Updated Jul 25, 2023

ash80 / RLHF_in_notebooks

RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks

Jupyter Notebook 226 20 Updated Jun 20, 2025

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 50,567 3,610 Updated Dec 20, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 24,497 1,921 Updated Jun 3, 2025

jquesnelle / yarn

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,654 126 Updated Apr 17, 2024

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,950 288 Updated May 15, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,131 4,676 Updated Jan 2, 2026

kaist-ina / stellatrain

Official Github repository for the SIGCOMM '24 paper "Accelerating Model Training in Multi-cluster Environments with Consumer-grade GPUs"

C++ 73 18 Updated Jul 13, 2024

Unity-Technologies / ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …

C# 18,997 4,406 Updated Dec 19, 2025

undreamai / LLMUnity

Create characters in Unity with LLMs!

C# 1,425 155 Updated Dec 23, 2025

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,583 7,060 Updated Jan 2, 2026

adewynter / Doom

Repository for the paper "Will GPT-4 Run DOOM?"

Python 24 4 Updated Nov 27, 2024

pygame / pygame

🐍🎮 pygame (the library) is a Free and Open Source python programming language library for making multimedia applications like games built on top of the excellent SDL library. C, Python, Native, Ope…

C 8,542 3,956 Updated Nov 1, 2025

godotengine / godot

Godot Engine – Multi-platform 2D and 3D game engine

C++ 104,701 23,935 Updated Jan 2, 2026

mosaicml / streaming

A Data Streaming Library for Efficient Neural Network Training

Python 1,436 181 Updated Oct 27, 2025

snesrev / zelda3

C 4,500 392 Updated Dec 27, 2023

zeldaret / tp

Decompilation of The Legend of Zelda: Twilight Princess

C++ 1,489 151 Updated Jan 2, 2026

bobeff / open-source-games

A list of open source games.

11,406 895 Updated Jan 2, 2026

chiphuyen / machine-learning-systems-design

A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems", which is `dmls-book`

HTML 9,718 1,510 Updated Apr 15, 2023

GokuMohandas / Made-With-ML

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Jupyter Notebook 45,415 7,105 Updated Aug 18, 2024

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,781 2,406 Updated Nov 24, 2025

flairNLP / flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,339 2,125 Updated Oct 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jaewoo (Kyle) Song devjwsong

Achievements

Achievements

Highlights

Block or report devjwsong

Stars

pranshuparmar / witr

IuryAlves / system-design-primer

karpathy / nanochat

SamsungSAILMontreal / TinyRecursiveModels

seannyD / VideoGameDialogueCorpusPublic

raghavc / LLM-RLHF-Tuning-with-PPO-and-DPO

huggingface / peft

vwxyzjn / lm-human-preference-details

openai / lm-human-preferences

ash80 / RLHF_in_notebooks

anthropics / claude-code

microsoft / BitNet

jquesnelle / yarn

deepseek-ai / open-infra-index

deepspeedai / DeepSpeed

kaist-ina / stellatrain

Unity-Technologies / ml-agents

undreamai / LLMUnity

ray-project / ray

adewynter / Doom

pygame / pygame

godotengine / godot

mosaicml / streaming

snesrev / zelda3

zeldaret / tp

bobeff / open-source-games

chiphuyen / machine-learning-systems-design

GokuMohandas / Made-With-ML

huggingface / open-r1

flairNLP / flair