Skip to content
View Unispac's full-sized avatar

Highlights

  • Pro

Block or report Unispac

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,724 257 Updated Nov 12, 2025

Official Repo for Open-Reasoner-Zero

Python 2,090 119 Updated Jun 2, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,703 3,662 Updated Apr 15, 2026

A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

Python 249 9 Updated Apr 15, 2025

s1: Simple test-time scaling

Python 6,642 763 Updated Jun 25, 2025

Princeton University Ph.D. Dissertation Template

TeX 18 20 Updated Apr 9, 2017

Simple RL training for reasoning

Python 3,845 289 Updated Dec 23, 2025

Fully open reproduction of DeepSeek-R1

Python 25,988 2,415 Updated Apr 2, 2026

Minimal reproduction of DeepSeek R1-Zero

Python 13,053 1,583 Updated Feb 27, 2026
Python 58 18 Updated Mar 25, 2026

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,571 171 Updated Apr 8, 2026

Sky-T1: Train your own O1 preview model within $450

Python 3,373 342 Updated Jul 12, 2025

Code release for Best-of-N Jailbreaking

Python 562 96 Updated Feb 5, 2025

O1 Replication Journey

2,000 61 Updated Jan 14, 2025

A series of technical report on Slow Thinking with LLM

Python 764 41 Updated Aug 13, 2025
Python 142 18 Updated Dec 23, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 28,874 2,928 Updated Apr 9, 2026

Recipes to scale inference-time compute of open models

Python 1,130 130 Updated Apr 2, 2026

PyTorch library for Active Fine-Tuning

Python 98 9 Updated Sep 27, 2025

NeurIPS 2024 tutorial on LLM Inference

Jupyter Notebook 49 4 Updated Dec 10, 2024

Audio Large Language Models

Python 904 47 Updated Jul 5, 2025

A survey on harmful fine-tuning attack for large language model (ACM CSUR)

238 7 Updated Feb 25, 2026

[ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs

Python 13 1 Updated Jun 20, 2025

the LLM vulnerability scanner

HTML 7,541 879 Updated Apr 15, 2026

Curated list of data science interview questions and answers

5,581 1,249 Updated Sep 29, 2024

This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)

Shell 49 5 Updated Jan 15, 2026

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 229 17 Updated Apr 13, 2026

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 8,033 1,438 Updated Nov 28, 2025

Code and example data for the paper: Rule Based Rewards for Language Model Safety

Jupyter Notebook 208 22 Updated Jul 19, 2024
Next