WuJian1995

Follow

Jian Wu WuJian1995

Follow

7 followers · 27 following

Titech
tokyo

Lists (2)

Sort

🔮 Future ideas

✨ Inspiration

Starred repositories

sugarandgugu / Simple-Trl-Training

基于DPO算法微调语言大模型，简单好上手。

Python 48 3 Updated Jul 3, 2024

eltsai / elfs

Code implementation for ICLR 2025 paper: ELFS: Label-Free Coreset Selection with Proxy Training Dynamics

Python 8 1 Updated Feb 11, 2025

ZifanL / TSDS

Implementation of TSDS: Data Selection for Task-Specific Model Finetuning. An optimal-transport framework for selecting domain-specific and task-specific training data to improve LLM finetuning and…

Python 16 1 Updated Dec 25, 2024

PKU-ML / LongPPL

Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"

Python 107 8 Updated Oct 11, 2025

fabiomatricardi / Deepseek-R1-qwen1.5B

how to run DeepSeek-R1-Distill-Qwen-1.5B GGUF locally on your PC

Python 28 4 Updated Jan 24, 2025

GAIR-NLP / LIMO

[COLM 2025] LIMO: Less is More for Reasoning

Python 1,060 52 Updated Jul 30, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,795 2,406 Updated Nov 24, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,040 1,110 Updated Jan 8, 2026

deepseek-ai / DeepSeek-V3

Python 101,024 16,459 Updated Aug 28, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 4,960 477 Updated Jan 6, 2026

SwanHubX / SwanLab

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 3,394 180 Updated Jan 8, 2026

hms-dbmi / CHIEF

Clinical Histopathology Imaging Evaluation Foundation Model

Python 683 112 Updated Jan 8, 2026

Relaxed-System-Lab / multi-actor-data-selection

This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.

Python 45 3 Updated Aug 22, 2025

docling-project / docling

Get your documents ready for gen AI

Python 49,385 3,431 Updated Jan 8, 2026

nuprl / MultiPL-E

A multi-programming language benchmark for LLMs

Python 290 53 Updated Jan 5, 2026

zhanshijinwat / Steel-LLM

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 782 78 Updated Apr 27, 2025

DigitalHarborFoundation / llm-math-education

Retrieval augmented generation for middle-school math question answering and hint generation.

Jupyter Notebook 43 5 Updated Feb 19, 2025

shibing624 / ChatPDF

RAG for Local LLM, chat with PDF/doc/txt files, ChatPDF. 纯原生实现RAG功能，基于本地LLM、embedding模型、reranker模型实现，支持GraphRAG，无须安装任何第三方agent库。

Python 824 143 Updated Apr 2, 2025

Alab-NII / morehopqa

The evaluation code for the paper "MoreHopQA: More Than Multi-hop Reasoning"

Python 14 1 Updated Jun 21, 2024

JingXuTHU / Random-Masking-Finds-Winning-Tickets-for-Parameter-Efficient-Fine-tuning

Python 14 1 Updated May 4, 2024

NVlabs / AFNO-transformer

Adaptive FNO transformer - official Pytorch implementation

Python 273 31 Updated Nov 7, 2022

USTC-StarTeam / ZIP

Python 26 2 Updated Jul 11, 2024

tianyi-lab / Superfiltering

[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

Python 184 16 Updated Jun 25, 2025

JIA-Lab-research / Step-DPO

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 389 16 Updated Jan 19, 2025

CAS-SIAT-XinHai / NUMCoT

[ACL 2024] NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models

Python 5 Updated Jun 6, 2024

kiddyboots216 / lottery-ticket-adaptation

Lottery Ticket Adaptation

Python 40 4 Updated Nov 20, 2024

cxcscmu / MATES

Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]

Python 77 9 Updated Nov 14, 2024

Chia-Hsuan-Lee / KaggleDBQA

Introduction page of a challenging text-to-SQL dataset: KaggleDBQA

40 6 Updated Sep 20, 2023

Hanzhang-lang / ALTER

Official implementation of the paper "ALTER: Augmentation for Large-Table-Based Reasoning"

Python 15 Updated Aug 26, 2024

FloridSleeves / LLMDebugger

LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step (ACL'24)

Python 574 55 Updated Sep 10, 2024

Starred topics

scientific-papers