Skip to content
View WuJian1995's full-sized avatar

Block or report WuJian1995

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

基于DPO算法微调语言大模型,简单好上手。

Python 48 3 Updated Jul 3, 2024

Code implementation for ICLR 2025 paper: ELFS: Label-Free Coreset Selection with Proxy Training Dynamics

Python 8 1 Updated Feb 11, 2025

Implementation of TSDS: Data Selection for Task-Specific Model Finetuning. An optimal-transport framework for selecting domain-specific and task-specific training data to improve LLM finetuning and…

Python 16 1 Updated Dec 25, 2024

Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"

Python 107 8 Updated Oct 11, 2025

how to run DeepSeek-R1-Distill-Qwen-1.5B GGUF locally on your PC

Python 28 4 Updated Jan 24, 2025

[COLM 2025] LIMO: Less is More for Reasoning

Python 1,060 52 Updated Jul 30, 2025

Fully open reproduction of DeepSeek-R1

Python 25,795 2,406 Updated Nov 24, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,040 1,110 Updated Jan 8, 2026

Democratizing Reinforcement Learning for LLMs

Python 4,960 477 Updated Jan 6, 2026

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 3,394 180 Updated Jan 8, 2026

Clinical Histopathology Imaging Evaluation Foundation Model

Python 683 112 Updated Jan 8, 2026

This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.

Python 45 3 Updated Aug 22, 2025

Get your documents ready for gen AI

Python 49,385 3,431 Updated Jan 8, 2026

A multi-programming language benchmark for LLMs

Python 290 53 Updated Jan 5, 2026

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 782 78 Updated Apr 27, 2025

Retrieval augmented generation for middle-school math question answering and hint generation.

Jupyter Notebook 43 5 Updated Feb 19, 2025

RAG for Local LLM, chat with PDF/doc/txt files, ChatPDF. 纯原生实现RAG功能,基于本地LLM、embedding模型、reranker模型实现,支持GraphRAG,无须安装任何第三方agent库。

Python 824 143 Updated Apr 2, 2025

The evaluation code for the paper "MoreHopQA: More Than Multi-hop Reasoning"

Python 14 1 Updated Jun 21, 2024

Adaptive FNO transformer - official Pytorch implementation

Python 273 31 Updated Nov 7, 2022
Python 26 2 Updated Jul 11, 2024

[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

Python 184 16 Updated Jun 25, 2025

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 389 16 Updated Jan 19, 2025

[ACL 2024] NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models

Python 5 Updated Jun 6, 2024

Lottery Ticket Adaptation

Python 40 4 Updated Nov 20, 2024

Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]

Python 77 9 Updated Nov 14, 2024

Introduction page of a challenging text-to-SQL dataset: KaggleDBQA

40 6 Updated Sep 20, 2023

Official implementation of the paper "ALTER: Augmentation for Large-Table-Based Reasoning"

Python 15 Updated Aug 26, 2024

LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step (ACL'24)

Python 574 55 Updated Sep 10, 2024
Next