Lists (2)
Sort Name ascending (A-Z)
Starred repositories
Code implementation for ICLR 2025 paper: ELFS: Label-Free Coreset Selection with Proxy Training Dynamics
Implementation of TSDS: Data Selection for Task-Specific Model Finetuning. An optimal-transport framework for selecting domain-specific and task-specific training data to improve LLM finetuning and…
Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"
how to run DeepSeek-R1-Distill-Qwen-1.5B GGUF locally on your PC
Fully open reproduction of DeepSeek-R1
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…
Clinical Histopathology Imaging Evaluation Foundation Model
This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.
Get your documents ready for gen AI
Train a 1B LLM with 1T tokens from scratch by personal
Retrieval augmented generation for middle-school math question answering and hint generation.
RAG for Local LLM, chat with PDF/doc/txt files, ChatPDF. 纯原生实现RAG功能,基于本地LLM、embedding模型、reranker模型实现,支持GraphRAG,无须安装任何第三方agent库。
The evaluation code for the paper "MoreHopQA: More Than Multi-hop Reasoning"
Adaptive FNO transformer - official Pytorch implementation
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"
[ACL 2024] NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models
Lottery Ticket Adaptation
Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]
Introduction page of a challenging text-to-SQL dataset: KaggleDBQA
Official implementation of the paper "ALTER: Augmentation for Large-Table-Based Reasoning"
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step (ACL'24)