Highlights
- Pro
Stars
kani (カニ) is a highly hackable microframework for tool-calling language models. (NLP-OSS @ EMNLP 2023)
Complete toolkit for developers building LLM applications.
SegEval Segmentation Evaluation Package
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
Building a comprehensive and handy list of papers for GUI agents
SGLang is a high-performance serving framework for large language models and multimodal models.
This repository contains demos I made with the Transformers library by HuggingFace.
A bibliography and survey of the papers surrounding o1
A reading list on LLM based Synthetic Data Generation 🔥
The only open-source toolkit that can download SEC EDGAR financial reports and extract textual data from specific item sections into nice & clean structured JSON files. Presented at WWW 2025 @ Sydn…
This repository contains all the relevant data and code files for DLT Project
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Optimizing inference proxy for LLMs
A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess finan…
A high-throughput and memory-efficient inference and serving engine for LLMs
[TMLR 2024] Efficient Large Language Models: A Survey
https://acl2023-retrieval-lm.github.io/
Must-read Papers on Knowledge Editing for Large Language Models.
The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.
🔥Highlighting the top ML papers every week.
This is the GitHub page for publicly available emotional speech data.
DSPy: The framework for programming—not prompting—language models
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.



