Skip to content
View bhaddow's full-sized avatar

Highlights

  • Pro

Organizations

@moses-smt @accept-project

Block or report bhaddow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PersonaPlex code.

Python 4,978 736 Updated Feb 9, 2026

kani (カニ) is a highly hackable microframework for tool-calling language models. (NLP-OSS @ EMNLP 2023)

Python 597 29 Updated Feb 10, 2026

Complete toolkit for developers building LLM applications.

Python 47 3 Updated Dec 5, 2025
Python 8 1 Updated Feb 12, 2026

AllenAI's post-training codebase

Python 3,579 497 Updated Feb 14, 2026

SegEval Segmentation Evaluation Package

Python 57 14 Updated Jun 13, 2023

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…

Jupyter Notebook 19,923 3,305 Updated Feb 3, 2026

Building a comprehensive and handy list of papers for GUI agents

Python 633 34 Updated Oct 27, 2025

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 23,520 4,440 Updated Feb 14, 2026

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 11,501 1,712 Updated Jan 13, 2026

A bibliography and survey of the papers surrounding o1

TeX 1,211 51 Updated Nov 16, 2024

A reading list on LLM based Synthetic Data Generation 🔥

1,516 91 Updated Jun 5, 2025

The only open-source toolkit that can download SEC EDGAR financial reports and extract textual data from specific item sections into nice & clean structured JSON files. Presented at WWW 2025 @ Sydn…

Python 477 124 Updated Jul 18, 2025

This repository contains all the relevant data and code files for DLT Project

Jupyter Notebook 2 2 Updated Dec 22, 2023

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 3,092 228 Updated Feb 9, 2026

Optimizing inference proxy for LLMs

Python 3,322 260 Updated Jan 28, 2026

A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.

Python 64 6 Updated Jul 29, 2024

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess finan…

Jupyter Notebook 829 109 Updated Mar 4, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 70,299 13,444 Updated Feb 14, 2026

[TMLR 2024] Efficient Large Language Models: A Survey

1,253 99 Updated Jun 23, 2025

https://acl2023-retrieval-lm.github.io/

JavaScript 156 15 Updated Oct 18, 2023

Must-read Papers on Knowledge Editing for Large Language Models.

1,212 80 Updated Jul 12, 2025

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

Python 177 23 Updated Dec 31, 2024

🔥Highlighting the top ML papers every week.

12,236 767 Updated Jul 20, 2025

This is the GitHub page for publicly available emotional speech data.

381 27 Updated Jan 6, 2022

DSPy: The framework for programming—not prompting—language models

Python 32,201 2,626 Updated Feb 11, 2026

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,081 523 Updated Jul 1, 2025

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,215 3,659 Updated Jul 4, 2024
Next