Skip to content
View data2json's full-sized avatar
💭
Teaching robots to teach themselves to teach humans.
💭
Teaching robots to teach themselves to teach humans.

Block or report data2json

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Code implementation of synthetic continued pretraining

Jupyter Notebook 158 19 Updated Jan 6, 2025

Code for "Reasoning to Learn from Latent Thoughts"

Python 129 4 Updated Mar 28, 2025

Jupyter notebooks for the Natural Language Processing with Transformers book

Jupyter Notebook 4,750 1,466 Updated Aug 21, 2024
Python 6 1 Updated Mar 31, 2026

A comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when building, optimizing, or debugging agent systems that require e…

Python 15,114 1,186 Updated Apr 14, 2026

Coding Agent Harness for custom AI agents

TypeScript 22 2 Updated Apr 16, 2026

🛠️ Awesome tools & guides for harness engineering.

1,772 122 Updated Apr 14, 2026
Python 462 38 Updated Apr 17, 2026
Python 664 50 Updated Apr 16, 2026

CrossTrace: A Cross-Domain Dataset of Grounded Scientific Reasoning Traces for Hypothesis Generation

Python 2 1 Updated Mar 11, 2026

DarkSword exploit chain dump.

JavaScript 241 118 Updated Mar 22, 2026

Official Codebase for "Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights"

Python 578 55 Updated Apr 2, 2026

[MIRROR] Common effort to get an official and automated gentoo base docker container

Dockerfile 342 90 Updated Jan 21, 2026

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,050 465 Updated Apr 16, 2026

AI agents running research on single-GPU nanochat training automatically

Python 73,473 10,701 Updated Mar 26, 2026

Cited 83-model x 49-benchmark LLM evaluation matrix with 18 matrix completion methods

Python 35 5 Updated Feb 25, 2026

List of Data Science Cheatsheets to rule the world

16,208 4,054 Updated Jul 18, 2024

official code for "Large Language Models as Optimizers"

Python 732 90 Updated Dec 4, 2024

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.

711 41 Updated Apr 10, 2026

TrinityX is the new generation of ClusterVision's open-source HPC, A/I and cloudbursting platform. It is designed from the ground up to provide all services required in a modern HPC and A/I system,…

Jinja 117 47 Updated Feb 20, 2026

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

Python 10,822 950 Updated Apr 16, 2026

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python 13,845 1,657 Updated Apr 17, 2026

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 35,664 2,424 Updated Apr 16, 2026

rep+ — Burp-style HTTP Repeater for Chrome DevTools with built‑in AI to explain requests and suggest attacks

JavaScript 1,564 184 Updated Jan 16, 2026

Machine Learning Systems

JavaScript 23,662 2,840 Updated Apr 16, 2026

Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models

Python 22 1 Updated Dec 21, 2025

Tools for merging pretrained large language models.

Python 6,985 691 Updated Mar 15, 2026
Next