hutm

Follow

Maksim Khadkevich hutm

Follow

11 followers · 2 following

Achievements

Achievements

Organizations

Lists (1)

Sort

audioML

Stars

NVIDIA / aicr

Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

Go 267 33 Updated Apr 18, 2026

kaito-project / airunway

✈️ Kubernetes-native platform for deploying and managing AI inference across multiple providers.

TypeScript 69 23 Updated Apr 18, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,016 585 Updated Mar 13, 2026

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 52,087 6,920 Updated Apr 14, 2026

samuelint / langchain-openai-api-bridge

A bridge to use Langchain output as an OpenAI-compatible API

Python 91 19 Updated Jul 11, 2025

emcie-co / parlant

The interaction control harness for customer-facing AI agents - optimized for building controlled, consistent, and predictable customer interactions with LLMs.

Python 17,974 1,524 Updated Apr 18, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 12,976 1,948 Updated Apr 13, 2026

ai-dynamo / grove

Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling

Go 196 52 Updated Apr 16, 2026

inclusionAI / AReaL

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,058 469 Updated Apr 18, 2026

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,774 3,688 Updated Apr 17, 2026

NVIDIA / nvidia-resiliency-ext

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 283 49 Updated Apr 17, 2026

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 3,323 277 Updated Apr 8, 2026

NVIDIA / garak

the LLM vulnerability scanner

HTML 7,559 888 Updated Apr 18, 2026

NixOS / nixpkgs

Nix Packages collection & NixOS

Nix 24,325 18,629 Updated Apr 18, 2026

HanGuo97 / flute

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

C++ 390 18 Updated Apr 13, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,582 1,036 Updated Apr 18, 2026

NVIDIA / NeMo-Agent-Toolkit

The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.

Python 2,200 608 Updated Apr 18, 2026

bentoml / BentoML

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

Python 8,587 951 Updated Apr 16, 2026

ai-dynamo / nixl

NVIDIA Inference Xfer Library (NIXL)

C++ 989 292 Updated Apr 18, 2026

simplescaling / s1

s1: Simple test-time scaling

Python 6,646 762 Updated Jun 25, 2025

ungoogled-software / ungoogled-chromium-macos

macOS packaging for ungoogled-chromium

Shell 628 94 Updated Apr 17, 2026

oumi-ai / oumi

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Python 9,186 750 Updated Apr 17, 2026

cookiecutter / cookiecutter

A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.

Python 24,819 2,223 Updated Apr 1, 2026

salykova / sgemm.cu

High-Performance FP32 GEMM on CUDA devices

Cuda 121 9 Updated Jan 21, 2025

huggingface / candle

Minimalist ML framework for Rust

Rust 20,027 1,529 Updated Apr 16, 2026

langfuse / langfuse-k8s

Community-maintained Kubernetes config and Helm chart for Langfuse

Go Template 236 136 Updated Apr 17, 2026

LLMServe / DistServe

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 800 91 Updated Apr 6, 2025

stefanprodan / podinfo

Go microservice template for Kubernetes

Go 5,878 1,866 Updated Apr 13, 2026

CapSoftware / Cap

Open source Loom alternative. Beautiful, shareable screen recordings.

TypeScript 18,201 1,410 Updated Apr 15, 2026

EricLBuehler / mistral.rs

Fast, flexible LLM inference

Rust 6,999 574 Updated Apr 15, 2026