Skip to content
View hutm's full-sized avatar

Organizations

@vertxai

Block or report hutm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tooling for optimized, validated, and reproducible GPU-accelerated AI runtime in Kubernetes

Go 267 33 Updated Apr 18, 2026

✈️ Kubernetes-native platform for deploying and managing AI inference across multiple providers.

TypeScript 69 23 Updated Apr 18, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,016 585 Updated Mar 13, 2026

The best ChatGPT that $100 can buy.

Python 52,087 6,920 Updated Apr 14, 2026

A bridge to use Langchain output as an OpenAI-compatible API

Python 91 19 Updated Jul 11, 2025

The interaction control harness for customer-facing AI agents - optimized for building controlled, consistent, and predictable customer interactions with LLMs.

Python 17,974 1,524 Updated Apr 18, 2026

Nano vLLM

Python 12,976 1,948 Updated Apr 13, 2026

Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling

Go 196 52 Updated Apr 16, 2026

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,058 469 Updated Apr 18, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,774 3,688 Updated Apr 17, 2026

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 283 49 Updated Apr 17, 2026

Tile primitives for speedy kernels

Cuda 3,323 277 Updated Apr 8, 2026

the LLM vulnerability scanner

HTML 7,559 888 Updated Apr 18, 2026

Nix Packages collection & NixOS

Nix 24,325 18,629 Updated Apr 18, 2026

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

C++ 390 18 Updated Apr 13, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,582 1,036 Updated Apr 18, 2026

The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.

Python 2,200 608 Updated Apr 18, 2026

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

Python 8,587 951 Updated Apr 16, 2026

NVIDIA Inference Xfer Library (NIXL)

C++ 989 292 Updated Apr 18, 2026

s1: Simple test-time scaling

Python 6,646 762 Updated Jun 25, 2025

macOS packaging for ungoogled-chromium

Shell 628 94 Updated Apr 17, 2026

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Python 9,186 750 Updated Apr 17, 2026

A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.

Python 24,819 2,223 Updated Apr 1, 2026

High-Performance FP32 GEMM on CUDA devices

Cuda 121 9 Updated Jan 21, 2025

Minimalist ML framework for Rust

Rust 20,027 1,529 Updated Apr 16, 2026

Community-maintained Kubernetes config and Helm chart for Langfuse

Go Template 236 136 Updated Apr 17, 2026

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 800 91 Updated Apr 6, 2025

Go microservice template for Kubernetes

Go 5,878 1,866 Updated Apr 13, 2026

Open source Loom alternative. Beautiful, shareable screen recordings.

TypeScript 18,201 1,410 Updated Apr 15, 2026

Fast, flexible LLM inference

Rust 6,999 574 Updated Apr 15, 2026
Next