Lists (15)
Sort Name ascending (A-Z)
Stars
Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Fast OS-level support for GPU checkpoint and restore
ValueCell is a community-driven, multi-agent platform for financial applications.
AI Trading OS: Multi-AI, multi-exchange trading infrastructure with Strategy Studio.
Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond
Engine-Agnostic Model Hot-Swapping for Cost-Effective LLM Inference
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
Ancillary open source software to support confidential computing on NVIDIA GPUs
NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
GPU Admin Tools. Includes Confidential Computing controls for H100, and other functionality
a.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation
Serverless LLM Serving for Everyone.
resource used in video clip on code reading
This repo is a mirror of the official lttng-ust git found at git://git.lttng.org/lttng-ust.git. LTTng-UST, the Linux Trace Toolkit Next Generation Userspace Tracer, is port of the low-overhead trac…
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
DeepEP: an efficient expert-parallel communication library
Kata Containers is an open source project and community working to build a standard implementation of lightweight Virtual Machines (VMs) that feel and perform like containers, but provide the workl…
NVIDIA Linux open GPU with P2P support