More
More
-
bitsandbytes Public
Forked from bitsandbytes-foundation/bitsandbytes8-bit CUDA functions for PyTorch
Python MIT License UpdatedDec 13, 2023 -
booru_yolo Public
Forked from aperveyev/booru_yoloYOLOv8 models and code for CG / art image processing
Python UpdatedFeb 12, 2024 -
cpu-regions-diag Public
Forked from NVIDIA/cpu-code-locality-toolScripts to identify an application that will benefit from code locality optimization on ARM architecture and to generate an optimized linker script for re-building the application.
Python Apache License 2.0 UpdatedJan 26, 2024 -
dalle3-eval-samples Public
Forked from openai/dalle3-eval-samplesText-to-image samples collected for the evaluation of DALL-E 3 in the whitepaper.
MIT License UpdatedOct 17, 2023 -
nvmath-python Public
Forked from NVIDIA/nvmath-pythonNVIDIA Math Libraries for the Python Ecosystem
Cython Apache License 2.0 UpdatedJul 8, 2024 -
sd-scripts Public
Forked from KohakuBlueleaf/sd-scriptsPython Apache License 2.0 UpdatedDec 21, 2023 -
SGEMM_CUDA Public
Forked from siboehm/SGEMM_CUDAFast CUDA matrix multiplication from scratch
Cuda MIT License UpdatedFeb 3, 2024 -
SIMDString Public
Forked from Roblox/SIMDStringFast string implementation for graphics.
C++ MIT License UpdatedJan 24, 2024 -
stable-fast Public
Forked from chengzeyi/stable-fastBest inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
Python MIT License UpdatedFeb 8, 2024 -
SUM Public
Forked from Arhosseini77/SUM[WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling
Python MIT License UpdatedApr 13, 2025