Skip to content
View darkbuck's full-sized avatar

Block or report darkbuck

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The best ChatGPT that $100 can buy.

Python 42,615 5,508 Updated Feb 8, 2026

Reproduction study of Grassmann Flows for sequence modeling (arXiv 2512.19428). Shows 22.6% gap vs claimed 10-15%, includes CUDA kernels with 2x speedup.

Python 19 3 Updated Dec 26, 2025
Cuda 1 2 Updated Oct 30, 2023

Statistics on GPUs

HTML 33 2 Updated Sep 8, 2025

An implementation of NeRF acceleration using RTX cores to compute ray-grid intersections

C 3 Updated Aug 25, 2023

Masked Depth Modeling for Spatial Perception

Python 822 60 Updated Jan 29, 2026

A Heterogeneous GPU Platform for Chipyard SoC

Scala 42 2 Updated Feb 8, 2026

The Turkish Sieve Methodology: Deterministic Computation of Twin and Cousin Prime Pairs Using an N/6 Bit Data Structure

9 Updated Feb 4, 2026

Sutskever 30 implementations inspired by https://papercode.vercel.app/

Jupyter Notebook 3,085 415 Updated Feb 8, 2026

TRELLIS (Microsoft's Image-to-3D generator) running on AMD GPUs with ROCm. Includes Gaussian splatting, mesh extraction, and GLB export. Tested on RX 7800 XT.

Jupyter Notebook 16 4 Updated Jan 2, 2026

RISC-V XV6/Linux SoC, marchID: 0x2b

Verilog 1,063 77 Updated Feb 7, 2026

Sample viewer implementing several rendering methods for 3D gaussians using Vulkan API

C++ 342 32 Updated Oct 29, 2025

A fast framework for writing baseline compiler back-ends in C++

LLVM 619 32 Updated Jan 27, 2026

Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.

Python 757 114 Updated Feb 2, 2026

PhotonVision is the free, fast, and easy-to-use computer vision solution for the FIRST Robotics Competition.

Java 393 281 Updated Feb 8, 2026

Rust library to manipulate CUDA fatbinary format

Rust 11 Updated Dec 9, 2025
Jupyter Notebook 713 44 Updated Feb 7, 2026

Official code repository of UTMIST's AI^2 Tournament (2024-2025 version)

Jupyter Notebook 2 Updated Aug 14, 2025

Official code repository of UTMIST's AI^2 Tournament

Python 6 98 Updated Nov 10, 2025

A framework that support executing unmodified CUDA source code on non-NVIDIA devices.

C++ 141 15 Updated Jan 3, 2025

Metal-based implementation of D3D11 and D3D10 for macOS / Wine

C++ 976 47 Updated Feb 6, 2026

A complete neural network built entirely in x86 assembly language that learns to recognize handwritten digits from the MNIST dataset. No frameworks, no high-level languages - just pure assembly - ~…

Assembly 161 4 Updated Nov 1, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,346 988 Updated Feb 1, 2026

A machine learning accelerator core designed for energy-efficient AI at the edge.

Emacs Lisp 2,046 230 Updated Feb 6, 2026

An open-source, GPU-accelerated physics simulation engine built upon NVIDIA Warp, specifically targeting roboticists and simulation researchers.

Python 2,486 247 Updated Feb 8, 2026

Pure LLM agent powered card game w/ TTS and live dashboard.

Python 10 1 Updated Jul 28, 2025

Allo Accelerator Design and Programming Framework (PLDI'24)

Python 343 64 Updated Feb 8, 2026

Intel® Open Image Denoise library

C++ 1,996 187 Updated Feb 3, 2026

FlashInfer: Kernel Library for LLM Serving

Python 4,917 698 Updated Feb 8, 2026
Next