Skip to content
View ethany21's full-sized avatar

Block or report ethany21

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Hands-On System Programming with C++, published by Packt

C++ 157 65 Updated Jan 18, 2023

Patterns and resources of low latency programming.

1,214 64 Updated Jul 30, 2025

A curated list of resources on operating system design and implementation.

201 13 Updated Jun 5, 2024

An implementation of the Raft distributed consensus protocol using the Tokio framework.

Rust 1,091 87 Updated Feb 12, 2023

Accelerating MoE with IO and Tile-aware Optimizations

Python 635 73 Updated Apr 15, 2026
Python 247 29 Updated Apr 5, 2026

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 17,000 1,265 Updated Apr 14, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,118 682 Updated Apr 17, 2026

rvLLM: High-performance LLM inference in Rust. Drop-in vLLM replacement.

Rust 428 48 Updated Apr 17, 2026

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

Java 17,917 1,600 Updated Apr 17, 2026

Mamba SSM architecture

Python 17,999 1,698 Updated Apr 16, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,431 903 Updated Apr 17, 2026

Extract tables from PDF files

Java 2,023 451 Updated Mar 19, 2025

Algorithms for latent compaction

Python 207 24 Updated Mar 31, 2026

LLM KV cache compression made easy

Python 1,042 134 Updated Apr 14, 2026

Systems Programming UIUC FA 2016

C 13 34 Updated Feb 24, 2017

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 4,096 307 Updated Apr 13, 2026

Sandboxes for every agent. Embeddable, stateful, snapshots, and hardware isolation.

Rust 1,779 100 Updated Apr 15, 2026

Fast and memory-efficient exact attention

Python 23,399 2,621 Updated Apr 17, 2026

A course to build the SQL layer of a distributed database.

Go 2,041 548 Updated Sep 27, 2023

A course to build distributed key-value service based on TiKV model

Go 3,907 1,082 Updated May 3, 2025

High-performance wait-free memory reclamation for wait-free data structures (ASMR). Bounded memory usage, predictable latency.

Rust 100 1 Updated Apr 14, 2026
C++ 3 1 Updated Dec 12, 2020

Raft distributed consensus algorithm implemented in Rust.

Rust 3,322 454 Updated Apr 14, 2026

Implementation of Chandy–Lamport snapshot algorithm for recording a consistent global state of an asynchronous distributed system

Java 5 3 Updated Jan 27, 2018

Chandy-Lamport distributed snapshot implementation

Python 6 2 Updated Apr 1, 2014

My UCLA ECE M116C Fall 2022 Resources

C++ 3 1 Updated Jan 3, 2023

A circular buffer written in C using Posix calls to create a contiguously mapped memory space. BSD Licensed.

C 270 42 Updated May 10, 2021
Next