Stars
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.
Fast and Lightweight Observability Data Collector
An industrial-grade C++ implementation of RAFT consensus algorithm based on brpc, widely used inside Baidu to build highly-available distributed systems.
A complete computer science study plan to become a software engineer.
An open sourced implementation of Bw-Tree in SQL Server Hekaton
Create, operate and scale self-healing MySQL clusters in Kubernetes
Roaring bitmaps in C (and C++), with SIMD (AVX2, AVX-512 and NEON) optimizations: used by Apache Doris, ClickHouse, Alibaba Tair, Redpanda, YDB and StarRocks
Papers from the computer science community to read and discuss.
Class materials for a distributed systems lecture series
C/C++ JSON parser/generator benchmark
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
LogCabin is a distributed storage system built on Raft that provides a small amount of highly replicated, consistent storage. It is a reliable place for other distributed systems to store their cor…
High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.
Protocol Buffers - Google's data interchange format
Fastsocket is a highly scalable socket and its underlying networking implementation of Linux kernel. With the straight linear scalability, Fastsocket can provide extremely good performance in multi…
Framework and Library for Distributed Online Machine Learning


