Stars
Your go-to microservice framework for any situation, from the creator of Netty et al. You can build any type of microservice leveraging your favorite technologies, including gRPC, Thrift, Kotlin, R…
A repository of data on coronavirus cases and deaths in the U.S.
The open source AI engineering platform. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI agents, LLM applications, and ML models while controlling …
Apache Druid: a high performance real-time analytics database.
Open Source ML Model Versioning, Metadata, and Experiment Management
FoundationDB - the open source, distributed, transactional key-value store
A Scala API for Apache Beam and Google Cloud Dataflow.
Apache Beam is a unified programming model for Batch and Streaming data processing.
Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Vitess is a database clustering system for horizontal scaling of MySQL.
twitter-forks / presto
Forked from prestodb/prestoDistributed SQL query engine for running interactive analytic queries against big data sources.
Papers from the computer science community to read and discuss.
A library that provides an embeddable, persistent key-value store for fast storage.
Streaming MapReduce with Scalding and Storm
A curated collection of papers on streaming algorithms
rangadi / zkclient
Forked from sgroschupf/zkclienta zookeeper client, that makes life a little easier.
A platform for visualization and real-time monitoring of data workflows
Data-Intensive Text Processing with MapReduce

