Skip to content
View pwatrik's full-sized avatar

Block or report pwatrik

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Always know what to expect from your data.

Python 11,322 1,706 Updated Apr 1, 2026

ClickHouse® is a real-time analytics database management system

C++ 46,663 8,260 Updated Apr 2, 2026

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 21,004 5,121 Updated Apr 2, 2026

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Python 8,685 957 Updated Apr 2, 2026

Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days

TypeScript 4,683 343 Updated Apr 2, 2026

Scalable and efficient data transformation framework - backwards compatible with dbt.

Python 3,011 370 Updated Apr 2, 2026

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Python 12,503 2,335 Updated Apr 2, 2026

The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊

Clojure 46,720 6,353 Updated Apr 2, 2026

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 22,021 2,196 Updated Apr 2, 2026

Data Contracts engine for the modern data stack. https://www.soda.io

Python 2,319 260 Updated Apr 2, 2026

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 9,246 1,719 Updated Apr 2, 2026

The Metadata Platform for your Data and AI Stack

Java 11,754 3,417 Updated Apr 2, 2026

Secure and fast microVMs for serverless computing.

Rust 33,409 2,329 Updated Apr 1, 2026

A simple, fast translation API deployable in minutes!

JavaScript 55 7 Updated Mar 11, 2026

A collective list of free APIs

Python 418,313 45,441 Updated Mar 18, 2026

Business intelligence as code: build fast, interactive data visualizations in SQL and markdown

JavaScript 6,117 333 Updated Feb 18, 2026

data load tool (dlt) is an open source Python library that makes data loading easy 🛠️

Python 5,165 488 Updated Apr 2, 2026

ELT Data Pipeline implementation in Data Warehousing environment

Jupyter Notebook 30 10 Updated May 2, 2025

📈 A scalable, production-ready data pipeline for real-time streaming & batch processing, integrating Kafka, Spark, Airflow, AWS, Kubernetes, and MLflow. Supports end-to-end data ingestion, transfor…

Python 98 52 Updated Apr 1, 2026

An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.

Python 1,501 247 Updated Mar 9, 2020

A simple EDI 835 file format parser.

Python 105 45 Updated Jun 3, 2024

Simple and rapid application development framework, built on top of Flask. includes detailed security, auto CRUD generation for your models, google charts and much more. Demo (login with guest/welc…

Python 4,945 1,443 Updated Mar 25, 2026