diptanu

Diptanu Choudhury diptanu

170 followers · 4 following

Facebook
San Francisco

Achievements

x3 x3

Achievements

x3 x3

Stars

bruin-data / bruin

Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.

Go 1,340 52 Updated Jan 23, 2026

tensorlakeai / indexify

A realtime serving engine for Data-Intensive Generative AI Applications

Rust 1,093 142 Updated Jan 25, 2026

VectifyAI / PageIndex

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

Python 8,684 643 Updated Jan 25, 2026

codelion / adaptive-classifier

A flexible, adaptive classification system for dynamic text classification

Python 526 36 Updated Oct 7, 2025

Sanster / OhMyTable

Table Structure Recognition

Python 28 2 Updated Jul 25, 2024

sparkfish / augraphy

Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes

Python 496 60 Updated Jul 20, 2025

philschmid / document-ai-transformers

Jupyter Notebook 391 58 Updated Jan 7, 2024

hikvision-research / DAVAR-Lab-OCR

Forked from hikopensource/DAVAR-Lab-OCR

OCR toolbox from Davar-Lab

Python 9 2 Updated Jan 8, 2024

MathamPollard / awesome-table-structure-recognition

A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.

224 11 Updated Sep 9, 2024

slawlor / ractor

Rust actor framework

Rust 1,934 116 Updated Dec 16, 2025

tqwewe / kameo

Fault-tolerant async actors for Rust that scale seamlessly

Rust 1,192 64 Updated Jan 19, 2026

AlibabaResearch / AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,814 200 Updated Apr 9, 2025

khuangaf / Awesome-Chart-Understanding

A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Mod…

231 23 Updated Dec 17, 2025

hitz-zentroa / GoLLIE

Guideline following Large Language Model for Information Extraction

Python 425 27 Updated Oct 27, 2024

opendatalab / DocLayout-YOLO

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 1,949 147 Updated Apr 14, 2025

quininer / cbor4ii

CBOR: Concise Binary Object Representation

Rust 83 10 Updated Nov 30, 2025

slatedb / slatedb

A cloud native embedded storage engine built on object storage.

Rust 2,668 181 Updated Jan 24, 2026

tom-doerr / dspy_nodes

WIP - Allows you to create DSPy pipelines using ComfyUI

Python 200 11 Updated Dec 1, 2024

autokitteh / autokitteh

Durable workflow automation in just a few lines of code

Go 1,084 41 Updated Jan 25, 2026

lsnvoid / query-doc

Query your PDF documents and get more insights from them

Python 5 Updated Apr 28, 2024

edwinkys / oasysdb

In-memory vector store with efficient read and write performance for semantic caching and retrieval system. Redis for Semantic Caching.

Rust 376 13 Updated Nov 29, 2024

nilsherzig / LLocalSearch

LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progres…

Go 5,965 372 Updated Dec 11, 2025

orbitinghail / sqlsync

SQLSync is a collaborative offline-first wrapper around SQLite. It is designed to synchronize web application state between users, devices, and the edge.

Rust 2,864 42 Updated Nov 19, 2025

LaurentMazare / tch-rs

Rust bindings for the C++ api of PyTorch.

Rust 5,237 411 Updated Jan 22, 2026

microsoft / CodeXGLUE

CodeXGLUE

C# 1,800 390 Updated Apr 23, 2024

hydradatabase / columnar

Postgres-native columnar storage extension

C 3,010 99 Updated Feb 10, 2025

fluxninja / aperture

Rate limiting, caching, and request prioritization for modern workloads

Go 725 35 Updated Dec 21, 2025

PsiACE / riteraft

RiteRaft - A raft framework, for regular people

Rust 333 24 Updated Feb 18, 2024

a327ex / blog

gamedev blog

3,313 147 Updated Mar 8, 2021

donnemartin / system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 333,281 54,188 Updated Nov 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Diptanu Choudhury diptanu

Achievements

Achievements

Block or report diptanu

Stars

bruin-data / bruin

tensorlakeai / indexify

VectifyAI / PageIndex

codelion / adaptive-classifier

Sanster / OhMyTable

sparkfish / augraphy

philschmid / document-ai-transformers

hikvision-research / DAVAR-Lab-OCR

MathamPollard / awesome-table-structure-recognition

slawlor / ractor

tqwewe / kameo

AlibabaResearch / AdvancedLiterateMachinery

khuangaf / Awesome-Chart-Understanding

hitz-zentroa / GoLLIE

opendatalab / DocLayout-YOLO

quininer / cbor4ii

slatedb / slatedb

tom-doerr / dspy_nodes

autokitteh / autokitteh

lsnvoid / query-doc

edwinkys / oasysdb

nilsherzig / LLocalSearch

orbitinghail / sqlsync

LaurentMazare / tch-rs

microsoft / CodeXGLUE

hydradatabase / columnar

fluxninja / aperture

PsiACE / riteraft

a327ex / blog

donnemartin / system-design-primer