firefighter-eric

👓

Focusing

Qifan Zhang firefighter-eric

👓

Focusing

20 followers · 94 following

Earth

Achievements

Stars

Document

52 repositories

jsvine / pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 9,772 864 Updated Jan 28, 2026

claird / PyPDF4

A utility to read and write PDFs with Python

Python 338 61 Updated Nov 24, 2021

pymupdf / PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 9,114 692 Updated Feb 24, 2026

clovaai / donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 6,791 553 Updated Jul 11, 2024

microsoft / table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,866 306 Updated Jun 24, 2024

microsoft / UDOP

249 6 Updated Jan 22, 2023

cv-small-snails / Awesome-Table-Recognition

A curated list of resources dedicated to table recognition

406 51 Updated Dec 12, 2024

buptlihang / CDLA

CDLA: A Chinese document layout analysis (CDLA) dataset

Python 288 34 Updated Sep 13, 2021

ibm-aur-nlp / PubLayNet

Jupyter Notebook 1,038 165 Updated Jul 9, 2025

breezedeus / Pix2Text

An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowe…

Jupyter Notebook 3,019 262 Updated Feb 7, 2026

Mathpix / mathpix-markdown-it

Markdown rendering + Latex extras (equations, tables, ...), with conversion features, for the scientific community

JavaScript 656 61 Updated Feb 24, 2026

allenai / pdffigures2

Given a scholarly PDF, extract figures, tables, captions, and section titles.

Scala 726 131 Updated Mar 10, 2024

breezedeus / CnSTD

CnSTD: 基于 PyTorch/MXNet 的中文/英文场景文字检测（Scene Text Detection）、数学公式检测（Mathematical Formula Detection, MFD）、篇章分析（Layout Analysis）的Python3 包

Python 781 115 Updated Feb 7, 2026

breezedeus / CnMFD_Dataset

Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集

Python 34 2 Updated Dec 21, 2022

Ucas-HaoranWei / Vary

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,897 143 Updated Dec 30, 2024

felix-schmitt / FormulaNet

FormulaNet is a new large-scale Mathematical Formula Detection dataset.

Python 20 11 Updated Nov 21, 2022

Yuxiang1995 / ICDAR2021_MFD

1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection（公式检测冠军方案）

Python 133 42 Updated Sep 4, 2023

DS4SD / DocLayNet

DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis

412 23 Updated Feb 1, 2023

HCIILAB / M6Doc

156 6 Updated May 8, 2025

shabie / docformer

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

Python 287 40 Updated Feb 13, 2023

AlibabaResearch / AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,820 200 Updated Apr 9, 2025

mindee / doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 5,886 623 Updated Feb 9, 2026

cndplab-founder / ICDAR2019_cTDaR

The ICDAR 2019 cTDaR is to evaluate the performance of methods for table detection (TRACK A) and table recognition (TRACK B). For the first track, document images containing one or several tables a…

178 68 Updated Aug 10, 2022

hikopensource / DAVAR-Lab-OCR

OCR toolbox from Davar-Lab

Python 759 155 Updated Nov 16, 2023

harrytea / Awesome-Document-Understanding

Document Artifical Intelligence

201 8 Updated Sep 28, 2025

google-research / tapas

End-to-end neural table-text understanding models.

Python 1,204 217 Updated Jul 22, 2024

gydpku / PPTC

PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion

Python 59 9 Updated Feb 29, 2024

nttmdlab-nlp / SlideVQA

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)

Python 105 8 Updated Mar 31, 2025

mayubo2333 / MMLongBench-Doc

Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations

Python 124 6 Updated Sep 28, 2025

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 8,086 699 Updated Feb 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qifan Zhang firefighter-eric

Achievements

Achievements

Block or report firefighter-eric

Document

jsvine / pdfplumber

claird / PyPDF4

pymupdf / PyMuPDF

clovaai / donut

microsoft / table-transformer

microsoft / UDOP

cv-small-snails / Awesome-Table-Recognition

buptlihang / CDLA

ibm-aur-nlp / PubLayNet

breezedeus / Pix2Text

Mathpix / mathpix-markdown-it

allenai / pdffigures2

breezedeus / CnSTD

breezedeus / CnMFD_Dataset

Ucas-HaoranWei / Vary

felix-schmitt / FormulaNet

Yuxiang1995 / ICDAR2021_MFD

DS4SD / DocLayNet

HCIILAB / M6Doc

shabie / docformer

AlibabaResearch / AdvancedLiterateMachinery

mindee / doctr

cndplab-founder / ICDAR2019_cTDaR

hikopensource / DAVAR-Lab-OCR

harrytea / Awesome-Document-Understanding

google-research / tapas

gydpku / PPTC

nttmdlab-nlp / SlideVQA

mayubo2333 / MMLongBench-Doc

Ucas-HaoranWei / GOT-OCR2.0