Skip to content
View firefighter-eric's full-sized avatar
👓
Focusing
👓
Focusing

Block or report firefighter-eric

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Document

52 repositories

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 9,772 864 Updated Jan 28, 2026

A utility to read and write PDFs with Python

Python 338 61 Updated Nov 24, 2021

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 9,114 692 Updated Feb 24, 2026

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 6,791 553 Updated Jul 11, 2024

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,866 306 Updated Jun 24, 2024
249 6 Updated Jan 22, 2023

A curated list of resources dedicated to table recognition

406 51 Updated Dec 12, 2024

CDLA: A Chinese document layout analysis (CDLA) dataset

Python 288 34 Updated Sep 13, 2021
Jupyter Notebook 1,038 165 Updated Jul 9, 2025

An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowe…

Jupyter Notebook 3,019 262 Updated Feb 7, 2026

Markdown rendering + Latex extras (equations, tables, ...), with conversion features, for the scientific community

JavaScript 656 61 Updated Feb 24, 2026

Given a scholarly PDF, extract figures, tables, captions, and section titles.

Scala 726 131 Updated Mar 10, 2024

CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包

Python 781 115 Updated Feb 7, 2026

Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集

Python 34 2 Updated Dec 21, 2022

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,897 143 Updated Dec 30, 2024

FormulaNet is a new large-scale Mathematical Formula Detection dataset.

Python 20 11 Updated Nov 21, 2022

1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection(公式检测冠军方案)

Python 133 42 Updated Sep 4, 2023

DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis

412 23 Updated Feb 1, 2023
156 6 Updated May 8, 2025

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

Python 287 40 Updated Feb 13, 2023

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,820 200 Updated Apr 9, 2025

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 5,886 623 Updated Feb 9, 2026

The ICDAR 2019 cTDaR is to evaluate the performance of methods for table detection (TRACK A) and table recognition (TRACK B). For the first track, document images containing one or several tables a…

178 68 Updated Aug 10, 2022

OCR toolbox from Davar-Lab

Python 759 155 Updated Nov 16, 2023

Document Artifical Intelligence

201 8 Updated Sep 28, 2025

End-to-end neural table-text understanding models.

Python 1,204 217 Updated Jul 22, 2024

PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion

Python 59 9 Updated Feb 29, 2024

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)

Python 105 8 Updated Mar 31, 2025

Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations

Python 124 6 Updated Sep 28, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 8,086 699 Updated Feb 10, 2025