Skip to content
View devatwork's full-sized avatar

Block or report devatwork

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
8 stars written in Python
Clear filter

Node.js native addon build tool

Python 10,484 1,867 Updated Jan 17, 2026

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 9,539 860 Updated Jan 5, 2026

Python PDF Parser (Not actively maintained). Check out pdfminer.six.

Python 5,302 1,120 Updated Dec 7, 2022

A Python library to extract tabular data from PDFs

Python 3,570 524 Updated Jan 15, 2026

Named Entity Recognition (LSTM + CRF) - Tensorflow

Python 1,956 701 Updated Oct 16, 2020

Text Classification Library in Keras

Python 423 97 Updated Jun 4, 2018

spaCy REST API, wrapped in a Docker container.

Python 268 98 Updated Jan 11, 2023

A library and command line utility for diffing xml

Python 224 54 Updated Jul 15, 2025