Stars
8
stars
written in Python
Clear filter
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Python PDF Parser (Not actively maintained). Check out pdfminer.six.
A Python library to extract tabular data from PDFs
Named Entity Recognition (LSTM + CRF) - Tensorflow
spaCy REST API, wrapped in a Docker container.
A library and command line utility for diffing xml


