A tiny library for Python text normalisation. Useful for ad-hoc text processing.
-
Updated
Sep 12, 2025 - Python
A CSS Reset is used to remove the default browser styling and make the website look same on all browsers
A tiny library for Python text normalisation. Useful for ad-hoc text processing.
URL normalization for Python
Text Normalizer module use for Bangla as well as English digit convert to textual format, Normalize Date and Extract Date
A missing toolkit for Khmer Natural Language Processing.
Unicode normalization forms (NFC, NFD, NFKC, NFKD). A pure-Python implementation independent of Python’s core Unicode database, supporting version 17.0 of the Unicode Standard.
Segmenter and Orthography Standardazier (SOS) for Classical Arabic (CA)
Executor that reads, resizes, crops and normalizes images.
RFC-aware email parsing, normalization, extraction, and DNS health checks with env-config and a phonenumbers-like API.
Normalization of country names in most languages
Project to convert raw, unstructured insurance claim notes into a standardized JSON format. It features a Hugging Face T5 fine-tuned model and an alternative LLM API prompting approach, wrapped in a high-performance FastAPI service.