- Built fuzzy merging algorithms with edit-distance constraints to find similar string pairs for data cleaning.
- Implemented the Pass-Join algorithm to perform string similarity joins using C++ and Python.
- Improved the classic fuzzy merging algorithm by 90% in speed.
-
Notifications
You must be signed in to change notification settings - Fork 0
sapphire921/Fuzzy_Merge
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
Use C++ and Python to implement the passjoin algorithm ⚔️
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published