Skip to content

A few programs to profile tables in a database. Most common value for an attribute, number of occurences etc

Notifications You must be signed in to change notification settings

NJW80/Python_Database_Profiling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

wip Python Database Profiling

A few programs to profile tables in a database. Mostly to demonstrate my knowledge and a simple use case of the mysql.connector Python library. I will be adding some more advanced code and queries to this repo over the coming days

Converted from SAS code run in hadoop to Python querying a MySQL database (Hadoop version may follow at a later date)

The programs are designed to give a quick overview of the data in a database. The summary information includes things like the most common values for every field in a table and the number of occurences, referential integrity and missing values which could feed into a Data Quality Analysis ahead of conducting a full analysis on new data

About

A few programs to profile tables in a database. Most common value for an attribute, number of occurences etc

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages