Model Tracker

Lightweight predictive modeling machine learning model tracking system.

Summary

ModelTracker is a system designed to streamline the experimentation and tracking of machine learning models. It allows data scientists to record and manage their experiments by automatically storing models, configurations, and evaluation metrics in a SQLite database. The system provides robust functionality for retrieving and analyzing experimental results, including the ability to fit final models on full datasets and visualize the performance distribution across experiments. With features such as automated tracking of model performance, runtime analysis, and kernel density estimates for score distributions, this system enables efficient management and insightful analysis of machine learning workflows, ensuring reproducibility and facilitating the selection of optimal models for deployment.

Developed in collaboration with ChatGPT4o in August 2024.

Usage

Setup and Installation

Clone the Repository: Clone the GitHub repository to your local machine.
```
git clone git@github.com:Jason2Brownlee/ModelTracker.git
```
Install Dependencies: Ensure that you have Python 3 installed, along with the required libraries:
```
pip install -r requirements.txt
```
Create the Database: Initialize the SQLite database by running:
```
make create-database
```
Add Sample Experiments: Add sample experiments using default classifier parameters:
```
make add-experiments
```
Run Experiments: Execute all pending experiments:
```
make run-experiments
```
Show Top Results: Display the top 5 experiments based on accuracy:
```
make show-results
```
Plot Experiment Results: Visualize the runtime vs. accuracy for the top 3 models and the distribution of all scores:
```
make plot-scores
```
Fit the Final Model: Fit a final model using the best experiment ID:
```
make final-model
```
Clean the Database (optional): Remove the current database to start fresh:
```
make clean
```

Customization

To customize your dataset loading, resampling method, and evaluation metric, edit the custom_config.py file located in the src directory.

load_dataset():
- Purpose: Loads the dataset used for experiments.
- Customization: Modify this function to load your specific dataset.
default_resampling_method(X, y):
- Purpose: Defines the resampling method (e.g., cross-validation) used during model evaluation.
- Customization: Update this function to reflect your preferred resampling strategy.
default_evaluation_metric():
- Purpose: Specifies the evaluation metric (e.g., accuracy, F1 score) used to assess model performance.
- Customization: Change this function to return the appropriate metric for your problem domain.

TODO

Use 10-fold repeated stratified cross validation to evaluate models as the default.
Add a script that inserts a grid search experiments of common hyperparameters for common classifications algorithms.
Write a script/bash snippet/make target that polls the database for top scores that can be run in a shell while experiments are running.
Perhaps add a one-age website that gives a live review scores and plots.
Plot of model score vs time.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
dev		dev
src		src
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Model Tracker

Summary

Usage

Setup and Installation

Customization

TODO

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Model Tracker

Summary

Usage

Setup and Installation

Customization

TODO

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages