Robust Intelligence Public Examples

This repository tracks sample datasets and models. Due to size constraints, the actual data and model files are stored externally.

This repository can be installed as a pip package (e.g. pip install https://github.com/RobustIntelligence/ri-public-examples/archive/master.zip). To pull the data and model(s) for a specific example, run the following module script as follows from within the top-level directory:

from ri_public_examples.download_files import download_files
download_files('tabular/nyc_tlc', 'nyc_tlc')

This will download the NYC TLC datasets/models/configs.

Tabular

1. Income (`tabular/income/`)

Based on the Adult Census Income dataset, this directory contains a basic Catboost binary classification model, reference set, and evaluation set.

2. Fraud (`tabular/fraud/`)

This is a proprietary fraud detection dataset created by Robust Intelligence. This directory contains a basic Catboost binary classification model, reference set, and evaluation set.

3. NYC TLC (`tabular/nyc_tlc/`)

This is based on public NYC Taxi and Limousine Commission data (https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page). This directory contains a basic Catboost regression model, reference set, evaluation set and test set (representing production data).

Natural Language Processing

Classification

1. ArXiv (`nlp/classification/arxiv/`)

This is based on the public ArXiv dataset. This directory contains an NLP topic classification model, reference set, evaluation set, and test sets representing production data.

2. Twitter Sentiment Analysis (`nlp/classification/sentiment_analysis/`)

This is based on the CARER Emotion Recognition dataset. This directory contains a RoBERTA-based model trained on tweets and used for sentiment analysis. It also contains the reference set and test sets.

Computer Vision

Classification

1. Animals with Attributes 2 (`images/classification/awa2`)

This is based on the Animals with Attributes dataset. This directory contains an image classification model, a reference and evaluation set for stress testing, as well as a test set representing production data.

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
generative/question_answering		generative/question_answering
images/classification/awa2		images/classification/awa2
nlp/classification		nlp/classification
ri_public_examples		ri_public_examples
tabular-2.0		tabular-2.0
tabular		tabular
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Robust Intelligence Public Examples

Tabular

1. Income (`tabular/income/`)

2. Fraud (`tabular/fraud/`)

3. NYC TLC (`tabular/nyc_tlc/`)

Natural Language Processing

Classification

1. ArXiv (`nlp/classification/arxiv/`)

2. Twitter Sentiment Analysis (`nlp/classification/sentiment_analysis/`)

Computer Vision

Classification

1. Animals with Attributes 2 (`images/classification/awa2`)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 12

Uh oh!

Languages

License

RobustIntelligence/ri-public-examples

Folders and files

Latest commit

History

Repository files navigation

Robust Intelligence Public Examples

Tabular

1. Income (tabular/income/)

2. Fraud (tabular/fraud/)

3. NYC TLC (tabular/nyc_tlc/)

Natural Language Processing

Classification

1. ArXiv (nlp/classification/arxiv/)

2. Twitter Sentiment Analysis (nlp/classification/sentiment_analysis/)

Computer Vision

Classification

1. Animals with Attributes 2 (images/classification/awa2)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 12

Uh oh!

Languages

1. Income (`tabular/income/`)

2. Fraud (`tabular/fraud/`)

3. NYC TLC (`tabular/nyc_tlc/`)

1. ArXiv (`nlp/classification/arxiv/`)

2. Twitter Sentiment Analysis (`nlp/classification/sentiment_analysis/`)

1. Animals with Attributes 2 (`images/classification/awa2`)

Packages