datasets

Datasets that I generally use for trainings, workshops

Public data sets links

Facebook post data text-mining, social-media
Medicare hospitals data
Uber trips data
Food products data

Code to plot decision tree

def draw_tree(model, columns):
    import pydotplus
    from sklearn.externals.six import StringIO
    from IPython.display import Image
    import os
    from sklearn import tree
    
    graphviz_path = 'C:\Program Files (x86)\Graphviz2.38/bin/'
    os.environ["PATH"] += os.pathsep + graphviz_path

    dot_data = StringIO()
    tree.export_graphviz(model,
                         out_file=dot_data,
                         feature_names=columns)
    graph = pydotplus.graph_from_dot_data(dot_data.getvalue())  
    return Image(graph.create_png())

Code to calculate Root Mean Square Percentage Error (RMSPE)

# Credit: kaggle.com
def ToWeight(y):
    w = np.zeros(y.shape, dtype=float)
    ind = y != 0
    w[ind] = 1./(y[ind]**2)
    return w

def rmspe(y, yhat):
    w = ToWeight(y)
    rmspe = np.sqrt(np.mean( w * (y - yhat)**2 ))
    return rmspe

Name		Name	Last commit message	Last commit date
Latest commit History 120 Commits
json		json
Advertising.csv		Advertising.csv
Automobile.csv		Automobile.csv
Complications and Deaths - Hospital.zip		Complications and Deaths - Hospital.zip
ENERNOC.csv		ENERNOC.csv
Guidelines.docx		Guidelines.docx
HR Analytics.csv		HR Analytics.csv
Latitude-and-Longitude-Add-In.xlam		Latitude-and-Longitude-Add-In.xlam
RCdata.zip		RCdata.zip
README.md		README.md
Sample - Superstore.xls		Sample - Superstore.xls
abcnews.csv.zip		abcnews.csv.zip
amazon_cells_labelled.csv		amazon_cells_labelled.csv
amazon_reviews.csv		amazon_reviews.csv
amazon_reviews_11.zip		amazon_reviews_11.zip
amazon_reviews_big.7z		amazon_reviews_big.7z
atm_iot.csv.zip		atm_iot.csv.zip
bank-full.csv		bank-full.csv
car_data.csv		car_data.csv
credit-default.csv		credit-default.csv
cricinfo_ind_vs_aus_2018_dec.csv		cricinfo_ind_vs_aus_2018_dec.csv
data_correlations.csv		data_correlations.csv
data_iot_temperature.zip		data_iot_temperature.zip
data_pca.csv		data_pca.csv
deliveries.csv		deliveries.csv
dontgobackmodi.csv		dontgobackmodi.csv
ellipse.csv		ellipse.csv
employees.csv		employees.csv
employees_attrition.csv.zip		employees_attrition.csv.zip
exercises.md		exercises.md
flipkart.csv		flipkart.csv
github_subscribers.csv		github_subscribers.csv
github_subscribers.csv.zip		github_subscribers.csv.zip
hotstar.allreviews_Sentiments.csv		hotstar.allreviews_Sentiments.csv
hotstar_tweets.csv		hotstar_tweets.csv
imdb_sentiment.csv		imdb_sentiment.csv
insurance.csv		insurance.csv
ipl.zip		ipl.zip
matches.csv		matches.csv
movies.zip		movies.zip
mushroom.csv		mushroom.csv
mushroom_full.csv		mushroom_full.csv
narendramodi_tweets.csv		narendramodi_tweets.csv
narendramodi_tweets.zip		narendramodi_tweets.zip
naukri_jobs_datascience.csv.zip		naukri_jobs_datascience.csv.zip
odi-batting.csv		odi-batting.csv
odi-batting.zip		odi-batting.zip
parliament.csv		parliament.csv
parliament.zip		parliament.zip
purchase_sample_1.csv		purchase_sample_1.csv
ratings.csv		ratings.csv
restaurants.csv		restaurants.csv
sample-text.txt		sample-text.txt
sample_locations.txt		sample_locations.txt
states_locations.csv		states_locations.csv
stock-prices.csv		stock-prices.csv
student_marks.csv		student_marks.csv
test.csv		test.csv
tips.csv		tips.csv
train.csv		train.csv
tweets_flipkart.zip		tweets_flipkart.zip
tweets_sentiment.csv		tweets_sentiment.csv
twitter_followers.csv		twitter_followers.csv
voted-kaggle-dataset.csv		voted-kaggle-dataset.csv
yelp_labelled.csv		yelp_labelled.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

datasets

Public data sets links

Code to plot decision tree

Code to calculate Root Mean Square Percentage Error (RMSPE)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

datasets

Public data sets links

Code to plot decision tree

Code to calculate Root Mean Square Percentage Error (RMSPE)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages