A quick demo illustrating a few data science techniques, and all in 30 minutes! We're going to illustrate what you can do in 30 minutes with a little bit of programming and science. This is an open-source contribution to the community so please add as much as you want!
NOTE: All code written below are done in python2, using python3 will not work.
You'll need to install the requirements
pip install -r requirements.txt
Next, you will also need to sign up for Twitter, obtain API credentials, and add those credentials to twitter_secrets.json.nogit (this file is intentionally ignored by git). You can obtain your API credentials on the Twitter API page. This file must be in JSON format so that the SimpleJSON package can read it, which will look something like:
{
"api_key": "XXXX",
"api_secret": "XXXX",
"access_token": "XXXX",
"access_token_secret": "XXXX"
}
where the XXXX values are yours.
To run this demo, you will need to startup an ipython notebook instance:
ipython notebook
Then go to http://localhost:8888 and click on DS30.ipynb to view the example
This is meant to just give you a brief guided tour of a few topics in machine-learning.
If you enjoyed this and want to learn more about doing data science in industry, consider applying to be a fellow at The Data Incubator
If you would like to hire data scientists, introduce data science corporate training, or partner to bring The Data Incubator to your country, reach out here.