Skip to content

Schamnad/Data-Science

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Data Science

My foray into the world of Data Science. I am following the curriculum provided in OSDSM and am working through the courses one by one. Presently I am completing the course thought by Bill Howe and his Coursera course, Introduction to Data Science.

Project 1

###Tweet Sentiment Analysis

Introduction

Scripts for capturing tweets, creating sentiment dictionary, processing & scoring tweet sentiments, written in Python. These scripts were written to facilitate the clustering of tweet length & sentiment scores in this research paper.

Twitter app authentication credentials are required for use of twitterstream.py. Acquire these here.

Project 2

Data Manipulation at Scale: Systems and Algorithms

These are my assignments for Data Manipulation at Scale: Systems and Algorithms - Bill Howe -- Spring 2016 offered on Coursera.

Topics: Python NLP on Twitter API, Distributed Computing Paradigm, MapReduce/Hadoop & Pig Script, SQL/NoSQL, Relational Algebra, Experiment design, Statistics, Graphs, Amazon EC2, Visualization.

About

My foray into the world of Data Science

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published