Skip to content

derek-johns/Youtube-Recommendation-System

 
 

Repository files navigation


Overview of Project

Using a Docker container to host our Airflow allowed us to run the standard build on each machine. Airflow was scheduled to run daily to pull YouTube's API to grab the top trending videos. The data was then cleaned and exported to MySQL Database hosted on AWS Lightsail. Which was then exported to be used as data for visualization and machine learning models.

Summary of Data Analysis

  • NFL produced the most trending videos in January
  • Most disliked video was by Mini Ladd with a 71% dislike ratio
  • Most popular category videos in order Entertainment, Spirts, Music
  • Most popular video is The Weeknd - Save Your Tears
  • Video that was on top trending the longest, MrBeastGaming - If You Build a House, I'll Pay For It! for 7 days


Technologies Involved:

  • Python
  • AWS Lightsail MySQL Database
  • Docker
  • Airflow
  • Jupyter Notebook
  • Pandas
  • Natural Language Toolkit(NLTK)
  • Seaborn
  • Matplotlib
  • SKLearn TF-IDF
  • Django

API:

  • YouTube Data API v3

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 97.1%
  • Python 2.5%
  • HTML 0.4%