Skip to content

A prediction system that can automatically guess the star ratings trained from 100,000 reviews on Amazon.com.

Notifications You must be signed in to change notification settings

upennyayang/amazon-data-mining

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Amazon Data Mining

A prediction system that can automatically guess the star ratings trained from 100,000 reviews on Amazon.com.

About

Course: CIS 520, Machine Learning, Fall 2011, University of Pennsylvania
Teamwork: Yayang Tian, Tao Feng, Wenbin Zhao
Skills:  Matlab, Python, machine learning

Contribution

  1. Increased accuracy from 40.1% to 81.3%, and decreased RMSE from 1.460 to 0.853.
  2. Implemented various machine learning methods, including feature selection like PCA, stemming, metadata, part of speech, and information gain, as well as mathematical models like Naive Bayes, Ada-boost, Logistic Regression, SVM, Intersection Kernels, and EMs.
  3. Got TOP one performance for a long period in class and awarded by Prof. Ben.Taskar.

About

A prediction system that can automatically guess the star ratings trained from 100,000 reviews on Amazon.com.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published