-
This repo contains a curated list of R tutorials and packages for Data Science, NLP and Machine Learning. This also serves as a reference guide for several common data analysis tasks.
-
Curated list of Python tutorials for Data Science, NLP and Machine Learning.
##General Stuff
- awesome-R
- R Reference Card
- R bloggers
- R vs Python
- R Resources GitHub
- Awesome R resources
- Data Mining with R
- Rob J Hyndman's R Blog
- Simple R Tricks and Tools (Video)
- RStudio GitHub Repo
- Tidying Messy Data in R Video
- Baseball Research with R
- 600 websites about R
- Implementation of 17 classification algorithms in R
- Cohort Analysis and LifeCycle Grids mixed segmentation with R
- Using R and Tableau
- COMPREHENSIVE VIEW ON CRAN PACKAGES
- Using R for Statistical Tables and Plotting Distributions
- Extended Model Formulas in R: Multiple Parts and Multiple Responses
##QUESTIONS
- In R, why is bracket better than
subset? - Subsetting Data
- Quickly reading very large tables as dataframes in R
- Using R to show data
- How can I view the source code for a function?
##Learning R
- Data Analysis and Visualization Using R
- MANY R PROGRAMMING TUTORIALS
- A Handbook of Statistical Analyses Using R, Find Other Chapters
- **Cookbook for R **
##Caret Package
- rf
- Ensembling Models with caret
- Model Training and Tuning
- Caret Model List
- relationship-between-data-splitting-and-traincontrol
- Specify model generation parameters
- Tutorial, Paper
- Ensembling models with R, Ensembling Regression Models in R
##Cheatsheets
##Reference Slides
- Awesome R Reference Card
- Association Rule Mining
- Time Series Analysis
- Data Exploration and Visualisation
- Regression and Classification
- Text Mining on Twitter Data
##Neural Networks in R
##Using R for Multivariate Analysis
- Little Book of R for Multivariate Analysis!
- THE FREQPARCOORD PACKAGE FOR MULTIVARIATE VISUALIZATION
- Use of freqparcoord for Regression Diagnostics
##Time Series Analysis
- Time Series Forecasting (Online Book)
- A Little Book of Time Series Analysis in R
- Quick R: Time Series and Forecasting
- Components of Time Series Data
- Unobserved Component Models using R
- The Holt-Winters Forecasting Method
- CRAN Task View: Time Series Analysis
##Bayesian Inference
##Machine Learning
- Using R for Multivariate Analysis (Online Book)
- CRAN Task View: Machine Learning & Statistical Learning
- Machine Learning Using R (Online Book)
- Linear Regression and Regularization Code
- Cheatsheet
- Multinomial and Ordinal Logistic Regression in R
##Sentiment Analysis
##Imputation
- Imputation in R
- Imputation with Random Forests
- How to Identify and Impute Multiple Missing Values using R
- MICE
##NLP and Text Mining
- What algorithm I need to find n-grams?
- NLP R Tutorial
- Introduction to the tm Package Text Mining in R
- Adding stopwords in R tm
- Text Mining
- Word Stemming in R
- Classification of Documents using Text Mining Package “tm”
- Text mining tools techniques and applications
- Text Mining: Overview,Applications and Issues
- Text Mining pdf
- Text Mining Another pdf
- Good PPT
- Scraping Twitter and Web Data Using R
##Visualisation
##Statistics
##Useful R Packages
- TIDY DATA HADLEY PAPER
- Package ‘tidyr’: tidyr is an evolution of reshape2. It's design specifically for data tidying (not general reshaping or aggregating) and works well with dplyr data pipelines.
- BROOM
- plyr, stringr, reshape2 tutorial Video, CODE
- dplyr
- ggplot2
- A speed test comparison of plyr, data.table, and dplyr
- data.table
- Other Packages
- Package 'e1071'
- Package ‘AppliedPredictiveModeling’
- Package ‘stringr’: stringr is a set of simple wrappers that make R's string functions more consistent, simpler and easier to use.
- Package ‘stringdist’: Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (damerau-levenshtein, hamming, levenshtein, optimal sting alignment), qgrams or heuristic metrics
- Package ‘FSelector’: This package provides functions for selecting attributes from a given dataset
- Ryacas – an R interface to the yacas computer algebra system
- Scatterplot3d – an R package for Visualizing Multivariate Data
- tm.plugin.webmining intro
- Solving Differential Equations in R - ODE examples
- Structural Equation Modeling With the sem Package in R
- prettyScree - prettyGraphs
##Code Snippets
##Market Basket