Highlights
- Pro
Stars
Statistical Rethinking course winter 2022
a curated list of R tutorials for Data Science, NLP and Machine Learning
Import public NYC taxi and for-hire vehicle (Uber, Lyft) trip data into a PostgreSQL or ClickHouse database
An R package for causal inference in time series
Data, Benchmarks, and methods submitted to the M4 forecasting competition
Journal of Statistical Education Paper on Using OkCupid Data for Data Science Courses
Loan-level analysis of Fannie Mae and Freddie Mac data
Advanced High Performance Data Science Toolbox for R by Laurae
PHESANT - PHEnome Scan ANalysis Tool (pheWAS, Mendelian randomisation (MR)-pheWAS etc.) in UK Biobank
An end-to-end tutorial creating an R Shiny app that uses the reticulate package with Python 3
An R package to calculate indices and theoretical physicochemical properties of peptides and protein sequences.
Open dataset of counties from the United States
Webscraping data about editors of scientific journals.
🧬 Toolkit for generating various numerical features of protein sequences
outbreaks: an R package compiling disease outbreak data
US EPA's Toxicity Forecaster (ToxCast) Pipeline. More information on the ToxCast program available here: https://www.epa.gov/comptox-tools/toxicity-forecasting-toxcast
Solution for Kaggle Rossmann Store Sales Competition
An implementation of the Rotation Forest algorithm from Rodriguez et al. 2006
Discovering anomalies in wikipedia with R
map and analyze common Milwaukee architectural styles
An R package to explore and quality check data



