Highlights
- Pro
Stars
Library for fast text representation and classification.
A curated list of awesome warez and piracy links
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
An Open-Source Collection of Flash Cards to Help You Preparing Your Algorithms & Data Structures and System Design Interviews 💯
The official online compendium for Mining the Social Web, 2nd Edition (O'Reilly, 2013)
Source Code for the book: Machine Learning in Action published by Manning
A collaboratively written review paper on deep learning, genomics, and precision medicine
A frictionless, pipeable approach to dealing with summary statistics
Data sets created for stories on The Pudding, open to the public.
Quickly and securely turn your code projects into LLM prompts, all locally on your own machine!
Get protein embeddings from protein sequences
Official repository for the ProteinGym benchmarks
Utilities and scripts developed as part of Microsoft's Team Data Science Process for productive data science
Med-BERT, contextualized embedding model for structured EHR data
Performance of various open source GBM implementations
Achoo uses a Raspberry Pi to predict if my son will need his inhaler on any given day using weather, pollen, and air quality data. If the prediction for a given day is above a specified threshold, …
Electricity load forecasting with LSTM (Recurrent Neural Network)
Code for the Actuarial Data Science Tutorials published at https://actuarialdatascience.org.
Comparing fairness-aware machine learning techniques.
mwydmuch / extremeText
Forked from facebookresearch/fastTextLibrary for fast text representation and extreme classification.
UKB RAP Notebooks contains a collection of examples of how to use UK Biobank Research Analyses Platform (RAP).
An ongoing fun challenge where I'll try to post one Python benchmark per day.
Welcome to Keras Deep Learning on Graphs (Keras-DGL) http://vermaMachineLearning.github.io/keras-deep-graph-learning
Python port of "Common statistical tests are linear models" by Jonas Kristoffer Lindeløv.
Project to understand pharmaceutical spending, currently focused on US government programs.



