Skip to content
View eric-wu's full-sized avatar

Block or report eric-wu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TextBoxes++: A Single-Shot Oriented Scene Text Detector

C++ 958 275 Updated Oct 22, 2023

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 33,138 4,637 Updated Nov 27, 2025

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,975 3,622 Updated Jul 28, 2024

Keras Attention Layer (Luong and Bahdanau scores).

Python 2,814 661 Updated Nov 17, 2023

Parallel computing with task scheduling

Python 13,725 1,837 Updated Jan 29, 2026

Simple examples to introduce PyTorch

Python 4,864 919 Updated Feb 20, 2022

Lime: Explaining the predictions of any machine learning classifier

JavaScript 12,093 1,857 Updated Jul 25, 2024

NLP, before and after spaCy

Python 2,232 249 Updated Sep 22, 2023

A topic-centric list of HQ open datasets.

72,509 11,116 Updated Jan 27, 2026

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

5,968 992 Updated Feb 15, 2023

Visualizations for machine learning datasets

Jupyter Notebook 7,374 889 Updated May 24, 2023

A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.

Python 10,215 1,153 Updated Jan 28, 2026

A library for efficient similarity search and clustering of dense vectors.

C++ 38,926 4,204 Updated Jan 29, 2026

Facebook AI Research Sequence-to-Sequence Toolkit

Lua 3,733 611 Updated Sep 17, 2021

Package for evaluating word embeddings

Python 441 109 Updated Jan 4, 2021

A curated list of data engineering tools for software developers

8,218 1,418 Updated Jan 5, 2026
Shell 2 Updated Nov 18, 2024
Shell 26 3 Updated Jan 9, 2026

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 70,349 16,612 Updated Jan 30, 2026

DEPRECATED in favor of BridgeServer2

Java 11 22 Updated May 30, 2019

mhealthx is a software pipeline that automates feature extraction from mobile health data saved as a Synapse project (synapse.org).

Python 13 11 Updated Jan 22, 2018

Programmatic interface to Synapse services for Python

Python 78 73 Updated Jan 29, 2026

PredictionIO, a machine learning server for developers and ML engineers.

Scala 12,536 1,918 Updated Jan 9, 2021

Interactive visualizations of time series using JavaScript and the HTML canvas tag

JavaScript 3,227 593 Updated Mar 15, 2025

A well lit place for tips, scripts, and tools written by the Cloud Foundry community

HTML 153 31 Updated Aug 4, 2021

A visualization grammar.

JavaScript 11,789 1,560 Updated Jan 29, 2026

JavaScript toolkit for creating interactive real-time graphs

JavaScript 6,522 933 Updated Jan 17, 2025

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,721 29,039 Updated Jan 30, 2026

The Community Maintained High Velocity Web Framework For Java and Scala.

Scala 12,622 4,067 Updated Jan 29, 2026

A reusable charting library written in d3.js

JavaScript 7,244 2,099 Updated Sep 15, 2023
Next