Skip to content
View RothLuo's full-sized avatar

Block or report RothLuo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Wavenet Autoencoder for Unsupervised speech representation learning (after Chorowski, Jan 2019)

Python 176 23 Updated Sep 16, 2020

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,254 193 Updated Mar 27, 2024

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 866 116 Updated Aug 20, 2024

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,225 1,662 Updated Feb 4, 2026

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,092 190 Updated Jun 30, 2025

Fast Inference Solutions for BLOOM

Python 566 112 Updated Oct 9, 2024

Making large AI models cheaper, faster and more accessible

Python 41,339 4,538 Updated Jan 19, 2026

Large Language Model Text Generation Inference

Python 10,752 1,254 Updated Jan 8, 2026

The simplest way to run LLaMA on your local machine

CSS 12,995 1,364 Updated Jun 18, 2024

LLM inference in C/C++

C++ 94,413 14,764 Updated Feb 5, 2026

C++ implementation for BLOOM

C 809 58 Updated May 13, 2023

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,384 591 Updated Oct 28, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 7,935 818 Updated Jan 22, 2026

🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 184,496 26,291 Updated Feb 2, 2026

a library for audio and music analysis

C 3,624 408 Updated Nov 20, 2025

A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems", which is `dmls-book`

HTML 9,956 1,553 Updated Apr 15, 2023

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 334,454 54,315 Updated Nov 3, 2025

C++那些事

C++ 42,844 8,834 Updated Jun 14, 2024

Interview questions to ponder related to computer vision.

117 26 Updated Jan 2, 2019

CV算法岗知识点及面试问答汇总,主要分为计算机视觉、机器学习、图像处理和 C++基础四大块,一起努力向offers发起冲击!

1,767 268 Updated Nov 2, 2021

根据网易云音乐的歌单, 下载flac无损音乐到本地. Download the FLAC music from Internet according to your NeteaseCloudMusic playlist.

Python 3,128 542 Updated May 22, 2023

《Linux高性能服务器编程》上的例子

C 52 29 Updated Apr 20, 2014

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,616 658 Updated Feb 4, 2026

AutoML tools chain

Python 852 178 Updated Feb 15, 2023
Python 120 20 Updated Jun 13, 2020

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 14,082 2,186 Updated Feb 5, 2026

Boosting your Web Services of Deep Learning Applications.

Python 1,244 188 Updated May 13, 2021

Tensorflow implementation of S4L: Self-Supervised Semi-Supervised Learning

Python 95 19 Updated Nov 6, 2019

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 70,211 9,785 Updated Feb 4, 2026
Next