Skip to content
View efrantar's full-sized avatar

Organizations

@IST-DASLab

Block or report efrantar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
14 stars written in Python
Clear filter

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,251 193 Updated Mar 27, 2024

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…

Python 1,314 191 Updated Aug 8, 2025

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 1,006 86 Updated Sep 4, 2024

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 865 116 Updated Aug 20, 2024

Solve Rubik's Cube in less than 19 moves on average with Python.

Python 722 105 Updated Mar 3, 2024

Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

Python 280 24 Updated Nov 3, 2023
Python 234 20 Updated Feb 12, 2025

Code for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".

Python 129 15 Updated Jul 11, 2023

A generic rubiks cube solver

Python 103 27 Updated Jun 11, 2024

The world's fastest Lego Rubik's Cube solving robot, averaging 1 second flat.

Python 80 4 Updated Feb 11, 2020

Use python3 to program your LEGO EV3. Communicate via Bluetooth, WiFi or USB. Send direct commands.

Python 50 16 Updated Oct 22, 2024

Code for ICML 2022 paper "SPDY: Accurate Pruning with Speedup Guarantees"

Python 20 4 Updated May 3, 2023

Efficient reference implementations of the static & dynamic M-FAC algorithms (for pruning and optimization)

Python 17 3 Updated Feb 23, 2022

The first ever Lego robot to solve a random Rubik's Cube in under 1 second.

Python 13 Updated Feb 11, 2020