Skip to content
View 42Shawn's full-sized avatar

Block or report 42Shawn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding

Python 28 Updated Jan 27, 2026

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Python 8,840 695 Updated Feb 1, 2026

PyTorch implementation of PTQ4DiT https://arxiv.org/abs/2405.16005

Python 45 1 Updated Nov 8, 2024

A Framework of Small-scale Large Multimodal Models

Python 959 96 Updated Apr 26, 2025

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Python 162 10 Updated Sep 27, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 28,493 2,880 Updated Apr 30, 2025

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 615 78 Updated Sep 11, 2024

Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT workshop

Python 35 5 Updated Aug 10, 2023

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Python 886 54 Updated Jan 3, 2025

The official implementation of Latte: Latent Diffusion Transformer for Video Generation.

Python 35 3 Updated Feb 26, 2025

[ICCV 2025] QuEST: Efficient Finetuning for Low-bit Diffusion Models

Python 55 5 Updated Jun 26, 2025

Lossless Training Speed Up by Unbiased Dynamic Data Pruning

Python 343 21 Updated Sep 24, 2024

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,327 86 Updated Apr 15, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,416 2,725 Updated Aug 12, 2024

Activation-aware Singular Value Decomposition for Compressing Large Language Models

Python 85 16 Updated Oct 22, 2024

#WORK IN PROGRESS PyTorch Implementation of Supervised and Deep Q-Learning EWC(Elastic Weight Consolidation), introduced in "Overcoming Catastrophic Forgetting in Neural Networks"

Python 30 4 Updated May 25, 2018

PB-LLM: Partially Binarized Large Language Models

Python 157 8 Updated Nov 20, 2023

[ICCV2023] Dataset Quantization

Python 263 18 Updated Jan 6, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 12,073 939 Updated Mar 11, 2025

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Python 713 49 Updated Aug 13, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,430 291 Updated Jul 17, 2025

Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)

Python 141 5 Updated Apr 1, 2023

Reorder-based post-training quantization for large language model

Python 198 15 Updated May 17, 2023

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,250 193 Updated Mar 27, 2024

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,597 193 Updated Jul 12, 2024

The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization

Python 127 17 Updated Sep 23, 2025

The code for the Network Binarization via Contrastive Learning, which has been accepted to ECCV 2022.

Python 14 1 Updated Jul 13, 2022

PyTorch 1.0 implementation of the approximate Earth Mover's Distance

Cuda 140 12 Updated Jun 5, 2019

Using pre-trained Diffusion models as priors for inference tasks

Jupyter Notebook 210 11 Updated Feb 9, 2023
Next