ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Languag…

Python 226 11 Updated Sep 18, 2025

withinmiaov / A-Survey-on-Mixture-of-Experts-in-LLMs

[TKDE'25] The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".

484 22 Updated Jul 23, 2025

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,413 996 Updated Mar 5, 2026

sjtug / SJTUBeamer

上海交通大学 Beamer 模版 | Beamer template for Shanghai Jiao Tong University

TeX 735 68 Updated Mar 5, 2026

alipay / L3TC-leveraging-rwkv-for-learned-lossless-low-complexity-text-compression

Python 17 2 Updated Apr 14, 2025

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,183 1,099 Updated Nov 18, 2024

github / gitignore

A collection of useful .gitignore templates

172,984 82,767 Updated Feb 12, 2026

swordlidev / Efficient-Multimodal-LLMs-Survey

Efficient Multimodal Large Language Models: A Survey

389 21 Updated Apr 29, 2025

Atomic-man007 / Awesome_Multimodel_LLM

Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-contex…

364 23 Updated Mar 19, 2025

qwopqwop200 / GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Python 3,073 455 Updated Jul 13, 2024

Infrasys-AI / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,440 2,335 Updated Sep 3, 2025

tongdaxu / YAECL-Yet-Another-Entropy-Coding-Library

YAECL: Yet Another Entropy Coding Library for Neural Compression Research, with Arithmetic Coding and Asymmetric Numeral Systems support

C++ 39 5 Updated Jul 23, 2023

nayuki / Reference-arithmetic-coding

Clear implementation of arithmetic coding for educational purposes in Java, Python, C++.

Java 392 106 Updated Jan 22, 2023

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,279 3,522 Updated Jan 26, 2025

0voice / linux_kernel_wiki

linux内核学习资料：200+经典内核文章，100+内核论文，50+内核项目，500+内核面试题，80+内核视频

7,447 2,032 Updated May 20, 2024

google-deepmind / language_modeling_is_compression

Python 175 19 Updated Aug 28, 2024

zhshi0816 / Video-Frame-Interpolation-Transformer

Python 103 15 Updated Mar 29, 2022

f / prompts.chat

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

HTML 152,419 20,033 Updated Mar 14, 2026

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,850 868 Updated Jun 10, 2024

coulsonlee / STDO-CVPR2023

[CVPR2023] Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting

Python 90 4 Updated Jun 20, 2023

sniklaus / sepconv-slomo

an implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch

Python 1,024 167 Updated May 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zhaoyan adminasmi

Block or report adminasmi

Stars

QwenLM / Qwen3-VL

openai / CLIP

adminasmi / Efficient-Bitrate-Ladder-Construction-for-Per-shot-Adaptive-Encoding

adminasmi / OmniZip-CVPR2026

faymek / MPCompress

youngyangyang04 / leetcode-master

krahets / hello-algo

yikangshen / MoA

OI-wiki / OI-wiki

IBM / ModuleFormer