Pinned Loading
-
EAGLE
EAGLE PublicForked from SafeAILab/EAGLE
Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)
Python
-
LLMSpeculativeSampling
LLMSpeculativeSampling PublicForked from feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
Python
-
Medusa
Medusa PublicForked from FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
