😆
PhD Candidate, LLM Researcher, Machine Learning System, GPU, FPGA
-
University of Sydney
- Sydney NSW, Australia
- https://summer-summer.github.io/
Pinned Loading
-
usyd-fsalab/fp6_llm
usyd-fsalab/fp6_llm PublicAn efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).
-
AlibabaResearch/flash-llm
AlibabaResearch/flash-llm PublicFlash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity
-
SpInfer
SpInfer PublicForked from xxyux/SpInfer
SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs
Cuda
-
ComputerArchitectureLab
ComputerArchitectureLab PublicThis repository is used to release the experimental assignments of Computer Architecture Course from USTC
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



