SwayamInSync

Swayam ❤️ Open Source

Swayam SwayamInSync

Swayam ❤️ Open Source

देखा एक ख्वाब तो ये सिलसिले हुए ✨

132 followers · 197 following

Achievements

x3 x2 x2

Achievements

x3 x2 x2

Organizations

Stars

Optimized Kernels

14 repositories

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 50,398 4,163 Updated Jan 6, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,931 12,412 Updated Jan 6, 2026

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 6,012 458 Updated Jan 5, 2026

microsoft / vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

C 454 34 Updated May 30, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,957 925 Updated Dec 15, 2025

OpenMathLib / OpenBLAS

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

C 7,197 1,632 Updated Jan 6, 2026

huggingface / kernels

Load compute kernels from the Hub

Python 357 29 Updated Dec 17, 2025

LaurieWired / BenchmarkCustomPTX

Custom PTX Instruction Benchmark

Cuda 137 10 Updated Feb 27, 2025

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,231 909 Updated Jan 4, 2026

MekkCyber / TritonAcademy

A repository to unravel the language of GPUs, making their kernel conversations easy to understand

Python 195 7 Updated Jun 1, 2025

MekkCyber / CutlassAcademy

A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS

247 12 Updated May 6, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 4,428 378 Updated Jan 6, 2026

NVIDIA / CUDALibrarySamples

CUDA Library Samples

C++ 2,270 439 Updated Jan 5, 2026

gpu-mode / ring-attention

ring-attention experiments

Python 161 14 Updated Oct 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Swayam SwayamInSync

Achievements

Achievements

Organizations

Block or report SwayamInSync

Optimized Kernels

unslothai / unsloth

vllm-project / vllm

linkedin / Liger-Kernel

microsoft / vattention

deepseek-ai / FlashMLA

OpenMathLib / OpenBLAS

huggingface / kernels

LaurieWired / BenchmarkCustomPTX

xlite-dev / LeetCUDA

MekkCyber / TritonAcademy

MekkCyber / CutlassAcademy

tile-ai / tilelang

NVIDIA / CUDALibrarySamples

gpu-mode / ring-attention