sara4dev

🏠

Working from home

Saravana Periyasamy sara4dev

🏠

Working from home

16 followers · 33 following

nvidia
Dallas, TX
11:09 (UTC -06:00)
@sara4dev

Achievements

Organizations

Stars

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,923 12,279 Updated Dec 27, 2025

huggingface / skills

Python 711 76 Updated Dec 21, 2025

ai-dynamo / nixl

NVIDIA Inference Xfer Library (NIXL)

C++ 781 208 Updated Dec 29, 2025

google / dranet

DRANET is a Kubernetes Network Driver that uses Dynamic Resource Allocation (DRA) to deliver high-performance networking for demanding applications in Kubernetes.

Go 159 24 Updated Dec 9, 2025

NVIDIA-AI-Blueprints / aiq-research-assistant

Python 243 80 Updated Oct 27, 2025

exo-explore / exo

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 38,854 2,607 Updated Dec 29, 2025

HazyResearch / Megakernels

kernels, of the mega variety

Python 636 35 Updated Sep 28, 2025

LegNeato / rust-gpu-chimera

Demo project showing a single Rust codebase running on CPU and directly on GPUs

Rust 466 12 Updated Aug 8, 2025

karmada-io / karmada

Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration

Go 5,217 1,035 Updated Dec 29, 2025

NVIDIA / KAI-Scheduler

KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale

Go 1,043 128 Updated Dec 25, 2025

Flux159 / mcp-server-kubernetes

MCP Server for kubernetes management commands

TypeScript 1,232 207 Updated Dec 28, 2025

EricLBuehler / mistral.rs

Blazingly fast LLM inference.

Rust 6,312 498 Updated Dec 19, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,696 756 Updated Dec 29, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 21,344 2,257 Updated Dec 26, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,941 924 Updated Dec 15, 2025

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,205 1,289 Updated May 23, 2024

therealoliver / Deepdive-llama3-from-scratch

Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.

Jupyter Notebook 613 51 Updated Feb 24, 2025

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,059 3,897 Updated Dec 29, 2025

ffromani / ctrreschk

tooling to verify resources associated to containers

Go 1 Updated Jan 29, 2025

google / cadvisor

Analyzes resource usage and performance characteristics of running containers.

Go 18,716 2,437 Updated Dec 25, 2025

kubernetes / kube-state-metrics

Add-on agent to generate and expose cluster-level metrics.

Go 6,023 2,145 Updated Dec 29, 2025

kevmo314 / scuda

SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.

C++ 1,792 77 Updated Jun 16, 2025

slashml / amd_inference

Docker-based inference engine for AMD GPUs

Python 231 8 Updated Oct 7, 2024

NVIDIA / k8s-nim-operator

An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.

Go 140 36 Updated Dec 17, 2025

NVIDIA / nim-deploy

A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deployment.

Jupyter Notebook 216 92 Updated Dec 18, 2025

volcano-sh / volcano

A Cloud Native Batch System (Project under CNCF)

Go 5,201 1,243 Updated Dec 29, 2025

mlfoundations / MINT-1T

🍃 MINT-1T: A one trillion token multimodal interleaved dataset.

829 18 Updated Jul 31, 2024

AutoCodeRoverSG / auto-code-rover

A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with…

Python 3,044 333 Updated Apr 24, 2025

EricLBuehler / candle-vllm

Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.

Rust 552 64 Updated Dec 24, 2025

doctorray117 / minecraft-ondemand

Templates to deploy a serverless Minecraft Server on demand in AWS

TypeScript 1,782 132 Updated Aug 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Saravana Periyasamy sara4dev

Achievements

Achievements

Organizations

Block or report sara4dev

Stars

rasbt / LLMs-from-scratch

huggingface / skills

ai-dynamo / nixl

google / dranet

NVIDIA-AI-Blueprints / aiq-research-assistant

exo-explore / exo

HazyResearch / Megakernels

LegNeato / rust-gpu-chimera

karmada-io / karmada

NVIDIA / KAI-Scheduler

Flux159 / mcp-server-kubernetes

EricLBuehler / mistral.rs

ai-dynamo / dynamo

Dao-AILab / flash-attention

deepseek-ai / FlashMLA

naklecha / llama3-from-scratch

therealoliver / Deepdive-llama3-from-scratch

sgl-project / sglang

ffromani / ctrreschk

google / cadvisor

kubernetes / kube-state-metrics

kevmo314 / scuda

slashml / amd_inference

NVIDIA / k8s-nim-operator

NVIDIA / nim-deploy

volcano-sh / volcano

mlfoundations / MINT-1T

AutoCodeRoverSG / auto-code-rover

EricLBuehler / candle-vllm

doctorray117 / minecraft-ondemand