Skip to content
View deepanshut041's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Canada
  • 20:26 (UTC -05:00)

Block or report deepanshut041

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
deepanshut041/README.md

Hi, I'm Deepanshu Tyagi πŸ‘‹

πŸš€ Software Engineer / AI Engineer (5+ years) focused on CUDA/C++, ML Systems / Distributed Training, and LLM infrastructure.
I like building systems that connect low-level performance with real-world ML products β€” from GPU kernels to scalable AI pipelines.


πŸ”₯ What I'm Working On (May 2026)

⚑ NanoTorch (C++/CUDA + Python)

A minimalist deep learning framework built from scratch:

  • Dynamic autograd + tensor ops (CPU/CUDA)
  • Core layers + optimizers
  • Serialization + checkpointing
  • ONNX export + benchmarks vs PyTorch
    πŸ”— https://github.com//nanotorch

🧠 CUDA + ML Systems

  • CUDA kernel optimization (matmul/elementwise/fused ops)
  • Profiling + memory efficiency
  • Distributed training experiments + benchmarking

πŸ€– LLM Agents + RAG

  • LangGraph/LangChain multi-agent workflows
  • Hybrid retrieval (pgvector + keyword)
  • Enterprise ingestion (Microsoft Graph / Google Drive)
  • Evaluation + grounding for reliability

πŸ† Highlights

  • Built production systems: microservices, Kubernetes, PostgreSQL, AWS
  • AI systems: agents, RAG pipelines, retrieval + evaluation
  • Strong GPU/C++ focus (CUDA, performance engineering)
  • Ontario Graduate Certificate (16 months) β€” Wireless Information Networking
    GPA: 3.33/4.0 (4 semesters)

πŸ› οΈ Tech Stack

C++ | CUDA | Python | PostgreSQL | Docker | Kubernetes | AWS | FastAPI | gRPC | LangGraph/LangChain | pgvector | Prometheus/Grafana


πŸ“Œ Featured


πŸ“« Connect

πŸ“§ deepanshut041@gamil.com
πŸ’Ό LinkedIn:
πŸ™ GitHub: https://github.com/deepanshut041

Pinned Loading

  1. PeerTube PeerTube Public

    Peer Tube Unofficial Client

    Kotlin 1

  2. feature-detection feature-detection Public

    Oriented FAST and RotatedΒ BRIEF using opencv

    Jupyter Notebook 221 88

  3. Reinforcement-Learning Reinforcement-Learning Public

    Implementations of Deep Reinforcement Learning Algorithms and Bench-marking with PyTorch

    Jupyter Notebook 143 39

  4. MlAgents MlAgents Public archive

    This repository contains the implementation of deep reinforcement learning algorithms to solve various unity The Environments.

    Jupyter Notebook 14 4

  5. Machine-Learning Machine-Learning Public

    Python 2