Skip to content
View nikam-shreyas's full-sized avatar

Highlights

  • Pro

Block or report nikam-shreyas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
nikam-shreyas/README.md

Hi there, I'm Shreyas Nikam πŸ‘‹

Full-Stack Software Engineer | Architecting Scalable AI Solutions

I earned senior responsibility by owning hard problems.

I am a Tech Lead at QuantUniversity based in Boston, MA, with 4+ years of experience building production-grade AI systems, LLM-powered tools, and scalable cloud infrastructure. I specialize in reducing inference costs and architecting end-to-end AI platforms.


πŸš€ What I Do

  • AI & LLM Systems: Architecting RAG pipelines, AI agents, and tool routing using LangChain, LlamaIndex, and Hugging Face.
  • Engineering Leadership: Mentoring engineers, designing system architecture, and leading CI/CD strategies for zero-downtime deployments.
  • Cloud & DevOps: Managing production MLOps and infrastructure using AWS, Docker, GitHub Actions, and Airflow.
  • Full-Stack Development: Building high-performance applications with FastAPI, Streamlit, Node.js, and Redis.

πŸ› οΈ Tech Stack

Languages: Python Node.js SQL

AI & Machine Learning: LangChain HuggingFace Pandas Azure ML

Backend & Infrastructure: FastAPI Docker AWS Redis Apache Airflow


πŸ“Š Key Highlights

  • Cost Optimization: Reduced LLM infrastructure costs by 65% and inference costs by 50%+ through intelligent model routing and cost-aware selection.
  • Workflow Automation: Reduced course production time by 85% by developing AI-driven content platforms.
  • Compliance & Ethics: Built bias auditing tools compliant with NYC Local Law 144.
  • Scalability: Orchestrated real-time systems ingesting data from industrial sensors and scaled data collection to 400k+ profiles.

πŸŽ“ Education

  • MS in Artificial Intelligence | Northeastern University (GPA 4.0)
  • BE in Computer Science | Pune Institute of Computer Technology

πŸ“« Connect with Me

Feel free to reach out. I respond fast!

Pinned Loading

  1. github-package-health github-package-health Public

    React app powered by FastAPI & Google's PALM AI visualizes repo health (Snyk & LLMs) & offers AI-driven insights, featuring a Palm-powered Package Expert chat.

    JavaScript

  2. WarcParser WarcParser Public

    The WARC Processor project efficiently generates multimodal datasets from Common Crawl's WARC files for training advanced language models.

    Python 1

  3. WUTechathon WUTechathon Public

    A one-stop-shop for all your currency trading needs with live and historical rates, trends analysis and next-day predictions.

    JavaScript 1 1

  4. MediCheck MediCheck Public

    Electron-based Medical Healthcare System, which provides diagnosis on the basis of symptoms.

    HTML 4

  5. VyakaranFrontend VyakaranFrontend Public

    React-based front end for Hindi Grammar checker. Inspired by Grammarly. Runs a series of grammatical checks in the document.

    JavaScript 2 2

  6. LeetCode-Daily LeetCode-Daily Public

    Collection of LeetCode questions to ace the coding interview! - Created using [LeetHub](https://github.com/QasimWani/LeetHub)

    Python