Full-Stack Software Engineer | Architecting Scalable AI Solutions
I earned senior responsibility by owning hard problems.
I am a Tech Lead at QuantUniversity based in Boston, MA, with 4+ years of experience building production-grade AI systems, LLM-powered tools, and scalable cloud infrastructure. I specialize in reducing inference costs and architecting end-to-end AI platforms.
- AI & LLM Systems: Architecting RAG pipelines, AI agents, and tool routing using LangChain, LlamaIndex, and Hugging Face.
- Engineering Leadership: Mentoring engineers, designing system architecture, and leading CI/CD strategies for zero-downtime deployments.
- Cloud & DevOps: Managing production MLOps and infrastructure using AWS, Docker, GitHub Actions, and Airflow.
- Full-Stack Development: Building high-performance applications with FastAPI, Streamlit, Node.js, and Redis.
- Cost Optimization: Reduced LLM infrastructure costs by 65% and inference costs by 50%+ through intelligent model routing and cost-aware selection.
- Workflow Automation: Reduced course production time by 85% by developing AI-driven content platforms.
- Compliance & Ethics: Built bias auditing tools compliant with NYC Local Law 144.
- Scalability: Orchestrated real-time systems ingesting data from industrial sensors and scaled data collection to 400k+ profiles.
- MS in Artificial Intelligence | Northeastern University (GPA 4.0)
- BE in Computer Science | Pune Institute of Computer Technology
- π Portfolio: www.nikam-shreyas.com
- πΌ LinkedIn: linkedin.com/in/nikam-shreyas
- π§ Email: shreyas.s.nikam@gmail.com
Feel free to reach out. I respond fast!
