- I’m a graduate student at Texas A&M University, College Station.
- Passionate about Natural Language Processing (NLP), Large Language Models (LLMs), AI Agents and Reinforcement Learning (RL).
- My research and projects focus on:
- Reasoning and Self-Correction Abilities of Large Language Models
- Alignment of LLMs using Reinforcement Learning
- Fine-tuning LLMs to instill better domain knowledge
- Developing intelligent AI agents capable of using external tools and knowledge to generate more accurate responses
- With a solid foundation in Machine Learning, Deep Learning, and Data Science, I am enthusiastic about solving complex business problems and making impactful contributions through innovative AI solutions.
-
Texas A&M University
- College Station, TX
-
19:39
(UTC -06:00) - in/ankitbasu
Pinned Loading
-
RLHF-PPO
RLHF-PPO PublicRLHF (Reinforcement Learning from Human Feedback) with PPO (Proximal Policy Optimization) is a reinforcement learning method where a model is fine-tuned using human feedback, optimized through PPO …
Python
-
CCT-ViT
CCT-ViT PublicA Compact Convolution Transformer is a neural network architecture that combines the efficiency of convolutional layers with the long-range dependency modeling capabilities of transformers, designe…
Python
-
Multimodal-Fusion-Model
Multimodal-Fusion-Model PublicAn architecture capable of taking different types inputs (images and sound) and fuse the models to build a multimodal classifier.
Python
-
FashionMNIST-Classification
FashionMNIST-Classification PublicForked from archit28-tamu/fashion_mnist_project
End-to-end image classification with conventional machine learning and deep learning models.
Jupyter Notebook
-
PDF-Insight
PDF-Insight PublicPDF Insight is an interactive chatbot application that allows users to upload PDF documents and engage in natural language conversations about their content.
Python
If the problem persists, check the GitHub status page or contact support.
