Skip to content
View VedKulkarni01's full-sized avatar
  • Northeastern University
  • United States
  • LinkedIn in/ved-kulk

Block or report VedKulkarni01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
VedKulkarni01/README.md

Hey, I'm Vedant 👋

Bioinformatics grad student @ Northeastern University · Building computational tools to bridge biology and data science.

I work at the intersection of genomics, machine learning, and reproducible pipelines — from analyzing sequencing data to building interactive tools that make complex biological results accessible. Currently a Teaching Assistant for Computational Biology at Northeastern.


Genomics & NGS

scRNA-seq snRNA-seq GWAS Scanpy scvi-tools Seurat DESeq2 STAR featureCounts FastQC

Programming & Machine Learning

Python R Bash Pandas NumPy scikit-learn TensorFlow tidyverse ggplot2

Pipelines & Infrastructure

Nextflow Docker Conda Git Linux SLURM

Visualization & Tools

Streamlit R Shiny RDKit ChEMBL UMAP


Featured Projects

EGFR Bioactivity Prediction — Ensemble ML pipeline (Random Forest, XGBoost, Neural Net) achieving 94.5% ROC-AUC on 20K+ compounds. Deployed as a Streamlit app with batch prediction for non-technical users.

COVID-19 snRNA-seq Lung Atlas — Integrated 81K nuclei across 27 lung samples using Scanpy & scvi-tools. Identified immune cell subpopulations and 150+ dysregulated genes linked to inflammatory pathways. Fully reproducible with Docker + SLURM.

β-Cell scRNA-seq in Type 2 Diabetes — End-to-end Nextflow pipeline (QC → STAR → featureCounts → DESeq2) profiling β-cell transcriptomes and revealing stress response signatures in T2D.

GWAS Interactive Dashboard — R Shiny dashboard for exploring GWAS results on 282 varieties with Manhattan plots, QQ plots, and population structure analysis for interdisciplinary teams.


📬 Let's Connect

Email LinkedIn GitHub

Pinned Loading

  1. egfr-bioactivity-ml egfr-bioactivity-ml Public

    Drug discovery ML project demonstrating cheminformatics (RDKit), ensemble modeling (RF/XGBoost/NN), and production deployment (Streamlit). Predicts EGFR bioactivity on 20K compounds from ChEMBL wit…

    Jupyter Notebook 1

  2. covid19-snrnaseq-reproduction covid19-snrnaseq-reproduction Public

    Reproducibility project for Melms et al. (Nature 2021) COVID-19 single-nucleus RNA-seq lung atlas

    Jupyter Notebook

  3. potato-gwas-project potato-gwas-project Public

    Genome-Wide Association Study identifying genetic loci controlling yield and canopy traits in 282 potato varieties using multi-model GWAS, population structure analysis, and an interactive Shiny da…

    R

  4. scRNAseq-Beta-Cell-T2D-Analysis scRNAseq-Beta-Cell-T2D-Analysis Public

  5. GeneLab_Data_Processing GeneLab_Data_Processing Public

    Forked from nasa/GeneLab_Data_Processing

    Jupyter Notebook 1