I'm Armaan Gohil, a passionate Data Engineer with over 2 years of experience in developing and optimizing data pipelines and ETL processes. My expertise lies in Python, PySpark, SQL, and cloud technologies, with a focus on delivering actionable insights and automating workflows.
- Programming Languages: Python, SQL
- Data Engineering: Databricks, Apache Airflow, DBT, DLTHub, PySpark, ETL, Data Warehousing, Hive, Hadoop, Shell Scripting
- Cloud Platforms: AWS, GCP
- Tools: Docker, Git, CI/CD, Tableau, Terraform
- Project: Samsung US
- Designed and implemented an ETL pipeline for data ingestion from Eloqua, doubling fortnightly deliverables.
- Automated Monthly Business Reviews (MBRs) by integrating Adobe Analytics data with Python, enhancing insights delivery.
- Developed a KPI-driven dashboard, reducing reporting time by 50% and boosting productivity.
- Automated Overhead Cables Material calculations, streamlining processes.
- Optimized simulation studies for major projects, including the Mauritius Metro, achieving significant cost reductions.
- Web App: Developed a web app using Dash and Pandas to visualize metro schedules, integrated with Microsoft Clarity and Google Analytics.
- Data Visualization & Analysis: Conducted comprehensive data analysis and visualization using Pandas, Beautiful Soup, NumPy, and Matplotlib.
- π Iβm currently working on NDAP Indian Government Data Engineering Project and NYC Taxi Data Project.
- π± Iβm currently learning advanced machine learning techniques and big data processing with Apache Spark.
- π― Iβm looking to collaborate on innovative data engineering and data analytics projects.
- π€ Iβm looking for help with enhancing my skills in real-time data streaming and processing.
- Email: armaangohil@hotmail.com
- LinkedIn: linkedin.com/in/armaan-gohil
Feel free to explore my repositories and connect with me for collaboration or any inquiries!

