Skip to content
View McNamara10's full-sized avatar

Block or report McNamara10

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
McNamara10/README.md

Hi, I'm Marcello Orru

Data Engineer | Full-Stack Developer

Macerata, Italy | Master's in Data Engineering

LinkedIn Email


About Me

Data Engineer with expertise in building scalable data pipelines and real-time processing systems. Strong foundation in full-stack development with 10+ years of backend experience (PHP, Yii2, Laravel) and deep knowledge of SQL and relational database design, now specialized in modern data infrastructure and distributed systems.

Currently completing Master's degree in Data Engineering with focus on real-time analytics, cloud-native architectures, and machine learning integration. I combine software engineering best practices with cutting-edge data technologies to deliver production-ready, enterprise-scale solutions.

Core expertise: Real-time data processing (Kafka, Spark Streaming) • ETL pipelines (Airflow, AWS Glue, Azure Data Factory) • Cloud data warehouses (Snowflake, Redshift) • NoSQL & distributed databases (MongoDB, DynamoDB, Redis) • SQL optimization • Web scraping & data collection • Cloud data services (AWS S3/EMR, Azure Data Lake) • Backend APIs (FastAPI, PHP) • Desktop applications (Delphi, VB) • ML deployment


Technical Skills

Data Engineering & Big Data

Python Apache Kafka Apache Spark Apache Airflow Snowflake SQL MongoDB Redis

Cloud Data Services

AWS Azure Amazon S3 DynamoDB

Backend Development & Databases

PHP Yii2 Laravel FastAPI JavaScript React PostgreSQL MySQL MariaDB

Desktop Application Development

Delphi Visual Basic

DevOps & Tools

Docker Git GitHub Actions


Featured Work

Data Engineering Portfolio

Real-time Analytics Pipeline

  • Kafka + Spark Streaming architecture for live data processing
  • Event-driven data ingestion and transformation
  • Deployed on AWS with automated monitoring

Cloud Data Warehouse on Snowflake

  • Designed and implemented scalable data warehouse on Snowflake
  • Multi-layered architecture (raw, staging, production layers)
  • Star schema dimensional modeling with optimized SQL queries
  • Query performance tuning with Snowflake-specific optimizations
  • Cost optimization through virtual warehouses and clustering

ETL Framework

  • Modular data transformation system built with Apache Airflow
  • Orchestrates complex workflows loading data into Snowflake and traditional databases
  • Automated data quality checks and error handling
  • Support for incremental and full-load strategies

Web Scraping & Data Collection

  • Automated data extraction from multiple web sources using Python (Beautiful Soup, Scrapy)
  • Robust scraping pipelines with error handling and retry logic
  • Integration with cloud storage (S3) and databases (MongoDB)
  • Scheduling and monitoring with Airflow

Cloud Data Infrastructure

  • AWS data services: S3 (data lake), DynamoDB (NoSQL), EMR (Spark clusters), Glue (ETL), Redshift (warehouse)
  • Azure data services: Data Lake Storage, Cosmos DB (NoSQL), Databricks (Spark), Data Factory (ETL)
  • Multi-cloud data architecture design and implementation

ML Deployment Pipeline

  • End-to-end machine learning model lifecycle management
  • Automated training, validation, and deployment workflows
  • Production monitoring with performance tracking

Backend Development

REST API Suite

  • Production-ready APIs built with Yii2 framework and FastAPI and Laravel
  • MySQL/MariaDB and PostgreSQL database integration
  • JWT authentication and role-based access control
  • Complex SQL queries and database optimization
  • Comprehensive API documentation and testing

Microservices Architecture

  • Scalable backend services with Docker containerization
  • Inter-service communication patterns
  • Distributed system design and implementation

Desktop Applications

  • Legacy desktop applications built with Embarcadero Delphi
  • Excel integration and data processing tools
  • Windows GUI applications with Visual Basic
  • Database connectivity and reporting features

Currently Learning

  • Advanced Apache Spark optimization and performance tuning
  • MLOps practices and model deployment strategies
  • Cloud-native data architectures on AWS
  • Distributed systems design patterns
  • LLM integration for intelligent data automation

Professional Goals

  • Complete Master's degree in Data Engineering
  • Build comprehensive portfolio of production-grade data engineering projects
  • Expand expertise in cloud-native architectures and real-time systems
  • Transition fully into Data Engineering role at enterprise level

Get In Touch

I'm open to discussing data engineering opportunities, collaboration on interesting projects, or technical conversations about data infrastructure and scalable systems.

LinkedIn: Marcello Orru
Email: marcelorru@gmail.com
Portfolio: Coming soon


Profile Views

Building data solutions that scale

Popular repositories Loading

  1. delphi-excel delphi-excel Public

    import file excel

    Pascal 3 1

  2. ZendSkeletonApplication ZendSkeletonApplication Public

    Forked from zendframework/ZendSkeletonApplication

    Sample application skeleton using the ZF2 MVC layer

    PHP

  3. acoustic-italian acoustic-italian Public

    Perl

  4. dataset dataset Public

    dataset ai

  5. McNamara10 McNamara10 Public

  6. todo_tracker todo_tracker Public

    Command-line software to manage tasks (to-do items) with CSV file persistence.

    Python