Skip to content
View Mesa-17's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing
  • Florida
  • 09:18 (UTC -12:00)

Block or report Mesa-17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Mesa-17/README.md

Hi, I'm Sandeep Menon ๐Ÿ‘‹

Data Engineer specializing in large-scale streaming pipelines, cloud data platforms, and modern analytics infrastructure.

I build systems that move, transform, and make sense of data at scale โ€” from real-time event ingestion to analytics-ready data models.

๐Ÿ“ Tampa, FL ย |ย  ๐Ÿ”— LinkedIn ย |ย  ๐Ÿ’ผ Open to Data Engineering roles


๐Ÿ› ๏ธ Tech Stack

Cloud & Platforms

Azure Databricks AWS Snowflake

Streaming & Pipelines

Apache Kafka Apache Spark Azure Event Hubs Airflow

Transformation & Modeling

dbt Delta Lake Python SQL

DevOps & Infra

Docker GitHub Actions


๐Ÿš€ Featured Projects

Azure Event Hubs ยท Databricks ยท Delta Live Tables ยท Medallion Architecture ยท Star Schema

Multi-source streaming platform on Azure that ingests live ride events (WebApp) and historical activity (GitHub) through independent Event Hubs, unifies them via Spark Structured Streaming on Databricks using declarative Delta Live Tables pipelines, and delivers a Gold-layer Star Schema for analytics.

Key decisions: Dual Event Hubs for source isolation โ†’ DLT over raw Spark for declarative, self-healing pipelines โ†’ Medallion over direct ingestion to preserve full event history and enable safe reprocessing.


Kafka ยท Debezium ยท Airflow ยท dbt ยท Python ยท Docker

CDC-based streaming pipeline for banking transaction data. Uses Debezium to capture row-level change events from a source database, streams them through Kafka, orchestrates ingestion via Airflow, and applies dbt transformations for analytics-ready output. Fully containerized with Docker Compose.


๐Ÿ“Š GitHub Stats

Sandeep's GitHub stats


Building reliable data infrastructure, one pipeline at a time.

Pinned Loading

  1. Uber_Streaming_Project_Azure Uber_Streaming_Project_Azure Public

    Jupyter Notebook

  2. Banking-modern-data-pipeline Banking-modern-data-pipeline Public

    Python

  3. Identification-of-Frost-in-Martian-HiRISE-Images Identification-of-Frost-in-Martian-HiRISE-Images Public

    Jupyter Notebook

  4. Facial-Emotion-Recognition-using-CNN Facial-Emotion-Recognition-using-CNN Public

    Jupyter Notebook

  5. Cryptocurrency-Analytics-Dashboard Cryptocurrency-Analytics-Dashboard Public

    JavaScript