Data Engineer
Send a job offer directly to this candidate
Data Engineer and Computer Science student specializing in building scalable, real-time data pipelines and cloud-native architectures. Expert in Azure, Microsoft Fabric, Databricks, and dbt, with a proven track record of developing metadata-driven frameworks and containerized ELT workflows. Skilled in transforming complex datasets into actionable insights through Medallion architectures and automated orchestration.
Real-Time Data Engineering (Azure & Databricks) at Uber
Engineering a high-throughput, low-latency streaming architecture to handle concurrent ride-sharing events while ensuring data integrity across a Medallion architecture. Real-time GPS and transaction streams ingested via FastAPI and Azure Event Hubs, processed into Delta Tables.
Advanced Azure Data Engineering (Synapse Analytics Integration) at Enterprise
Integrating diverse enterprise data sources into a unified analytics platform while optimizing cost and performance for massive-scale querying. Large-scale enterprise datasets moved through Synapse Pipelines and stored in Dedicated SQL Pools for high-performance BI.
ShoppingMart Data Warehouse (Microsoft Fabric) at Retail
Migrating traditional retail ETL processes to a modern SaaS data lakehouse environment to simplify management and scale-out performance. Multi-source retail transaction data ingested into OneLake using Fabric Notebooks and Data Factory.
Airflow Config-Driven Pipelines (Orchestration Scalability) at Enterprise
Overcoming "DAG sprawl" by developing a system to programmatically generate Airflow workflows based on central configuration files. Enterprise-grade orchestration capable of scaling across hundreds of independent data tasks.
Real-Time Data Lakehouse on Azure (FastAPI & Medallion) at Azure
Building a unified platform that supports both real-time ingestion and historical analytics without data duplication. Streaming event data processed through a Medallion Architecture with sub-second ingestion latency.
Car Sales Data Warehouse (AWS Native Stack) at AWS
Building a cost-effective, serverless data warehousing solution that can handle high volumes of semi-structured automotive sales data while ensuring the final dataset is optimized for complex analytical queries and business intelligence reporting. Massive-scale automotive sales records, inventory logs, and customer interaction data stored as raw files in Amazon S3, refined through multiple architectural layers to support a production-grade dimensional model.
Bachelor Of Computer Science in Computer Science – Superior University, Lahore, Pakistan