
Senior Data Engineering Leader @ Amazon | Apache Spark & Distributed Systems Specialist | Data Platform Architect | AWS EMR & Glue | Performance Engineering | Staff/Principal Opportunities
Send a job offer directly to this candidate
I am a Senior Data Engineering Leader with 14+ years of experience building and scaling distributed data platforms across Amazon, Gartner, and TCS.
✔ Apache Spark Internals & Performance Engineering
✔ AWS EMR, Glue & Large-Scale Data Platforms
✔ Distributed Systems Architecture
✔ Financial & Regulatory Reporting Systems
✔ Cloud Modernization & Data Platform Transformation
Recently, I led a large-scale optimization initiative that reduced a 945M-record (~189GB) reporting workload from 51.5 hours to 18.5 hours while preserving audit and compliance guarantees.
I also architected a reusable compliance platform that reduced jurisdiction onboarding time by 63% and enabled scalable multi-country reporting capabilities.
I enjoy solving problems involving scale, performance bottlenecks, distributed systems, and platform engineering.
I have extensive experience leading cross-functional initiatives involving Finance, Tax, Product, and Engineering organizations, defining technical strategy, mentoring engineers, and driving large-scale platform transformations.
Architected reusable compliance reporting platforms supporting multi-jurisdiction tax and statutory reporting.
Defined scalable Spark and AWS patterns for processing billion-record workloads.
Established engineering best practices for Spark optimization, performance tuning, and data quality.
Influenced technical roadmap across Finance, Tax, and Engineering organizations.
Mentored engineers and drove adoption of scalable platform design principles.
✔ Reduced reporting runtime by 60–76%
✔ Optimized 945M records from 51.5h to 18.5h
✔ Reduced onboarding timelines by 63%
✔ Enabled 15 business launches
✔ Eliminated 117 manual processes
✔ Delivered 1,300+ hours annual savings