Senior Data Engineer
Send a job offer directly to this candidate
Senior Data Engineer with 5+ years of experience in architecting, building, and operating cloud-native, distributed data platforms in banking and enterprise environments. Extensive hands-on expertise in Python, SQL, Apache Spark, Databricks, Azure, AWS, and Snowflake, with deep focus on end-to-end data architecture, including source ingestion, change data capture (CDC), batch and real-time streaming pipelines, distributed processing, and optimized storage layers. Proven ability to design fault-tolerant, scalable data pipelines using Spark, Kafka, and cloud-native orchestration frameworks, ensuring high availability, data consistency, and SLA compliance.
Strong background in data modeling (dimensional, denormalized, and analytical schemas), partitioning strategies, and file format optimization (Parquet, ORC, AVRO, Delta) to support large-scale analytical workloads. Experienced in implementing data quality frameworks, validation rules, and monitoring/alerting mechanisms to ensure data reliability across complex pipelines. Adept at performance tuning through optimized joins, caching strategies, indexing, and resource configuration across Spark and cloud data warehouses.
Hands-on experience with data governance and security, including role-based access control (RBAC), row-level security (RLS), data masking, and auditability to meet regulatory and compliance requirements. Skilled in establishing CI/CD pipelines for data platforms, automating deployments, and enforcing coding and architectural standards. Recognized for partnering closely with product owners, data scientists, analysts, and business stakeholders to translate complex requirements into robust, production-grade data solutions, while mentoring engineers and driving best practices in Agile delivery environments.
Data Engineer - Molina Healthcare
(2023-08)
Data Analyst - HDFC Bank
(2020-04 - 2021-12)