Senior Data Engineer | 10+ Years Experience
Send a job offer directly to this candidate
Highly skilled Data Engineer with 10+ years of experience designing and delivering scalable, high-performance data solutions. Proven expertise in Apache Spark, Databricks, Kafka, PySpark, Scala, Snowflake, and cloud platforms (AWS & Azure). Demonstrated ability to optimize data pipelines resulting in up to 67% reduction in job run times and significant cost savings through cluster and resource optimization.
Experienced in real-time streaming, lakehouse architecture, Delta Lake, and end-to-end pipeline development across Healthcare, Finance, and Retail domains. Strong communicator who has successfully engaged with 10+ enterprise clients as a Databricks Solutions Engineer.
Data Engineer at Vrentin Tech (2025-04 – Present)
The project involved building a robust, real-time financial data pipeline to ingest, transform, and deliver financial data to end-users and reporting dashboards. The system processes transactional and market data from multiple sources in near real-time, ensuring data freshness, accuracy, and reliability for financial reporting and compliance purposes.
Sr. Technical Solutions Engineer at Databricks Pvt. Ltd. (2022-07 – 2025-04)
As a Senior Technical Solutions Engineer at Databricks, worked in an individual contributor capacity with 10+ enterprise customers across Finance, Retail, and E-commerce domains. Engagements were project/need-based, focusing on Data Lakehouse migration, real-time streaming pipeline development, performance tuning, and enabling customers to adopt new Databricks platform features. Handled data volumes up to 12 TB and delivered up to 50% job run time improvements across client engagements.
Senior Software Engineer — Data Engineer at Optum Global Solutions (2020-06 – 2022-07)
Commercial Cradle is a healthcare claims intelligence platform at Optum, designed to analyze, prioritize, and flag insurance claims using a rule-based recommendation engine. The system ingests large volumes of claims data from Oracle, applies rule-based scoring and prioritization logic, and stores processed data in Cassandra and Azure Cosmos DB for downstream consumption. The platform improved fraud detection accuracy and enabled analysts to focus on high-risk claims first.
Technology Analyst / Big Data Developer at Infosys Limited (2014-06 – 2020-04)
Prime Therapeutics is a leading pharmacy benefits management (PBM) company in the US healthcare space. The project involved designing and building a reusable, scalable ETL Pipeline Framework using Apache Spark and the Hadoop ecosystem to process large volumes of pharmaceutical and claims data. The framework performed data cleansing, change data capture (CDC), historical data purging, and analytical data loading — significantly reducing data processing time and improving data quality for downstream healthcare analytics.
B.Tech – NIT Hamirpur (2014)