Azure Data Engineer
Send a job offer directly to this candidate
Having 8+ Years of IT experience, which incorporates designing and developing Data Models and orchestrating Data Pipelines for ingesting and transforming structured and semi-structured data to various destinations like Azure SQL, Synapse or Datalake gen2 which can be further used for BI, data science or machine-learning workloads. Good understanding of Data Modeling, Data warehousing and performance tunning ETL processes. Extensive knowledge in SSIS, ADFv2, Databricks, PySpark, SparkSQL for building data ingestion and processing frameworks which can be reused.
Have good exposure to data governance and implementation of data governance in datalakehouse architecture using unity catalogue in azure databricks. Worked with micro batch processing using DLT pipelines for high frequency data refresh. Along with data ingestion and transformation built ADFv2 pipelines to refresh power bi datasets, built a automated workday integration monitoring system to raise service now tickets (used databricks) and ADFv2 and alert systems to send email notifications (used logic apps).
All secretes in ADFv2 pipelines were secured in Azure Key Vault. Extensive knowledge on cloud technologies Azure Data Factory, Azure Key Vault, Azure Data Lake Storage Gen2, Azure Synapse Analytics (Warehouse), Azure Synapse (pipelines), Azure SQL Database, Azure Databricks, Azure Active Directory, SQL. Expertise in Microsoft T-SQL, DDL, DML, views, stored procedure, functions and modifying database schemas.
Fair handson on cutting edge analytics reporting tools like Power BI, Tableau and other power platform tools like Power apps and Power flows. Committed to give impeccable result and performing under minimal supervision, multi-tasking, meet deadlines as an individual contributor and a committed team player.
Azure Data Engineer - Northern Trust - Chicago
(2024-06)
Azure Data Engineer - Optum - 1325 Boylston St, Boston, MA 02215
(2020-08 - 2024-05)
Metadata Driven Framework to Build a Lakehouse: Developed metadata driven Data pipelines using ADFv2 to load data from different file formats to data Lakehouse. Developed generic databricks notebooks to load data from landing layer to curated layer. Also built generic databricks notebooks to populate SCD Type 1 and SCD Type 2-dimension tables. Framework has self-logging mechanism to catch run details and failure details. In case of failure, framework is intelligent enough to raise a service-now ticket based on configured capability. Wrote custom fact notebooks to built fact. All custom and generic notebooks execute vacuum command for better space management. Enabled cluster pooling for better startup time. Helped analytics teams to create tokens and configure simba spark connector for power bi.
Workday Data ingestion Using Azure SFTP: Existing workday to ADLS gen2 connectivity was through Mulesoft which needed additional development efforts. Explored and configured ADLS Gen2's SFTP to get data files directly from Workday which saved 5 FTEs for each data pull considering Mulesoft developers efforts. Data pulled from Workday further processed by Metadata framework explained above. Wo
Sr. SQL BI Developer - HealthX - 9225 Priority Way W Dr #100, Indianapolis, IN 46240
(2020-05 - 2020-07)
Software Engineer/Data Engineer - Indiana University Health - 1633 N Capitol Ave, Indianapolis, IN 46202
(2019-10 - 2020-05)
Data Engineer - Kroger - 1014 Vine St, Cincinnati, OH 45202
(2019-01 - 2019-10)