Responsibilities Build and maintain ingestion pipelines from operational systems, files, APIs and events Design batch and streaming data pipelines that are resilient, observable and testable Normalize, enrich and transform raw data into reusable, well-structured datasets Enforce schemas, validate in