Key Responsibilities Develop and Optimize: Design, implement, and optimize robust data processing pipelines using PySpark for large-scale data processing tasks. Data Transformation: Collaborate with data teams to transform and aggregate data from various sources, ensuring data quality and integrity