Responsibilities Analyze, manipulate, and process large sets of structured and semi-structured data including healthcare datasets (claims, patient, and provider data) using Python (PySpark), SQL, and Spark SQL within the Azure data platform. Apply data mining, data modeling, and statistical analysis