• 1-3 Years in Python, including data structures, algorithms, and libraries for data manipulation (e.g., Pandas). • Deep understanding of Apache Spark, its architecture, and components (RDDs, DataFrames, Datasets). • Strong knowledge of SQL for data querying and manipulation. • Experience in ETL (Ex