1-3 Years in Python, including data structures, algorithms, and libraries for data manipulation (e.g., Pandas). Deep understanding of Apache Spark, its architecture, and components (RDDs, DataFrames, Datasets). Strong knowledge of SQL for data querying and manipulation. Experience in ETL (Extract, T