Must Have: Python‑based ML development (production ML pipelines & models) ML frameworks: Scikit‑learn, XGBoost/LightGBM/CatBoost, and PyTorch or TensorFlow Feature engineering & feature pipelines (large‑scale, automated where applicable) Big data processing: Apache Spark / PySpark, Databricks, Delta