Must Have: Python-based ML development (production ML pipelines & models) ML frameworks: Scikit-learn, XGBoost/LightGBM/CatBoost, and PyTorch or TensorFlow Feature engineering & feature pipelines (large-scale, automated where applicable) Big data processing: Apache Spark / PySpark, Databricks, Delta