An AI infrastructure company in San Francisco is seeking a Data Scientist to lead the strategy and creation of massive datasets for foundation models. You will own every aspect of the data lifecycle and innovate high-throughput data processing scripts in Python. The ideal candidate has over 5 years