- PhD in Computer Science, Machine Learning, Systems, or a related field
- Strong foundation in machine learning systems, distributed systems, or large-scale data processing
- Experience with Python and working with data-intensive workloads
- Familiarity with ML frameworks (e.g., PyTorch, TensorFlow) and/or distributed systems (e.g., Ray, Spark)
- Experience (academic or applied) with data pipelines, model training workflows, or large datasets
- Strong problem-solving skills and ability to translate research ideas into practical systems
- Interest in building scalable, reliable infrastructure for machine learning
- Experience with workflow orchestration systems (Airflow, Flyte, etc.) (Nice to Have)
- Exposure to large-scale data platforms (data lakes, warehouses, streaming systems) (Nice to Have)
- Publications or research in ML systems, distributed systems, or related areas (Nice to Have)