US
Unlimited PTO
- Design and improve data pipelines that process large, multi-modal datasets from internal and external sources into AI model training datasets.
- Evolve our data storage layer to support analytics, schema evolution, reproducibility, and efficient data access.
- Collaborate with ML engineers to improve the performance and reliability of Python-based data processing workflows.