- Improve the quality of pretraining datasets by leveraging your previous experience, intuition and training experiments.
- Focus on generating synthetic data at scale and determining the best strategies to leverage such data into training large models.
- Closely collaborate with other teams like Pretraining, Postraining, Evals, and Product to define high-quality data needs.
Poolside aims to be the company that builds a world where AI will be the engine behind economically valuable work and scientific progress. They are a remote-first team across Europe and North America that values the quality of their systems.