Job Description

Shape the foundation of AI systems by managing and optimizing data pipelines.

  • Design and build scalable pipelines and curate high-quality datasets.
  • Structure data for optimal training efficiency.
  • Bridge research and engineering, enabling faster, more reliable model training.

Work with diverse data sources.

  • Includes web data, code repositories, and multilingual corpora.
  • Conduct data ablations and experiments to assess quality and improve model performance.
  • Ensure datasets are diverse, reliable, and optimized for throughput.

Collaborate in a fast-paced environment.

  • Work with researchers, engineers, and cross-functional teams globally.
  • Contribute directly influence AI model performance and innovation.
  • Remote options are available.

About Jobgether

Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities.

Apply for This Position