Source Job

Europe North America 7w PTO

  • Improve the quality of pretraining datasets by leveraging your previous experience, intuition and training experiments.
  • Focus on generating synthetic data at scale and determining the best strategies to leverage such data into training large models.
  • Closely collaborate with other teams like Pretraining, Postraining, Evals, and Product to define high-quality data needs.

Python LLM Prompt Engineering Machine Learning

3 jobs similar to Data Team Member

Jobs ranked by similarity.

Europe 5w PTO

  • Improve model performance through data quality, curation, labeling, and evaluation.
  • Work on the data layer of Generative AI products involving images, video, or audio.
  • Design, build, and operate workflow orchestration systems and large-scale data processing pipelines.

Synthesia is on a mission to make video easy for everyone with their AI video communications platform. They simplify the entire video production process, making it easy for everyone to create, collaborate, and share high-quality videos, and are trusted by leading brands such as Heineken, Zoom, Xerox, and McDonald’s.

$230,000–$322,000/yr
US

  • Define technical strategy & architecture for data curriculum pipelines powering next-gen foundation models.
  • Design & execute dynamic curriculum learning strategies, improving model stability & reasoning.
  • Engineer logic for serializing Reddit’s complex conversational trees into optimal training contexts.

Reddit is a community-driven platform where users submit, vote, and comment on what interests them. With over 100,000 active communities and 116 million daily active users, they foster open conversations and shared interests.

US

  • Design, train, and evaluate machine learning models from first principles.
  • Develop and maintain production-quality Python code for data processing.
  • Build natural language processing systems for document understanding.

Alpha7X is a technology company. They seem to be a growing company with a focus on innovation within the AI and machine learning space.