Remote Data Jobs · Reinforcement Learning

Job listings

  • Design and evaluate reinforcement learning systems for agentic AI workflows, including RL environments, reward models, and post-training pipelines for LLM-based agents.
  • Develop simulation environments, reward functions, and evaluation frameworks for enterprise workflows.
  • Collaborate with researchers to translate research into practical enterprise solutions, with opportunities to publish and present findings.

Centific is a frontier AI data foundry that curates diverse, high-quality data to empower clients with safe, scalable AI deployment. Their team includes over 150 PhDs and data scientists, along with 4,000 AI practitioners and engineers, fostering a culture of innovation and excellence.

  • Own the full data science engine for a priority vertical, from business problem to deployed model to live ROAS performance.
  • Build buying models that maintain positive ROAS and drive lead quality improvements across a portfolio of brands.
  • Establish direct partnership with vertical business stakeholders and identify net-new modeling opportunities.

Launch Potato is a digital media company that reaches over 30M monthly visitors through brands like FinanceBuzz and All About Cookies. Headquartered in South Florida with a remote-first team spanning over 15 countries, they have a high-growth, high-performance culture.

  • Own the full data science engine for a priority vertical, from business problem to deployed model to live ROAS performance, driving measurable revenue and media efficiency.
  • Deliver buying models that maintain positive ROAS and improve lead quality across a portfolio of brands.
  • Establish trusted, direct partnership with business stakeholders and identify net-new modeling opportunities.

Launch Potato is a profitable digital media company that connects consumers with leading brands through data-driven content and technology. With a remote-first team spanning over 15 countries and reaching 30M+ monthly visitors, they have a high-growth, high-performance culture focused on speed, ownership, and measurable impact.