Design and evaluate reinforcement learning systems for agentic AI workflows, including RL environments, reward models, and post-training pipelines for LLM-based agents.
Develop simulation environments, reward functions, and evaluation frameworks for enterprise workflows.
Collaborate with researchers to translate research into practical enterprise solutions, with opportunities to publish and present findings.
Centific is a frontier AI data foundry that curates diverse, high-quality data to empower clients with safe, scalable AI deployment. Their team includes over 150 PhDs and data scientists, along with 4,000 AI practitioners and engineers, fostering a culture of innovation and excellence.
Own the full data science engine for a priority vertical, from business problem to deployed model to live ROAS performance.
Build buying models that maintain positive ROAS and drive lead quality improvements across a portfolio of brands.
Establish direct partnership with vertical business stakeholders and identify net-new modeling opportunities.
Launch Potato is a digital media company that reaches over 30M monthly visitors through brands like FinanceBuzz and All About Cookies. Headquartered in South Florida with a remote-first team spanning over 15 countries, they have a high-growth, high-performance culture.
Own the full data science engine for a priority vertical, from business problem to deployed model to live ROAS performance, driving measurable revenue and media efficiency.
Deliver buying models that maintain positive ROAS and improve lead quality across a portfolio of brands.
Establish trusted, direct partnership with business stakeholders and identify net-new modeling opportunities.
Launch Potato is a profitable digital media company that connects consumers with leading brands through data-driven content and technology. With a remote-first team spanning over 15 countries and reaching 30M+ monthly visitors, they have a high-growth, high-performance culture focused on speed, ownership, and measurable impact.