PhD Research Intern - Applied Reinforcement Learning

Centific

Remote regions

US

Benefits

Similar Jobs

See all

Scope of Work:

  • Design and evaluate RL systems for agentic AI, including RL environments and reward models.
  • Develop simulation environments and scalable training pipelines for enterprise workflows.

Minimum Qualifications:

  • PhD candidate in CS/ML with research in RL or agentic AI.
  • Strong Python, PyTorch, and GPU training experience.
  • Experience with LLMs and post-training techniques (RLHF, PPO, etc.).

What We Offer:

  • Competitive stipend and impactful projects.
  • Mentorship and access to GPU infrastructure.
  • Opportunities to publish and present research.

Centific

Centific is a frontier AI data foundry that curates diverse, high-quality data to empower clients with safe, scalable AI deployment. Their team includes over 150 PhDs and data scientists, along with 4,000 AI practitioners and engineers, fostering a culture of innovation and excellence.

Apply for This Position