Source Job

$340,000–$425,000/yr

  • Lead research efforts to improve how human preferences are specified and learned at scale.
  • Develop novel architectures and training methodologies for RLHF.
  • Research techniques to identify and mitigate reward hacking.

RLHF Python Machine Learning LLM

4 jobs similar to Senior Research Scientist, Reward Models

Jobs ranked by similarity.

$315,000–$340,000/yr
US

  • Design and build infrastructure that enables researchers to rapidly iterate on reward signals.
  • Develop systems for automated quality assessment of rewards, including detection of reward hacks and other pathologies.
  • Collaborate with researchers to translate science requirements into platform capabilities.

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems to be safe and beneficial for users and society.

Europe

  • Design, implement, and maintain SFT and RL post-training pipelines for multi-step coding agents.
  • Train and adapt LLMs for agent workflows, including planning, tool use, and multi-step interactions inside JetBrains IDEs.
  • Build and develop evaluation and simulation environments where coding agents can act, be measured, and compared on realistic developer tasks.

At JetBrains, code is their passion and they strive to make the strongest, most effective developer tools on earth. Today, AI-powered assistance and agents are becoming a core part of how developers work in their IDEs.

$160,000–$190,000/yr

  • Design, implement, and deploy AI-powered features, including model training, fine-tuning, and prompt engineering workflows.
  • Translate product requirements into robust, production-ready AI solutions, working with Product Managers, Software Engineers, and Data Scientists.
  • Optimize models and infrastructure for scalability, latency, and cost efficiency, partnering with DevOps and MLOps to ensure reliable and maintainable AI pipelines.

Paper is reimagining how schools support students so that every learner can reach their full potential.

$265,800–$365,100/yr
US

  • Lead Reddit’s efforts in building ML systems that keep our platform safe.
  • Drive the strategy, development, and deployment of machine learning models that detect and prevent harmful content and behavior at scale.
  • Partner cross-functionally across Product, Engineering, Safety operations, Trust & Community, and AI/ML Platform to innovate on real-time detection, automation, and user protection systems.

Reddit is built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet.