Jobs Similar to Senior Research Scientist, Reward Models

Senior Research Scientist, Reward Models

Anthropic 16 days ago

$340,000–$425,000/yr

Lead research efforts to improve how human preferences are specified and learned at scale.
Develop novel architectures and training methodologies for RLHF.
Research techniques to identify and mitigate reward hacking.

RLHF Python Machine Learning LLM

View details

4 jobs similar to Senior Research Scientist, Reward Models

Jobs ranked by similarity.

Research Engineer, Reward Models Platform

Anthropic 16 days ago

$315,000–$340,000/yr

Design and build infrastructure that enables researchers to rapidly iterate on reward signals.
Develop systems for automated quality assessment of rewards, including detection of reward hacks and other pathologies.
Collaborate with researchers to translate science requirements into platform capabilities.

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems to be safe and beneficial for users and society.

View details Similar jobs

Research Engineer (Agentic Models)

JetBrains 10 days ago

Europe

Design, implement, and maintain SFT and RL post-training pipelines for multi-step coding agents.
Train and adapt LLMs for agent workflows, including planning, tool use, and multi-step interactions inside JetBrains IDEs.
Build and develop evaluation and simulation environments where coding agents can act, be measured, and compared on realistic developer tasks.

At JetBrains, code is their passion and they strive to make the strongest, most effective developer tools on earth. Today, AI-powered assistance and agents are becoming a core part of how developers work in their IDEs.

View details Similar jobs

Senior AI Engineer

Paper 23 days ago

$160,000–$190,000/yr

Design, implement, and deploy AI-powered features, including model training, fine-tuning, and prompt engineering workflows.
Translate product requirements into robust, production-ready AI solutions, working with Product Managers, Software Engineers, and Data Scientists.
Optimize models and infrastructure for scalability, latency, and cost efficiency, partnering with DevOps and MLOps to ensure reliable and maintainable AI pipelines.

Paper is reimagining how schools support students so that every learner can reach their full potential.

View details Similar jobs

Director of Machine Learning, Safety & Mods

Reddit 30 days ago

$265,800–$365,100/yr

Lead Reddit’s efforts in building ML systems that keep our platform safe.
Drive the strategy, development, and deployment of machine learning models that detect and prevent harmful content and behavior at scale.
Partner cross-functionally across Product, Engineering, Safety operations, Trust & Community, and AI/ML Platform to innovate on real-time detection, automation, and user protection systems.

Reddit is built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet.

View details Similar jobs

Source Job

Senior Research Scientist, Reward Models

Research Engineer, Reward Models Platform

Research Engineer (Agentic Models)

Senior AI Engineer

Director of Machine Learning, Safety & Mods