Why this role:

Improve Large Language Model’s performance for iterations to come, thus having a lasting impact on Cohere’s tech.

-Bring strong contextual judgment, cultural and bias sensitivity, and experience applying nuanced guidelines to complex or ambiguous content.

-Successful candidates will be highly detail-oriented, comfortable evaluating safety risks across different user intents and scenarios.

Responsibilities:

Evaluate and improve model safety: Label, rank, audit, and refine human- and model-generated text to improve safety, quality, and policy alignment, including content that may be sexual, violent, or psychologically disturbing.
Apply nuanced safety judgment: Assess model outputs against detailed safety guidelines, rubrics, and style standards, making consistent decisions across ambiguous, sensitive, and context-dependent cases.
Support quality and calibration: Identify annotation inconsistencies or unclear guidelines, and provide actionable feedback on recurring edge cases, model failures, and opportunities to improve data quality.

You may be a good fit if you have:

1+ years of experience in Content Moderation,Trust and Safety, AI data annotation, LLM evaluation, or a related analytical role, with exposure to quality assurance, red teaming, and/or prompt engineering preferred.
Experience applying detailed guidelines to complex and sensitive content, with strong contextual and sociocultural judgment and the ability to recognize and manage personal bias.
Emotional resilience: Comfort working with content that contains unsafe, explicit, and/or toxic content, including content of a sexual, violent, or psychologically disturbing nature.

Cohere

Cohere's mission is to scale intelligence to serve humanity by training and deploying frontier models for developers and enterprises. They are a team of researchers, engineers, and designers passionate about their craft, believing that a diverse range of perspectives is a requirement for building great products.

Apply for This Position