Why this role:
- Improve Large Language Model’s performance for iterations to come, thus having a lasting impact on Cohere’s tech.
-Bring strong contextual judgment, cultural and bias sensitivity, and experience applying nuanced guidelines to complex or ambiguous content.
-Successful candidates will be highly detail-oriented, comfortable evaluating safety risks across different user intents and scenarios.
Responsibilities:
- Evaluate and improve model safety: Label, rank, audit, and refine human- and model-generated text to improve safety, quality, and policy alignment, including content that may be sexual, violent, or psychologically disturbing.
- Apply nuanced safety judgment: Assess model outputs against detailed safety guidelines, rubrics, and style standards, making consistent decisions across ambiguous, sensitive, and context-dependent cases.
- Support quality and calibration: Identify annotation inconsistencies or unclear guidelines, and provide actionable feedback on recurring edge cases, model failures, and opportunities to improve data quality.
You may be a good fit if you have:
- 1+ years of experience in Content Moderation,Trust and Safety, AI data annotation, LLM evaluation, or a related analytical role, with exposure to quality assurance, red teaming, and/or prompt engineering preferred.
- Experience applying detailed guidelines to complex and sensitive content, with strong contextual and sociocultural judgment and the ability to recognize and manage personal bias.
- Emotional resilience: Comfort working with content that contains unsafe, explicit, and/or toxic content, including content of a sexual, violent, or psychologically disturbing nature.
Cohere
Cohere's mission is to scale intelligence to serve humanity by training and deploying frontier models for developers and enterprises. They are a team of researchers, engineers, and designers passionate about their craft, believing that a diverse range of perspectives is a requirement for building great products.