Source Job

US Europe

  • Serve as a core safety partner embedded across product and research teams, providing Trust & Safety engineering support for all launches from early design through post-launch monitoring.
  • Build and maintain safety infrastructure ensuring Runway's models have a positive impact as they reach millions of users.
  • Design, execute, and continuously improve red teaming systems to proactively surface harmful outputs before production.

Python TypeScript AWS GCP

20 jobs similar to Trust & Safety Engineer

Jobs ranked by similarity.

  • Define policies across accounts, payments, and API usage.
  • Build a risk-based system driven by aggregated signals.
  • Handle investigations, enforcement decisions, and appeals.

OpenRouter provides the AI routing and infrastructure layer that AI builders, AI-native startups, and enterprises use to access, manage, and optimize their AI usage through a unified API, billing interface, and analytics platform. They empower some of the most advanced AI builders in the world by giving them the flexibility to move fast and scale confidently. A fully remote team with a strong culture of autonomy and trust.

US

  • Interact with generative AI models and project guidelines.
  • Create prompts to test model behavior across safety categories.
  • Document model breakability and effort level.

Welo Data provides AI services and specializes in data annotation. We foster a collaborative and innovative culture where employees contribute to cutting-edge AI safety evaluation.

Latin America

  • Design and implement guardrails for agentic AI systems, including tool access controls and step-level validation.
  • Build runtime security controls like interceptors, policy enforcement, and kill-switches for AI behavior.
  • Implement non-human identity access controls, observability, and threat modeling for AI-driven activity.

Backblaze is the object storage leader in the open cloud movement, offering cloud storage built to unlock budgets and unburden administrators. Founded in 2007, the company has over $100m in revenue and manages over three billion gigabytes of data for 500K+ customers across 175+ countries, with a culture of innovation and inclusion.

$3,850–$3,850/yr
US UK Canada

  • Fellows will use external infrastructure to work on an empirical project aligned with research priorities.
  • Projects aim to produce a public output, such as a paper submission.
  • Fellows receive mentorship and can access a shared workspace in Berkeley or London.

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. Their team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

$165,000–$250,000/yr
US Japan

  • Lead a multi-disciplinary team across systems, test execution, and safety risk and process/SMS.
  • Drive the Driver-Out (DO) gated deployment plan with strategic ride-hail partners (Lyft, Uber) — internal gating, partner safety onsites, and recommendation packages for go/no-go decisions.
  • Own the Safety Case Framework for May's L4 systems, including HARA, FTA, FMEA, and runtime-monitor coverage for the MPDM-based decision-making stack, with ML-aware safety analysis.

May Mobility is transforming cities through autonomous technology to create a safer, greener, more accessible world. Based in Ann Arbor, Michigan, May develops and deploys autonomous vehicles (AVs) powered by our innovative Multi-Policy Decision Making (MPDM) technology. We’re hiring people who share our passion for building the future, today, solving real-world problems and seeing the impact of their work.

US

  • Interact with generative AI models using project-provided guidelines, safety taxonomies, and attack-vector guidance.
  • Create and evaluate prompts designed to test model behavior across safety-related categories.
  • Identify where model responses become unsafe, noncompliant, inconsistent, or otherwise problematic.

Welo Data is an AI services company that specializes in data annotation. They deliver multilingual content transformation services in translation, localization, and adaptation for over 250 languages with a growing network of over 400,000 in-country linguistic resources.

Europe North America 7w PTO

  • Design a Python framework for implementing internal and public benchmarks.
  • Build and maintain a pipeline that runs distributed evaluations at scale.
  • Collaborate with modeling and product teams to improve experimentation and evaluation tooling.

Poolside aims to be the leading company in building a world where AI drives economically valuable work and scientific progress. They are a remote-first team across Europe and North America, gathering monthly in person for 3 days and twice a year for longer offsites.

US Unlimited PTO

  • Contribute to strategic direction for Trust & Safety and translate risk insights into measurable OKRs.
  • Own end-to-end risk mitigation across Critical Response, Compliance, and Claims.
  • Partner cross-functionally with Legal, Product, Engineering, Operations, and CX leadership.

Roadie, a UPS company, is a leading logistics and delivery platform that helps businesses tackle the complexities of modern retail. They offer flexible delivery solutions with a network of more than 310,000 independent drivers nationwide.

US

  • Shape technical direction and architecture: Define the foundational architecture for enterprise agentic AI at Benchling.
  • Build and ship the early portfolio yourself: Write production code at least half your time, particularly during the team's first year.
  • Design for enterprise from day one: Build for multi-tenant isolation, secrets management, audit logging, payload encryption, role-based access controls, and human-in-the-loop controls calibrated to risk.

Benchling is the AI platform for biotech R&D. Scientists use Benchling to design experiments, capture structured data, and run AI agents and models directly in their workflows. They have over 200,000 scientists around the world, from academic labs to Sanofi and Moderna.

Europe

  • Responsible for the foundational security posture of our organization.
  • Architect and build preventative guardrails and mitigate new risks introduced by first and third-party AI agents in our Enterprise.
  • Develop and set the long term roadmap for agentic AI identity and posture management, ensuring cohesive strategies for reducing risk from agentic AI use.

Twilio is shaping the future of communications, delivering innovative solutions to hundreds of thousands of businesses and empowering millions of developers worldwide to craft personalized customer experiences. Our dedication to remote-first work, and strong culture of connection and global inclusion means that no matter your location, you’re part of a vibrant team with diverse experiences making a global impact each day.

Latin America

  • Partner with full-stack and backend engineers on the features they are shipping, write tests that prove it works, and flag gaps early.
  • Help build and run evaluation pipelines for non-deterministic LLM outputs, prompt regression, model drift detection, and output quality scoring across the LiteLLM routing layer.
  • Test the Nango-based integration layer across connectors and the file ingestion pipeline including encryption, formatting edge cases, and audit trail continuity.

Peach Pilot transforms how businesses run with a platform that ingests everything about how a company operates and constructs a Company Brain. It is a funded early-stage AI startup headquartered in Atlanta, Georgia, with a working platform on live infrastructure.

US

  • Secure AI Systems and Use AI to Scale Security.
  • Deliver Application Security Reviews.
  • Advance CI/CD Pipeline Security.

Smartsheet helps people and teams achieve their goals with seamless work management and scalable solutions. They empower teams to automate tasks, uncover insights, and scale smarter, fostering a culture of innovation and impact with a focus on challenge and purpose.

US

  • Understand real security workflows across threat modeling, privacy, and vendor risk.
  • Integrate Clearly AI into those workflows alongside Jira, ServiceNow, Confluence, and GitHub.
  • Drive disciplined implementation from contract to production.

Clearly AI automates the most painful bottleneck in the enterprise: security and privacy reviews. We help security teams complete high-quality threat models, privacy impact assessments, and vendor risk evaluations in minutes instead of weeks. We are early and deeply technical, backed by Y Combinator and live with Fortune 500s and global brands.

US Unlimited PTO

  • Drive architecture and technical strategy for core platform systems, APIs, and data pipelines
  • Hire, manage, and develop a high-performing engineering team
  • Partner with Product and Data teams to define scope, timelines, and tradeoffs

VulnCheck is transforming exploit intelligence by helping security teams act faster. They deliver exploit intelligence, asset correlation, and contextual insights. Founded in 2021 in Lexington, Massachusetts, they have a transparent, collaborative, and supportive culture.

$150,000–$170,000/yr
US

  • Design, implement, and maintain reliable, scalable, and secure infrastructure, applications, and tooling, with a focus on our ML/AI pipelines and workloads
  • Write clean, maintainable code, and perform peer code-reviews
  • Write clear and concise documentation and engage in cross-team communication and knowledge sharing

Bright Machines is a next-generation, AI-enabled manufacturer focused on data center infrastructure assembly operations. The company utilizes AI-based robotics and software to assemble AI infrastructure hardware products for hyperscalers and leading OEMs, employing under 500 employees, with a culture rooted in innovation and expertise.

$217,000–$303,900/yr
US

  • Lead the measurement strategy for user safety, quantifying safety experiences across Reddit.
  • Use data-backed methods to inform strategic direction of Safety product development.
  • Design and execute experiments to estimate the impact and ROI of safety initiatives.

Reddit is a community-based platform built on shared interests and authentic conversations. With over 100,000 active communities and millions of daily active users, it fosters information sharing and discussions across diverse topics.

US

  • Own and maintain strong relationships with our top API partners, serving as their primary technical point of contact to ensure long-term alignment and collaboration.
  • Own full post-sale success: onboarding, adoption, renewal, and expansion for key accounts.
  • Conduct technical discovery sessions to bridge Runway's API capabilities with our partners' most complex challenges.

Runway is building AI to simulate the world through merging art and science. They believe that world models are at the frontier of progress in artificial intelligence. Runway's team consists of creative, open minded, caring and ambitious people who are determined to change the world.

US

  • Lead and mentor a high-performing team of security engineers, setting technical direction and standards for excellence.
  • Define and execute the security roadmap for infrastructure, remote access, endpoints, and M&A.
  • Design and implement security controls across cloud, production, and corporate environments.

Anduril Industries is a defense technology company transforming U.S. and allied military capabilities with advanced technology, powered by Lattice OS. They bring the expertise and business model of innovative companies to the defense industry, focusing on autonomy, AI, and networking.

US

  • Manage customer inquiries across email, chat, Slack, focusing on complex cases.
  • Provide technical and creative support to enterprise customers via written communication and calls.
  • Investigate technical issues and create detailed bug reports that save engineering time.

Runway is building AI to simulate the world through merging art and science. Their team consists of creative, open-minded, caring, and ambitious people determined to change the world.

$0–$0/yr
US Canada

  • Design, build, and ship agentic workflows across multiple domains.
  • Build multi-step agents capable of autonomous planning, context tracking, memory, tool use, and API orchestration.
  • Drive technical and architectural decisions to meet product requirements while also anticipating and designing for future needs

Cority helps customers see and prevent risks across their operations in real time. Our EHS+ platform converges people, data, and AI agents to provide a clear view of information people can trust. For 40 years, Cority has been the market leader in EHS+, recognized by top analysts and trusted by more than 1,500 of the most complex organizations worldwide.