Site Reliability Engineer

Cohere

Remote regions

Global

Benefits

6w PTO 26w maternity

Similar Jobs

See all

WHY THIS ROLE:

  • Energized by building high-performance, scalable and reliable machine learning systems?
  • Help define and build the next generation of AI platforms powering advanced NLP applications.
  • Work closely with many teams to deploy optimized NLP models to production.

As a Site Reliability Engineer you will:

  • Build self-service systems that automate managing, deploying and operating services.
  • Automate environment observability and resilience.
  • Build strong relationships with internal developers.

You may be a good fit if you have:

  • 5+ years of engineering experience running production infrastructure at a large scale.
  • Experience designing large, highly available distributed systems with Kubernetes.
  • Experience with GCP, Azure, AWS, OCI, multi-cloud on-prem / hybrid serving.

Cohere

Cohere is focused on scaling intelligence to serve humanity by training and deploying frontier models for developers and enterprises. They are a team of researchers, engineers, and designers. They value diversity and strive to create an inclusive work environment.

Apply for This Position