Similar Jobs

See all

Role Overview:

  • Own the reliability, availability, and operational excellence of business-critical production systems.
  • This is a dedicated Site Reliability Engineering role, not a general DevOps or Infrastructure position.

Key Responsibilities:

  • Define, implement, and continuously improve Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Error Budgets.
  • Lead Incident Command during production incidents and coordinate cross-functional response efforts.
  • Automate operational processes and reliability improvements using software engineering best practices.

Qualifications:

  • 5+ years of experience in Site Reliability Engineering or similar roles.
  • Proven experience defining and managing SLOs, SLIs, and Error Budgets.
  • Strong automation skills using Python, Go, or TypeScript.

Benefits & Perks:

  • Home office and remote work flexibility.
  • Competitive compensation based on experience.
  • Career plans and professional development opportunities.

Oowlish

Oowlish is a rapidly expanding software development company in Latin America. It is certified as a Great Place to Work and offers a nurturing environment with professional development opportunities.

Apply for This Position