Own the reliability, availability, and operational excellence of business-critical production systems.
This is a dedicated Site Reliability Engineering role, not a general DevOps or Infrastructure position.

Key Responsibilities:

Define, implement, and continuously improve Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Error Budgets.
Lead Incident Command during production incidents and coordinate cross-functional response efforts.
Automate operational processes and reliability improvements using software engineering best practices.

Qualifications:

5+ years of experience in Site Reliability Engineering or similar roles.
Proven experience defining and managing SLOs, SLIs, and Error Budgets.
Strong automation skills using Python, Go, or TypeScript.

Benefits & Perks:

Home office and remote work flexibility.
Competitive compensation based on experience.
Career plans and professional development opportunities.

Oowlish

Oowlish is a rapidly expanding software development company in Latin America. It is certified as a Great Place to Work and offers a nurturing environment with professional development opportunities.

Apply for This Position