Senior Site Reliability Engineer

ScienceLogic ☁️🤖💡

Remote regions

US

Benefits

Unlimited PTO

Job Description

Lead design reviews and buildout of secure systems for delivering new Artificial Intelligence Product in SaaS, aiming for 99.99% uptime. Design, automate, test, and monitor the use of cloud native technologies as a foundation for a service platform. Spend 75% of your time on forward looking priorities designing and building SaaS systems while remaining on supporting the Operations and Maintenance of the current SaaS infrastructure. Investigate and resolve customer and operational issues with the mentality of fixing and not just mitigating issues. Identify and automate measurement of operations SLAs and SLOs Triage incident response, document SOPs, Runbooks, and train NOC team members Writing automation can be easily supported and extended by others. Collaborate across the organization to design, build and operationalize SaaS services conforming to various security standards like FedRAMP, SOC2, ISO etc. Participate in the on-call rotation as assigned. Take full responsibility for the availability and performance of the platform. Work on special projects as assigned.

About ScienceLogic

ScienceLogic is redefining IT operations for the modern enterprise with its AIOps platform that empowers organizations to achieve Autonomic IT.

Apply for This Position