Job Description
As a Site Reliability Engineer (SRE), you will provide relief and sustainable resolution to issues within our infrastructure, conduct root cause analysis of incidents and implement preventive measures, and participate in troubleshooting bridges and provide support during critical incidents. You'll use your experience in software development, systems engineering, and networking to proactively prevent repeatable issues. Drive initiatives with partner teams to improve the reliability and performance of the infrastructure through improved system design while driving a culture of intolerance to manual activity which results in a highly automated environment delivering scalable solutions. The SRE will also design, develop, and maintain scalable and reliable systems; implement and manage monitoring, alerting, and incident response processes; collaborate with development teams to ensure the reliability and performance of new features; automate repetitive tasks to improve efficiency and reduce human error; and innovate and continuously improve system reliability, performance, and capacity.
About ServiceNow
ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500®.