Job Description
As a Staff Site Reliability Engineer, you will be responsible for setting and driving a multi-year reliability strategy that enables Zapier engineers to deliver reliable products at scale, shaping how engineers instrument, monitor, debug, and learn from their systems. You'll define standards and best practices for instrumentation, telemetry, alerting, and visibility, partnering with product and platform teams to remove friction in adopting golden paths for observability, service ownership, and incident response. You will also mentor and guide engineers, champion AI integration, and act as a reliability steward for Zapier. This involves contributing to shaping what “owning a service” means at Zapier, improving how engineers measure and understand reliability and use AI tools to optimize development, debugging, analysis, and documentation workflows.
About Zapier
Zapier is a platform to help millions of businesses globally scale with automation and AI, with the mission to make automation work for everyone.