Job Description
Develop comprehensive observability best practices and standardize guidelines for metrics, traces, and logs, ensuring consistent implementation and adoption across all engineering teams. Partner closely with various engineering teams to enhance overall system reliability and performance by defining clear Service Level Indicators (SLIs) and Service Level Objectives (SLOs), and seamlessly integrating observability practices into continuous integration and continuous deployment (CI/CD) pipelines to promote a culture of "observability by design." Drive the automation of monitoring infrastructure through Infrastructure-as-Code (e.g., leveraging the Terraform Datadog provider) and develop intuitive self-service observability tools. Continuously refine and optimize alerting mechanisms by meticulously tuning thresholds and implementing intelligent noise reduction strategies. Provide comprehensive training and ongoing support to product teams, enabling them to effectively utilize observability tools. Proactively research, evaluate, and integrate new and emerging observability tools and technologies as needed.
About Aircall
Aircall is a unicorn AI-powered customer communications platform used by 22,000+ companies worldwide to drive revenue, faster resolutions, and scale.