Work with customers and implement Observability solutions. Build and maintain scalable systems and robust automation that supports engineering goals. Develop and maintain monitoring tools, alerts, and dashboards to provide visibility into system health and performance. Proactively gather and analyze both metric and log data from systems and applications to perform anomaly detection, performance tuning, capacity planning and fault isolation.
Job listings
Planning is managed using a backlog tool and change requests are related to listed tools. The main task is to set up components and integrate interfaces in an automated way. The role involves executing tasks, participating in daily team calls, and maintaining, operating, and debugging enterprise tools to troubleshoot and resolve incidents, handle support tickets, and assist users. Integrating data sources into dashboards and log archives, developing predictive analytics models, and analyzing time-series data for forecasting is required.