Job Description
As an Infrastructure Engineer focusing on Observability, you'll be responsible for designing, building, and maintaining observability platforms spanning metrics, logs, traces, and events. You will create dashboards and alerting for internal stakeholders and scoped visibility for external customers.
The role includes ingesting and correlating telemetry from various sources like GPUs, CPUs, networking components, containers, APIs, and BMC/Redfish. Your responsibilities will also involve implementing noise-resistant alerting pipelines to improve detection and reduce operational load. Collaboration with infrastructure, platform, and customer-facing teams will be crucial to embed observability into workflows. The role also involves contributing to broader infrastructure engineering projects beyond observability.
About Voltage Park
Voltage Park designs and operates the systems that manage thousands of bare-metal servers, GPUs, and high-performance networks across multiple data centers.