As a Sr. Platform Engineer, you'll lead modernization efforts across Kubernetes environments to improve scalability, reliability, and developer productivity. Drive the transition from legacy infrastructure, reducing operational complexity and aligning with standardized deployment practices. Enhance and expand the Backstage developer portal to unify CI/CD pipelines, observability, and service management tools. Define and implement observability practices including metrics, logging, and tracing that reduce incident response times and improve system insights.
Partner with AI/ML teams to ensure infrastructure readiness for scalable model deployment and reproducibility of pipelines. Design and maintain self-service templates, infrastructure blueprints, and rollout guides to promote engineering best practices. Collaborate across platform teams to troubleshoot systemic issues and continuously improve service health and performance.