Similar Jobs

See all

Key Responsibilities:

  • Investigate, troubleshoot, and resolve complex production issues across cloud and customer environments with a strong focus on root cause analysis.
  • Debug across Linux systems, Kubernetes clusters, networking layers, storage systems, and GPU-accelerated workloads.
  • Act as a senior escalation point for critical incidents, ensuring fast and effective resolution of high-impact issues.

Requirements:

  • Strong hands-on experience with Linux system administration and troubleshooting in production environments.
  • Solid expertise in Kubernetes and containerized application environments.
  • Good understanding of cloud infrastructure platforms such as AWS, GCP, Azure, or OpenStack.

Benefits:

  • Competitive compensation package aligned with experience and expertise.
  • Strong focus on career development, learning, and technical growth opportunities.
  • Flexible working arrangements with high autonomy and ownership.

Jobgether

The partner company provides advanced AI and cloud infrastructure solutions, supporting large-scale distributed computing and AI workloads. They operate in a fast-moving, collaborative environment with highly skilled engineering teams focused on cutting-edge technology and operational excellence.

Apply for This Position