Source Job

Global

  • Act as the final escalation point for complex Cloud infrastructure issues, analyzing logs and metrics to identify root causes.
  • Own high-severity incidents, coordinate resolution with Engineering, DevOps, and SRE teams, and contribute to preventive actions.
  • Mentor L1 and L2 support engineers, create runbooks and SOPs, and collaborate with Product teams to reproduce issues.

Cloud Infrastructure Networking Kubernetes Scripting

20 jobs similar to Principal Support Engineer (L3, Edge Cloud)

Jobs ranked by similarity.

  • Maintain the reliability and performance of customer environments remotely, supporting Mirantis Opensack/k0s layers.
  • Diagnose and resolve system-level issues, requiring hands-on Linux administration experience.
  • Troubleshoot customer environments based on Linux, OpenStack, Kubernetes, networking, and other cloud technologies; detect, report, and resolve issues.

Mirantis helps enterprises move to the cloud on their terms, delivering a true cloud experience on any infrastructure, powered by Kubernetes. They serve many of the world’s leading enterprises and value openness, collaboration, risk-taking, and continuous growth.

Global

  • Act as the 3rd-level escalation point for complex technical issues related to CDN and Edge Network products.
  • Diagnose and resolve advanced issues involving caching, DNS, routing, load balancing, SSL/TLS, and web security.
  • Take ownership of high-severity incidents (P1/P2) and drive resolution in collaboration with Engineering, Network, and Operations teams.

Gcore provides infrastructure and software solutions for AI, cloud, network, and security. They have 550+ professionals and offer a global team environment.

US 4w PTO 14w maternity 14w paternity

  • Own Render's core network infrastructure across multiple data centers and cloud providers, shaping how networking evolves as Render rapidly scales.
  • Design and build customer-facing networking capabilities that give users greater flexibility in how their services connect and communicate, and how traffic is routed.
  • Investigate complex networking issues across the stack, from the kernel and data plane to distributed systems and edge networking.

Render is building a modern cloud platform for developers creating AI-native, full-stack, multi-service applications, eliminating the tradeoff between hyperscaler power and developer-friendliness. They are a diverse and talented team that values craft, velocity, and user experience.

Europe

  • Support and improve hybrid production infrastructure for 15+ development teams handling 100+ products, 10K+ domains, and billions of hits per day.
  • Architect and plan improvements of a multi-datacenter development environment, advocating for migration to automated, elastic infrastructures using cloud, Kubernetes, and serverless technologies.
  • Document processes, monitor performance metrics, promote CICD practices, and mentor junior DevOps engineers.

Aylo is a tech pioneer that offers world-class adult entertainment and games on safe, popular platforms. With an international team of dynamic innovators, the company focuses on trust-and-safety protocols and has offices in Montreal, Austin, and Nicosia.

US

  • Serve as the top-tier technical expert to resolve complex issues across endpoints, identity, networks, and core business applications, documenting root causes and scalable solutions.
  • Maintain strong security and compliance across all IT workflows, applying data security principles (e.g., MFA, least-privilege access) and ensuring all records (tickets, assets, changes) are audit-ready.
  • Mentor junior team members, providing guidance and stepping in to handle frontline support during high-volume periods.

Equip is a virtual eating disorder treatment program that aims to ensure everyone with an eating disorder can access effective treatment. Founded in 2019, Equip has a highly-engaged, passionate, and diverse culture, operating in all 50 states and partnering with most major health insurance plans.

$200,000–$225,000/yr

  • Lead the evaluation, adoption, and execution of technology initiatives.
  • Recruit, mentor, and motivate a high-performance operations staff.
  • Drive operational excellence through structured incident, problem, and change management practices.

Business Wire is a press release distribution company. The company's total rewards include remote work, health benefits, fitness allotment, and a 401(k) plan.

Europe

  • Lead the investigation and resolution of complex infrastructure, networking, and platform-related incidents.
  • Provide technical leadership for Kubernetes platform operations and supporting infrastructure services.
  • Mentor and support AI Infrastructure & Platform Operations Engineers, sharing technical knowledge through documentation and training.

Mirantis helps organizations ship code faster on public and private clouds, providing a public cloud experience on any infrastructure from the data center to the edge. The company serves many of the world's leading enterprises, including Adobe, DocuSign, Liberty Mutual, and PayPal, and is a leader in container management.

$110,000–$140,000/yr
US

  • Perform systems administration and maintenance including patching and vulnerability scanning.
  • Primarily support AWS environments, including Windows and Linux virtual machines.
  • Troubleshoot issues across network, compute, application, and identity layers.

Tyto Athene delivers mission-focused digital transformation through IT services and solutions. They have over 50 years of experience and foster a collaborative, innovative, and mission-driven environment.

US Unlimited PTO

  • Provide technical support to customers through email, screen sharing, and chat within established SLAs.
  • Own and resolve complex technical customer issues, partnering with Technical Support Specialists.
  • Problem-solve and troubleshoot in a repeatable manner, documenting in the Support CRM to identify trends.

Vanta helps businesses earn and prove trust by enabling companies to practice better security. They have a talented team and empower companies to improve and prove their security.

US

  • Design, implement, and automate mission-critical cloud infrastructure solutions for enterprise customers globally.
  • Architect and support highly available, resilient, and disaster recovery-enabled infrastructure environments.
  • Automate infrastructure provisioning and operational activities using Terraform, scripting, and Infrastructure-as-Code methodologies.

Axway has been shaping the future of enterprise integration for over 25 years. They are a global industry leader, helping organizations drive digital transformation with secure, mission-critical software that powers impactful business outcomes. As part of 74Software, Axway is backed by a group of software companies and focuses on delivering long-term value, leveraging cutting-edge technology, and fostering strong client partnerships.

US

  • Working with clients to understand their requirements and technical challenges.
  • Developing solutions using cloud technologies and evangelize the value of your solution to the client team.
  • Acting as the lead technical member of the project team, providing delivery and technical oversight.

Zencore is a fast-growing company founded by former Google Cloud leaders, architects, and engineers. They aim to eliminate obstacles, reduce risk, and accelerate timelines for customers transitioning to Google and seeking assistance with data and application modernization.

Europe

  • Design, implement, and maintain GCP Landing Zone components.
  • Manage infrastructure using Terraform and CI/CD pipelines.
  • Configure and troubleshoot networking (VPC, VPN, routing, DNS).

Software Mind develops impactful solutions for companies globally, working with tech giants and on transformative projects. They foster a culture of openness, respect, and enjoyment, building cross-functional engineering teams that value passion and creativity.

US

  • Design, build, and operate core cloud infrastructure across compute, storage, databases, and networking layers.
  • Own and improve the reliability, scalability, and security of Valon’s production systems as we scale to support major enterprise deployments.
  • Evaluate, adopt, and operationalize new infrastructure technologies (e.g., Vitess, Clickhouse, Redis) to meet evolving product and scale requirements.

Valon is building the AI-native operating system for regulated finance, starting with mortgage servicing. They're a Series C company backed by a16z, transforming industries that others have written off as too complex to innovate.

$29,000–$36,000/yr
India

  • Design, build, and maintain scalable, reliable systems on GCP.
  • Develop automation for infrastructure provisioning using Terraform, Ansible, or Deployment Manager.
  • Manage incident response, conduct postmortems, and implement improvements to reduce recurrence.

SupplyHouse.com is an industry-leading e-commerce company specializing in HVAC, plumbing, heating, and electrical supplies since 2004. They value every individual team member and cultivate a community where people come first with Generosity, Respect, Innovation, Teamwork, and GRIT.

Global 16w maternity 16w paternity

  • Lead the design and implementation of self-service platform infrastructure for provisioning, deployment, and observability across engineering teams.
  • Evolve multi-tenant EKS foundations toward better reliability, security, scale, and multi-region connectivity.
  • Set delivery standards using Terraform, GitOps, and progressive rollout, while improving SLOs and alerting on Grafana Cloud.

Docker is a developer tooling company trusted by over 20 million monthly users and 20 billion container image pulls. They are a globally distributed, remote-first team building tools that define how software gets built and delivered.

US

  • Provide prompt and courteous support to internal users, resolving complex issues and documenting root causes.
  • Use CLI and system logs to diagnose issues, performing HTTP/DNS/network checks.
  • Maintain SOPs and knowledge articles to reduce resolution times and re-opens.

Equip is a virtual eating disorder treatment program that aims to make effective treatment accessible to everyone. They offer dedicated care teams and are partnered with major health insurance plans, operating in all 50 states.

$188,550–$212,150/yr
Global Unlimited PTO

  • Own the technical direction of Remote's SRE/Platform domain.
  • Define and drive the reliability strategy across the platform.
  • Identify and lead AI enablement initiatives across the engineering organisation.

Remote is solving modern organizations’ biggest challenge – navigating global employment compliantly with ease. With our core values at heart and a future-focused work culture, our team works tirelessly on ambitious problems, asynchronously, around the world.

$120,000–$170,000/yr
Global Unlimited PTO

  • Own and evolve Quansight's cloud infrastructure across AWS, Azure, and GCP.
  • Build, deploy, and maintain internal dashboards and reporting for operations and project management.
  • Lead infrastructure engagements for clients from scoping and architecture through delivery, upskilling client teams.

Quansight is rooted in the Python and PyData ecosystems. They provide services ranging from open-source software development to training and consulting, believing in a culture of do-ers, learners, and collaborators.

US

  • Own and operate end-to-end infrastructure for backend services, frontend systems and databases.
  • Build and maintain reliable deployment workflows including CI/CD pipelines and rollback procedures.
  • Improve system-wide observability through metrics, logging, alerting, and monitoring to ensure uptime.

Jito Labs builds a high-performance trading terminal on Solana. They are a lean, high-output team building something that sits at the intersection of execution quality, user experience, and on-chain infrastructure.

$115,000–$130,000/yr
US Unlimited PTO

  • Develop and maintain scalable automation and integrations across cloud platforms and services.
  • Design, implement, and operate CI/CD pipelines using Jenkins, Dagger, Terraform, and Docker.
  • Build, operate, and troubleshoot workloads on Kubernetes, using Kustomize and Helm.

People Inc. is America’s largest digital and print publisher. Our brands harness the best intent-driven content, the fastest sites, and the fewest ads to help nearly 200 million people every month make decisions.