Source Job

UK 5w PTO

  • Own the design, deployment, and operation of OpenStack and Kubernetes environments to ensure performance, scalability, and resilience for GPU workloads.
  • Build and improve infrastructure using infrastructure-as-code and GitOps practices, driving automation across provisioning, deployment, and operational workflows.
  • Optimize GPU workload scheduling using Kubernetes and NVIDIA tooling, and implement monitoring, logging, and alerting to ensure platform stability.

OpenStack Kubernetes Linux Networking Infrastructure Automation

17 jobs similar to Senior Infrastructure Engineer

Jobs ranked by similarity.

UK 5w PTO

  • Own and drive the design, deployment, and operation of OpenStack and Kubernetes clusters optimized for GPU workloads.
  • Lead and develop a team of 4-5 infrastructure engineers, setting clear direction and standards through automation and incident management.
  • Collaborate closely with DevOps, Product, and Support teams to align infrastructure with customer needs and communicate performance to leadership.

NexGen Cloud is the company behind Hyperstack, a full-stack AI cloud serving tens of thousands of customers from AI researchers to enterprises. It is a fast-moving team working at the cutting edge of AI cloud infrastructure, with a collaborative and international culture built on trust and ownership.

Australia 5w PTO

  • Own the design, deployment and operation of OpenStack and Kubernetes environments.
  • Build and improve infrastructure using infrastructure-as-code and GitOps practices.
  • Optimise GPU workload scheduling using Kubernetes and NVIDIA tooling.

NexGen Cloud is building next-generation GPU cloud infrastructure, and is the company behind Hyperstack, a high-performance cloud platform designed for compute-intensive workloads. We're a scale-up by design, solving complex infrastructure challenges at pace, with real-world impact.

Australia

  • Own and drive the design, deployment, and operation of OpenStack and Kubernetes clusters optimised for GPU workloads
  • Lead and develop a team of 4–5 infrastructure engineers, setting clear direction and standards
  • Build and improve infrastructure through automation (IaC, GitOps, CI/CD pipelines)

NexGen Cloud is a fast-growing company building next-generation GPU cloud infrastructure. At the core of NexGen Cloud is a team of curious, driven people who care deeply about quality, ownership and collaboration.

Europe

  • Develop new applications and services that expand the ecosystem of our cloud platform.
  • Code, test, and deploy frequently — embracing modern development practices.
  • Automate with common automation frameworks to eliminate manual work.

Deutsche Telekom IT Solutions is a subsidiary of the Deutsche Telekom Group, ranked as Hungary’s most attractive employer in 2025. The company provides a wide portfolio of IT and telecommunications services with more than 5300 employees, serving hundreds of large customers in Germany and other European countries.

US

  • Lead the design and implementation of scalable, secure, and resilient cloud infrastructure across AWS and Azure.
  • Drive the architectural vision and strategy, ensuring alignment with long-term business goals.
  • Take the lead on automating and accelerating SDLC processes by identifying bottlenecks.

Candidly flips the script on planning, borrowing, repaying, and saving for college and is a category leader with an AI-driven student debt and savings optimization platform. They partner with hundreds of top employers and have a fully remote, international team of 70+ including alumni from Google, UBS, and Twitter.

Global

  • Design and implement infrastructure and tools that empower our product teams to rapidly and securely iterate, emphasizing reliability and automation.
  • Influence the strategic direction of our infrastructure and operational practices, ensuring that we are well-positioned to scale and support our growing organization.
  • Take a proactive role in the resolution of production issues, ensuring that we are well-prepared to handle incidents and that we learn from them in a blameless manner.

SSV Labs is the core team behind the SSV Network - pioneering decentralized infrastructure for Ethereum staking. They are building tools, protocols, and standards to make staking more secure, scalable, and trustless.

Europe 5w PTO

  • Work with other Engineering teams to design sustainable infrastructure and microservice solutions.
  • Automate tools and infrastructure to reduce manual work.
  • Monitor applications and participate in an on-call rotation as required.

Bloomreach is building the world’s premier agentic platform for personalization, revolutionizing how businesses connect with their customers by building and deploying AI agents to personalize the entire customer journey. They power personalization for more than 1,400 global brands.

Spain 6w PTO

  • Operating and evolving 100+ multi-cloud streaming clusters and related database infrastructure.
  • Diagnosing and eliminating cross-layer failure modes.
  • Designing safe upgrade and rollout strategies at scale.

Grafana Labs is a remote-first, open-source powerhouse with over 20M users of Grafana, its open source visualization tool. Grafana Labs helps more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, and its team thrives in an innovation-driven environment.

$163,339–$204,743/yr
Europe Unlimited PTO

  • Support and enable internal business units through shared engineering services.
  • Build relationships with multiple stakeholders across the organization.
  • Automate upgrades for managed services and VMs.

Tailscale is building the new Internet by delivering software that makes it easy to securely interconnect people and their devices, no matter where they are. Founded in 2019 and fully distributed, they're backed by Accel, CRV, Insight, Heavybit, and Uncork Capital.

Europe

  • Improve the platform architecture while taking IT security seriously.
  • Design and develop innovative platform and software-as-a-service features.
  • Be a trusted advisor for OpenStack-based public cloud technology.

Deutsche Telekom IT Solutions is a subsidiary of the Deutsche Telekom Group and was ranked as Hungary’s most attractive employer in 2025. As a company, they provide a wide portfolio of IT and telecommunications services with more than 5300 employees.

Global Unlimited PTO

  • Lead a platform engineering team delivering managed Kubernetes and cloud infrastructure across multiple providers and deployment models.
  • Own the platform delivery roadmap, coordinating with Cloud Organization, Security, and Professional Services to manage dependencies.
  • Drive foundational infrastructure programs in private networking and cloud governance to establish Ditto's deployment baseline.

Ditto redefines data movement at the edge by providing a peer-to-peer sync engine for building resilient, real-time applications in any network condition. This venture-backed, globally distributed startup is trusted by major enterprises across aviation, retail, and defense, and is committed to building a diverse and inclusive team.

US

  • Design, build, and operate core cloud infrastructure across compute, storage, databases, and networking layers.
  • Own and improve the reliability, scalability, and security of Valon’s production systems as we scale to support major enterprise deployments.
  • Evaluate, adopt, and operationalize new infrastructure technologies (e.g., Vitess, Clickhouse, Redis) to meet evolving product and scale requirements.

Valon is building the AI-native operating system for regulated finance, starting with mortgage servicing. They are a Series C company backed by a16z, transforming industries that others have written off as too complex to innovate.

$160,000–$200,000/yr
US Unlimited PTO

  • Maintain, optimize, and enhance on-premises and cloud computing environments.
  • Execute technical aspects of implementation projects, ensuring seamless software integration and customization.
  • Automate Infrastructure-as-Code (IaC) to manage virtual machines and deploy containers, services, and other infrastructure.

Striveworks helps organizations harness AI to solve national security and business challenges, acting as a command center for data and models. Founded by data scientists and engineers, they aim to simplify the deployment and optimization of AI systems, ensuring reliability and scalability.

Global

  • Deliver a scalable internal infrastructure platform on public cloud environments.
  • Establish and evolve Kubernetes-based platform capabilities to support high-availability, production-grade workloads at scale.
  • Build a secure and reliable foundation that supports CI/CD pipelines and minimizes operational risk across engineering teams

Chainlink is the industry-standard oracle platform bringing the capital markets onchain and powering the majority of decentralized finance (DeFi). Since inventing decentralized oracle networks, Chainlink has enabled tens of trillions in transaction value and now secures the vast majority of DeFi.

$145,000–$170,000/yr
US Unlimited PTO 12w maternity 12w paternity

  • Learn platform infrastructure, developer tooling, and deployment patterns.
  • Own and drive at least one architecture decision that improves platform reliability.
  • Ship infrastructure improvements that measurably improve developer experience or platform stability.

Homebot is a homeownership platform for lenders and real estate, title & insurance agents that drives client retention and partner referrals. They maintain a clear focus on culture, engagement, and creating an environment where people are valued and can thrive.

Global

  • Resolving complex customer problems related to Ubuntu, OpenStack, or Kubernetes and other open source software
  • Maintaining a close working relationship with Canonical's field, support and product engineering teams
  • Developing fixes, backporting patches, and working with upstream for inclusion

Canonical is a leading provider of open source software and operating systems. With 1200+ colleagues in 75+ countries, they are a pioneer of global distributed collaboration and have very few office-based roles.

Hungary

  • Tackle complex challenges in the day‑to‑day operations of a hyperscaler’s cloud backend
  • Work hands‑on at the console with a strong hardware focus
  • Design, build, and operate monitoring and quality‑assurance solutions

Deutsche Telekom IT Solutions is a subsidiary of the Deutsche Telekom Group, providing IT and telecommunications services. With over 5300 employees, they serve hundreds of large customers, corporations in Germany, and other European countries, fostering an ethical and collaborative culture.