Source Job

US

  • Design and build the orchestration layer using Kubernetes, Slurm, or comparable technologies.
  • Build customer-facing platform APIs, CLIs, web portals, and SDKs.
  • Drive infrastructure-as-code, multi-tenant isolation, and platform reliability.

Kubernetes Go Python Infrastructure As Code

20 jobs similar to Senior Platform Engineer

Jobs ranked by similarity.

Europe

  • Lead the investigation and resolution of complex infrastructure, networking, and platform-related incidents.
  • Provide technical leadership for Kubernetes platform operations and supporting infrastructure services.
  • Mentor and support AI Infrastructure & Platform Operations Engineers, sharing technical knowledge through documentation and training.

Mirantis helps organizations ship code faster on public and private clouds, providing a public cloud experience on any infrastructure from the data center to the edge. The company serves many of the world's leading enterprises, including Adobe, DocuSign, Liberty Mutual, and PayPal, and is a leader in container management.

SRE

Fal
$180,000–$250,000/yr
US

  • Own and operate our Kubernetes infrastructure.
  • Build and maintain CI/CD pipelines and deployment infrastructure.
  • Leverage AI to automate analysis and resolution of production issues.

Fal is the generative media ecosystem powering the next generation of AI products. They build the infrastructure, tools, and model access that teams need to move from idea to production.

Global Unlimited PTO

  • Lead the architecture and implementation of managed Kubernetes infrastructure across AWS, Azure, and GCP for enterprise customer deployments.
  • Own the systems that provision, organize, and manage cloud accounts, including resource governance and multi-tenant isolation.
  • Mentor P3/P4 engineers and define architectural patterns that scale across the company's infrastructure.

Ditto builds the world's leading edge sync platform, enabling applications to share data peer-to-peer with or without internet connectivity. With over $145 million in funding and trusted by organizations like Chick-fil-A and Delta Airlines, Ditto is a fast-growing, globally distributed startup committed to building a diverse and inclusive team.

Canada

  • Build and maintain infrastructure platforms for over 200 backend services running on Kubernetes clusters with 40,000+ cores.
  • Lead and mentor other engineers, own complex infrastructure failures, and participate in a shared on-call rotation.
  • Drive cloud cost efficiency, estimate schedules, and use AI tools as a first-class collaborator in daily workflows.

Life360's mission is to keep people close to the ones they love through location sharing, safe driver reports, and crash detection. The company serves approximately 97.8 million monthly active users across more than 180 countries and has more than 500 remote-first employees.

UK

  • Manage and optimise Kubernetes clusters in GKE through Terraform.
  • Design and implement automation strategies that empower developers to self-serve.
  • Serve as the technical point-of-contact for GCP and Kubernetes-related queries.

Prolific builds the human data infrastructure that reshapes AI development by enabling collection of high-quality, ethically sourced human behavioral data. They are a mission-driven company with a competitive salary and benefits, offering remote working within a culture focused on impact and innovation.

Europe

  • Monitor, operate, and support production AI infrastructure platforms.
  • Investigate and resolve infrastructure, networking, hardware, and platform-related incidents.
  • Collaborate with engineering teams, hardware vendors, and datacenter personnel to resolve technical issues.

Mirantis is the Kubernetes-native AI infrastructure company, enabling organizations to build and operate scalable, secure infrastructure for AI and data-intensive applications. The company is growing and invests heavily in AI infrastructure and platform services.

India

  • Design, deploy, and manage Kubernetes-based platforms in production.
  • Implement and manage automation frameworks for infrastructure provisioning and operations.
  • Administer and optimize VMware environments (vSphere, ESXi, vCenter).

EPlus believes technology is a people business and delivers solutions that make a real difference. Their team is passionate, skilled, and driven, valuing collaboration, innovation, and extraordinary results and dedicated to fostering, cultivating, and preserving a culture that represents diversity, enables inclusion.

US

  • Design, deploy, and manage production Kubernetes clusters with workload scheduling, resource quotas, network policies, and RBAC.
  • Build and optimize CI/CD pipelines using Infrastructure as Code and GitOps principles.
  • Implement observability solutions using Prometheus, Grafana, and OpenTelemetry for performance tuning and reliability.

VerTALENTS is a subsidiary of VerSprite Cybersecurity, specializing in technology staffing. The company connects top technical talent with industry clients through various methods, adding value to both clients and candidates for full-time and contracting opportunities.

$200,000–$225,000/yr
Unlimited PTO

  • Partner with strategic customers during onboarding to maximize value from our platform.
  • Design tailored Kubernetes deployment and integration solutions.
  • Write scripts and tooling to accelerate customer time-to-value.

Edera is dedicated to making secure computing simple. The products and innovations we release will change everything. We operate as a team and value diversity in all its forms, understanding that different perspectives drive our success.

United States

  • Design and build core platform infrastructure for large-scale cloud-native data and analytics systems.
  • Own and improve CI/CD pipelines, testing frameworks, and deployment in a high-scale PaaS environment.
  • Contribute to reliability engineering, observability, and operational excellence across distributed systems.

Jobgether uses an AI-powered matching process to connect candidates with roles. The company is a growing platform focused on efficient job matching and data privacy compliance.

Global 16w maternity 16w paternity

  • Lead the design and implementation of self-service platform infrastructure for provisioning, deployment, and observability across engineering teams.
  • Evolve multi-tenant EKS foundations toward better reliability, security, scale, and multi-region connectivity.
  • Set delivery standards using Terraform, GitOps, and progressive rollout, while improving SLOs and alerting on Grafana Cloud.

Docker is a developer tooling company trusted by over 20 million monthly users and 20 billion container image pulls. They are a globally distributed, remote-first team building tools that define how software gets built and delivered.

Singapore Unlimited PTO

  • Lead forward-deployed engagements end to end, from scoping customer goals to production deployment of Stacklok's Enterprise platform.
  • Design and deliver changes to the platform's Kubernetes deployment, contributing fixes back to the product and unblocking enterprise adoption.
  • Act as technical SME for enterprise adoption, answering deep platform and Kubernetes questions while coaching teammates in your areas of expertise.

Stacklok is building the control plane for enterprise AI agents, enabling organizations to run, govern, and secure them on Kubernetes. The company is an early-stage startup founded by two of the creators of Kubernetes, with a strong open source focus.

Global

  • Make high-quality, data-driven decisions on building the next generation of our production platform and deliver results.
  • Own how we test, build, and deploy code in a high-scale PaaS environment, collaborating across the company on design and technology choices.
  • Blaze a trail as part of a small platform engineering team, driving reliability practices and directly influencing what we work on and how we work.

Astronomer empowers data teams to bring mission-critical software, analytics, and AI to life with its unified DataOps platform Astro, built on Apache Airflow. Trusted by over 800 enterprises, the company is a leader in data orchestration and innovation.

US

  • Develop and maintain core messaging, positioning, and value propositions for Mirantis’ AI infrastructure and cloud-native platform portfolio.
  • Translate technical capabilities like GPU orchestration and MLOps into compelling narratives for practitioners, platform engineers, and executives.
  • Produce high-quality technical content including solution briefs, white papers, blog posts, and enable sales teams with battlecards and objection-handling guides.

Mirantis is a Kubernetes-native AI infrastructure company that enables organizations to build scalable, secure, and sovereign infrastructure for AI and data-intensive workloads. The company fosters a culture of openness, collaboration, risk-taking, and continuous growth, working with passionate colleagues to help Fortune 500 customers implement next-generation cloud technologies.

Global

  • Own and evolve Webshare's production infrastructure by leading migration from Docker Swarm to Kubernetes and maintaining high availability across hundreds of servers and ~50 services.
  • Drive observability, establish IaC practices, CI/CD pipeline reliability, and participate in on-call rotation alongside backend developers.
  • Contribute platform tooling to improve developer experience and reduce infrastructure toil, ensuring no silos and shared infrastructure ownership.

We develop cutting-edge proxy and web data scraping solutions for thousands of the world's best known businesses, including Fortune 500 companies. We are a team of 500+ professionals with a culture focused on growth, learning, and shared infrastructure ownership.

United States 6w PTO

  • Build and operate the internal engineering platform that provides application engineers with the tools, systems, and Kubernetes clusters they need to deploy and run their workloads.
  • Focus on cloud infrastructure, capacity management, security, engineering productivity, monitoring, and US Federal compliance across squads.
  • Participate in on-call rotations to ensure the health of the system and understand how people use our products.

Grafana Labs, the company behind the open observability cloud, is founded on the principles of open source, open standards, open ecosystems, and open culture. We are a 100% remote company with 1,600+ team members across 40+ countries, backed by leading investors including Lightspeed Venture Partners, Sequoia Capital, GIC, Coatue, J.P. Morgan, CapitalG, and Lead Edge Capital.

United States Unlimited PTO

  • Own full-stack design and delivery of platform capabilities from architecture to deployment and observability.
  • Build open source infrastructure packages for airgap and cloud-native environments and write comprehensive tests.
  • Work directly with product and customers to translate mission problems into platform capabilities and mentor team members.

Defense Unicorns delivers mission value by streamlining software delivery for defense and civil agencies, focusing on speed, security, and optionality. The team includes innovators, software engineers, and veterans with decades of experience delivering technology programs across the federal market.

US

  • Owning cloud infrastructure on Azure, data pipeline orchestration, CI/CD, and observability to ensure production-grade reliability.
  • Building and maintaining foundational infrastructure that enables fast engineering velocity without breaking things.
  • Applying SRE principles such as SLOs, capacity planning, incident response, and eliminating toil through automation.

Terzo's platform processes enterprise-scale document corpora, powers real-time AI agents, and serves the Financial Intelligence Graph to Fortune 500 customers. As a small, senior team with strong ownership and minimal bureaucracy, we foster a culture of collaboration, mentorship, and continuous improvement.

Brazil

  • Develop high-performance cloud and container technologies using Python as the main programming language.
  • Actively participate in all aspects of an agile software development process and contribute to a highly available orchestration platform.
  • Collaborate effectively across remote teams and time zones to deliver high-quality solutions and code.

Coforge is a global IT solutions company that hires professionals based solely on their skills. They foster a diverse and inclusive culture, focusing on cloud, container, and 5G technologies.

Global

  • Build the self-improvement machine for integrations that automatically learn and fix APIs.
  • Own integration quality end to end, catching issues before customers do.
  • Build and run the tool gateway, persistent sandboxes, and multi-tenant data layer.

Viktor builds a general AI agent that integrates with thousands of tools and APIs to automate workflows for companies. They are a fast-growing, small team with a high-trust, low-process culture.