Source Job

EU

  • Design, build, and maintain platform infrastructure using IaC principles with tools like Terraform.
  • Develop memory services, vector storage patterns, and semantic search capabilities.
  • Operate a Kubernetes-based platform for the Data & AI department, enabling deployment and scaling via GitOps.

Kubernetes Terraform Python Go Databricks

20 jobs similar to Senior Platform Engineer (m/f/d)

Jobs ranked by similarity.

Canada

  • Build and maintain infrastructure platforms for over 200 backend services running on Kubernetes clusters with 40,000+ cores.
  • Lead and mentor other engineers, own complex infrastructure failures, and participate in a shared on-call rotation.
  • Drive cloud cost efficiency, estimate schedules, and use AI tools as a first-class collaborator in daily workflows.

Life360's mission is to keep people close to the ones they love through location sharing, safe driver reports, and crash detection. The company serves approximately 97.8 million monthly active users across more than 180 countries and has more than 500 remote-first employees.

US Unlimited PTO

  • Own and scale AI compute and deployment platforms including Kubernetes and GitOps pipelines.
  • Build inference infrastructure and observability stacks for LLM-powered workflows.
  • Drive security, compliance, and governance at the systems level in a regulated healthcare environment.

Hims & Hers is a leading health and wellness platform focused on making healthcare accessible and personal. As a publicly traded company on the NYSE (HIMS), it offers flexible/remote work and a culture centered on innovation and employee well-being.

Global

  • Develop, deploy, maintain, operate, and support an Agentic AI Developer Platform.
  • Strongly oriented towards technical implementation and operation of the platform with hands-on experience.
  • Collaborate and lend experience to less experienced team members as needed.

We build modern Machine Learning systems for demand planning and budget forecasting, offering custom AI solutions to optimize cloud-based systems. We are a remote startup with a culture that values being data nerds, open team players, ownership, and a positive mindset.

US Unlimited PTO

  • Design and operate the developer platform powering services, focusing on AI-native tooling like agentic service catalogs and MCP-backed APIs.
  • Build and scale agent golden paths, treating AI agents as first-class platform users, and drive Istio service mesh adoption across the fleet.
  • Establish platform guardrails with scorecards and CI policies, and own core components like Kubernetes, IaC, and CI/CD pipelines.

Hims & Hers is the leading health and wellness platform on a mission to help the world feel great through better health. The company is public, traded on the NYSE as "HIMS," and offers a talent-first flexible/remote work approach with outstanding benefits and culture.

  • Own reliability, latency, and performance for AI platform services and data infrastructure on AWS.
  • Design and maintain CI/CD pipelines, infrastructure-as-code, and observability frameworks across the stack.
  • Partner with AI and data engineers to ensure secure, cost-optimized, and scalable deployment of platform components.

HHAeXchange is the leading technology platform for home and community-based care, providing an end-to-end homecare solution for people who are aging or have disabilities. Founded in 2008, the company is passionate about transforming healthcare by connecting patients, providers, managed care organizations, and states.

Latin America

  • Build and operate the self-service infrastructure platform for developers and AI agents.
  • Own core platform layers including CI/CD, GitOps, IaC module catalog, and golden-path scaffolding.
  • Build internal tooling, observability, and metrics to make pipelines observable and improvable.

Luxury Presence is building the AI growth platform for real estate. Backed by top investors like Bessemer Venture Partners, we're a Series C company with over $100M in ARR and more than 90,000 real estate professionals using our platform.

US

  • Harden, simplify, and operationalize the production platform on Google Cloud Platform for enterprise customers.
  • Own and evolve core infrastructure, CI/CD pipelines, and infrastructure as code practices using Terraform or Pulumi.
  • Drive observability, developer productivity, and engineering culture to raise the bar across the team.

GC AI is the fastest-growing legal AI platform for in-house legal teams, building the future of legal work. With over 1,700 companies using the platform, including 150+ public companies and 25+ unicorns, the team has 10x'd revenue in 12 months and raised a $60 million Series B.

US Unlimited PTO 16w maternity 4w paternity

  • Build and operate the ML lifecycle platform, including tooling for experiment tracking, model registry, and versioned pipelines.
  • Own CI/CD and deployment for ML workloads, building automated pipelines from notebook to production.
  • Make models observable and reliable in production with monitoring for latency, drift, data quality, and cost signals.

dv01 provides a data analytics platform for the structured finance market, offering transparency into investment performance and risk for lenders and Wall Street investors. With over 400 clients and coverage of over 100 million loans, dv01 is a data-first company with a diverse and innovative culture.

US

  • Owning cloud infrastructure on Azure, data pipeline orchestration, CI/CD, and observability to ensure production-grade reliability.
  • Building and maintaining foundational infrastructure that enables fast engineering velocity without breaking things.
  • Applying SRE principles such as SLOs, capacity planning, incident response, and eliminating toil through automation.

Terzo's platform processes enterprise-scale document corpora, powers real-time AI agents, and serves the Financial Intelligence Graph to Fortune 500 customers. As a small, senior team with strong ownership and minimal bureaucracy, we foster a culture of collaboration, mentorship, and continuous improvement.

US

  • Design, deploy, and manage production Kubernetes clusters with workload scheduling, resource quotas, network policies, and RBAC.
  • Build and optimize CI/CD pipelines using Infrastructure as Code and GitOps principles.
  • Implement observability solutions using Prometheus, Grafana, and OpenTelemetry for performance tuning and reliability.

VerTALENTS is a subsidiary of VerSprite Cybersecurity, specializing in technology staffing. The company connects top technical talent with industry clients through various methods, adding value to both clients and candidates for full-time and contracting opportunities.

UK

  • Manage and optimise Kubernetes clusters in GKE through Terraform.
  • Design and implement automation strategies that empower developers to self-serve.
  • Serve as the technical point-of-contact for GCP and Kubernetes-related queries.

Prolific builds the human data infrastructure that reshapes AI development by enabling collection of high-quality, ethically sourced human behavioral data. They are a mission-driven company with a competitive salary and benefits, offering remote working within a culture focused on impact and innovation.

Europe

  • Design, build, and maintain scalable cloud infrastructure for an AI-powered platform.
  • Manage and optimize AWS environments, develop Infrastructure as Code using Terraform, and build CI/CD pipelines.
  • Troubleshoot production issues and implement security best practices across infrastructure and deployment pipelines.

North America

  • Design, build, and maintain cloud infrastructure across Azure, GCP, and AWS, including landing zones, Kubernetes, and CI/CD pipelines.
  • Implement monitoring, security, and hybrid connectivity for enterprise-scale cloud environments.
  • Collaborate cross-functionally, mentor engineers, and leverage AI tools to accelerate infrastructure development.

Applied is an Insurtech company that builds technology solutions for insurance professionals. With over 40 years of experience, they foster a culture of trust, inclusion, and growth.

Brazil

  • Evolve and maintain our Kubeflow, Feast, and Spark-on-Kubernetes ML infrastructure.
  • Design tools and APIs empowering teams to transition from centralized bottlenecks to self-service excellence.
  • Collaborate with Data Science teams to apply software engineering best practices to ML workflows.

Wellhub revolutionizes workplace wellness by connecting employees to partners for fitness, mindfulness, therapy, nutrition, and sleep in one subscription. Headquartered in NYC with team members across the globe, we value wellbeing, collaboration, and different perspectives.

US Unlimited PTO

  • Build end-to-end automation solutions using GitLab CI, AKS, Terraform, and Ansible with security controls built in from the start.
  • Design, deploy, and secure MCP servers on Azure, exposing tools and data for AI agents with attention to access boundaries.
  • Integrate AI agent skills, orchestrate multi-step workflows, and enable autonomous interactions within defined security guardrails.

General Dynamics Mission Systems engineers a diverse portfolio of high technology solutions for defense and scientific missions. With a global team of 12,000+ professionals, they value trust, honesty, and transparency, offering a flexible work environment and competitive benefits.

US

  • Work as part of a small, cross-functional XP team installing Imogen into client cloud environments, partnering with client infosec, infrastructure, and IT teams.
  • Pair program with other engineers and collaborate closely with product managers and designers.
  • Lead technical discovery efforts for existing customer systems and adapt Imogen to their public cloud estate.

Mechanical Orchard specializes in safely rewriting critical business applications using a unique method that eliminates modernization risks. The company is known for its expertise in Agile practices and has a small, cross-functional team culture focused on collective ownership and continuous improvement.

United States Unlimited PTO

  • Design and build scalable backend systems powering AI agents in real-time enterprise environments.
  • Develop agent orchestration frameworks and low-latency inference pipelines integrating LLMs and SLMs.
  • Build robust APIs and work with cross-functional teams to productionize agentic AI at scale.

Level AI is an AI-native platform that helps enterprises transform contact centers into engines of customer intelligence and operational efficiency. The company is a Series C startup backed by Battery Ventures and ENIAC, based in Mountain View, California, with a globally distributed team.

Europe

  • Lead the investigation and resolution of complex infrastructure, networking, and platform-related incidents.
  • Provide technical leadership for Kubernetes platform operations and supporting infrastructure services.
  • Mentor and support AI Infrastructure & Platform Operations Engineers, sharing technical knowledge through documentation and training.

Mirantis helps organizations ship code faster on public and private clouds, providing a public cloud experience on any infrastructure from the data center to the edge. The company serves many of the world's leading enterprises, including Adobe, DocuSign, Liberty Mutual, and PayPal, and is a leader in container management.

US Unlimited PTO

  • Engineer security infrastructure across AWS and Kubernetes including telemetry pipelines, cryptographic lifecycle, and compliance automation.
  • Build and maintain agentic AI workflows using tools like Claude Code and MCP integrations to automate security engineering tasks.
  • Embed security controls into deployment pipelines and develop threat models that inform architecture decisions.

Lumin Digital creates cutting-edge digital banking solutions for credit unions and banks as a 100% cloud-native company. Their culture is built on trust, respect, and boldness in a fully remote environment.

US Canada

  • Build and maintain cloud infrastructure across GCP, Kubernetes, and Terraform.
  • Own CI/CD pipelines and deploy fully automated, locked-down systems.
  • Strengthen security, access control, and observability for a growing platform.

Gauntlet builds the financial systems of the future, operating across the entire stack to offer best-in-class vault products. The team serves over $1.5B in client TVL and brings together traditional finance and crypto-native expertise.