Source Job

Global Unlimited PTO

  • Improve the reliability, performance, and scalability of our production platform.
  • Operate reliable infrastructure, improve observability, and drive incident response.
  • Use data-driven reliability practices such as SLIs, SLOs, SLAs, and DORA metrics.

Terraform MongoDB Elasticsearch Redis Linux

20 jobs similar to Senior/Staff Platform Engineer

Jobs ranked by similarity.

$188,550–$212,150/yr
Global Unlimited PTO

  • Own the technical direction of Remote's SRE/Platform domain.
  • Define and drive the reliability strategy across the platform.
  • Identify and lead AI enablement initiatives across the engineering organisation.

Remote is solving modern organizations’ biggest challenge – navigating global employment compliantly with ease. With our core values at heart and a future-focused work culture, our team works tirelessly on ambitious problems, asynchronously, around the world.

US

  • Own and operate end-to-end infrastructure for backend services, frontend systems and databases.
  • Build and maintain reliable deployment workflows including CI/CD pipelines and rollback procedures.
  • Improve system-wide observability through metrics, logging, alerting, and monitoring to ensure uptime.

Jito Labs builds a high-performance trading terminal on Solana. They are a lean, high-output team building something that sits at the intersection of execution quality, user experience, and on-chain infrastructure.

$210,000–$278,000/yr
US Unlimited PTO

  • Architect future iterations of core systems, addressing scaling requirements.
  • Design and implement developer tools to enhance deployment safety and reproducibility.
  • Drive excellence in monitoring and guide incident response for quick issue resolution.

Found provides tools for self-employed individuals, offering a business bank account that automates taxes and expense tracking. They aim to give self-employed people the security and peace of mind historically available only at large corporations and are looking for kind, resourceful, and passionate people.

$113,850–$126,500/yr
Europe 5w PTO

  • Design, build, and maintain infrastructure using Infrastructure as Code tools such as Terraform.
  • Improve system reliability, scalability, resilience, and performance across the Mast platform.
  • Build systems and tooling that automate infrastructure management and operational workflows wherever possible.

Mast is on a mission to make complex lending simple by building modern, cloud-native lending technology purpose-built for specialist lenders. It is a high-performance team of engineers and lending experts that values radical honesty, transparency, and speed.

US Global

  • Performing day-to-day operational/DevOps tasks on Wikimedia’s public facing infrastructure.
  • Implementing and utilizing configuration management and deployment tools.
  • Leading continuous improvement, by automating the installation, configuration and maintenance of services on our platform.

The Wikimedia Foundation operates Wikipedia and other Wikimedia free knowledge projects with the vision of a world where every single human can freely share in the sum of all knowledge. As a charitable, not-for-profit organization, it relies on donations and has staff members based in 40+ countries.

Canada US 4w PTO

  • Lead and grow high-performing platform engineering teams that deliver reliable, scalable infrastructure and operational excellence for Vanta’s products and customers.
  • Set technical direction and drive multi-quarter platform initiatives spanning infrastructure reliability, security, scalability, and developer experience across shared systems and services.
  • Partner closely with product engineering, security, and engineering leadership to identify organizational needs and deliver scalable platform solutions.

Vanta helps businesses earn and prove trust by empowering companies to practice better security and prove it with ease. They have a kind and talented team, and while some have prior security experience, many have been successful without it.

Mexico Brazil

  • Lead the design, implementation, and ongoing improvement of reliable, scalable, performant, and secure production platforms and services.
  • Work closely with cross-functional teams to build and maintain resilient infrastructure and deployment patterns.
  • Provide technical leadership and mentorship to engineers across the organisation, promoting strong engineering standards and operational best practice.

Cision empowers individuals to make an impact and values diverse perspectives. They foster curiosity, collaboration, and innovation while driving meaningful contributions to brands; they have offices in 24 countries throughout the Americas, EMEA and APAC.

$29,000–$36,000/yr
India

  • Design, build, and maintain scalable, reliable systems on GCP.
  • Develop automation for infrastructure provisioning using Terraform, Ansible, or Deployment Manager.
  • Manage incident response, conduct postmortems, and implement improvements to reduce recurrence.

SupplyHouse.com is an industry-leading e-commerce company specializing in HVAC, plumbing, heating, and electrical supplies since 2004. They value every individual team member and cultivate a community where people come first with Generosity, Respect, Innovation, Teamwork, and GRIT.

US Unlimited PTO

  • Lead software engineering teams providing infrastructure-as-code to manage cloud infrastructure.
  • Hire experienced site reliability staff, and a line manager to grow and oversee the SRE team.
  • Establish design-before-build discipline; facilitate lightweight design documents, architectural decision records, and working group reviews.

Horizon3.ai is a cybersecurity company dedicated to enabling organizations to proactively find, fix, and verify exploitable attack vectors. They are a fast-growing company with a culture of respect, collaboration, ownership, and results.

US

  • Design, build, and operate core cloud infrastructure across compute, storage, databases, and networking layers.
  • Own and improve the reliability, scalability, and security of Valon’s production systems as we scale to support major enterprise deployments.
  • Evaluate, adopt, and operationalize new infrastructure technologies (e.g., Vitess, Clickhouse, Redis) to meet evolving product and scale requirements.

Valon is building the AI-native operating system for regulated finance, starting with mortgage servicing. They're a Series C company backed by a16z, transforming industries that others have written off as too complex to innovate.

US

  • Oversee a specialized SRE team focused on the design, deployment, and maintenance of automation toolsets.
  • Establish and enforce standards for IaC to ensure consistent, repeatable, and secure deployments.
  • Drive the automated lifecycle of both physical and virtual assets, from initial template creation/deployment to automated patching, scaling, and decommissioning.

Galaxy is a global leader in digital assets and data center infrastructure, delivering solutions that accelerate progress in finance and artificial intelligence. Led by CEO and Founder Michael Novogratz, their team blends deep crypto expertise with institutional experience and a shared commitment to shaping the future of Web3 and AI.

SRE

Fal
$180,000–$250,000/yr
US

  • Own and operate our Kubernetes infrastructure.
  • Build and maintain CI/CD pipelines and deployment infrastructure.
  • Leverage AI to automate analysis and resolution of production issues.

Fal is the generative media ecosystem powering the next generation of AI products. They build the infrastructure, tools, and model access that teams need to move from idea to production.

Global

  • Deploy and maintain infrastructure using Terraform on AWS.
  • Operate and govern production-grade platforms running on Kubernetes / EKS.
  • Build and maintain CI/CD pipelines using GitHub Actions.

Muttdata is a dynamic startup committed to crafting innovative systems using cutting-edge Big Data and Machine Learning technologies. They are looking for a hands-on DevOps to join a strategic initiative focused on deploying and operating Data & AI platforms.

$120,000–$170,000/yr
Global Unlimited PTO

  • Own and evolve Quansight's cloud infrastructure across AWS, Azure, and GCP.
  • Build, deploy, and maintain internal dashboards and reporting for operations and project management.
  • Lead infrastructure engagements for clients from scoping and architecture through delivery, upskilling client teams.

Quansight is rooted in the Python and PyData ecosystems. They provide services ranging from open-source software development to training and consulting, believing in a culture of do-ers, learners, and collaborators.

$205,000–$235,000/yr
US

  • Provide technical leadership for infrastructure, reliability, and observability.
  • Own the observability stack using Datadog and CloudWatch.
  • Design and evolve AWS infrastructure for reliability, security, scalability, and cost efficiency.

Topstep is an engaging working environment that ranges from fully remote to hybrid. They foster a culture of collaboration by keeping cameras on during meetings and maintaining a robust Slack environment for communication.

$148,750–$201,250/yr
US Unlimited PTO

  • Designing and operating always-on product environments for customer demos, internal use, and stakeholder access.
  • Building feature branch / preview environments to support UX and rapid feedback loops.
  • Integrating core system components across Fleet Management, Edge Management, OS, and related services.

Defense Unicorns delivers mission value by streamlining software delivery. They are composed of innovators, software engineers, and veterans with decades of experience delivering technology programs across the federal market.

$160,000–$190,000/yr
US

  • Own and evolve Launch Potato's cloud infrastructure, CI/CD platform, and compliance posture.
  • Build the SRE function from the ground up so product teams can ship faster without compromising reliability, security, or cost control.
  • Stand up the SRE practice from scratch: on-call rotation, PagerDuty configuration, SLA/SLO definitions for core infrastructure services, runbook library, and observability dashboards that tie site performance to business metrics.

Launch Potato is a digital media company that connects consumers with leading brands through data-driven content and technology. They are headquartered in South Florida with a remote-first team spanning over 15 countries, with a high-growth, high-performance culture.

$145,000–$250,000/yr
US Unlimited PTO

  • Construct infrastructure as code, developing and enforcing best practice across configurations while preventing drift between Terraform configurations and infrastructure deployments.

SentiLink provides innovative identity and risk solutions, empowering institutions and individuals to transaction with confidence. They are building the future of identity verification in the United States replacing a clunky, ineffective, and expensive status quo with solutions that are 10x faster, smarter, and more accurate.

$4,313–$5,391/mo
Europe

  • Own 5 AWS accounts across the organisation.
  • Architect and maintain infrastructure as code with Terraform.
  • Set up monitoring, alerting, and incident response.

We're a UK fintech building high-throughput digital infrastructure for the mortgage and property space. Recently acquired Trussle and we are taking our platform to the next level. The company values innovation and building high-quality products.

India

  • Design, deploy, and manage Kubernetes-based platforms in production.
  • Implement and manage automation frameworks for infrastructure provisioning and operations.
  • Administer and optimize VMware environments (vSphere, ESXi, vCenter).

EPlus believes technology is a people business and delivers solutions that make a real difference. Their team is passionate, skilled, and driven, valuing collaboration, innovation, and extraordinary results and dedicated to fostering, cultivating, and preserving a culture that represents diversity, enables inclusion.