Source Job

Global

  • You will plan and execute infrastructure deployments, using automation to ensure a stable platform.
  • You will manage operations, troubleshoot, and optimize workflows to maintain high availability.
  • You will own backend features supporting our platforms and interface with users for feedback.

AWS Kubernetes Terraform Go Security

20 jobs similar to SRE Engineer

Jobs ranked by similarity.

$165,000–$195,000/yr
US

  • Support and operate Legion’s AWS-based cloud platform and Kubernetes (EKS) environments.
  • Build and maintain infrastructure-as-code using Terraform.
  • Improve CI/CD pipelines to increase deployment safety and velocity.

Legion Technologies delivers the industry’s most innovative workforce management platform. The AI-driven Legion WFM platform maximizes labor efficiency and employee engagement. They are a remote, mission-driven team that embraces a collaborative, fast-paced, and entrepreneurial culture.

US EMEA

  • Design and implement the complex distributed infrastructure that powers our core AI engine and distributed analysis systems.
  • Tune and optimize cloud services across compute, storage, networking, and observability to drive performance and reliability.
  • Develop our core services, written in TypeScript, Kotlin and Go to support our unique deployment and infrastructure requirements.

XBOW is building the future of offensive security. They create the platform that puts security ahead in the arms race, using AI to autonomously discover, validate, and exploit vulnerabilities. Founded by Oege de Moor, the company is backed by Sequoia, Altimeter, and other leading investors.

Europe

  • Maintaining and updating Glia’s core infrastructure.
  • Troubleshooting and resolving infrastructure-related issues.
  • Improving our security posture.

Glia provides AI customer service solutions for banks and credit unions, unifying AI and human agents across all conversations via their ChannelLess® Architecture. They are valued at over $1 billion, have been named a Deloitte Technology Fast 500™ company for five years, powers over 700 financial institutions and maintains an industry-leading 72 NPS.

US Global

  • Automate infrastructure provisioning and configuration using Infrastructure-as-Code (Terraform)
  • Develop, implement, and optimize CI/CD pipelines (GitHub Actions, ArgoCD)
  • Manage Kubernetes clusters (EKS, GKE)

Verve For Advertisers empowers brands and agencies to connect moments of discovery and drive measurable outcomes across screens. They bring together the largest on-site search intent dataset outside of walled gardens, direct SDK integrations with top apps, alongside data partnerships with 3M+ websites and LLMs.

Global

  • Build Reliable Cloud Infrastructure: Implement and maintain AWS infrastructure using Terraform across EKS, Lambda, EC2, and S3.
  • Improve Developer Workflows: Contribute to CI/CD pipelines, starter kits, and internal tooling that reduce manual effort and improve deployment confidence.
  • Strengthen Observability & Operations: Add monitoring, logging, and alerting (DataDog) to platform services and participate in an on-call rotation.

Spreetail helps brands increase their ecommerce market share globally while improving operational costs. They are building one of the fastest-growing ecommerce companies in history with a focus on innovation.

Europe

  • Implement SLI/SLO frameworks with error budgets to drive reliability decisions
  • Design release strategies including blue/green deployments and version tracking
  • Lead incident response and develop automated runbooks to reduce MTTR

Jobgether is a company that helps connect individuals with jobs through an AI-powered matching process. They ensure applications are reviewed quickly, objectively, and fairly against roles' core requirements.

Europe 5w PTO

  • Responsible for security and integrity of the underlying infrastructure, safeguarding the platform from potential vulnerabilities, threats, and attacks.
  • Developing and maintaining tools for Global Security in order to deliver vulnerability management platforms for application triaging and continuous compliance.
  • Making sure that the platform is compliant with the best industry practices and standards for security (ISO27001, C5, SOC2).

Docplanner empowers patients by giving them access to leave and read reviews about their visit and provide doctors with the technology to manage bookings easily and save time. They are a leader in 13 countries with over 2,900 employees globally and maintain a startup mindset with a diverse group of ~325 people in Docplanner Tech.

$205,000–$270,000/yr
US Unlimited PTO

  • Partner with engineers to build dev tools that empower developer workflows and deployment infrastructure.
  • Ensure reliability of multi-cloud Kubernetes clusters and pipelines.
  • Focus on automation so we can spend energy where it matters.

Cresta is on a mission to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. Their platform combines the best of AI and human intelligence to help contact centers discover customer insights and behavioral best practices.

$120,000–$140,000/yr
US Unlimited PTO

  • Architect and manage scalable cloud infrastructure within AWS.
  • Implement and maintain infrastructure using Terraform.
  • Develop automation scripts to improve operational efficiency.

Attune empowers insurance agents with their technology solutions. We foster a remote-first culture and value employee development.

$230,000–$250,000/yr
US Unlimited PTO 12w paternity

  • Define and evolve reliability standards for the SmarterDx platform.
  • Enhance observability systems (metrics, logs, traces, alerting) to provide actionable insights and reduce mean time to detect (MTTD) and resolve (MTTR).
  • Reduce operational toil through automation, self-healing systems, and improved deployment and rollback mechanisms.

SmarterDx, a Smarter Technologies company, builds clinical AI that is transforming how hospitals translate care into payment. Founded by physicians in 2020, their platform connects clinical context with revenue intelligence, helping health systems recover millions in missed revenue, improve quality scores, and appeal every denial.

South America

  • Own the end‑to‑end lifecycle of core platform components, including cloud infrastructure primitives and Kubernetes clusters.
  • Design platform components to be resilient by default, applying SRE principles like fault isolation and capacity planning.
  • Drive Infrastructure‑as‑Code and GitOps‑first practices to ensure platform components are reproducible and auditable.

Pismo, founded in 2016, provides a comprehensive processing platform for banking, card issuing, and financial market infrastructure, helping customers innovate in banking and payments. With over 500 employees across 10+ countries, Pismo joined Visa in 2024, leveraging Visa’s solutions to advance financial technology.

US Canada Ireland UK Mexico Argentina

  • Perform infrastructure security reviews across cloud services, network design, IAM, and platform components.
  • Design and build internal security services, APIs, and tools that automate infrastructure vulnerability detection, triage, reporting, and remediation.
  • Develop security automation that integrates with CI/CD, cloud control planes, and developer workflows to shift detection and remediation earlier in the lifecycle.

Webflow is building the world’s leading AI-native Digital Experience Platform as a remote-first company. They empower teams to design, launch, and optimize for the web without barriers, from entrepreneurs to global enterprises, and believe the future of the web, and work, is more open, more creative, and more equitable.

Global

  • Own the end-to-end lifecycle (design, provisioning, upgrades, and decommissioning) of core platform components.
  • Lead the design and implementation of infrastructure bootstrap orchestration, including: Automated cluster and environment provisioning.
  • Apply and promote SRE practices across the platform, including: Clear ownership and runbooks for platform components.

Pismo provides a comprehensive processing platform for banking, card issuing and financial market infrastructure and helps customers innovate and build the next generation of banking and payment solutions. Pismo’s 500+ employees are located in more than 10 countries around the world.

Europe 5w PTO

  • Responsible for security and integrity of the underlying infrastructure.
  • Developing and maintaining tools for Global Security.
  • Optimize system scalability and cost efficiency.

Docplanner empowers patients by giving them access to leave and read reviews about their visit. They provide doctors with technology to manage bookings easily and save time. Docplanner employs over 2,900 people globally and has maintained a startup mindset.

Canada

  • Designing and implementing SLI/SLO frameworks with error budgets to guide reliability and performance decisions.
  • Building and maintaining AWS-based production infrastructure using Infrastructure as Code (Terraform, CloudFormation), including ECS, EKS/Kubernetes, and microservices orchestration.
  • Developing internal tools, automation frameworks, and reliability services in TypeScript, Python, or similar languages to enhance operational efficiency.

Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. They identify the top-fitting candidates, and this shortlist is then shared directly with the hiring company.

Europe Unlimited PTO 13w maternity 9w paternity

  • Collaborate with cross-functional teams to translate product requirements into technical solutions.
  • Develop and maintain core services for Chainguard.
  • Practice continuous improvement by iterating on how services are deployed, configured, monitored, and maintained on our platform.

Chainguard provides a secure foundation for software development and deployment by offering guarded open source software that is built from source and continuously updated. Founded by industry experts, they aim to be the safe source for open source and have built the largest library of open source software that is secure by default.

Europe

  • Design, deploy, and manage cloud infrastructure.
  • Build and maintain ETL pipelines.
  • Develop and manage APIs, databases, and middleware.

The Starknet Foundation stewards Starknet, a permissionless validity rollup scaling blockchains. They pioneered ZK-STARK technology and are entering a new era settling on both Bitcoin and Ethereum, aiming to build a unified execution layer for secure assets.

Europe

  • Build and maintain CI/CD pipelines and GitOps workflows across a diverse set of engineering teams.
  • Own observability — monitoring, alerting, logging — and support development teams in instrumenting their services.
  • Optimise infrastructure for security, cost, performance and reliability.

1inch is a decentralized finance (DeFi) platform. We empower users to access the best rates and execute efficient and secure trades across multiple liquidity sources.

Global Unlimited PTO

  • Build and maintain Infrastructure as Code to power our production systems, Python tools to automate toil, and monitoring systems to detect problems early.
  • Independently execute on large DevOps projects such as major migrations, product rollouts, and infrastructure enhancements
  • Participate in the infrastructure on-call rotation & incident response process, including triaging alerts, coordinating responders, and contributing to blame-free RCAs. Leverage senior level expertise to drive rapid resolutions.

Super.com aims to maximize the lives of both customers and employees, providing opportunities to unlock potential through learning and impact. They are a fast-paced, high-growth tech company that values career progression and supports employees through various programs.

Europe

  • Design and maintain scalable, fault-tolerant infrastructure that supports our SaaS platform and keeps pace with business growth.
  • Define, document, and maintain SLIs, SLOs, and SLAs in partnership with product engineering, translating business commitments into technical guardrails.
  • Lead incident response with steady judgment, facilitate blameless postmortems, and drive remediation efforts that prevent recurrence.

Fixify is on a mission to reimagine IT teams support companies. They need a Senior Site Reliability Engineer who finds joy in building systems that fade into the background, empowering product engineers to ship with confidence and their customers to work without interruption.