Source Job

Canada

  • Designing and implementing SLI/SLO frameworks with error budgets to guide reliability and performance decisions.
  • Building and maintaining AWS-based production infrastructure using Infrastructure as Code (Terraform, CloudFormation), including ECS, EKS/Kubernetes, and microservices orchestration.
  • Developing internal tools, automation frameworks, and reliability services in TypeScript, Python, or similar languages to enhance operational efficiency.

Terraform CloudFormation AWS TypeScript Python

20 jobs similar to Senior SRE DevOps Engineer

Jobs ranked by similarity.

Europe

  • Implement SLI/SLO frameworks with error budgets to drive reliability decisions
  • Design release strategies including blue/green deployments and version tracking
  • Lead incident response and develop automated runbooks to reduce MTTR

Jobgether is a company that helps connect individuals with jobs through an AI-powered matching process. They ensure applications are reviewed quickly, objectively, and fairly against roles' core requirements.

US Canada

  • Maintain tooling, libraries, and infrastructure leveraged by core service teams
  • Develop and maintain infrastructure services that enable engineers to manage, deploy, and scale systems
  • Act as a technical leader, guiding core service teams to design robust and reliable software

StackAdapt is a technology company that empowers marketers to reach, engage, and convert audiences with precision. They are an AI-powered platform connecting brand and performance marketing, recognized for their diverse workplace and high-performing campaigns.

$150,000–$167,000/yr
US

  • Lead reliability-focused design and readiness reviews.
  • Build, operate, and continuously improve our observability stack.
  • Own and evolve incident management practices.

Transcend is building the privacy platform that easily embeds privacy into your entire tech stack. They are growing quickly, backed by top-tier investors and are proud to serve some of the world's most iconic brands.

Global

  • Design, build, and maintain scalable backend services primarily using Python
  • Develop and operate cloud-native systems on AWS, ensuring reliability, security, and performance
  • Contribute to infrastructure design and automation using Terraform

Smart Working connects skilled professionals with global teams for full-time, long-term roles, breaking down geographic barriers. They value growth and well-being, fostering a genuine community and empowering individuals to thrive in a remote-first world.

  • Maximize the velocity of our product engineering team.
  • Ensure platform scalability, reliability, and security.
  • Champion best practices and shape the engineering culture.

They are building a robust, scalable trading platform to serve high-traffic, latency-sensitive applications. They leverage state-of-the-art technologies to support real-time trading while providing unparalleled reliability and performance.

US

  • Leverage infrastructure as code (Terraform) to build and maintain complex production and analytics workflows including networking and containerized services.
  • Rapidly diagnose and resolve faults in system services as part of a 24/7 on-call rotation focused on actionable alerting and eliminating toil.
  • Improve speed of delivery by developing and maintaining CI/CD pipelines.

Linus Health is a Boston-based digital health company transforming brain health worldwide. They combine cutting-edge neuroscience, clinical expertise, and AI to advance early detection and intervention for cognitive and brain disorders, empowering people to live longer, healthier lives. With 100+ team members and growing, they’re entering a phase of accelerated growth and looking for top talent to help shape their future.

Global

  • Partner with engineers to build dev tools that empower developer workflows and deployment infrastructure.
  • Ensure reliability of multi-cloud Kubernetes clusters and pipelines.
  • Metrics, logging, analytics, and alerting for performance and security across all endpoints and applications.

Cresta is on a mission to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. Their platform combines the best of AI and human intelligence to help contact centers discover customer insights and behavioral best practices.

$120,000–$150,000/yr
US

  • Design, build, and maintain automated CI/CD pipelines to enable fast, secure, and reliable deployments.
  • Provision, manage, and optimize core AWS services to support scalable, highly available applications.
  • Implement and maintain IaC frameworks to ensure infrastructure is version-controlled, repeatable, and auditable.

Arine is a healthcare technology and clinical services company dedicated to ensuring individuals receive the safest and most effective treatment. They are backed by leading healthcare investors and collaborate with top healthcare organizations, managing more than 18 million lives across prominent health plans.

Europe

  • Own the reliability, scalability, and performance of Peec AI’s core systems and infrastructure
  • Design, build, and maintain the tooling, automation, and monitoring that keep our services fast, secure, and highly available
  • Partner closely with product and engineering teams to ensure new features are reliable, observable, and easy to operate from day one

Peec AI is one of Europe’s fastest-growing Series A startups (no employee count/culture details given). They provide exciting and challenging work in the AI space.

  • Design, develop, and implement platform solutions that enhance the reliability, security, and scalability of the Database Platform infrastructure.
  • Provide technical leadership in AWS cloud infrastructure, networking, CI/CD, and security for cloud infrastructure solutions.
  • Mentor and coach team members, fostering a culture of knowledge sharing, technical excellence, and continuous improvement.

SYSTABUILD is building a shared cloud and platform foundation for a group of leading software companies in the construction, CAD and ERP domain. They are looking for a Lead Cloud Infrastructure Engineer to take a key role in designing, operating, and evolving their central cloud infrastructure and platform services.

$80,300–$109,500/yr
Canada 3w PTO

  • Lead and mentor a team of DevOps engineers.
  • Design, implement, and manage scalable cloud infrastructure.
  • Automate and optimize infrastructure management tasks.

Rival Group is a forward-thinking, results-driven organization obsessed with helping innovative brands get closer to their customers. They have a fast-growing tech company with award-winning market research agency with offices in Chicago, Toronto, and Vancouver.

$150,000–$220,000/yr
US

  • Design, build, and maintain scalable cloud infrastructure with a focus on reliability, security, and performance
  • Define and manage infrastructure as code using tools like Terraform to ensure consistent and auditable environments
  • Develop, improve, and support CI/CD pipelines to accelerate development cycles and ensure high code quality

Medallion provides a provider operations platform to eliminate healthcare administrative bottlenecks. They are ranked No. 3 on Inc. Magazine’s 2024 Fastest-Growing Private Companies in the Pacific Region and have been featured on The Today Show.

Global Unlimited PTO

  • Build and maintain Infrastructure as Code to power our production systems, Python tools to automate toil, and monitoring systems to detect problems early.
  • Independently execute on large DevOps projects such as major migrations, product rollouts, and infrastructure enhancements
  • Participate in the infrastructure on-call rotation & incident response process, including triaging alerts, coordinating responders, and contributing to blame-free RCAs. Leverage senior level expertise to drive rapid resolutions.

Super.com aims to maximize the lives of both customers and employees, providing opportunities to unlock potential through learning and impact. They are a fast-paced, high-growth tech company that values career progression and supports employees through various programs.

US

  • Own and scale AWS and Kubernetes infrastructure.
  • Build and maintain CI/CD pipelines and infrastructure-as-code.
  • Lead observability and monitoring initiatives.

Truelogic is a nearshore staff augmentation services provider headquartered in New York. They deliver technology solutions to companies of all sizes, helping them achieve their digital transformation goals with a team of 600+ highly skilled tech professionals based in Latin America.

US

  • Design, build, and maintain our core cloud infrastructure on AWS/GCP using Infrastructure as Code.
  • Manage and scale our mission-critical services on Kubernetes, ensuring high availability and resilience.
  • Enhance and operate our CI/CD systems and developer tools within a GitLab-based workflow.

Mambu is a leading SaaS cloud banking platform that is on a mission to make banking better for a billion people. They empower customers to build innovative and secure financial products, and power billions of transactions for millions of end-users.

$113,082–$175,725/yr
Canada

  • Operate and maintain large-scale data systems, ensuring stability and performance.
  • Design, implement, and optimize deployment processes using virtualization.
  • Monitor system health, analyze failures, and identify instability sources.

Jobgether is a platform that uses AI-powered matching to connect candidates with companies. They ensure applications are reviewed quickly, objectively, and fairly, then share a shortlist of top candidates directly with the hiring company.

$100,000–$165,000/yr
Europe Latin America 3w PTO

  • You’ll lead the initial setup of our DevOps and platform engineering practices
  • You’ll design and deliver an internal platform for personal or feature environments to boost developer velocity
  • You’ll build and maintain AWS-based infrastructure for performance, scale, and security

DualEntry, founded in 2024, is a rapidly growing AI startup focused on revolutionizing the finance industry. Our AI-native ERP platform helps accounting teams achieve more with less effort, automating manual data entry using AI for businesses ranging from $5M-ARR to NYSE-listed companies.

Egypt

  • Deploy and maintain highly available AWS environments using Terraform.
  • Build and manage automated pipelines in GitLab CI or Jenkins, utilizing Ansible.
  • Manage the full lifecycle of containerized applications using Docker and Kubernetes.

Jobgether is a platform that connects job seekers with companies. They use an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly, identifying top-fitting candidates for hiring companies.

US Unlimited PTO 12w maternity 12w paternity

  • Design, implement, and maintain cloud-based infrastructure using AWS, Azure, or GCP.
  • Build, optimize, and manage continuous integration and continuous deployment (CI/CD) pipelines.
  • Integrate AI-powered tooling into engineering workflows to accelerate delivery and improve code quality.

Givebutter is a nonprofit fundraising and CRM platform. They empower millions to raise more, pay less, and give better by offering tools like fundraisers, donation forms, donor management, emails, and text blasts all in one place.

Global Unlimited PTO

  • Design, build, and operate cloud infrastructure for Polygon Labs’ payments platform.
  • Implement and maintain infrastructure as code using Terraform.
  • Partner with payments application engineers to define infrastructure requirements.

Polygon Labs is a global blockchain payments company building and operating infrastructure. They aim to move money instantly, reliably, and at internet scale, with the mission to move all money onchain. They are a fast-moving, remote-first team that values collaboration.