Source Job

$115,200–$172,800/yr
US 8w paternity

  • Build internal tooling to help other engineers and the rest of the company understand and operate our system.
  • Design and implement security best practices for our team and infrastructure.
  • Reduce toil through automation, including building and maintaining CI/CD infrastructure.

Terraform Python Go VueJS

20 jobs similar to Site Reliability Engineer II

Jobs ranked by similarity.

$29,000–$36,000/yr
India

  • Design, build, and maintain scalable, reliable systems on GCP.
  • Develop automation for infrastructure provisioning using Terraform, Ansible, or Deployment Manager.
  • Manage incident response, conduct postmortems, and implement improvements to reduce recurrence.

SupplyHouse.com is an industry-leading e-commerce company specializing in HVAC, plumbing, heating, and electrical supplies since 2004. They value every individual team member and cultivate a community where people come first with Generosity, Respect, Innovation, Teamwork, and GRIT.

$165,000–$165,000/yr
US

  • Design, build, and maintain scalable cloud infrastructure services in AWS and GCP.
  • Contribute production-quality Go and Python code to existing cloud services.
  • Develop and own automation and software deployment pipelines with maximum efficiency.

Dragos is dedicated to arming customers with best-in-class technology, threat intelligence, and services to protect their systems. They embody core values of authenticity, transparency, and trust and are a remote-first culture with operations in North America, Europe, the Middle East, and APAC.

US

  • Oversee a specialized SRE team focused on the design, deployment, and maintenance of automation toolsets.
  • Establish and enforce standards for IaC to ensure consistent, repeatable, and secure deployments.
  • Drive the automated lifecycle of both physical and virtual assets, from initial template creation/deployment to automated patching, scaling, and decommissioning.

Galaxy is a global leader in digital assets and data center infrastructure, delivering solutions that accelerate progress in finance and artificial intelligence. Led by CEO and Founder Michael Novogratz, their team blends deep crypto expertise with institutional experience and a shared commitment to shaping the future of Web3 and AI.

US Unlimited PTO

  • Design, build, and maintain secure CI/CD pipelines supporting cloud-native applications and services.
  • Implement Infrastructure as Code using tools such as Terraform to provision and manage cloud resources.
  • Integrate security controls and best practices into the software development lifecycle (DevSecOps).

540 is a forward-thinking company that the government turns to in order to #getshitdone. They break down barriers, build impactful technology, and solve mission-critical problems.

$160,000–$200,000/yr
US

  • Drive the stability and reliability of Epic's GCP infrastructure.
  • Manage and harden our Docker and GKE container platform.
  • Maintain and improve CI/CD pipelines.

Epic is the leading digital reading platform for kids ages 12 and under, used by millions of children, families, and educators around the world. As Epic continues to grow, we are reimagining what reading can be through thoughtful technology, data, and global collaboration to make learning more engaging, accessible, and impactful.

$115,000–$130,000/yr
US Unlimited PTO

  • Develop and maintain scalable automation and integrations across cloud platforms and services.
  • Design, implement, and operate CI/CD pipelines using Jenkins, Dagger, Terraform, and Docker.
  • Build, operate, and troubleshoot workloads on Kubernetes, using Kustomize and Helm.

People Inc. is America’s largest digital and print publisher. Our brands harness the best intent-driven content, the fastest sites, and the fewest ads to help nearly 200 million people every month make decisions.

$188,550–$212,150/yr
Global Unlimited PTO

  • Own the technical direction of Remote's SRE/Platform domain.
  • Define and drive the reliability strategy across the platform.
  • Identify and lead AI enablement initiatives across the engineering organisation.

Remote is solving modern organizations’ biggest challenge – navigating global employment compliantly with ease. With our core values at heart and a future-focused work culture, our team works tirelessly on ambitious problems, asynchronously, around the world.

US

  • Responsible for overall health, availability, performance, security, cost and day-to-day operations of the GCP platform and toolset.
  • Build and maintain Azure DevOps pipelines for infrastructure and application deployment.
  • Design, implement, maintain, operate GCP infrastructure across DEV, QA, STAGE, PROD etc.

Resultant is a consulting firm that helps clients make technology a strategic asset and use data to guide better decisions. They employ over 350 team members who operate remotely and from offices and hubs around the United States.

$120,000–$170,000/yr
Global Unlimited PTO

  • Own and evolve Quansight's cloud infrastructure across AWS, Azure, and GCP.
  • Build, deploy, and maintain internal dashboards and reporting for operations and project management.
  • Lead infrastructure engagements for clients from scoping and architecture through delivery, upskilling client teams.

Quansight is rooted in the Python and PyData ecosystems. They provide services ranging from open-source software development to training and consulting, believing in a culture of do-ers, learners, and collaborators.

US Unlimited PTO

  • Work with IaC tools like Terraform to ensure configurations are steady and change-managed.
  • Design and deploy endpoint security measures aligned with industry standards.
  • Ensure a strong security posture for corporate SaaS applications by configuring vendor capabilities.

OnePay is a consumer fintech company trusted by millions of Americans to make money better, providing an all-in-one financial services platform. Backed by Walmart and Ribbit Capital, they offer banking, savings, credit cards, lending, investing, and crypto services.

$127,160–$205,700/yr
North America Unlimited PTO

  • Own the delivery of developer platform capabilities end-to-end, including design, implementation, rollout, and iteration.
  • Build and evolve paved roads that make it easy to deploy, operate, and scale services.
  • Drive improvements to GitOps workflows and harden CI/CD to improve pipeline performance and developer ergonomics.

Phaidra is building the future of industrial automation with AI-powered control systems. They are a 100% remote company with employees located throughout the USA, Canada, UK, Sweden, Spain, Portugal, the Netherlands, Singapore, Australia, and India.

$145,000–$250,000/yr
US Unlimited PTO

  • Construct infrastructure as code, developing and enforcing best practice across configurations while preventing drift between Terraform configurations and infrastructure deployments.

SentiLink provides innovative identity and risk solutions, empowering institutions and individuals to transaction with confidence. They are building the future of identity verification in the United States replacing a clunky, ineffective, and expensive status quo with solutions that are 10x faster, smarter, and more accurate.

Europe

  • Design, implement, and maintain GCP Landing Zone components.
  • Manage infrastructure using Terraform and CI/CD pipelines.
  • Configure and troubleshoot networking (VPC, VPN, routing, DNS).

Software Mind develops impactful solutions for companies globally, working with tech giants and on transformative projects. They foster a culture of openness, respect, and enjoyment, building cross-functional engineering teams that value passion and creativity.

$113,850–$126,500/yr
Europe 5w PTO

  • Design, build, and maintain infrastructure using Infrastructure as Code tools such as Terraform.
  • Improve system reliability, scalability, resilience, and performance across the Mast platform.
  • Build systems and tooling that automate infrastructure management and operational workflows wherever possible.

Mast is on a mission to make complex lending simple by building modern, cloud-native lending technology purpose-built for specialist lenders. It is a high-performance team of engineers and lending experts that values radical honesty, transparency, and speed.

$160,000–$190,000/yr
US

  • Own and evolve Launch Potato's cloud infrastructure, CI/CD platform, and compliance posture.
  • Build the SRE function from the ground up so product teams can ship faster without compromising reliability, security, or cost control.
  • Stand up the SRE practice from scratch: on-call rotation, PagerDuty configuration, SLA/SLO definitions for core infrastructure services, runbook library, and observability dashboards that tie site performance to business metrics.

Launch Potato is a digital media company that connects consumers with leading brands through data-driven content and technology. They are headquartered in South Florida with a remote-first team spanning over 15 countries, with a high-growth, high-performance culture.

US

  • Own and operate end-to-end infrastructure for backend services, frontend systems and databases.
  • Build and maintain reliable deployment workflows including CI/CD pipelines and rollback procedures.
  • Improve system-wide observability through metrics, logging, alerting, and monitoring to ensure uptime.

Jito Labs builds a high-performance trading terminal on Solana. They are a lean, high-output team building something that sits at the intersection of execution quality, user experience, and on-chain infrastructure.

Unlimited PTO

  • Assess and improve visibility by identifying gaps in dashboards, metrics, and logs.
  • Refine alerts and dashboards for critical services to catch issues earlier.
  • Automate routine checks and monitoring tasks to free up engineers.

PlayOn is where high school sports come to life through platforms like GoFan, NFHS Network, and MaxPreps. As a growth-stage company backed by KKR, we build the technology that powers high school athletics from ticketing and streaming to fundraising and merchandise.

US Unlimited PTO

  • Lead software engineering teams providing infrastructure-as-code to manage cloud infrastructure.
  • Hire experienced site reliability staff, and a line manager to grow and oversee the SRE team.
  • Establish design-before-build discipline; facilitate lightweight design documents, architectural decision records, and working group reviews.

Horizon3.ai is a cybersecurity company dedicated to enabling organizations to proactively find, fix, and verify exploitable attack vectors. They are a fast-growing company with a culture of respect, collaboration, ownership, and results.

Global

  • Build and maintain our host provisioning stack to bring new bare metal online quickly and confidently.
  • Evolve our homegrown orchestration engine to manage clusters, containers, and VMs.
  • Build out internal observability and alerting so we catch fleet problems before customers feel them.

Railway's core mission is to make software engineers higher leverage. They provide powerful tools so engineers can spend less time setting up and more time doing. The team is small, with high ownership, and they are passionate about being exceptional.

Germany

  • Build and maintain end-to-end observability with ELK, Prometheus, and Grafana.
  • Own and improve CI/CD pipelines (CircleCI, GitLab CI, GitHub Actions, ArgoCD).
  • Lead incident response and postmortems in a blameless culture.

Redcare Pharmacy is Europe’s No.1 e-pharmacy, powered by passionate teams and cutting-edge innovation. They strive to create a healthy, collaborative work environment where every employee feels valued and inspired to contribute to their vision “Until every human has their health”.