Jobs Similar to Site Reliability Engineer II

Senior Site Reliability Engineer

Transcend 20 days ago

$150,000–$167,000/yr

US

Lead reliability-focused design and readiness reviews.
Build, operate, and continuously improve our observability stack.
Own and evolve incident management practices.

Transcend is building the privacy platform that easily embeds privacy into your entire tech stack. They are growing quickly, backed by top-tier investors and are proud to serve some of the world's most iconic brands.

View details Similar jobs

Site Reliability Engineer

Infiterra 16 days ago

Global

Maintain and continuously improve production uptime, supporting our ≥99.9% target for 2026.
Monitor systems proactively and respond effectively to production incidents.
Drive improvements in MTTR (Mean Time to Resolution).

Infiterra's B2B SaaS platform simplifies subscription service delivery, helping IT Distributors and Managed Service Providers (MSPs) automate and grow their subscription business. With 100+ customers in 75 countries, Infiterra is known for its collaborative and growth-oriented culture.

View details Similar jobs

Senior DevOps & Platform Engineer

About Us 5 days ago

Maximize the velocity of our product engineering team.
Ensure platform scalability, reliability, and security.
Champion best practices and shape the engineering culture.

They are building a robust, scalable trading platform to serve high-traffic, latency-sensitive applications. They leverage state-of-the-art technologies to support real-time trading while providing unparalleled reliability and performance.

View details Similar jobs

Senior Site Reliability Engineer

Enumerate 17 days ago

$4,000–$5,000/mo

Latin America

Design and evolve production environments, define standards and best practices.
Partner with engineering and IT teams to build scalable, reliable systems.
Lead incident response practices, and set guardrails around security, reliability, and cost management.

They are looking for a Senior Site Reliability Engineer who can own the architecture, governance, and cost efficiency of their cloud and platform infrastructure. This role is a remote contractor role and they are seeking candidates located in LATAM.

View details Similar jobs

Site Reliability Operations

Truelogic 28 days ago

US

Lead incident response as Incident Commander, coordinating teams, communications, and service restoration
Produce executive-level incident reports, run RCAs, and drive continuous improvement
Enforce change management and risk assessment for production changes

Truelogic is a leading provider of nearshore staff augmentation services headquartered in New York, delivering top-tier technology solutions to companies of all sizes. Their team of 600+ highly skilled tech professionals, based in Latin America, drives digital disruption by partnering with U.S. companies on their most impactful projects.

View details Similar jobs

Site Reliability Engineer

Peec AI 25 days ago

Europe

Own the reliability, scalability, and performance of Peec AI’s core systems and infrastructure
Design, build, and maintain the tooling, automation, and monitoring that keep our services fast, secure, and highly available
Partner closely with product and engineering teams to ensure new features are reliable, observable, and easy to operate from day one

Peec AI is one of Europe’s fastest-growing Series A startups (no employee count/culture details given). They provide exciting and challenging work in the AI space.

View details Similar jobs

Senior Software Engineer - Grafana Databases, SRE

Grafana Labs 10 days ago

Europe 6w PTO

Partner closely with product engineering squads (embedded model)
Own production reliability for high-SLA and complex customer environments
Design and implement automation to scale our reliability practices

Grafana Labs is a remote-first, open-source powerhouse that helps more than 3,000 companies manage their observability strategies. They are scaling fast and staying true to what makes them different: an open-source legacy, a global collaborative culture, and a passion for meaningful work.

View details Similar jobs

Site Reliability Engineer

Jobgether 19 days ago

LATAM

Monitor production systems, dashboards, logs, and alerts to ensure high availability and performance across distributed environments.
Assist in incident detection, triage, escalation, and resolution, following structured on-call rotations with mentorship support.
Maintain, follow, and continuously improve runbooks, operational procedures, and incident response workflows.

Jobgether is a platform that helps job seekers find the right opportunities. They use an AI-powered matching process to ensure applications are reviewed quickly and fairly.

View details Similar jobs

Site Reliability Engineer

Moniepoint 13 days ago

Nigeria

Detect and triage service and reliability issues.
Develop automation to eliminate manual and repetitive operational tasks.
Investigate and resolve customer complaints escalated beyond L1 and L2 support.

Moniepoint is an all-in-one financial services platform for emerging markets. Since 2019, Moniepoint’s technology has powered over 3 million people, offering personal and business banking, payment, credit and business management tools to help them succeed.

View details Similar jobs

Staff Site Reliability Engineer

Juniper Square 28 days ago

US Canada Europe Asia

Automate the provisioning of all of Juniper Square’s infrastructure in code.
Partner with our Platform Engineering team on building developer tooling / improving developer experiences via joint initiatives and enhancements.
Partner with our Data Engineering team on improving our data posture and driving operational excellence.

Juniper Square's mission is to unlock the full potential of private markets by digitizing them to bring efficiency, transparency, and access. They are a values-driven organization with a hybrid workplace strategy, allowing employees to collaborate effectively across multiple countries and offering physical offices in several major cities.

View details Similar jobs

Staff Software Engineer - Grafana Cloud k6

Grafana Labs 5 days ago

$174,986–$209,983/yr

US 6w PTO

Build and scale a strong culture of operational excellence by defining standards and coaching teams to own reliability and availability.
Drive mature DevOps/SRE practices, including incident response and PIRs, on-call readiness, runbooks, alerting, observability, and release/change management.
Guide teams in the design, development, evolution, and operation of large-scale, distributed cloud systems.

Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana, the open source visualization tool, around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack.

View details Similar jobs

Senior Site Reliability Engineer

Jobgether 26 days ago

$113,082–$175,725/yr

Canada

Operate and maintain large-scale data systems, ensuring stability and performance.
Design, implement, and optimize deployment processes using virtualization.
Monitor system health, analyze failures, and identify instability sources.

Jobgether is a platform that uses AI-powered matching to connect candidates with companies. They ensure applications are reviewed quickly, objectively, and fairly, then share a shortlist of top candidates directly with the hiring company.

View details Similar jobs

Infrastructure Engineer IV

HackerOne 18 days ago

$165,000–$200,000/yr

US Unlimited PTO

Contribute to building and operating the infrastructure that supports the HackerOne platform.
Improve the reliability, security, and scalability of our systems.
Design and operate highly available cloud systems and apply best practices for reliability, observability, and security.

HackerOne is a global leader in Continuous Threat Exposure Management (CTEM). The HackerOne Platform unites agentic AI solutions with the ingenuity of the world’s largest community of security researchers to continuously discover, validate, prioritize, and remediate exposures across code, cloud, and AI systems. They combine the ingenuity of the largest security research community with a best-in-class AI-powered platform, trusted by the world’s top organizations.

View details Similar jobs

Sr Site Reliability Engineer

Dataiku 18 days ago

Europe Middle East Africa

Design, deploy and maintain a cloud infrastructure to support a Dataiku SaaS offering mainly on AWS and Azure and GCP
Continuously improve the infrastructure, deployment and configuration to deliver more reliable, resilient, scalable and secure services
Automate as much as possible all technical operations

Dataiku is The Universal AI Platform™, giving organizations control over their AI talent, processes, and technologies to unleash the creation of analytics, models, and agents. They connect many data science technologies and integrate the best of data and AI tech.

View details Similar jobs

Sr. DevOps Engineer

Jobgether 3 days ago

Europe

Implement SLI/SLO frameworks with error budgets to drive reliability decisions
Design release strategies including blue/green deployments and version tracking
Lead incident response and develop automated runbooks to reduce MTTR

Jobgether is a company that helps connect individuals with jobs through an AI-powered matching process. They ensure applications are reviewed quickly, objectively, and fairly against roles' core requirements.

View details Similar jobs

Senior SRE DevOps Engineer

Jobgether 1 day ago

Canada

Designing and implementing SLI/SLO frameworks with error budgets to guide reliability and performance decisions.
Building and maintaining AWS-based production infrastructure using Infrastructure as Code (Terraform, CloudFormation), including ECS, EKS/Kubernetes, and microservices orchestration.
Developing internal tools, automation frameworks, and reliability services in TypeScript, Python, or similar languages to enhance operational efficiency.

Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. They identify the top-fitting candidates, and this shortlist is then shared directly with the hiring company.

View details Similar jobs

Site Reliability Engineering Manager

Customer.io 23 days ago

$175,000–$195,000/yr

Americas Unlimited PTO 16w maternity

Lead effective squad rituals and ensure production readiness.
Partner with engineers to ensure solutions are scalable, architecturally sound, flexible, and secure.
Provide timely, specific coaching and development opportunities for your direct reports.

Customer.io's platform allows over 8,000 companies to send messages using real-time behavioral data. Their team uses Go, React, Ember, and AI to ship fast and scale with confidence and they value ownership, leadership, and healthy skepticism.

View details Similar jobs

Senior AWS DevOps Engineer (Remote, Poland) Contract

Nearform 9 days ago

Europe

Developing infrastructure to support cloud-based applications.
Creating deployment architect and continuous delivery pipelines.
Designing high-availability approaches, and implementing monitoring architecture.

Nearform is a digital and AI engineering consultancy with a reputation for experience-led modernization. They focus on creating transformative digital products for enterprise customers across the UK and Ireland. Nearformers form a close-knit community built on trust and camaraderie.

View details Similar jobs

Staff Site Reliability Engineer, DevOps

Pismo 2 days ago

Global

Lead the implementation and optimization of CI/CD pipelines.
Develop and maintain Infrastructure as Code (IaC) scripts to automate infrastructure provisioning and management.
Identify and implement automation opportunities to improve efficiency and reduce manual effort.

Pismo, founded in 2016, provides a comprehensive processing platform for banking, card issuing, and financial market infrastructure, helping customers innovate and build next-generation banking and payment solutions. Pismo joined Visa in 2024 and has over 500 employees in more than 10 countries.

View details Similar jobs

Infrastructure Engineer/SRE

Cresta 21 days ago

Global

Partner with engineers to build dev tools that empower developer workflows and deployment infrastructure.
Ensure reliability of multi-cloud Kubernetes clusters and pipelines.
Metrics, logging, analytics, and alerting for performance and security across all endpoints and applications.

Cresta is on a mission to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. Their platform combines the best of AI and human intelligence to help contact centers discover customer insights and behavioral best practices.

View details Similar jobs

Source Job