Jobs Similar to Senior Site Reliability Engineer - Infrastructure | TangerineFeed

Senior Site Reliability Engineer - Infrastructure

Underdog 1 day ago

$160,000–$240,000/yr

US Unlimited PTO 11w maternity

Own and maintain the incident response process, including defining procedures, tools, and best practices
Guide teams in establishing and monitoring Service Level Objectives (SLOs), including setting up alerts and reporting systems
Lead capacity planning initiatives, focusing on both short and long-term scalability while optimizing costs

AWS Kubernetes PostgreSQL Datadog

20 jobs similar to Senior Site Reliability Engineer - Infrastructure

Jobs ranked by similarity.

Sr. Infrastructure Engineer

VGS 24 days ago

$140,000–$190,000/yr

US Canada Unlimited PTO

Architect and maintain scalable, reliable infrastructure: Design and optimize infrastructure for high availability, fault tolerance, and performance across distributed systems.
Lead incident management and root cause analysis: Own incident response processes, ensure swift resolution of issues, and drive post-incident improvements to prevent recurrences.
Service monitoring and automation: Build and maintain automated monitoring, alerting, and healing systems that improve system health, reduce manual intervention, and minimize downtime.

VGS is the world's leader in payment tokenization, empowering clients and partners by tokenizing sensitive payment data and limiting compliance scope. They embed a universal token vault into their technology stack to manage the complexities of payment data tokenization across processors and networks and more. While the job posting doesn't specify size, they appear to have a culture that values transparency, collaboration, grit, and humility.

View details Similar jobs

Senior Site Reliability Engineer

DexCare 21 days ago

$125,000–$169,000/yr

Unlimited PTO

Design, scale, and operate resilient, cloud-native infrastructure in AWS with an emphasis on EKS, IAM, RBAC, and modern security-first practices.
Build and optimize CI/CD pipelines with GitHub Actions and GitHub Advanced Security enabling velocity without compromising safety.
Own observability across the stack using Datadog (metrics, logging, alerting, and tracing).

DexCare optimizes time in healthcare, streamlining patient access, reducing waits, and enhancing overall experiences. They are committed to creating an inclusive workplace where diversity drives innovation and belonging strengthens collaboration, enabling everyone to thrive.

View details Similar jobs

Staff Database Reliability Engineer

Boulevard 8 days ago

US Canada Unlimited PTO

Own and improve database reliability, performance, and scalability; participate in incident response.
Partner with engineering teams to design, build, and operate scalable, fault-tolerant, and secure distributed systems.
Build tools, automation, and frameworks that eliminate toil, reduce operational overhead, and establish best practices.

Boulevard provides a client experience platform for appointment-based, self-care businesses, empowering customers to enhance client interactions. They value diversity, curiosity, and simple solutions, fostering an inclusive and open environment for employees to perform their best work.

View details Similar jobs

Site Reliability Engineer

Truelogic 20 days ago

US

Designs, implements, and continuously improves observability strategies across services.
Focuses on understanding system behavior in production, identifying failure modes, performance bottlenecks, and reliability risks.
Evolves and maintains shared AWS CDK and CDK8s constructs, with emphasis on observability, autoscaling, and operational safeguards.

Truelogic is a leading provider of nearshore staff augmentation services. They have a team of 600+ highly skilled tech professionals based in Latin America, partnering with U.S. companies on impactful projects and valuing expertise and aspirations.

View details Similar jobs

Senior Site Reliability Engineer

Clarifai 17 hours ago

US

Ensure the smooth operation and high availability of Clarifai's core services
Monitor system performance, identify bottlenecks, and implement optimizations to enhance reliability and efficiency
Design and implement scalable, secure, and cost-effective infrastructure solutions

Clarifai is a leading AI platform specializing in computer vision and generative AI, empowering organizations to transform unstructured data into actionable insights. Founded in 2013, they have a diverse, globally distributed team with $100M in funding and are committed to building a diverse and inclusive team.

View details Similar jobs

Head of Reliability and Operations

RWS 27 days ago

Europe

Lead the Reliability & Operations function within the Developer & Production Enablement (DPE) division of RWS’s Product & Technology organization. Take ownership of global production operations and lead the transition from manual, ticket-based workflows to platform-integrated automation. Ensure stability today, while designing for scalability and autonomy in the future.

RWS's purpose is to unlock global understanding, valuing every language and culture, and celebrating diversity and inclusion to make the company strong.

View details Similar jobs

Intermediate Site Reliability Engineer, Tenant Scale

GitLab 5 days ago

Americas EMEA Unlimited PTO

Design and implement highly scalable infrastructure for GitLab.com to support current and future growth.
Collaborate with cross-functional teams across the Infrastructure organization to plan and deliver projects that shape GitLab’s platform direction.
Operate and improve edge services and Kubernetes workloads, acting as a subject matter expert within the infrastructure department.

GitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. They aim to enable everyone to contribute to and co-create the software that powers our world.

View details Similar jobs

Site Reliability Engineer

Miris 5 days ago

$89,155–$287,488/yr

Global

Configure and maintain cloud infrastructure automation using Terraform, focusing on CDN optimization and content delivery performance
Develop capacity planning strategies and performance optimization initiatives for high-volume spatial content delivery.
Instrument services to understand system health.

Miris is a cutting-edge technology company building the future of 3D content delivery at global scale. Our mission is to empower creators and developers to deliver high-fidelity, photorealistic 3D experiences to billions of users instantly, seamlessly, and across all major platforms and devices.

View details Similar jobs

Senior Manager, Infrastructure

Wealthsimple 10 days ago

North America

Own the strategy and execution for Runtime Platform.
Set the technical direction, build and develop the team, and are accountable for outcomes.
Translate product needs into platform capabilities and building trust through consistent delivery.

Wealthsimple aims to help everyone achieve financial freedom by reimagining how people manage their money. As the largest fintech company in Canada, it has over 3+ million users and manages more than $100 billion in assets, fostering inclusive and high-performing teams.

View details Similar jobs

Senior Platform Engineer

Fanvue 19 days ago

Unlimited PTO

Build and evolve the infrastructure foundations that support Fanvue’s move toward a service-oriented architecture
Enable stream teams to deploy and operate services independently using platform-provided tooling and patterns
Design and maintain AWS infrastructure using AWS CDK (TypeScript), with a strong focus on safety, reuse, and automation

Fanvue is the fastest-growing creator monetisation platform in the creator economy. We are the leading AI-powered creator-first platform, designed to empower creators worldwide to directly monetise their audience.

View details Similar jobs

Staff DevOps Infrastructure Engineer

NMI 17 days ago

$155,000–$165,000/yr

US Unlimited PTO

Lead maintenance and operations for production and development environments.
Architect and implement complex solutions spanning OS, virtualization, network, and cloud layers.
Lead automation initiatives for infrastructure provisioning and operational tasks.

NMI enables partners with choice in payments, challenging the one-size-fits-all approach. They power innovative tech for SMBs, entrepreneurs, and fintech startups, fostering a diverse and welcoming workplace with a dedicated Diversity, Equity & Inclusion action group.

View details Similar jobs

DevOps Engineer

True 24 days ago

US Unlimited PTO

Implement and maintain observability tools and dashboards using [e.g., AWS CloudWatch, Datadog, Sentry, OpenTelemetry].
Assist with cloud cost visibility and optimization, analyze infrastructure usage patterns to identify waste and implement aggressive tagging strategies.
Manage the tooling and processes for deploying applications to AWS EKS / Kubernetes / ECS / Serverless and facilitate modern deployment strategies.

True is a global platform of companies that optimizes value creation by placing executive talent, developing business leaders, creating diverse and inclusive networks, and using innovative technology to advance executive talent priorities. True was founded on the belief that doing good is the pathway to doing well and their growth and success are a by-product of their values treating people right, listening to new ideas and keeping culture at the heart of their business.

View details Similar jobs

Senior DevOps Engineer/SRE

Cision 14 days ago

India

Oversee the reliability, scalability, performance, and security of key production services.
Collaborate with cross-functional teams to develop and maintain resilient infrastructure.
Provide expert mentorship and guidance on best practices to engineers throughout the organization.

Cision is a global leader in PR, marketing and social media management technology and intelligence, helping brands and organizations connect with customers and stakeholders to drive business results. The company has offices in 24 countries throughout the Americas, EMEA and APAC.

View details Similar jobs

Associate Director, Cloud Engineering

Model N 17 days ago

$160,000–$182,000/yr

US

Lead and mentor multiple teams across SRE, cloud infrastructure, and platform engineering functions.
Drive multi-team initiatives to deliver scalable, secure, and cost-efficient infrastructure leveraging AWS-native and serverless technologies.
Drive adoption of FinOps practices and partner with finance and product teams on budgeting and forecasting.

Model N is the leader in revenue optimization and compliance for pharmaceutical, medtech, and high-tech innovators. Model N is trusted by over 150 of the world’s leading companies across more than 120 countries.

View details Similar jobs

Senior Infrastructure Engineer - Hosting

Webflow 27 days ago

$150,100–$188,100/yr

US Canada 2w PTO 12w maternity 12w paternity

Create and test reliable cloud infrastructure services that support Webflow’s range of products.
Balance reliability, scalability, and cost efficiency concerns while refactoring and modernizing existing services.
Collaborate with product engineering teams to deliver new solutions for services and ways of working that might not exist yet.

Webflow is the leading visual development platform for building powerful websites without writing code.

View details Similar jobs

Senior Site Reliability Engineer

NICE 29 days ago

UK

Run the production environment by monitoring availability and taking a holistic view of system health. Build software and systems to manage platform infrastructure and applications. Improve reliability, quality, and time-to-market of our suite of software solutions.

NICE software products are used by 25,000+ global businesses to deliver extraordinary customer experiences, fight financial crime and ensure public safety.

View details Similar jobs

Infrastructure Engineer

Glia 1 day ago

Europe

Maintaining and updating Glia’s core infrastructure.
Troubleshooting and resolving infrastructure-related issues.
Improving our security posture.

Glia provides an AI customer service solution for banks and credit unions, unifying AI and human agents across every voice and digital conversation through its ChannelLess® Architecture. Valued at over $1 billion, Glia powers over 700 financial institutions and is certified as a Great Place to Work, with 98% employee satisfaction.

View details Similar jobs

Director of Infrastructure US Security & IT

Jobgether 27 days ago

$145,470–$228,597/yr

US

Responsible for building, maintaining, and scaling secure, reliable, and compliant IT and Cloud infrastructure.
Lead cross-functional teams to optimize deployment velocity and enhance observability.
Balance operational support with strategic initiatives and drive innovation in infrastructure practices.

This position is posted by Jobgether on behalf of a partner company.

View details Similar jobs

Database Engineer

Ruby Labs 21 days ago

Europe Unlimited PTO

Own and operate AWS Aurora (PostgreSQL) in a high-load production environment.
Design and evolve schemas for large transactional domains.
Analyze and optimize slow queries and production metrics.

Ruby Labs is a leading tech company that creates and operates innovative consumer products, offering opportunities across health, education, and entertainment. Their innovative teams are driving the future of consumer-led products.

View details Similar jobs

(Senior) Cloud Site Reliability Engineer (Scalability) (m/f/x)

Scalable Capital 28 days ago

Germany

Shape the way Scalable runs microservices in a performant, secure, and cost-efficient way. Collaborate with cross-functional teams to understand scalability requirements. Develop and maintain internal tooling around Monitoring, Developer Portal, and Load Testing.

Scalable Capital is a leading digital investment and banking platform with a full banking licence, empowering people across Europe to shape their own finances.

View details Similar jobs