In this role, the Site Reliability Engineer (SRE) will be responsible for managing and resolving the most challenging issues for the ServiceNow SRE team, focusing on instance performance, reliability, and availability. This is a swing shift role (4 days a week) and the candidate must be located within the Republic of Ireland.
Job listings
As a Senior DevOps Engineer in 3Cloudβs managed services team, you will be responsible for ongoing support, addressing escalations from the monitoring team, and performing proactive maintenance for our clientβs Azure platforms, utilizing the Managed Services teams processes, procedures, and tools. You will play a critical role in our core team, essential to the success of our Managed Services division.
Join Granicus as a Site Reliability Engineer! You will be pivotal in ensuring the reliability, scalability, and performance of our services, leading efforts in building and maintaining a robust infrastructure, automating processes, and guiding the team to implement best practices in site reliability. This role involves on-call production support, monitoring systems, automating processes, incident management, and collaboration with software engineers.
We are seeking a Platform Engineer to build and maintain the infrastructure powering our edge computing environments. This role focuses on deploying, automating, and scaling distributed infrastructure at remote edge locations, ensuring reliable performance close to end users and devices. You will collaborate with software engineers, operations, and security teams to create resilient, low-latency platforms that support applications running at the edge.
Join our dynamic IT team as a Mid-Level Site Reliability Engineer (SRE 2). Ensure the reliability, availability, and performance of our services. Troubleshoot incidents, automate processes, and collaborate with software engineers to enhance system performance. Implement security best practices to protect our systems and data.
Pythian is building a next-generation Site Reliability Engineering team, and weβre looking for talented, motivated engineers who thrive in fast-paced, problem-solving environments. As an SRE, youβll design, deploy, and operate large-scale distributed systems across compute, storage, networking, and AI/ML environments. Youβll lead projects from architecture to automation to intelligent monitoring, collaborating with both clients and teammates to build resilient, high-performing infrastructure.
Play a vital role in managing and enhancing Kraken's engineering platform as part of the Core Infrastructure - Platforms Team. Responsible for managing and applying best practices to our orchestration platform while providing infrastructure support across Engineering, focusing on a secure, reliable, and scalable infrastructure platform.
Maintain and develop the cloud infrastructure on AWS. Ensure observability practices using tools like Prometheus, Grafana, and CloudWatch. Maintain and develop CI/CD pipelines in Bitbucket for applications in containers. Support and improve test automation processes. Implement security strategies in the cloud. Support the engineering team in diagnosing and resolving critical incidents. Propose improvements to architecture and governance in the cloud. Work collaboratively with engineering and quality teams.
As a Technical Operations Engineer, ensure the stability, reliability, and performance of our production systems. Leverage deep technical expertise, particularly in Web3/blockchain technologies, to manage, optimize, and enhance our platform infrastructure. Drive operational excellence through proactive monitoring, meticulous incident management, innovative problem-solving, and collaborative cross-team initiatives.
Redhorse transforms how government uses data and technology, seeking a skilled and motivated AWS Cloud DevOps Engineer to join the high-performing team, playing a critical role in maintaining and enhancing the cloud infrastructure, CI/CD pipelines, and containerized environments supporting government clients, directly impacting the reliability and security of mission-critical systems.