Design and implement reliable and scalable AWS architecture.
Support the CICD process with ArgoCD and GitOps, automating deployments with Terraform.
Optimize system performance and troubleshoot issues, collaborating with development teams.
Cloudbeds is transforming hospitality with its intelligently designed platform that powers properties across 150 countries. They are a completely remote team of 650+ employees across 40+ countries, focused on building AI-powered solutions for hotels.
Ensure the smooth operation and high availability of Clarifai's core services
Monitor system performance, identify bottlenecks, and implement optimizations to enhance reliability and efficiency
Design and implement scalable, secure, and cost-effective infrastructure solutions
Clarifai is a leading AI platform specializing in computer vision and generative AI, empowering organizations to transform unstructured data into actionable insights. Founded in 2013, they have a diverse, globally distributed team with $100M in funding and are committed to building a diverse and inclusive team.
Design, build, and maintain automated CI/CD pipelines to enable fast, secure, and reliable deployments.
Provision, manage, and optimize core AWS services to support scalable, highly available applications.
Implement and maintain IaC frameworks to ensure infrastructure is version-controlled, repeatable, and auditable.
Arine is a healthcare technology and clinical services company dedicated to ensuring individuals receive the safest and most effective treatment. They are backed by leading healthcare investors and collaborate with top healthcare organizations, managing more than 18 million lives across prominent health plans.
Automate the provisioning of all of Juniper Square’s infrastructure in code.
Partner with our Platform Engineering team on building developer tooling / improving developer experiences via joint initiatives and enhancements.
Partner with our Data Engineering team on improving our data posture and driving operational excellence.
Juniper Square's mission is to unlock the full potential of private markets by digitizing them to bring efficiency, transparency, and access. They are a values-driven organization with a hybrid workplace strategy, allowing employees to collaborate effectively across multiple countries and offering physical offices in several major cities.
Contribute to high impact AWS cloud infrastructure initiatives.
Participate in operability and production readiness reviews.
Advocate and implement Site Reliability Engineering practices.
Patreon is a media and community platform where creators give fans access to exclusive work. They have generated over $10 billion for creators and have 25 million+ paid memberships, with a hybrid work model and offices in New York and San Francisco.
Lead incident response as Incident Commander, coordinating teams, communications, and service restoration
Produce executive-level incident reports, run RCAs, and drive continuous improvement
Enforce change management and risk assessment for production changes
Truelogic is a leading provider of nearshore staff augmentation services headquartered in New York, delivering top-tier technology solutions to companies of all sizes. Their team of 600+ highly skilled tech professionals, based in Latin America, drives digital disruption by partnering with U.S. companies on their most impactful projects.
Be a key contributor on an Agile development team, collaboratively realizing business value through iterative software development lifecycle.
Build and execute the monitoring strategy for ScienceLogic SaaS infrastructure.
Define, deploy, and maintain system and service monitors.
ScienceLogic is a leader in IT Operations Management, giving modern IT operations actionable insights for faster problem resolution and prediction. They see everything across cloud and distributed architectures, contextualizing data through relationship mapping, and acting on this insight through integration and automation.
Ensure near-zero downtime with monitoring and alerting, self-healing automation, and continuous improvement
Create highly automated, available and scalable systems by applying software and infrastructure principles
Employ and advise clients on DevOps and SRE principles and practices, covering deployment pipelines, HA, service reliability, technical debt, and operational toil for live services running at scale
66degrees is an AI transformation partner. They guide enterprises from business challenges to quantifiable outcomes, helping businesses reach their inflection point where chaotic data becomes a strategic asset, complexity becomes clarity, and AI becomes an engine for growth. They believe in thriving through challenges and winning together.
Design, implement, and manage cloud infrastructure using Infrastructure as Code (IaC) tools.
Design, build, and maintain scalable CI/CD pipelines using tools like CircleCI or GitHub Actions.
Implement and maintain observability tooling (Prometheus, Grafana, Datadog), and lead incident response to ensure system reliability.
Engine is transforming business travel into something personalized, rewarding, and simple. More than 20,000 companies already rely on Engine to support over 1 million travelers and billions in annual bookings each year.
Design, implement, and manage CI/CD pipelines to automate the software development lifecycle and perform platform, application deployments using cloud and on-prem services.
Collaborate with agile development teams to ensure code quality and reliability.
Implement observability using Dynatrace, AWS cloud watch and related tools and monitor and maintain system performance, availability, and security.
Experian is a global data and technology company, powering opportunities for people and businesses around the world. As a FTSE 100 Index company listed on the London Stock Exchange (EXPN), they have a team of 22,500 people across 32 countries.
Maintain tooling, libraries, and infrastructure leveraged by core service teams
Develop and maintain infrastructure services that enable engineers to manage, deploy, and scale systems
Act as a technical leader, guiding core service teams to design robust and reliable software
StackAdapt is a technology company that empowers marketers to reach, engage, and convert audiences with precision. They are an AI-powered platform connecting brand and performance marketing, recognized for their diverse workplace and high-performing campaigns.
Lead platform engineering initiatives using Kubernetes (EKS), Helm, and Infrastructure as Code.
Design and operate CI/CD platforms and deployment strategies to enable safe, low-risk releases.
Build and maintain strong observability foundations, including metrics, logging, alerting, and dashboards tied to service health.
Patriot Software is a remote-first, product-led tech company with a mission to make accounting and payroll fast, simple, and affordable for millions of American businesses. With 175+ team members across the U.S. and a collaborative office hub in Canton, Ohio, we’re building software that empowers the backbone of the American economy.
Own the reliability, scalability, and performance of Peec AI’s core systems and infrastructure
Design, build, and maintain the tooling, automation, and monitoring that keep our services fast, secure, and highly available
Partner closely with product and engineering teams to ensure new features are reliable, observable, and easy to operate from day one
Peec AI is one of Europe’s fastest-growing Series A startups (no employee count/culture details given). They provide exciting and challenging work in the AI space.
Own developer operations and platform reliability across Introzy’s product stack.
Lead how we run infrastructure on Render, design and evolve our observability and alerting, shape our CI/CD and release practices.
Continuously improve internal developer experience so the engineering team can ship quickly and safely.
Introzy is a multi-app platform designed to unify networking, workflow, and productivity. As a subsidiary of Sanguine Technology Solutions, they are an early-stage company moving fast to deliver value, with a lean engineering team and a culture that embraces AI.
Design, build, and maintain secure, scalable cloud infrastructure.
Own CI/CD pipelines and deployment workflows across services and environments.
Improve reliability, availability, and performance through monitoring, alerting, and incident response practices.
Jobgether is a company that uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. They identify the top-fitting candidates and share this short list directly with the hiring company.
Own and operate core platform systems across AWS, GCP, Vercel, Github, and Cloudflare.
Improve reliability, scalability, and security of production and non-production environments.
Improve local development environments and onboarding experience for engineers.
Moxie empowers ambitious aesthetic entrepreneurs to build profitable, independent practices. A global, remote-first team of more than 140 people supports hundreds of practices nationwide as they unlock sustainable success for aesthetic entrepreneurs.
Architect, maintain, and scale critical infrastructure.
Ensure system reliability and optimize performance.
Implement modern deployment strategies.
Scribe's Workflow AI platform automatically captures and optimizes workflows so teams work smarter, faster, and more consistently. They are a fast-growing company founded in 2019 with over 5 million users across 600,000 businesses, and they are backed by leading investors.
Design, develop, and implement platform solutions that enhance the reliability, security, and scalability of the Database Platform infrastructure.
Provide technical leadership in AWS cloud infrastructure, networking, CI/CD, and security for cloud infrastructure solutions.
Mentor and coach team members, fostering a culture of knowledge sharing, technical excellence, and continuous improvement.
SYSTABUILD is building a shared cloud and platform foundation for a group of leading software companies in the construction, CAD and ERP domain. They are looking for a Lead Cloud Infrastructure Engineer to take a key role in designing, operating, and evolving their central cloud infrastructure and platform services.
Partner with engineers to build dev tools that empower developer workflows and deployment infrastructure.
Ensure reliability of multi-cloud Kubernetes clusters and pipelines.
Metrics, logging, analytics, and alerting for performance and security across all endpoints and applications.
Cresta is on a mission to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. Their platform combines the best of AI and human intelligence to help contact centers discover customer insights and behavioral best practices.