Design, implement, and manage cloud infrastructure using Infrastructure as Code (IaC) tools.
Design, build, and maintain scalable CI/CD pipelines using tools like CircleCI or GitHub Actions.
Implement and maintain observability tooling (Prometheus, Grafana, Datadog), and lead incident response to ensure system reliability.
Engine is transforming business travel into something personalized, rewarding, and simple. More than 20,000 companies already rely on Engine to support over 1 million travelers and billions in annual bookings each year.
Designing, building, and maintaining infrastructure that enables fast, reliable, and secure product delivery.
Improving and maintaining CI/CD pipelines to streamline deployments and increase reliability.
Contributing to infrastructure reliability and ensuring systems are designed for resilience and growth.
Incident.io is the leading AI incident response platform, built to help teams dramatically reduce incident response time and improve reliability. They have raised $100M from Index Ventures, Insight Partners, and Point Nine, alongside founders and executives from world-class technology companies.
Architect and deploy secure, scalable infrastructure using Terraform, CloudFormation, or similar tools.
Ensure the platform meets strict SLA requirements for enterprise clients, minimizing downtime.
Implement comprehensive monitoring, logging, and alerting to provide deep visibility into system health.
Filevine provides cloud-based workflow tools for legal professionals, helping them manage organizations and serve clients. They are recognized as a fast-growing and innovative technology company with a team of passionate professionals.
Design, build, and maintain secure, scalable cloud infrastructure.
Own CI/CD pipelines and deployment workflows across services and environments.
Improve reliability, availability, and performance through monitoring, alerting, and incident response practices.
Jobgether is a company that uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. They identify the top-fitting candidates and share this short list directly with the hiring company.
Ensure the smooth operation and high availability of Clarifai's core services
Monitor system performance, identify bottlenecks, and implement optimizations to enhance reliability and efficiency
Design and implement scalable, secure, and cost-effective infrastructure solutions
Clarifai is a leading AI platform specializing in computer vision and generative AI, empowering organizations to transform unstructured data into actionable insights. Founded in 2013, they have a diverse, globally distributed team with $100M in funding and are committed to building a diverse and inclusive team.
Design, build, and maintain secure, scalable cloud infrastructure
Own CI/CD pipelines and deployment workflows across services and environments
Improve reliability, availability, and performance through monitoring, alerting, and incident response practices
Wizard is revolutionizing the shopping experience using the power of generative AI and rich messaging technologies to build a personalized shopping assistant for every consumer. We scour the entire internet of products and ratings across brands and retailers to find the best products for every consumer’s personalized needs.
Own the operational stability and performance of Juul’s hybrid cloud infrastructure.
Lead automation efforts and architect for reliability.
Act as the final escalation point for critical incidents.
Juul Labs aims to transition the world’s billion adult smokers away from combustible cigarettes and eliminate their use, while also combating underage usage of their products. They are backed by leading technology investors and are committed to hiring great talent and building a diverse team.
Own developer operations and platform reliability across Introzy’s product stack.
Lead how we run infrastructure on Render, design and evolve our observability and alerting, shape our CI/CD and release practices.
Continuously improve internal developer experience so the engineering team can ship quickly and safely.
Introzy is a multi-app platform designed to unify networking, workflow, and productivity. As a subsidiary of Sanguine Technology Solutions, they are an early-stage company moving fast to deliver value, with a lean engineering team and a culture that embraces AI.
Design, implement, and maintain scalable and reliable infrastructure solutions.
Automate deployments and maintain a resilient, secure SaaS application platform.
Develop comprehensive monitoring and alerting solutions, and respond to incidents.
Veeam is the #1 global market leader in data resilience, believing businesses should control all their data whenever and wherever they need it, providing data resilience through data backup, data recovery, data portability, data security, and data intelligence. Based in Seattle, Veeam protects over 550,000 customers worldwide who trust Veeam to keep their businesses running.
Own and operate core platform systems across AWS, GCP, Vercel, Github, and Cloudflare.
Improve reliability, scalability, and security of production and non-production environments.
Improve local development environments and onboarding experience for engineers.
Moxie empowers ambitious aesthetic entrepreneurs to build profitable, independent practices. A global, remote-first team of more than 140 people supports hundreds of practices nationwide as they unlock sustainable success for aesthetic entrepreneurs.
Be a key contributor on an Agile development team, collaboratively realizing business value through iterative software development lifecycle.
Build and execute the monitoring strategy for ScienceLogic SaaS infrastructure.
Define, deploy, and maintain system and service monitors.
ScienceLogic is a leader in IT Operations Management, giving modern IT operations actionable insights for faster problem resolution and prediction. They see everything across cloud and distributed architectures, contextualizing data through relationship mapping, and acting on this insight through integration and automation.
Building monitoring, alerting, logging, and observability from the ground up.
Improving our security posture across auth, IAM, policies, and data access.
Software Mind develops solutions that make an impact for companies around the globe. They build cross-functional engineering teams that take ownership and crave more, embracing openness, acting with respect, showing grit & guts and combining employment with enjoyment.
Apply experience of IaC to develop infrastructure as code practice.
Automate software operations for re-usability and consistency across private and public clouds.
Collaborate with development teams to design service architecture, documentation, playbooks, policies and operational procedures.
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. With 1200+ colleagues in 75+ countries, it's a pioneer of global distributed collaboration with very few office-based roles and a founder-led, profitable, and growing company.
Ensure near-zero downtime with monitoring and alerting, self-healing automation, and continuous improvement
Create highly automated, available and scalable systems by applying software and infrastructure principles
Employ and advise clients on DevOps and SRE principles and practices, covering deployment pipelines, HA, service reliability, technical debt, and operational toil for live services running at scale
66degrees is an AI transformation partner. They guide enterprises from business challenges to quantifiable outcomes, helping businesses reach their inflection point where chaotic data becomes a strategic asset, complexity becomes clarity, and AI becomes an engine for growth. They believe in thriving through challenges and winning together.
Make deployments boring (in the best way possible)
Own CI/CD pipelines: optimize build times, improve caching, reduce flakiness
Evolve our Kubernetes (EKS) deployment strategy for reliability and speed
Obvious is building an AI-native workspace, an operating system for work that puts co-intelligence at the center. They are a small and talent-dense team with world-class builders, former founders, and leaders from companies like Netflix, Google, and Meta.
Design, implement, and manage multi-cloud infrastructure.
Implement container orchestration strategies for microservices architectures.
Develop and maintain infrastructure as code using Terraform.
Moniepoint is an all-in-one financial services platform for emerging markets, offering personal and business banking, payment, credit, and business management tools. It is also the second-fastest-growing company in Africa with over 3 million users.
Architect, maintain, and scale critical infrastructure.
Ensure system reliability and optimize performance.
Implement modern deployment strategies.
Scribe's Workflow AI platform automatically captures and optimizes workflows so teams work smarter, faster, and more consistently. They are a fast-growing company founded in 2019 with over 5 million users across 600,000 businesses, and they are backed by leading investors.
Contribute to high impact AWS cloud infrastructure initiatives.
Participate in operability and production readiness reviews.
Advocate and implement Site Reliability Engineering practices.
Patreon is a media and community platform where creators give fans access to exclusive work. They have generated over $10 billion for creators and have 25 million+ paid memberships, with a hybrid work model and offices in New York and San Francisco.
Design and implement highly scalable infrastructure for GitLab.com to support current and future growth.
Collaborate with cross-functional teams across the Infrastructure organization to plan and deliver projects that shape GitLab’s platform direction.
Operate and improve edge services and Kubernetes workloads, acting as a subject matter expert within the infrastructure department.
GitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. They aim to enable everyone to contribute to and co-create the software that powers our world.
Design, build, and maintain shared platform services that support secure and scalable infrastructure across client and internal environments.
Develop and maintain infrastructure-as-code (IaC) using tools such as Terraform, ARM/Bicep, or similar frameworks.
Build automation for system provisioning, configuration management, patching, and lifecycle operations.
Sentinel Blue is bringing enterprise-class cybersecurity to small and medium sized businesses. They are pushing the envelope of how things are done and constantly seeking innovative ways to meet that mission.