Design, deploy, and manage AWS cloud infrastructure using Pulumi (Python). Manage CI/CD pipelines using CircleCI for Next.js, Django, and React Native/Expo services.
Implement monitoring, alerting, and logging solutions using Datadog and Sentry. Develop internal tooling and automation, primarily using Python.
Collaborate with product and development teams to architect scalable services and ensure high availability. Embed security best practices in infrastructure.
TrustedHousesitters operates a direct-to-consumer marketplace connecting pet owners and sitters for pet care and travel solutions. They are a fast-growing global community with members in over 140 countries and a team of more than 100 employees.
Lead efforts to scale and improve our infrastructure.
Develop and support internal team tooling.
Troubleshoot, debug and resolve issues as part of a shared on-call rotation.
Lillio, formerly HiMama, empowers early childhood educators through innovative tools. They are a Series B, private-equity backed company recognized as an industry leader and selected in 2025 by Time Magazine as one of the world's top EdTech companies.
Take technical ownership of our cloud infrastructure and DevOps practices.
Help us design resilient systems, scale infrastructure, mentor engineers, and collaborate across teams.
Own and improve CI/CD workflows, enabling fast and reliable deployments across teams.
Halcyon is the industry’s first dedicated, adaptive security platform that combines multiple proprietary advanced prevention engines along with AI models focused specifically on stopping ransomware. Formed in 2021 by a team of cyber industry veterans after battling ransomware for years, Halcyon is focused on solutions for mid-market and enterprise customers.
Lead two high-performing teams focused on CI/CD pipelines, developer experience, and system health, performance, and incident response.
Improve processes for deployment, monitoring, incident management, and release engineering, ensuring speed and reliability.
Drive the adoption of newer GitOps practices, with an emphasis on MLOps and AI-related infrastructure and tooling.
Bluesight creates groundbreaking solutions that increase efficiency, safety and visibility for health systems, hospital pharmacy, and pharmaceutical manufacturers. They are a high growth healthcare information technology company with a start-up vibe and over 3,000 customers using their proven solutions.
Partner with product and platform engineering teams to improve system reliability, scalability, and developer experience
Build, maintain, and evolve CI/CD pipelines to support safe, fast, and reliable deployments
Improve observability through better monitoring, alerting, logging, and telemetry
Zipline is a SaaS company transforming how frontline teams work. They empower leading brands across retail, healthcare, logistics, and beyond. Zipline is a fully remote company with employees across the U.S., Canada, and around the globe.
Design, deploy, and manage scalable and highly available cloud infrastructure on AWS.
Design reusable Terraform/OpenTofu modules following DRY principles and organizational standards.
Implement AIOps practices, leveraging AI tools to enhance monitoring, incident response, and predictive alerting.
DistroKid is the world’s largest distributor of music to Spotify, Apple Music, YouTube, and beyond, empowering millions of independent artists to get their music into streaming services and keep 100% of their earnings. They move fast, stay curious, and build tools that directly impact how artists share their music with the world.
Collaborate with application engineering teams on platform infrastructure.
Enhance observability and spearhead the adoption of SRE best practices.
Build and maintain reliable CI/CD pipelines, tooling, and infrastructure.
Rula strives to provide quality, evidence-based, compassionate mental healthcare and aims to create a world where mental health is no longer stigmatized. They are a remote-first company operating in most U.S. states, and are dedicated to having a culture of inclusion that supports their employees.
Drive adoption of team deliverables, championing the tools, standards, and practices.
Gather and translate stakeholder requirements, understanding infrastructure pain points.
Lead the team’s delivery, owning the backlog and prioritizing across competing demands.
Tem is rebuilding the energy transaction, making it transparent and fair. They are putting power back where it belongs, in the hands of customers and taking on one of the most critical problems of our century, access to low cost electricity. Tem closed a $75 million Series B and is scaling internationally.
Lead the push toward a modern, cloud-native organization by designing and managing scalable, resilient systems on AWS.
Own the Infrastructure as Code (IaC) strategy using Terraform, ensuring environments are repeatable, versioned, and stable.
Build and optimize high-velocity deployment pipelines using GitHub Actions, ArgoCD, and Helm to get code from "commit" to "production" seamlessly.
TrueML is undergoing a major platform rearchitecture, moving toward a fully cloud-native, modernized infrastructure. They seem to be a medium-sized company with a focus on innovation and providing engineers with the tools and data they need to make smart, impactful choices.
Design and maintain AWS infrastructure using best practices.
Develop, operate, and improve CI/CD pipelines using GitHub Actions.
Lead initiatives around infrastructure security and compliance.
Bluefish is building the platform that helps brands engage consumers on the new AI channel, with enterprise tools to manage AI brand safety and engage consumers with personalized AI marketing experiences. The Bluefish team is a tight-knit group of mar-tech industry veterans.
Spearhead the evolution of our scalable, secure, and high-performing platform, driving the infrastructure that fuels our startup.
Conduct gap analyses to strengthen infrastructure, maintain exceptional uptime, and enhance monitoring systems for rapid incident detection.
Mentor and develop your team, recruit top talent, and foster a culture of collaboration, technical excellence, and continuous improvement.
Ethena Labs is actively building and deploying groundbreaking digital dollar products, aiming to upgrade money into the internet era. They have scaled to $15b in 18 months and continue to develop new product lines and foundational infrastructure for a more open, efficient, and interconnected global financial system.
Design and implement the AWS platform foundations used by product and service teams across RWS.
Support engineering and IT teams with guidance as migration of application workloads from on-premise environments into AWS is completed.
Build and maintain infrastructure using Infrastructure as Code to ensure consistent, repeatable cloud deployments.
RWS unlocks global understanding by growing the value of ideas, data, and content. They have over 500 staff within Product & Technology and support over 7500 end users worldwide.
Build Self-Service Infrastructure: Design and scale highly available Infrastructure as Code (IaC) modules using Terraform. Empower development teams to provision resources autonomously and securely.
Champion Platform Reliability: Partner closely with engineering teams to define, measure, and operationalize SRE metrics. Balance feature velocity with system stability.
Elevate Developer Experience (DevEx): Architect frictionless, GitOps-driven CI/CD pipelines utilizing GitHub Actions and ArgoCD. Facilitate automated, secure, and progressive deployments.
KTO Group drives excitement in iGaming through innovation, focusing on transparency and player satisfaction. Founded in 2018, KTO blends sports betting with online casino entertainment on a proprietary platform, and is a rising leader in LATAM, ranked among Brazil’s top 10 iGaming brands.
Design, build, and optimize cloud platform capabilities.
Tackle complex infrastructure challenges and raise engineering quality.
Apply AI and AIOps to make the platform smarter and more resilient.
PerfectServe offers Best in KLAS clinical communication and physician scheduling solutions and is a Leader in the Gartner Magic Quadrant for Clinical Communication and Collaboration. We focus on optimizing provider schedules and dynamically routing messages to advance patient care and clinical workflows, valuing growth, transparency, and innovation.
Lead the Infrastructure Engineering team, taking full ownership of cloud infrastructure, Kubernetes platforms, DevOps tooling, and CI/CD pipelines.
Drive reliability, scalability, and security across the production environment while maintaining a sharp focus on developer velocity and business impact.
Mentor and guide engineers across SRE, DevOps, and Database Reliability functions, fostering a culture of operational excellence and pragmatic problem-solving.
Finom is a European tech startup headquartered in Amsterdam, revolutionizing financial services for entrepreneurs with an all-in-one B2B platform. They have raised $346 million, are expanding across key EU markets, and foster innovation, prioritizing research and solutions that benefit users, employees, partners, and the business.
Improve and maintain CI/CD, deployment workflows, and environment management across backend, web, and internal services.
Build, maintain and scale infrastructure across AWS and container based services.
Improve monitoring, alerting, logging, dashboards, tracing, and runbooks.
Newton aims to change how Canadians trade crypto and make financial freedom achievable for everyone by providing tools and knowledge. They foster a dynamic and collaborative remote team across Canada that values continuous improvement and creativity.
Learn platform infrastructure, developer tooling, and deployment patterns.
Own and drive at least one architecture decision that improves platform reliability.
Ship infrastructure improvements that measurably improve developer experience or platform stability.
Homebot is a homeownership platform for lenders and real estate, title & insurance agents that drives client retention and partner referrals. They maintain a clear focus on culture, engagement, and creating an environment where people are valued and can thrive.
Design and build reusable platform solutions empowering engineering and SRE teams across AWS, Azure, and GCP.
Spearhead the evolution of our Packer-driven VM image pipelines, establishing standardized, maintainable processes.
Lead application migrations into GCP while rapidly mastering our complex, multi-cloud infrastructure footprint.
TELUS Agriculture and Consumer Goods (TAC) is committed to disrupting the status quo with state-of-the-art applications that leverage data to reimagine the way we approach food. TAC is composed of inspired individuals united in passion and purpose, working collaboratively to bring extraordinary opportunities to life.
Design, migrate, and operate cloud-native platforms across AWS, GCP, and OCI.
Build and maintain Infrastructure as Code using Terraform across multiple cloud providers.
Apply SRE best practices to improve availability, performance, and reliability (SLIs/SLOs, monitoring, alerting).
Apriorit is a software engineering company established in 2002, with experience in system programming, cybersecurity, reverse engineering, SaaS/Web, blockchain-based solutions, and AI. With over 400 specialists, they help tech companies around the world turn their challenging ideas into secure and viable products.
Own and evolve the uptime monitoring platform to enhance customer capabilities.
Deploy a Clickhouse instance to capture check run logs and design APIs for reporting.
Collaborate with customers to resolve bugs affecting their infrastructure.
Jobgether is a platform posting jobs on behalf of partner companies. We use AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements.