Own the technical direction of Remote's SRE/Platform domain.
Define and drive the reliability strategy across the platform.
Identify and lead AI enablement initiatives across the engineering organisation.
Remote is solving modern organizations’ biggest challenge – navigating global employment compliantly with ease. With our core values at heart and a future-focused work culture, our team works tirelessly on ambitious problems, asynchronously, around the world.
Maintain and optimize AWS EC2 and EKS clusters to ensure high availability and performance.
Lead troubleshooting of production outages, providing timely resolution and root cause analysis.
Implement and improve CI/CD pipelines using tools like Jenkins and GitHub Actions to streamline deployment processes.
CI&T are tech transformation specialists uniting human expertise with AI to create scalable tech solutions. With over 8,000 CI&Ters globally, they have built partnerships with more than 1,000 clients over 30 years, and Artificial Intelligence is deeply embedded in their work reality.
Construct infrastructure as code, developing and enforcing best practice across configurations while preventing drift between Terraform configurations and infrastructure deployments.
SentiLink provides innovative identity and risk solutions, empowering institutions and individuals to transaction with confidence. They are building the future of identity verification in the United States replacing a clunky, ineffective, and expensive status quo with solutions that are 10x faster, smarter, and more accurate.
Lead the design, implementation, and continuous improvement of our cloud infrastructure and DevOps practices.
Ensure that our systems are scalable, reliable, and secure, enabling seamless software delivery across environments.
Improve development velocity while increasing system reliability
Cadence is building a remote care delivery system that keeps older people healthy, out of the hospital, and at home. They support tens of thousands of active patients nationwide with their AI‑powered system and scalable clinical model enabling proactive, population‑level care.
Design, build, and maintain scalable, reliable systems on GCP.
Develop automation for infrastructure provisioning using Terraform, Ansible, or Deployment Manager.
Manage incident response, conduct postmortems, and implement improvements to reduce recurrence.
SupplyHouse.com is an industry-leading e-commerce company specializing in HVAC, plumbing, heating, and electrical supplies since 2004. They value every individual team member and cultivate a community where people come first with Generosity, Respect, Innovation, Teamwork, and GRIT.
Designing and managing cloud-based infrastructure on AWS.
Creating and maintaining deployment architectures and continuous delivery pipelines.
Automating infrastructure provisioning and management using Infrastructure as Code (IaC) tools such as Terraform or CloudFormation.
Nearform is an independent team of data & AI experts, engineers, and designers who build intelligent digital solutions and capability at pace. Our team of 500 experts in 20+ countries is trusted by leading enterprises.
Lead the design, implementation, manage, support and operation of cloud-native infrastructure and container orchestration platforms.
Drive platform reliability, scalability, automation, and operational excellence across critical SaaS and cloud-based workloads.
Contribute to architectural decisions, mentoring engineers, and ensuring alignment with security, compliance, and operational standards.
Availity delivers revenue cycle and related business solutions for health care professionals who want to build healthy, thriving organizations. They are a global team with headquarters in Jacksonville, FL, and an office in Bangalore, India, united by a mission to bring the focus back to patient care.
Own and evolve the cloud substrate including compute, EKS fleet, networking, and cloud operations across AWS and GCP.
Design and maintain the networking fabric connecting Webflow's services, ensuring reliability, security, and scalability.
Build and enforce guardrails around IAM and permissions to keep infrastructure secure and auditable while driving FinOps and cost optimization.
Webflow is building the world's leading AI-native Digital Experience Platform. As a remote-first company built on trust and creativity, it empowers over 2 million users globally to design, launch, and optimize for the web without barriers.
Design, build, and operate core cloud infrastructure across compute, storage, databases, and networking layers.
Own and improve the reliability, scalability, and security of Valon’s production systems as we scale to support major enterprise deployments.
Evaluate, adopt, and operationalize new infrastructure technologies (e.g., Vitess, Clickhouse, Redis) to meet evolving product and scale requirements.
Valon is building the AI-native operating system for regulated finance, starting with mortgage servicing. They're a Series C company backed by a16z, transforming industries that others have written off as too complex to innovate.
Lead the design, implementation, and ongoing improvement of reliable, scalable, performant, and secure production platforms and services.
Work closely with cross-functional teams to build and maintain resilient infrastructure and deployment patterns.
Provide technical leadership and mentorship to engineers across the organisation, promoting strong engineering standards and operational best practice.
Cision empowers individuals to make an impact and values diverse perspectives. They foster curiosity, collaboration, and innovation while driving meaningful contributions to brands; they have offices in 24 countries throughout the Americas, EMEA and APAC.
Drive the stability and reliability of Epic's GCP infrastructure.
Manage and harden our Docker and GKE container platform.
Maintain and improve CI/CD pipelines.
Epic is the leading digital reading platform for kids ages 12 and under, used by millions of children, families, and educators around the world. As Epic continues to grow, we are reimagining what reading can be through thoughtful technology, data, and global collaboration to make learning more engaging, accessible, and impactful.
Extend and enhance an in-house platform for managing and automating SaaS deployments into AWS environments.
Design, implement, and maintain AWS infrastructure components for SaaS deployments using Java and AWS-native technologies.
Troubleshoot deployment and infrastructure issues while following SDLC processes and participating in code reviews.
Sapio Sciences builds a unified lab informatics platform that helps organizations accelerate scientific discovery and clinical diagnostics. It is a lean, growth-focused team guided by EMBRACE values, emphasizing ownership, collaboration, and real-world impact.
Design, build, and maintain scalable cloud infrastructure services in AWS and GCP.
Contribute production-quality Go and Python code to existing cloud services.
Develop and own automation and software deployment pipelines with maximum efficiency.
Dragos is dedicated to arming customers with best-in-class technology, threat intelligence, and services to protect their systems. They embody core values of authenticity, transparency, and trust and are a remote-first culture with operations in North America, Europe, the Middle East, and APAC.
Build and maintain CI/CD pipelines and deployment infrastructure.
Leverage AI to automate analysis and resolution of production issues.
Fal is the generative media ecosystem powering the next generation of AI products. They build the infrastructure, tools, and model access that teams need to move from idea to production.
Architect and maintain infrastructure as code with Terraform.
Set up monitoring, alerting, and incident response.
We're a UK fintech building high-throughput digital infrastructure for the mortgage and property space. Recently acquired Trussle and we are taking our platform to the next level. The company values innovation and building high-quality products.
Deploy and maintain infrastructure using Terraform on AWS.
Operate and govern production-grade platforms running on Kubernetes / EKS.
Build and maintain CI/CD pipelines using GitHub Actions.
Muttdata is a dynamic startup committed to crafting innovative systems using cutting-edge Big Data and Machine Learning technologies. They are looking for a hands-on DevOps to join a strategic initiative focused on deploying and operating Data & AI platforms.
Own and evolve Launch Potato's cloud infrastructure, CI/CD platform, and compliance posture.
Build the SRE function from the ground up so product teams can ship faster without compromising reliability, security, or cost control.
Stand up the SRE practice from scratch: on-call rotation, PagerDuty configuration, SLA/SLO definitions for core infrastructure services, runbook library, and observability dashboards that tie site performance to business metrics.
Launch Potato is a digital media company that connects consumers with leading brands through data-driven content and technology. They are headquartered in South Florida with a remote-first team spanning over 15 countries, with a high-growth, high-performance culture.
Build small to medium-sized infrastructure components using Terraform and AWS.
Ensure reliable build-and-deploy cycles by maintaining and debugging CI/CD workflows, including GitHub Actions and ArgoCD.
Learn to troubleshoot and resolve issues in containerized environments, including Kubernetes pods and EKS networking bottlenecks.
TrueML is a mission-driven financial software company that aims to create better customer experiences for distressed borrowers. The TrueML team includes inspired data scientists, financial services industry experts and customer experience fanatics building technology.
Build and maintain end-to-end observability with ELK, Prometheus, and Grafana.
Own and improve CI/CD pipelines (CircleCI, GitLab CI, GitHub Actions, ArgoCD).
Lead incident response and postmortems in a blameless culture.
Redcare Pharmacy is Europe’s No.1 e-pharmacy, powered by passionate teams and cutting-edge innovation. They strive to create a healthy, collaborative work environment where every employee feels valued and inspired to contribute to their vision “Until every human has their health”.
Support the Platform Infrastructure by managing container environments on EKS, implementing GitOps workflows, and maintaining CI/CD pipelines.
Build for Reliability by defining SLIs/SLOs, leading incident response, and contributing to disaster recovery planning.
Drive Observability by designing and maintaining monitoring and logging stacks with Datadog, Sentry, and CloudWatch.
Turquoise Health is a Series C price transparency platform for finance leaders across healthcare, building the infrastructure for a more open, efficient healthcare marketplace. The company is a remote-first, US-based team of over 300 enterprise organizations that values transparency, empathy, inclusivity, creativity, and ownership.