Source Job

Global Unlimited PTO 16w maternity 16w paternity

  • Design, implement, and operate core services that power Docker’s Cloud Sandboxes platform.
  • Build scalable systems for microVM orchestration, workload scheduling, and lifecycle management.
  • Ensure system reliability, observability, and performance across Docker’s Cloud Sandbox infrastructure.

Go Java Kubernetes AWS Azure

20 jobs similar to Staff Software Engineer

Jobs ranked by similarity.

Global Unlimited PTO

  • Design and build resilient, scalable platform services like authentication and rate limiting.
  • Collaborate with engineers across teams to deliver infrastructure solutions.
  • Optimize systems for security, performance, and always-on availability.

Constructor is an AI-first ecommerce search and discovery platform that helps shoppers find products and enables brands to drive revenue. The company is fully remote and fosters a culture of growth, offering training budgets and regular team offsites.

Global 16w maternity 16w paternity

  • Lead the design and implementation of self-service platform infrastructure for provisioning, deployment, and observability across engineering teams.
  • Evolve multi-tenant EKS foundations toward better reliability, security, scale, and multi-region connectivity.
  • Set delivery standards using Terraform, GitOps, and progressive rollout, while improving SLOs and alerting on Grafana Cloud.

Docker is a developer tooling company trusted by over 20 million monthly users and 20 billion container image pulls. They are a globally distributed, remote-first team building tools that define how software gets built and delivered.

US Unlimited PTO

  • Provide frontline technical expertise to help developers deploy and scale Temporal in cloud-native environments.
  • Troubleshoot complex infrastructure issues, optimize performance, and develop automation solutions.
  • Collaborate with engineering and product teams to influence platform improvements and enhance developer experience.

Temporal provides an open source programming model that simplifies code and makes applications more reliable. The company is a growing team driven by values of curiosity, collaboration, and humility, focused on improving developer experience.

  • Design and develop scalable, high-performance, low-latency backend services using Java and AWS technologies.
  • Collaborate with product, frontend, and QA teams to define technical requirements and ensure smooth integration with other platform components.
  • Optimize existing services for maximum performance, reliability, and maintainability while implementing CI/CD best practices.

Oscilar builds an advanced AI Risk Decisioning Platform that helps banks, fintechs, and digital organizations manage fraud, credit, and compliance risk. The team includes industry veterans from Meta, Uber, Citi, and Confluent, and operates with a mission-driven culture emphasizing ownership and innovation.

US Unlimited PTO 12w maternity 12w paternity

  • Break down complex problems into understandable and iterative solutions.
  • Develop and operate Infrastructure-as-code on Kubernetes environments using Docker and Helm.
  • Plan and implement migrations of systems between hosts with minimal downtime.

CivicActions helps government agencies deliver better services through modern software practices. They work in cross-functional, agile teams and encourage a balanced, autonomous lifestyle.

Germany Unlimited PTO

  • Design and maintain scalable infrastructure-as-code solutions using Terraform and Kubernetes.
  • Build and operate observability systems while leading incident response and reliability improvements.
  • Embed security and compliance practices into infrastructure and optimize system performance and cloud costs.

This partner company builds a next-generation platform enabling AI-driven services across global employment infrastructure. It is a highly distributed, async-first organization where engineers thrive in ownership and autonomy.

US

  • Design, implement, and maintain CI/CD pipelines for Java-based microservices and enterprise healthcare applications.
  • Automate infrastructure provisioning and configuration using Terraform or CloudFormation.
  • Support containerization and orchestration using Docker and Kubernetes with a focus on security and HIPAA compliance.

HealthEdge provides healthcare technology platforms. The company is a full-time employer seeking a DevOps Engineer to join their remote team and focuses on automation and reliability.

US 4w PTO 14w maternity 14w paternity

  • Own core compute infrastructure across multiple cloud providers and regions.
  • Design capabilities for greater performance and flexibility in service deployment.
  • Investigate and resolve challenging cloud and compute issues across the stack.

Render is a cloud platform for developers building AI-native, full-stack, multi-service applications. Trusted by over 6 million developers, the company has raised $257M in funding and values craft, velocity, and user experience.

US Unlimited PTO

  • Design, build, and operate core platform services that replicate and recover complex cloud environments with speed and precision.
  • Develop and extend Arpio's application-aware orchestration engine, mapping dependencies and recovering entire application stacks.
  • Work directly against cloud provider APIs, building robust abstractions that absorb their differences and edge cases.

Arpio builds a next-generation disaster recovery SaaS platform for complex cloud-native environments, recovering entire application stacks across AWS and Azure. The company is a small, YC21 and Series A-backed team that values tackling hard problems and building an innovative product.

United States Canada UK Unlimited PTO 18w maternity 12w paternity

  • Build and maintain core components of the clearing house in Go on GCP, including customer onboarding flows and data ingestion pipelines.
  • Take ownership of ambiguous problems and drive features from design through production with appropriate testing and observability.
  • Participate in on-call rotation, contribute to incident response, and become a go-to engineer for core subsystems.

Chainguard is the trusted source for secure open source software, delivering hardened builds for enterprise customers. The company is venture-backed by leading investors and serves Fortune 500 enterprises.

US Unlimited PTO

  • Own the US-only production environment end-to-end, including infrastructure deployment, maintenance, scaling, and reliability.
  • Lead and grow the US-based DevOps team, design scalable AWS infrastructure, and build CI/CD pipelines for safe, fast shipping.
  • Partner with engineering on application error investigations, improve monitoring and alerting, and coordinate with the Tel Aviv team on shared platform standards.

Zafran de-risks 90% of critical vulnerabilities overnight across hybrid environments using existing security tools. Backed by Sequoia Capital and Cyberstarts, it is one of the fastest-growing companies in cybersecurity, scaling to meet demand from advanced organizations.

US Unlimited PTO

  • Develop internal tools and automate infrastructure using AWS, Kubernetes, and programming languages.
  • Research and design solutions to increase website robustness, availability, and cost efficiency.
  • Collaborate on documentation, code reviews, and rollout of new processes.

Angi powers the future of the home services industry, connecting homeowners with skilled pros. With 9 brands in 8 countries and employees worldwide, Angi has helped homeowners with over 300 million home projects.

US

  • Independently troubleshoot enterprise CI/CD and infrastructure issues for top tech companies.
  • Design and implement proactive tools, processes, and open source contributions.
  • Provide support via Slack, Zoom, and Community Forum with no on-call duties.

Buildkite is rethinking software delivery, building a fast, reliable, and secure CI/CD platform for high-growth tech companies like Airbnb and Canva. They are a remote-first company with a culture of kindness, autonomy, and collaboration.

Global

  • Design and implement AI inference and training cloud products optimized for Kubernetes, including autoscaling and distributed jobs across GPU fleets.
  • Write clean, efficient Go code for Kubernetes controllers, operators, and custom resources supporting AI workloads.
  • Build APIs, CLIs, and developer tools to simplify deployment, lifecycle management, and monitoring of AI applications.

Gcore is a global provider of infrastructure and software solutions for AI, cloud, network, and security, powering digital experiences worldwide. With 550+ professionals and 210+ edge locations, the company collaborates with partners like Intel, NVIDIA, and Equinix to build the foundation for an AI-driven world.

Europe

  • Build Cloud-native, AWS-based greenfield software and invent solutions to various user, business, and technical challenges.
  • Operate a 'build it and own it' culture with a DevOps and individual contributor mindset.
  • Be a key member of a fast-growing global team, shaping the back-end building squads for scale.

Santander Auto Software is building a brand-new global software platform and product suite for international customers, focusing on creative technology solutions. The company is a 100% tech-focused, cross-border team with a leadership team of experienced tech professionals, promoting equal opportunities regardless of gender, culture, or disability.

Serbia

  • Lead the design and development of scalable backend APIs and services integrating with cloud services, focusing on IBM Cloud.
  • Collaborate with product, infrastructure, and partner teams to design intuitive and maintainable interfaces.
  • Influence long-term architecture and technical direction for Sysdig's integrations with IBM Cloud.

Sysdig creates cloud security tools, including the open-source Falco project for threat detection. It is trusted by over 60% of the Fortune 500 and recognized as a Best Place to Work and one of Deloitte's fastest-growing companies for the past 5 years.

US Canada UK Unlimited PTO 18w maternity 12w paternity

  • Own end-to-end domain within the clearing house: customer onboarding, entitlements, or data validation.
  • Drive architecture and implementation of backend systems in Go on GCP, ensuring production readiness.
  • Establish engineering best practices and collaborate with principal engineer on technical planning.

Chainguard secures the open source software supply chain by providing hardened, secure builds of open source software. It is a venture-backed startup with a remote-first culture, trusted by Fortune 500 enterprises.

US

  • Design and implement scalable systems for Trust & Safety engineering.
  • Lead the architecture of digital armor to mitigate fraud and billing anomalies.
  • Mentor senior engineers and establish engineering best practices.

Rula is a remote-first mental healthcare company dedicated to evidence-based and compassionate care. They focus on building trust with payers, patients, and providers through scalable technology.

Global Unlimited PTO

  • Design, implement, and maintain CI/CD tools and processes for seamless software delivery.
  • Automate and optimize systems using IaC, monitoring, and scripting with Python and Bash.
  • Architect and manage service components across GCP, Oracle, AWS, and Azure environments.

Teramind is a global leader in user behavior analytics, insider risk management, and workforce intelligence, using a predictive AI-driven approach to safeguard organizations. As a fully-remote team since 2014, it fosters a diverse, global culture with a forward-thinking and collaborative environment.

UK

  • Design, develop, and optimize scalable shared backend services using Java and serverless technologies (AWS Lambda).
  • Design and implement RESTful APIs and event-driven systems, contributing to frontend components as needed.
  • Collaborate with cross-functional teams, mentor junior developers, and maintain CI/CD pipelines for high-quality software delivery.

Turnitin is a recognized innovator in global education, partnering with educators and institutions for over 25 years to develop learning integrity solutions. With over 16,000 academic institutions and team members in 35+ countries, the company fosters a remote-first culture focused on purpose, accountability, and well-being.