Source Job

$190,800–$267,100/yr
United States

  • Design and deliver software solutions in Go to improve the availability, scalability, and latency of Reddit's compute infrastructure.
  • Develop Kubernetes controllers and operators to automate cluster management, workload scheduling, and the reconciliation of complex system states.
  • Build core tooling and SDKs that codify network configurations, managed services, and compute capacity tracking across a multi-region fleet.

Go Kubernetes Distributed Systems Linux System Design

17 jobs similar to Senior Software Engineer, Compute Platform

Jobs ranked by similarity.

US 4w PTO 14w maternity 14w paternity

  • Own Render's core network infrastructure across multiple data centers and cloud providers, shaping how networking evolves as Render rapidly scales.
  • Design and build customer-facing networking capabilities that give users greater flexibility in how their services connect and communicate, and how traffic is routed.
  • Investigate complex networking issues across the stack, from the kernel and data plane to distributed systems and edge networking.

Render is building a modern cloud platform for developers creating AI-native, full-stack, multi-service applications, eliminating the tradeoff between hyperscaler power and developer-friendliness. They are a diverse and talented team that values craft, velocity, and user experience.

$190,800–$267,100/yr
US

  • Design and build backend systems, APIs, infrastructure, and platform capabilities that improve developer workflows across Reddit.
  • Build scalable and reliable systems across both AI-powered developer workflows and the core non-AI systems engineers rely on every day.
  • Lead high-impact projects across Reddit’s developer tooling ecosystem by writing and reviewing code and design docs, aligning stakeholders, and making pragmatic technical tradeoffs.

Reddit is a community-based platform built on shared interests, passion, and trust, facilitating open and authentic conversations. With over 100,000 active communities and approximately 126 million daily active unique visitors, it serves as one of the internet’s largest sources of information.

$115,000–$130,000/yr
US Unlimited PTO

  • Develop and maintain scalable automation and integrations across cloud platforms and services.
  • Design, implement, and operate CI/CD pipelines using Jenkins, Dagger, Terraform, and Docker.
  • Build, operate, and troubleshoot workloads on Kubernetes, using Kustomize and Helm.

People Inc. is America’s largest digital and print publisher. Our brands harness the best intent-driven content, the fastest sites, and the fewest ads to help nearly 200 million people every month make decisions.

US

  • Work cross-functionally to build novel products and features.
  • Contribute to the full development cycle.
  • Contribute standards that improve developer workflows.

Reddit is a community-driven platform built on shared interests and trust, hosting open conversations. With over 100,000 active communities and 126 million daily active users, it's a major source of information.

North America

  • Design, develop, and deliver high-quality backend services and APIs primarily using Go (Golang) and deploy them in Kubernetes environments.
  • Build and maintain automated tests for backend services, participate in code reviews, and monitor service performance in production to debug issues.
  • Provide technical guidance on backend architecture and integration challenges, sharing knowledge and supporting continuous improvement of processes and documentation.

Applied Systems provides innovative software and services for the insurance industry. They are an established insurtech company with 40+ years of experience and focus on creating a collaborative, value-driven culture for their team.

US 6w PTO

  • Design, build, and operate reconciliation systems to track desired stack state, detect and repair drift across stack templates, grafana.com state, Hosted Grafana, and actual customer stack configuration.
  • Collaborate across SSS, grafana.com, and deployment configurations to ensure stack lifecycle workflows remain reliable, observable, and resilient.
  • Improve operational efficiency by reducing deployment complexity and contributing to the Stack Config Reconciliation project.

Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack. Their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.

US Canada Unlimited PTO

  • Build scalable backend services and APIs that power our digital merchandising platform.
  • Work with other senior engineers to contribute to high level decisions about the architecture and design.
  • Work with Product Managers to make Jane’s advertising product offerings sound, robust and easy to use.

Jane Technologies is an MIT-founded eCommerce company in the cannabis industry experiencing rapid growth. Their mission is to bring confidence to the online cannabis shopping experience by connecting consumers with local dispensaries and brands. They are a small close-knit team of highly technical engineers with diverse backgrounds and a strong engineering culture.

$165,000–$165,000/yr
US

  • Design, build, and maintain scalable cloud infrastructure services in AWS and GCP.
  • Contribute production-quality Go and Python code to existing cloud services.
  • Develop and own automation and software deployment pipelines with maximum efficiency.

Dragos is dedicated to arming customers with best-in-class technology, threat intelligence, and services to protect their systems. They embody core values of authenticity, transparency, and trust and are a remote-first culture with operations in North America, Europe, the Middle East, and APAC.

$205,000–$231,000/yr
United States Unlimited PTO 18w maternity 12w paternity

  • Own architecture and direction for .NET ecosystem infrastructure, enabling secure, reproducible build, test, and distribution workflows for .NET libraries and SDKs.
  • Design and maintain automation for building, updating, validating, and publishing .NET artifacts, including vulnerability scanning, remediation, SBOMs, and provenance.
  • Build internal developer tools and integrate deeply with dotnet projects, NuGet, and artifact repositories to solve complex dependency and version-resolution issues in large codebases.

Chainguard is the trusted source for open source, delivering hardened, secure, and production-ready builds of the open source software engineers and AI agents rely on to help organizations build faster, stay compliant, and eliminate risk. It is a venture-backed company with customers including Fortune 500 enterprises and global industry leaders like OpenAI and Snowflake, embodying a remote-first culture with values focused on customer obsession, intentional action, and trust.

$140,000–$230,000/yr
US

  • Collaborate with Engineering, Product, and Operations to manage a global fleet of tens of thousands of media players and smart speakers.
  • Build tools in Bash and Golang for fleet management and investigate network issues, collaborating with customer Network Engineers.
  • Refine observability pipelines and processes to ensure efficient monitoring and support for distributed device management.

QSIC is a technology company that reinvents in-store audio by using audio, AI, and creativity to drive growth for retailers and brands. With team members in Australia, the US, and Mexico, they power thousands of stores across three continents, reaching over 100 million shoppers monthly, and received Series B funding in 2025.

Europe Unlimited PTO

  • Participate in the development and maintenance of high-performance backend services and applications using Golang.
  • Architect, implement, and optimize microservices-based applications, ensuring scalability, reliability, and maintainability.
  • Collaborate with the DevOps team to deploy and manage Golang applications in Kubernetes clusters using Helm.

Ruby Labs creates and operates innovative consumer products across health, education, and entertainment. They foster innovation and look for passionate individuals to join their fast-growing teams.

$29,000–$36,000/yr
India

  • Design, build, and maintain scalable, reliable systems on GCP.
  • Develop automation for infrastructure provisioning using Terraform, Ansible, or Deployment Manager.
  • Manage incident response, conduct postmortems, and implement improvements to reduce recurrence.

SupplyHouse.com is an industry-leading e-commerce company specializing in HVAC, plumbing, heating, and electrical supplies since 2004. They value every individual team member and cultivate a community where people come first with Generosity, Respect, Innovation, Teamwork, and GRIT.

Unlimited PTO 16w maternity 16w paternity

  • Scale and mature Vesta’s infrastructure to support the entire mortgage market reliably, securely, and efficiently.
  • Build the foundational systems that power engineering velocity and platform reliability.
  • Focus on cloud architecture, deployment systems, observability, incident response, and internal developer tooling.

Vesta is building the next-generation system of record to power the multi-trillion mortgage market. They value humility, empathy, self-awareness, and an orientation towards action and have raised $45M from top tier investors.

US

  • Lead the Platform (Core Services) team to build and maintain shared libraries and services for product engineering teams.
  • Define the team charter, establish ownership boundaries, and drive the mission of providing scalable and reliable infrastructure.
  • Manage and mentor a high-performing team of platform engineers, focusing on people development and building high-trust environments.

Rula is a mental healthcare company dedicated to providing evidence-based and compassionate care to treat the whole person. It is a remote-first organization focused on creating a stigma-free environment and fostering a culture of diversity, equity, and inclusion for its employees.

Europe 6w PTO

  • Design, build, and operate reconciliation systems to track desired stack state, detect and repair drift across stack templates, grafana.com state, Hosted Grafana, and actual customer stack configuration.
  • Collaborate across SSS, grafana.com, and deployment configurations to ensure stack lifecycle workflows remain reliable, observable, and resilient.
  • Improve operational efficiency by reducing deployment complexity and contributing to the Stack Config Reconciliation project.

Grafana Labs is a remote-first, open-source powerhouse with over 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, featuring scalable metrics (Grafana Mimir), logs (Grafana Loki), and traces (Grafana Tempo).

US Unlimited PTO

  • Build and maintain integrations for Microsoft platforms including SharePoint, OneDrive, Teams, Azure Blob Storage, and Azure DevOps
  • Design and implement robust authentication and authorization handling using Microsoft Graph APIs, Entra ID (Azure AD), OAuth2, and enterprise access patterns
  • Improve performance, scalability, and reliability of large-scale content scanning systems

Truffle Security is a cybersecurity company that aims to make handling secrets easier. Built on the TruffleHog secrets scanning platform, their enterprise solution aids security and engineering teams in finding exposed credentials, understanding their activity, and acting confidently.

Canada

  • Design, develop, and maintain core infrastructure supporting large-scale optimization engines and planning workflows to improve scalability and performance.
  • Analyze and optimize performance bottlenecks in optimization pipelines, focusing on compute, memory usage, and data flow for complex planning problems.
  • Contribute to evolving platform architecture, designing systems for large datasets and parallel execution while ensuring enterprise-grade reliability and maintainability.

Kinaxis is a global leader in modern supply chain orchestration, providing an AI-powered platform for end-to-end supply chain transparency and faster decision-making. The company has over 2000 employees globally, is a multi-time Top Employer award winner, and fosters a culture of innovation with a serious focus on technology, customers, and a collaborative, not-too-serious internal environment.