The Opportunity:
- Gain a front-row seat to the biggest infrastructure shift in decades by building platforms that enable durable execution for AI applications and enterprise workflows.
- Work with state-of-the-art tech built from first principles, including a lightweight Rust binary with a custom storage layer and low latency orchestration.
- Partner with world-class engineers from companies like Apache Flink and Meta, focusing on reliability, correctness, and operational simplicity.
What You'll Do:
- Build and operate Restate Cloud across its infrastructure, control plane, networking, storage, and observability.
- Evolve BYOC products and assist customers with on-prem installations by designing infrastructure for their cloud accounts.
- Focus on fleet reliability and observability through SLOs, metrics, alerting, and automation to scale deployment methods.
What We're Looking For:
- Senior to Staff profile with experience operating production SaaS or platform infrastructure and navigating compliance-sensitive environments.
- Must-haves include strong cloud infrastructure background, IaC expertise with Kubernetes, software engineering skills in Rust/Go/C++, and end-to-end ownership.
- Nice-to-haves include prior experience with Restate, enterprise procurement navigation, Kubernetes operator development, and IaC systems like Cluster API or Terraform.
Restate
Restate provides a lightweight runtime that transforms AI agents, workflows, and backend services into durable processes, allowing teams to focus on logic instead of failure mechanics. It's an early-stage startup with a team of world-class engineers who have built foundational systems at scale and emphasize deep technical craft.