Designing and implementing SLI/SLO frameworks with error budgets to guide reliability and performance decisions.
Building and maintaining AWS-based production infrastructure using Infrastructure as Code (Terraform, CloudFormation), including ECS, EKS/Kubernetes, and microservices orchestration.
Developing internal tools, automation frameworks, and reliability services in TypeScript, Python, or similar languages to enhance operational efficiency.
Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. They identify the top-fitting candidates, and this shortlist is then shared directly with the hiring company.
Implement SLI/SLO frameworks with error budgets to drive reliability decisions
Design release strategies including blue/green deployments and version tracking
Lead incident response and develop automated runbooks to reduce MTTR
Jobgether is a company that helps connect individuals with jobs through an AI-powered matching process. They ensure applications are reviewed quickly, objectively, and fairly against roles' core requirements.
Evolve progressive delivery with Argo Rollouts and GitOps to improve automated health checks and rollback triggers.
Optimize CI/CD infrastructure at scale to improve CI workflows and optimize build times.
Build deployment gates and guardrails that prevent production incidents by designing and implementing automated quality checks.
Wealthsimple is on a mission to help everyone achieve financial freedom by reimagining what it means to manage your money. They are the largest fintech company in Canada, with 3+ million users who trust them with more than $100 billion in assets.
Apply SRE principles to Customer Success and enable monitoring for key customers.
Detect and prioritize critical issues affecting the platform's reliability.
Proactively identify and implement improvements that enhance platform performance.
Jobgether is a platform that matches job seekers with companies using AI. They aim to ensure applications are reviewed quickly and fairly, connecting top candidates with hiring companies.
Optimize and scale our PostgreSQL (Supabase) infrastructure
Design indexing, partitioning, and query strategies for large-scale media datasets
Improve performance across ingestion, enrichment, and retrieval pipelines
Kled is building the largest opt-in human data network in the world. Since launching their mobile app in 2026 and scaling to 200,000+ active data contributors, they have raised $5M+ from investors behind SpaceX, Airbnb, Coinbase, xAI, OpenAI, Anthropic, Spotify, Lyft, and Uber.
Design and develop the platform architecture to enable developers to self service in building out infrastructure.
Collaborate with development teams to ensure their applications are optimized for deployment in an IAC environment, and meet security and compliance needs.
Develop and maintain automation and deployment processes to enable and improve developer experience and efficiency.
Quanata aims to ensure a better world through context-based insurance solutions. They are a customer-centered team creating innovative technologies and digital products, backed by State Farm, blending Silicon Valley talent with long-term backing of a leading insurer.
Design, implement, and maintain core components of the Internal Developer Platform, covering CI/CD, self-service, delivery pipelines, service templates, golden paths, developer tooling, internal APIs, observability integrations, and cloud/runtime abstractions.
Build and evolve opinionated golden paths that cover the majority of use cases, making explicit trade-offs between flexibility, standardization, and long-term maintainability.
Continuously improve developer workflows by reducing lead time, failure rates, onboarding effort, and cognitive load, guided by DevEx and flow metrics.
Redcare Pharmacy is Europe’s No.1 e-pharmacy, powered by passionate teams and cutting-edge innovation. They strive to create a healthy, collaborative work environment where every employee feels valued and inspired to contribute to their vision “Until every human has their health”.
Own the reliability, scalability, and performance of Peec AI’s core systems and infrastructure
Design, build, and maintain the tooling, automation, and monitoring that keep our services fast, secure, and highly available
Partner closely with product and engineering teams to ensure new features are reliable, observable, and easy to operate from day one
Peec AI is one of Europe’s fastest-growing Series A startups (no employee count/culture details given). They provide exciting and challenging work in the AI space.
Optimizing how the team produces code and collaborates to build WorkOS.
Identifying pain points and recommendations to improve how the company builds software internally.
Serving as a bridge between infrastructure, product, and leadership to ensure the tools and systems are maturing alongside the product.
WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. They are a fully distributed team with employees across North American time zones and are well-funded by top investors.