Drive high availability and disaster recovery strategies, ensuring RPO/RTO targets are met for our core banking systems.
Define and execute the scalability roadmap, including partitioning, sharding strategies, and capacity forecasting.
Perform deep-dive query optimization, execution plan analysis, and proactive bottleneck remediation.
Finom is a European tech startup headquartered in Amsterdam that aims to revolutionize the financial landscape for entrepreneurs. They offer an all-in-one financial B2B solution that integrates banking functions, accounting, financial management, and invoicing. They recently closed a €115 million Series C equity round.
Incident Management: Respond to and resolve incidents in a timely manner, conducting post-incident reviews to identify and implement improvements.
Alpaca is a self-clearing broker-dealer and brokerage infrastructure for stocks, ETFs, options, crypto, fixed income, 24/5 trading, and more. They are a dynamic team of 380+ globally distributed members.
Own database reliability across Aurora, OpenSearch, Redis, and CDC pipeline, including schema design reviews, migration safety, and incident response.
Make the Django ORM a strength at scale by catching N+1 patterns, extending QuerySet conventions, and building CI checks that encode standards.
Build self-service tooling and dashboards giving teams visibility into their query footprint, and contribute to onboarding and knowledge-sharing as the engineering org grows.
Scribe provides a Workflow AI platform that automatically captures and optimizes how work gets done, used by 94% of the Fortune 500. The company has grown to over 5 million daily active users across 600,000 businesses, achieved $100M ARR in May 2026, is Series C valued at $1.3 billion, and fosters a builder culture with a high bar and fast pace.
Design and build large-scale distributed systems and high-throughput data pipelines using Go and cloud-native technologies.
Lead system-wide architectural decisions focusing on data flow, performance, and resilience.
Champion best engineering practices, code quality, testing, and maintainability while mentoring junior engineers.
DoiT is a global technology company that helps organizations leverage the cloud for business growth, combining data, technology, and human expertise. With thousands of customers worldwide, DoiT fosters a remote-first culture that values entrepreneurship, knowledge pursuit, and fun.
Design, build, and operate distributed systems powering observability across ClickHouse Cloud.
Own reliability, performance, and cost-efficiency of the telemetry pipeline and storage systems.
Take part in on-call rotation and drive root-cause resolution and long-term fixes.
ClickHouse is a real-time analytics and data warehousing company recognized on the 2025 Forbes Cloud 100 list. With over 3,000 customers and rapid growth, the company fosters an innovative and fast-paced culture.
Act as a first responder for system incidents and outages, ensuring high availability and performance.
Own and evolve monitoring, alerting, and log management systems while optimizing database infrastructure.
Collaborate with engineering teams to build scalable, resilient systems and contribute to SRE tooling and automation.
Circle is building the world's leading all-in-one platform for online communities. We're a fully remote company of around 200 team members from 30+ countries, with a culture that values autonomy, async collaboration, and high expectations.
Architect future iterations of core systems, addressing scaling requirements.
Design and implement developer tools to enhance deployment safety and reproducibility.
Drive excellence in monitoring and guide incident response for quick issue resolution.
Found provides tools for self-employed individuals, offering a business bank account that automates taxes and expense tracking. They aim to give self-employed people the security and peace of mind historically available only at large corporations and are looking for kind, resourceful, and passionate people.
Earning the trust of our large-scale operator customers to further Grafana's "big tent" philosophy of data accessibility and to meet clear business objectives.
Designing and leading the development of backend services, distributed systems, and enterprise features at scale.
Driving continuous improvement of our engineering culture through words and actions.
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana, the open source visualization tool, around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self-managed with the Grafana Enterprise Stack. The Grafana team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.
Implement and optimize data access patterns for efficient interaction with large-scale data.
Monitor, troubleshoot, and tune existing database instances to ensure latencies and operational stability.
Develop and maintain reusable frameworks, SDKs, and platform services in programming languages.
HighLevel is an AI-powered business operating system. They give agencies, entrepreneurs and SMBs the infrastructure to build, automate and scale, supporting SMBs across 150+ countries and fostering community-driven growth with over 2,000 team members across 10+ countries.
Scale and mature Vesta’s infrastructure to support the entire mortgage market reliably, securely, and efficiently.
Build the foundational systems that power engineering velocity and platform reliability.
Focus on cloud architecture, deployment systems, observability, incident response, and internal developer tooling.
Vesta is building the next-generation system of record to power the multi-trillion mortgage market. They value humility, empathy, self-awareness, and an orientation towards action and have raised $45M from top tier investors.
Build and maintain the platform that runs all Close systems.
Automate database lifecycles and eliminate static credentials.
Improve our multi-region disaster recovery system and reduce downtime.
Close is a bootstrapped, profitable, and fully remote company with a team of thoughtful individuals. They focus on building a CRM that prioritizes better communication for small scaling businesses and have about 100 employees.
Design, build, and operate reconciliation systems to track desired stack state, detect and repair drift across stack templates, grafana.com state, Hosted Grafana, and actual customer stack configuration.
Collaborate across SSS, grafana.com, and deployment configurations to ensure stack lifecycle workflows remain reliable, observable, and resilient.
Improve operational efficiency by reducing deployment complexity and contributing to the Stack Config Reconciliation project.
Grafana Labs is a remote-first, open-source powerhouse with over 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, featuring scalable metrics (Grafana Mimir), logs (Grafana Loki), and traces (Grafana Tempo).