Work with IaC tools like Terraform to ensure configurations are steady and change-managed.
Design and deploy endpoint security measures aligned with industry standards.
Ensure a strong security posture for corporate SaaS applications by configuring vendor capabilities.
OnePay is a consumer fintech company trusted by millions of Americans to make money better, providing an all-in-one financial services platform. Backed by Walmart and Ribbit Capital, they offer banking, savings, credit cards, lending, investing, and crypto services.
Build internal tooling to help other engineers and the rest of the company understand and operate our system.
Design and implement security best practices for our team and infrastructure.
Reduce toil through automation, including building and maintaining CI/CD infrastructure.
Openly is rebuilding insurance from the ground up by re-envisioning and enhancing every aspect of the customer experience. They are a rapidly growing team of exceptional, curious, empathetic people with a wide range of skill sets, spanning many departments.
Build scalable backend services and APIs that power our digital merchandising platform.
Work with other senior engineers to contribute to high level decisions about the architecture and design.
Work with Product Managers to make Jane’s advertising product offerings sound, robust and easy to use.
Jane Technologies is an MIT-founded eCommerce company in the cannabis industry experiencing rapid growth. Their mission is to bring confidence to the online cannabis shopping experience by connecting consumers with local dispensaries and brands. They are a small close-knit team of highly technical engineers with diverse backgrounds and a strong engineering culture.
Design, build, and operate reconciliation systems to track desired stack state, detect and repair drift across stack templates, grafana.com state, Hosted Grafana, and actual customer stack configuration.
Collaborate across SSS, grafana.com, and deployment configurations to ensure stack lifecycle workflows remain reliable, observable, and resilient.
Improve operational efficiency by reducing deployment complexity and contributing to the Stack Config Reconciliation project.
Grafana Labs is a remote-first, open-source powerhouse with over 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, featuring scalable metrics (Grafana Mimir), logs (Grafana Loki), and traces (Grafana Tempo).
Develop and maintain scalable automation and integrations across cloud platforms and services.
Design, implement, and operate CI/CD pipelines using Jenkins, Dagger, Terraform, and Docker.
Build, operate, and troubleshoot workloads on Kubernetes, using Kustomize and Helm.
People Inc. is America’s largest digital and print publisher. Our brands harness the best intent-driven content, the fastest sites, and the fewest ads to help nearly 200 million people every month make decisions.
Design, build, and maintain secure CI/CD pipelines supporting cloud-native applications and services.
Implement Infrastructure as Code using tools such as Terraform to provision and manage cloud resources.
Integrate security controls and best practices into the software development lifecycle (DevSecOps).
540 is a forward-thinking company that the government turns to in order to #getshitdone. They break down barriers, build impactful technology, and solve mission-critical problems.
Lead software engineering teams providing infrastructure-as-code to manage cloud infrastructure.
Hire experienced site reliability staff, and a line manager to grow and oversee the SRE team.
Establish design-before-build discipline; facilitate lightweight design documents, architectural decision records, and working group reviews.
Horizon3.ai is a cybersecurity company dedicated to enabling organizations to proactively find, fix, and verify exploitable attack vectors. They are a fast-growing company with a culture of respect, collaboration, ownership, and results.
Own and operate end-to-end infrastructure for backend services, frontend systems and databases.
Build and maintain reliable deployment workflows including CI/CD pipelines and rollback procedures.
Improve system-wide observability through metrics, logging, alerting, and monitoring to ensure uptime.
Jito Labs builds a high-performance trading terminal on Solana. They are a lean, high-output team building something that sits at the intersection of execution quality, user experience, and on-chain infrastructure.
Scale and mature Vesta’s infrastructure to support the entire mortgage market reliably, securely, and efficiently.
Build the foundational systems that power engineering velocity and platform reliability.
Focus on cloud architecture, deployment systems, observability, incident response, and internal developer tooling.
Vesta is building the next-generation system of record to power the multi-trillion mortgage market. They value humility, empathy, self-awareness, and an orientation towards action and have raised $45M from top tier investors.
Design, build, and operate reconciliation systems to track desired stack state, detect and repair drift across stack templates, grafana.com state, Hosted Grafana, and actual customer stack configuration.
Collaborate across SSS, grafana.com, and deployment configurations to ensure stack lifecycle workflows remain reliable, observable, and resilient.
Improve operational efficiency by reducing deployment complexity and contributing to the Stack Config Reconciliation project.
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack. Their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.
Design, build, and maintain scalable, reliable systems on GCP.
Develop automation for infrastructure provisioning using Terraform, Ansible, or Deployment Manager.
Manage incident response, conduct postmortems, and implement improvements to reduce recurrence.
SupplyHouse.com is an industry-leading e-commerce company specializing in HVAC, plumbing, heating, and electrical supplies since 2004. They value every individual team member and cultivate a community where people come first with Generosity, Respect, Innovation, Teamwork, and GRIT.
Design, build, and maintain scalable cloud infrastructure services in AWS and GCP.
Contribute production-quality Go and Python code to existing cloud services.
Develop and own automation and software deployment pipelines with maximum efficiency.
Dragos is dedicated to arming customers with best-in-class technology, threat intelligence, and services to protect their systems. They embody core values of authenticity, transparency, and trust and are a remote-first culture with operations in North America, Europe, the Middle East, and APAC.
Oversee a specialized SRE team focused on the design, deployment, and maintenance of automation toolsets.
Establish and enforce standards for IaC to ensure consistent, repeatable, and secure deployments.
Drive the automated lifecycle of both physical and virtual assets, from initial template creation/deployment to automated patching, scaling, and decommissioning.
Galaxy is a global leader in digital assets and data center infrastructure, delivering solutions that accelerate progress in finance and artificial intelligence. Led by CEO and Founder Michael Novogratz, their team blends deep crypto expertise with institutional experience and a shared commitment to shaping the future of Web3 and AI.
Build and maintain the platform that runs all Close systems.
Automate database lifecycles and eliminate static credentials.
Improve our multi-region disaster recovery system and reduce downtime.
Close is a bootstrapped, profitable, and fully remote company with a team of thoughtful individuals. They focus on building a CRM that prioritizes better communication for small scaling businesses and have about 100 employees.
Own the technical direction of Remote's SRE/Platform domain.
Define and drive the reliability strategy across the platform.
Identify and lead AI enablement initiatives across the engineering organisation.
Remote is solving modern organizations’ biggest challenge – navigating global employment compliantly with ease. With our core values at heart and a future-focused work culture, our team works tirelessly on ambitious problems, asynchronously, around the world.
Own and evolve Launch Potato's cloud infrastructure, CI/CD platform, and compliance posture.
Build the SRE function from the ground up so product teams can ship faster without compromising reliability, security, or cost control.
Stand up the SRE practice from scratch: on-call rotation, PagerDuty configuration, SLA/SLO definitions for core infrastructure services, runbook library, and observability dashboards that tie site performance to business metrics.
Launch Potato is a digital media company that connects consumers with leading brands through data-driven content and technology. They are headquartered in South Florida with a remote-first team spanning over 15 countries, with a high-growth, high-performance culture.
Architect future iterations of core systems, addressing scaling requirements.
Design and implement developer tools to enhance deployment safety and reproducibility.
Drive excellence in monitoring and guide incident response for quick issue resolution.
Found provides tools for self-employed individuals, offering a business bank account that automates taxes and expense tracking. They aim to give self-employed people the security and peace of mind historically available only at large corporations and are looking for kind, resourceful, and passionate people.
Ensure the protection of patient data and all of the technology behind our platform.
Work helps ensure the best outcomes for patients as we strive to make mental healthcare work for everyone.
Rula strives to create a world where mental health is embraced as part of overall well-being. They are dedicated to providing quality, evidence-based care and making a positive impact on the lives of individuals struggling with mental health issues.
Design, build, and maintain infrastructure using Infrastructure as Code tools such as Terraform.
Improve system reliability, scalability, resilience, and performance across the Mast platform.
Build systems and tooling that automate infrastructure management and operational workflows wherever possible.
Mast is on a mission to make complex lending simple by building modern, cloud-native lending technology purpose-built for specialist lenders. It is a high-performance team of engineers and lending experts that values radical honesty, transparency, and speed.
Build and develop the Core PM team, managing PMs across various products.
Set the product strategy across Core, deciding on bets and product reinforcement.
Make cross-product calls, prioritizing between products and shaping roadmaps.
Supabase provides the default data layer for AI-native applications. They have over 280 team members in 55+ countries and are committed to building tools developers love.