Lead the design, implementation, and continuous improvement of our cloud-native platform infrastructure.
Create and maintain tooling and automation that improves efficiency and developer experience.
Drive platform optimization initiatives focused on performance, cost efficiency, and reliability.
Intelerad's medical imaging solutions streamline the flow of information, simplifying complex processes, maximizing efficiencies, and shining a light on the unknown.
Design and implement the "Golden Paths"—standardized, automated templates for microservices and infrastructure.
Develop the CLI tools, portals, or API interfaces that abstract the complexity of our cloud infrastructure.
Develop and maintain a library of modular, testable, and versioned Terraform modules.
SEON is a command center for fraud prevention and AML compliance, helping companies stop fraud, reduce risk and protect revenue. They are powered by real-time, first-party data signals, enriches customer profiles, flags suspicious behavior and streamlines compliance workflows.
Design and manage AWS infrastructure for AI services.
Implement Infrastructure as Code using Terraform.
Collaborate with cross-functional teams to enhance performance.
Jobgether uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly against the role's core requirements. Their system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.
Build Enterprise-Scale Infrastructure leveraging infrastructure-as-code to manage complex cloud environments.
Sustain Platform Health and Performance owning critical systems in production, including reliability and security.
Enable Teams and Customers to Move Faster creating abstractions and tooling that deploy, run, and scale AI/ML workloads.
Cake is on a mission to make cutting-edge AI accessible to enterprise teams. Backed by top investors, Cake is seeing strong adoption and is positioned for rapid growth in the next 12 months, emphasizing ownership, clear communication, and collaboration.
Architect, operate, improve and secure the platform the Garner Health app runs on
Boost development velocity and productivity
Build systems to a high engineering standard and hold others to the same high standard
Garner has developed a revolutionary approach to evaluating doctor performance and a unique incentive model that's reshaping the healthcare economy to ensure everyone can afford high quality care. They have more than doubled their revenue annually over the last 5 years. Garner's award winning culture is designed to cultivate teamwork, trust, autonomy, exceptional results, and individual growth.
Architect and deploy secure, scalable infrastructure using Terraform, CloudFormation, or similar tools.
Ensure the platform meets strict SLA requirements for enterprise clients, minimizing downtime.
Implement comprehensive monitoring, logging, and alerting to provide deep visibility into system health.
Filevine provides cloud-based workflow tools for legal professionals, helping them manage organizations and serve clients. They are recognized as a fast-growing and innovative technology company with a team of passionate professionals.
Heavily contribute to the architecture and migration of our CI/CD platform. Act as a pragmatic driver and senior contributor, responsible for designing and implementing solutions. Design and build the paved path as a product, ensuring they are reliable, secure, and well-documented.
Glia is the leading AI customer service solution for banks and credit unions offering AI and human agents across every voice and digital conversation.
Ensure the smooth operation and high availability of Clarifai's core services
Monitor system performance, identify bottlenecks, and implement optimizations to enhance reliability and efficiency
Design and implement scalable, secure, and cost-effective infrastructure solutions
Clarifai is a leading AI platform specializing in computer vision and generative AI, empowering organizations to transform unstructured data into actionable insights. Founded in 2013, they have a diverse, globally distributed team with $100M in funding and are committed to building a diverse and inclusive team.
Designing, building, and maintaining infrastructure that enables fast, reliable, and secure product delivery.
Improving and maintaining CI/CD pipelines to streamline deployments and increase reliability.
Contributing to infrastructure reliability and ensuring systems are designed for resilience and growth.
Incident.io is the leading AI incident response platform, built to help teams dramatically reduce incident response time and improve reliability. They have raised $100M from Index Ventures, Insight Partners, and Point Nine, alongside founders and executives from world-class technology companies.
Design and manage infrastructure-as-code with Terraform and GitOps.
Build and maintain secure CI/CD pipelines with integrated security automation.
Deploy and operate Kubernetes/K3s clusters in AWS GovCloud (IL5/IL6).
Rackner is a cloud-native software consultancy delivering solutions for startups, enterprises, and the public sector. They enable digital transformation through DevSecOps, AI/ML, and cloud-first innovation, solving high-impact problems and delivering secure, scalable solutions for the Department of Defense and federal health programs.
Design, operate, and scale storage based infrastructure systems across multiple tenancy models and public clouds.
Deepen our team’s expertise in relational databases, search, caching, queuing, and streaming.
Partner with Architecture, Release Engineering, Network, Compute, and Security teams.
Dbt Labs is the pioneer of analytics engineering, helping data teams transform raw data into reliable, actionable insights. They have grown from an open source project into the leading analytics engineering platform, and believe in empowering data practitioners.
Work with research teams to design and build our training infrastructure
Prototype new training frameworks and production-ize solutions at scale
Design, optimize and test model integration infrastructure
Clarifai is a leading AI platform specializing in computer vision, NLP, LLMs, and audio recognition, helping organizations transform unstructured data into structured data. Founded in 2013, they remotely operate across multiple countries with backing from industry leaders, fostering a diverse and equal opportunity workplace.
Shape the way Scalable runs microservices in a performant, secure, and cost-efficient way. Collaborate with cross-functional teams to understand scalability requirements. Develop and maintain internal tooling around Monitoring, Developer Portal, and Load Testing.
Scalable Capital is a leading digital investment and banking platform with a full banking licence, empowering people across Europe to shape their own finances.
Design, develop, and maintain Python-based services and data integrations supporting IAM and access management platforms.
Deploy, optimize, and manage cloud infrastructure using Infrastructure as Code (Terraform / Terraform Enterprise).
Collaborate with application and product teams to onboard internally hosted, SaaS, and homegrown applications into centralized data and access frameworks.
Blend is an AI services provider that co-creates meaningful impact for its clients through data science, AI, technology, and people. They are dedicated to unlocking value and fostering innovation for its clients by harnessing world-class people and data-driven strategy.
Enable teams to build features at scale by providing a foundation of reusable software components and infrastructure.
Motive empowers the people who run physical operations with tools to make their work safer, more productive, and more profitable. Motive serves nearly 100,000 customers – from Fortune 500 enterprises to small businesses – across a wide range of industries.
Architect and maintain scalable, reliable infrastructure: Design and optimize infrastructure for high availability, fault tolerance, and performance across distributed systems.
Lead incident management and root cause analysis: Own incident response processes, ensure swift resolution of issues, and drive post-incident improvements to prevent recurrences.
Service monitoring and automation: Build and maintain automated monitoring, alerting, and healing systems that improve system health, reduce manual intervention, and minimize downtime.
VGS is the world's leader in payment tokenization, empowering clients and partners by tokenizing sensitive payment data and limiting compliance scope. They embed a universal token vault into their technology stack to manage the complexities of payment data tokenization across processors and networks and more. While the job posting doesn't specify size, they appear to have a culture that values transparency, collaboration, grit, and humility.
Take ownership of an ML deployment system spanning multiple production environments and continue to research efficient and effective strategies.
Improve, expand, and streamline our existing deployment pipelines to support faster deployments and automated model retraining.
Collaborate with Data Scientists to understand model requirements and provide guidance to ensure seamless integration with production environments.
Best Egg is a market-leading, tech-enabled financial platform helping people build financial confidence through lending solutions and financial health tools. They foster an inclusive, flexible, and fun workplace with top-tier benefits and growth opportunities.