Operating and evolving 100+ multi-cloud streaming clusters and related database infrastructure.
Diagnosing and eliminating cross-layer failure modes.
Designing safe upgrade and rollout strategies at scale.
Grafana Labs is a remote-first, open-source powerhouse with over 20M users of Grafana, its open source visualization tool. Grafana Labs helps more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, and its team thrives in an innovation-driven environment.
Own and drive the design, deployment, and operation of OpenStack and Kubernetes clusters optimised for GPU workloads
Lead and develop a team of 4–5 infrastructure engineers, setting clear direction and standards
Build and improve infrastructure through automation (IaC, GitOps, CI/CD pipelines)
NexGen Cloud is a fast-growing company building next-generation GPU cloud infrastructure. At the core of NexGen Cloud is a team of curious, driven people who care deeply about quality, ownership and collaboration.
Support the availability and durability of critical services across production environments.
Develop automation for common operational tasks, reducing manual intervention and toil.
Partner with engineering, product, and operations teams to support resilient system design and operations.
Backblaze is the object storage leader in the open cloud movement, fueling customer success with cloud storage built purposefully to unlock budgets and unleash innovators. Founded in 2007, they scaled the business with less than $3 million in outside funding until 2021, and generate over $100m in revenue managing over three billion gigabytes of data storage for 500K+ customers in 175+ countries.
Remotely provision and configure edge compute nodes immediately following physical installation.
Activate and integrate newly deployed systems into a distributed global network.
Monitor system health and serve as the escalation point for diagnosing and resolving technical issues
Sitreps is building a next-generation maritime intelligence platform that transforms global fleets into a persistent sensing network. They combine edge computing, advanced sensors, and satellite connectivity, operating in remote environments to deliver critical insights.
Build scalable Edge infrastructure, designing and maintaining delivery systems for model deployment.
Work with cross-functional teams to integrate complex features, translating research into hardware realities.
Drive automation and reliability by implementing infrastructure to test models and monitor performance.
Hudl builds great teams and hires the best to ensure employees are working with people they can constantly learn from. They provide a culture where everyone feels supported, becoming one of Newsweek's Top 100 Global Most Loved Workplaces.
Serve as the technical owner for Connectivity projects, responsible for translating product goals into technical strategy.
Lead the design of secure, scalable Connectivity architecture patterns spanning facility-edge systems and Linux-based platforms.
Define and evolve the reusable automation patterns and configuration models that allow Connectivity solutions to be delivered repeatedly.
Simplesense builds, deploys, and sustains the Installation Resilience Platform that enables mission operators to rapidly adapt and respond. Their team combines over 100 years of direct mission experience solving hard problems with 50 years technical expertise deploying DevSecOps, cybersecurity, and cloud infrastructure.
Design, implement, and maintain robust and secure device management strategies for remote devices using Unified Endpoint Management (UEM), MDM solutions, and orchestration tools.
Develop and manage observability pipelines to track device health, connectivity, and performance metrics across diverse warehouse environments.
Own the end-to-end lifecycle of device software, including secure Over-the-Air (OTA) firmware updates, rollback strategies, and OS hardening.
Locus Robotics is a global leader in warehouse automation, delivering unmatched flexibility, unlimited throughput, and actionable intelligence to optimize operations. They are trusted by industry-leading retail, healthcare, 3PL, and industrial brands.
Own the design, deployment and operation of OpenStack and Kubernetes environments.
Build and improve infrastructure using infrastructure-as-code and GitOps practices.
Optimise GPU workload scheduling using Kubernetes and NVIDIA tooling.
NexGen Cloud is building next-generation GPU cloud infrastructure, and is the company behind Hyperstack, a high-performance cloud platform designed for compute-intensive workloads. We're a scale-up by design, solving complex infrastructure challenges at pace, with real-world impact.
Ensure system reliability and performance across production environments by proactively monitoring infrastructure health, identifying performance bottlenecks, and implementing solutions that maintain 99.9%+ uptime for mission-critical healthcare imaging systems
Optimize SQL database performance and maintenance by analyzing query execution plans, implementing indexing strategies, performing database tuning, and executing maintenance routines that ensure fast, reliable access to imaging data across multiple SQL Server-based applications
Perform software deployments, including planning, testing, and executing application and system updates, patches, and new releases across on-premises and cloud environments, while minimizing downtime and ensuring compatibility with existing healthcare imaging workflows
Intelerad provides medical imaging solutions that streamline information flow, simplify processes, and maximize efficiencies to improve patient outcomes. They have nearly 800 employees across four countries, empowering nearly 2,000 healthcare organizations worldwide.
Provide production support on a shift according to the team on-call roster.
Work on the customer and internal engineering/implementation team raised tickets while not on-call for production support.
Continuously monitor the health and performance of our services, systems, and infrastructure.
Granicus is driven by the excitement of building, implementing, and maintaining technology that is transforming the Govtech industry by bringing governments and its constituents together. They have served 5,500 federal, state, and local government agencies and more than 300 million citizen subscribers.