Operating and evolving 100+ multi-cloud streaming clusters and related database infrastructure.
Diagnosing and eliminating cross-layer failure modes.
Designing safe upgrade and rollout strategies at scale.
Grafana Labs is a remote-first, open-source powerhouse with over 20M users of Grafana, its open source visualization tool. Grafana Labs helps more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, and its team thrives in an innovation-driven environment.
Operate and evolve multi-cloud streaming clusters and related database infrastructure, diagnosing and eliminating cross-layer failure modes.
Define and evolve the technical direction for operating shared database systems at scale, leading complex initiatives and reliability investments.
Mentor and support engineers, improve systems toil with automation, and partner with database and platform teams to align on strategy.
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, featuring scalable metrics, logs, and traces and thrive in an innovation-driven environment where transparency, autonomy, and trust fuel everything.
Own and drive the design, deployment, and operation of OpenStack and Kubernetes clusters optimised for GPU workloads
Lead and develop a team of 4–5 infrastructure engineers, setting clear direction and standards
Build and improve infrastructure through automation (IaC, GitOps, CI/CD pipelines)
NexGen Cloud is a fast-growing company building next-generation GPU cloud infrastructure. At the core of NexGen Cloud is a team of curious, driven people who care deeply about quality, ownership and collaboration.
Automate EC2 provisioning and configuration using Ansible
Deutsche Telekom IT Solutions Slovakia entered the life of Košice region in 2006. They have managed to grow from scratch to the second largest employer in the eastern part of the country with more than 3900 employees.
Design and maintain robust ML deployment pipelines to ensure seamless model delivery.
Automate model training, deployment, and monitoring workflows to increase operational efficiency.
Collaborate closely with Data Scientists and Engineering teams to integrate models into production environments.
Truelogic is a leading provider of nearshore staff augmentation services, headquartered in New York. With over 600+ highly skilled tech professionals based in Latin America, they drive digital disruption by partnering with U.S. companies on their most impactful projects.
Design, deploy, and manage AWS cloud infrastructure using Pulumi (Python). Manage CI/CD pipelines using CircleCI for Next.js, Django, and React Native/Expo services.
Implement monitoring, alerting, and logging solutions using Datadog and Sentry. Develop internal tooling and automation, primarily using Python.
Collaborate with product and development teams to architect scalable services and ensure high availability. Embed security best practices in infrastructure.
TrustedHousesitters operates a direct-to-consumer marketplace connecting pet owners and sitters for pet care and travel solutions. They are a fast-growing global community with members in over 140 countries and a team of more than 100 employees.
Design, implement, and operate cloud-native infrastructure across GCP, AWS, or Azure using Terraform.
Take full ownership of MongoDB Atlas in production, including cluster architecture, scaling, and security.
Build and maintain CI/CD pipelines with a strong focus on automation, reliability, and scalability.
Smart Working Solutions believes jobs should feel right every day and connect skilled professionals with global teams for full-time, long-term roles. They offer a supportive community and value growth in a remote-first environment, priding themselves as a top-rated workplace.
Design, implement, and maintain cloud-based infrastructure and services at the intersection of agentic AI and biomedical data.
Collaborate with software engineers, data engineers, researchers and data scientists to understand their needs and implement solutions that enhance their productivity.
Build and lead a high-performing platform engineering team, setting a high bar for technical excellence, ownership, and accountability in the organization.
Owkin is an AI company on a mission to solve the complexity of biology. They are building the first Biology Super Intelligence (BASI) by combining powerful biological large language models, multimodal patient data, and agentic software.
Learn platform infrastructure, developer tooling, and deployment patterns.
Own and drive at least one architecture decision that improves platform reliability.
Ship infrastructure improvements that measurably improve developer experience or platform stability.
Homebot is a homeownership platform for lenders and real estate, title & insurance agents that drives client retention and partner referrals. They maintain a clear focus on culture, engagement, and creating an environment where people are valued and can thrive.
Partner with Sales and Field Engineering to design and architect complex, enterprise-grade solutions tailored to customer needs.
Lead the implementation of custom solutions within customer environments across multi-cloud and hybrid architectures.
Optimize solutions for performance, scalability, and reliability in production environments.
Striim is a unified data integration and streaming platform that connects clouds, data, and applications. We believe and expect all of our employees to operate as one with unlimited potential and dignity.