Source Job

UK 6w PTO

  • Act as a trusted technical partner, guiding organizations through onboarding, implementation, and expansion with white-glove support and best practices.
  • Deliver high-impact training, jumpstart engagements, and provide tailored technical consulting to help customers succeed.
  • Identify recurring issues, monitor support needs, and advocate for product improvements in close collaboration with internal teams.

Kubernetes Prometheus Grafana Cloud Platforms Observability

20 jobs similar to Senior Solutions Architect

Jobs ranked by similarity.

United States 6w PTO

  • Build and operate the internal engineering platform that provides application engineers with the tools, systems, and Kubernetes clusters they need to deploy and run their workloads.
  • Focus on cloud infrastructure, capacity management, security, engineering productivity, monitoring, and US Federal compliance across squads.
  • Participate in on-call rotations to ensure the health of the system and understand how people use our products.

Grafana Labs, the company behind the open observability cloud, is founded on the principles of open source, open standards, open ecosystems, and open culture. We are a 100% remote company with 1,600+ team members across 40+ countries, backed by leading investors including Lightspeed Venture Partners, Sequoia Capital, GIC, Coatue, J.P. Morgan, CapitalG, and Lead Edge Capital.

US Canada 6w PTO

  • Earning the trust of our large-scale operator customers to further Grafana's "big tent" philosophy of data accessibility and to meet clear business objectives.
  • Designing and leading the development of backend services, distributed systems, and enterprise features at scale.
  • Driving continuous improvement of our engineering culture through words and actions.

Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana, the open source visualization tool, around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self-managed with the Grafana Enterprise Stack. The Grafana team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.

Global

  • Defining and driving the vision and strategy for Infrastructure Observability.
  • Identifying gaps in end to end experience, defining and owning the roadmap to fill those gaps.
  • Working closely across teams and across Orgs, collaborating with Engineering, UX, Design and other teams to deliver on your roadmap.

Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone to accelerate the results that matter.

Europe 6w PTO

  • Design, build, and operate reconciliation systems to track desired stack state, detect and repair drift across stack templates, grafana.com state, Hosted Grafana, and actual customer stack configuration.
  • Collaborate across SSS, grafana.com, and deployment configurations to ensure stack lifecycle workflows remain reliable, observable, and resilient.
  • Improve operational efficiency by reducing deployment complexity and contributing to the Stack Config Reconciliation project.

Grafana Labs is a remote-first, open-source powerhouse with over 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, featuring scalable metrics (Grafana Mimir), logs (Grafana Loki), and traces (Grafana Tempo).

US 6w PTO

  • Design, build, and operate reconciliation systems to track desired stack state, detect and repair drift across stack templates, grafana.com state, Hosted Grafana, and actual customer stack configuration.
  • Collaborate across SSS, grafana.com, and deployment configurations to ensure stack lifecycle workflows remain reliable, observable, and resilient.
  • Improve operational efficiency by reducing deployment complexity and contributing to the Stack Config Reconciliation project.

Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack. Their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.

Spain 6w PTO

  • Take an active role in influencing our roadmap and your own career objectives.
  • Design, build, operate, and maintain critical systems, owning the reliability, performance, and availability.
  • Mentor and support other team members, participate in design discussions and collaborate with the team.

Grafana Labs is a remote-first, open-source powerhouse that provides visualization tools. They help companies manage their observability strategies. Grafana Labs has a global collaborative culture, and a passion for meaningful work.

Europe 6w PTO

  • Develop and maintain features as part of Observability solutions in Grafana Cloud.
  • Contribute to the design and implementation of high-quality, scalable integrations for various infrastructure components, databases, and applications
  • Build prototypes and present your ideas as part of a cross-functional team

Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, and thrive in an innovation-driven environment with a global collaborative culture.

Germany 6w PTO

  • Build and scale a strong culture of operational excellence by defining standards and coaching teams to own reliability and availability.
  • Drive mature DevOps/SRE practices, including incident response and PIRs, on-call readiness, runbooks, alerting, observability, and release/change management.
  • Guide teams in the design, development, evolution, and operation of large-scale, distributed cloud systems.

Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, and their team thrives in an innovation-driven environment.

Canada Unlimited PTO

  • Design, build, and operate distributed systems powering observability across ClickHouse Cloud.
  • Own reliability, performance, and cost-efficiency of the telemetry pipeline and storage systems.
  • Take part in on-call rotation and drive root-cause resolution and long-term fixes.

ClickHouse is a real-time analytics and data warehousing company recognized on the 2025 Forbes Cloud 100 list. With over 3,000 customers and rapid growth, the company fosters an innovative and fast-paced culture.

Germany Spain Ireland UK 6w PTO

  • Lead a team covering corporate systems, employee device lifecycles, helpdesk queues, and internal tooling development.
  • Handle corporate security initiatives and compliance checks in an employee-enabling way.
  • Help define the wider internal AI rollout, enablement, and security strategy across teams.

Grafana Labs is a remote-first, open-source powerhouse, with over 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack and thrive in an innovation-driven environment, scaling fast and staying true to its open-source legacy and a global collaborative culture.

$180,000–$200,000/yr
US

  • Own and evolve a scalable observability platform spanning metrics, logs, traces, and events.
  • Design telemetry pipelines ingesting data from GPUs, CPUs, networking, containers, APIs, and BMC/Redfish.
  • Design and implement noise-resistant alerting systems to improve signal quality and reduce operational load.

Lightning AI builds an end-to-end platform for developing, training, and deploying AI systems, designed to take ideas from research to production with less friction. They combine developer-first software with cost-efficient, large-scale compute, serving solo researchers, startups, and large enterprises.

Europe

  • Design and operate our Kubernetes ecosystem with a focus on high availability and zero-downtime operations.
  • Own and evolve our PaaS strategy, using GitOps and CI/CD to empower domain teams to deploy independently.
  • Define and implement our observability strategy across metrics, logs, and tracing.

Finom is a European tech startup headquartered in Amsterdam, revolutionizing financial services for entrepreneurs. They offer an all-in-one financial B2B solution integrating banking, accounting, financial management, and invoicing into a mobile-first platform, with about 346 million in funding.

Europe

  • Act as the primary technical partner for our top logos, seamlessly orchestrating everything from initial onboarding and technical enablement to customized workshops and advanced troubleshooting.
  • Develop, execute, and continuously iterate on tailored customer strategies that maximize ROI, deepen platform adoption, and cement long-term brand loyalty.
  • Foster deep, lasting relationships with key technical stakeholders, acting as their ultimate problem-solver and guiding them step-by-step through complex architectural implementations.

Logz.io is building the future of how engineering teams manage cloud complexity. Their Open 360 platform delivers unified, full-stack observability and security as a fully managed SaaS built on best-of-breed open source.

$124,845–$146,205/yr
Europe 6w PTO

  • Manage and grow a distributed team of engineers, providing feedback and supporting career development.
  • Partner with product management to shape the Usage squad's roadmap, ensuring alignment with company mission and customer impact.
  • Guide the team through the full project lifecycle, ensuring high-quality and timely outcomes within the Usage domain.

Grafana Labs is a remote-first, open-source powerhouse with over 20M users globally. Their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.

$104,414–$125,381/yr
Europe 6w PTO

  • Build delightful interactive learning inside Grafana and ship features that make learning experiences feel obvious, smooth, and scalable.
  • Enable contribution and authoring by creating workflows and product features that let many contributors safely create, iterate on, and improve learning content.
  • Build fast feedback loops (metrics/logs/traces + user journey visibility) so issues stay shallow by making it easy to understand what’s happening in production and in real user experiences.

Grafana Labs is a remote-first, open-source powerhouse that provides the Grafana LGTM Stack for managing observability strategies. They have over 20M users worldwide and help over 3,000 companies manage their observability strategies with the Grafana LGTM Stack. Their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.

$116,449–$139,531/yr
Europe 6w PTO

  • Take an active role in influencing our roadmap and your own career objectives.
  • Drive projects from initial ideation all the way to operations once it is in the hands of customers.
  • Design, build, operate, and maintain critical systems, owning the reliability, performance, and availability.

Grafana Labs is behind the open observability cloud, and is founded on the principles of open source, open standards, open ecosystems, and open culture. They are a 100% remote company with 1,600+ team members across 40+ countries.

$130,000–$180,000/yr
US

  • Partner with Account Executives to uncover customer pain points and compliance requirements.
  • Deliver technical presentations on RapidFort’s DevTime Protection Tools and Runtime Protection capabilities.
  • Advise customers on vulnerability management strategies and compliance frameworks.

RapidFort's Software Supply Chain Security platform helps organizations reduce vulnerabilities, harden containerized workloads, and minimize attack surface without requiring code changes. They are a fast-paced startup where adaptability, ownership, and execution are critical.

Europe

  • Lead the investigation and resolution of complex infrastructure, networking, and platform-related incidents.
  • Provide technical leadership for Kubernetes platform operations and supporting infrastructure services.
  • Mentor and support AI Infrastructure & Platform Operations Engineers, sharing technical knowledge through documentation and training.

Mirantis helps organizations ship code faster on public and private clouds, providing a public cloud experience on any infrastructure from the data center to the edge. The company serves many of the world's leading enterprises, including Adobe, DocuSign, Liberty Mutual, and PayPal, and is a leader in container management.

Europe 6w PTO

  • Develop and own the product vision and strategy for Data Collection, Transformation, and Ingestion as a core platform capability for Grafana Cloud
  • Partner closely with senior R&D leaders to align product and technical strategy across teams, make clear tradeoffs, and ensure the roadmap balances customer value, platform leverage, operational excellence, and business impact
  • Use customer research, product analytics, competitive insight, and business context to identify the highest-impact problems to solve

Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, and their team thrives in an innovation-driven environment.

$188,550–$212,150/yr
Global Unlimited PTO

  • Own the technical direction of Remote's SRE/Platform domain.
  • Define and drive the reliability strategy across the platform.
  • Identify and lead AI enablement initiatives across the engineering organisation.

Remote is solving modern organizations’ biggest challenge – navigating global employment compliantly with ease. With our core values at heart and a future-focused work culture, our team works tirelessly on ambitious problems, asynchronously, around the world.