Source Job

$200,000–$285,000/yr
US

  • Spearhead development and implementation of observability tools.
  • Drive performance and ensure resilient systems.
  • Provide technical guidance and improve operational efficiencies.

Go Python Typescript Cloud Infrastructure Distributed Systems

20 jobs similar to Engineering Manager

Jobs ranked by similarity.

$200,000–$285,000/yr
Global

  • Manage and grow a team of engineers, conducting performance reviews and providing coaching.
  • Define and execute the technical vision for the observability platform.
  • Provide architectural oversight on instrumentation, logging, metrics, and tracing.

Jobgether uses an AI-powered matching process to ensure candidate applications are reviewed quickly, objectively, and fairly against a role's core requirements. They identify the top-fitting candidates and share this shortlist directly with the hiring company.

US 6w PTO

  • Build & Lead a High-Performing Team.

Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana. It helps more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self-managed with the Grafana Enterprise Stack. They thrive in an innovation-driven environment where transparency, autonomy, and trust fuel everything.

North America 4w PTO

  • Guide and support a team of developers through coaching, career development, and regular feedback conversations
  • Partner with product teams to identify reliability challenges and create solutions that improve the client experience
  • Promote best practices for production engineering and help establish patterns that scale across the organization

Wealthsimple aims to provide financial freedom to everyone by making money management transparent and low-cost through smart technology. As the largest fintech company in Canada, they serve 3+ million users and manage over $100 billion in assets, fostering a collaborative, humble culture focused on quality.

US 6w PTO

  • Design, implement, and maintain scalable integrations for metrics, logs, and traces across cloud and Kubernetes environments.
  • Build middleware, libraries, and services to simplify development and observability workflows.
  • Lead technical direction and strategic planning for observability projects.

They are currently looking for a Staff Software Engineer - Grafana Cloud Observability, Kubernetes Monitoring in United States. This role offers a unique opportunity to shape and advance cloud observability solutions for large-scale systems, focusing on metrics, logs, and traces.

Europe 6w PTO

  • Design and implement high-quality, scalable integrations for various infrastructure components, applications, and data ingestion pipelines.
  • Create middleware components and libraries that simplify development and maintenance of observability solutions.
  • Lead the technical direction and vision of the team, contributing to strategic discussions and future development of observability solutions.

Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, featuring scalable metrics, logs, and traces, and thrive in an innovation-driven environment.

$72,000–$111,000/yr
US Unlimited PTO

  • Enabling faster incident response by improving monitoring coverage, alert accuracy, and root cause visibility
  • Helping teams shift from reactive to proactive operations by applying telemetry data and AI-driven insights
  • Empowering service owners with clear dashboards and actionable insights that guide performance improvements

HealthEquity's mission is to save and improve lives by empowering healthcare consumers. They envision making HSAs as widespread and popular as retirement accounts by 2030, valuing individuals more than their positions and passionate about connecting health and wealth for American families.

$220,000–$220,000/yr
US

  • Manage and mentor Systems Engineers across multiple product initiatives.
  • Provide clear performance expectations, coaching, and career development.
  • Foster accountability, ownership, and technical rigor while building a resilient, high-trust engineering culture.

Dragos is dedicated to arming customers with technology, threat intelligence, and services to protect their systems. They are a remote-first culture with operations in North America, Europe, the Middle East, and APAC, looking for mission-oriented teammates.

$200,000–$230,000/yr
US Canada

  • Manage, mentor, and hire for a high-performing team of engineers.
  • Instill best practices for monitoring, alerting, and incident response.
  • Streamline how we build and maintain integrations with external systems.

Inspiren offers a complete and connected ecosystem in senior living, founded by Michael Wang. Their integrated platform connects smart hardware, embedded software, and cloud infrastructure to deliver real-time insights that improve safety, operational efficiency, and care outcomes.

  • Architect observability platform: Design, implement, and maintain the LGTM stack as the primary observability platform across all engineering teams.
  • Build internal observability products: Design and develop production-grade internal platform products with React/TypeScript frontends and Python/Rust backends.
  • Develop custom log indexing systems: Architect and build high-performance log indexing solutions using Rust that process logs and provide sub-second search across billions of log lines.

Judi Health is an enterprise health technology company providing a comprehensive suite of solutions for employers and health plans. They have a mission of rebuilding trust in healthcare in the U.S. and deploying the infrastructure we need for the care we deserve.

US

  • Define and execute the Customer Experience and Success coverage model for the US region.
  • Recruit, onboard, and develop Customer Experience and Success professionals within the region.
  • Personally manage complex enterprise accounts through the full engagement cycle.

Dash0 is building an AI-centric OpenTelemetry-native platform that eleminates vendor lock-in and toil. Dash0 is growing rapidly, with the United States as a primary strategic market.

US Unlimited PTO

  • Build, lead, and develop a remote engineering team of ~5 engineers.
  • Act as a hands-on technical leader—participating in code reviews, architecture decisions, and occasional implementation work.
  • Improve engineering processes, quality, and productivity as the company scales.

EdSights builds technology that helps colleges and universities better understand and support their students at scale. They partner with 250+ colleges and universities and are backed by an $80M investment, in a phase of rapid execution and expansion.

$230,000–$250,000/yr
US Unlimited PTO 12w paternity

  • Define and evolve reliability standards for the SmarterDx platform.
  • Enhance observability systems (metrics, logs, traces, alerting) to provide actionable insights and reduce mean time to detect (MTTD) and resolve (MTTR).
  • Reduce operational toil through automation, self-healing systems, and improved deployment and rollback mechanisms.

SmarterDx, a Smarter Technologies company, builds clinical AI that is transforming how hospitals translate care into payment. Founded by physicians in 2020, their platform connects clinical context with revenue intelligence, helping health systems recover millions in missed revenue, improve quality scores, and appeal every denial.

UK

  • Manage and support a team of 6 Data Engineers, helping them focus on impactful technical initiatives as the platform scales.
  • Drive execution excellence, ensuring the team delivers with high velocity, quality, and reliability.
  • Foster a healthy and sustainable team environment by helping the team manage workload and focus on meaningful engineering work.

Bluefish believes that AI represents the next major chapter of the internet and that consumers will increasingly use AI to consume information and media online. Bluefish is building the platform that helps brands engage consumers on this new AI channel, with powerful enterprise tools to manage AI brand safety and engage consumers with thoughtful and personalized AI marketing experiences.

$110,000–$125,000/yr
US

  • Monitor cloud infrastructure and application health using observability tools; respond to alerts.
  • Perform Tier 1 incident triage, document findings, and escalate appropriately to Development or SRE teams.
  • Monitor and support CI/CD pipelines to ensure successful builds and deployments.

Lumin Digital empowers credit unions and banks by creating cutting-edge digital experiences. They are a trailblazer in digital banking solutions with a culture that fosters trust, respect, and boldness, encouraging team members to explore and experiment with new ideas.

Australia

  • Support and implement monitoring and alerting strategy across Kraken’s customer business.
  • Define and uphold observability best practices across multiple products and platforms.
  • Partner with product teams to implement observability tooling and improve reliability across the organisation.

Kraken is a technology company focused on creating a smart, sustainable energy system. Their operating system for energy is transforming the industry around the world in a way that benefits everyone. They are a Great Place to Work with genuinely decent, honest, and empathetic people.

US Unlimited PTO

  • Own the roadmap for workflow debugging and diagnostic tooling.
  • Drive distributed tracing strategy, enabling customers to correlate Temporal workflows with their broader application traces.
  • Collaborate with customers running mission-critical workloads to understand their incident response and debugging workflows.

Temporal provides an open source programming model simplifying code, making applications more reliable, and helping developers deliver features faster. They value curiosity, drive, collaboration, authenticity, and humility, and are looking for people who share these values.

$200,000–$300,000/yr
US

  • Build, mentor, and lead high-performing engineering teams.
  • Champion technical excellence by setting high standards.
  • Foster a culture focused on customer satisfaction.

Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. The system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.

US Europe

  • Build and lead the team responsible for the reliability, security, and scalability of Gensyn’s production infrastructure and developer platform.
  • Own the availability, scalability, and security posture of production systems: SLOs/SLIs, incident response, postmortems, reliability improvements, and hardening.
  • Drive delivery across ambiguous, high-stakes initiatives: roadmap planning, prioritization, and execution against tight timelines.

Gensyn is building a protocol that networks together the core resources required for machine intelligence to flourish alongside human intelligence. They value autonomy, independence, direct feedback and an extreme learning rate, and strive to reject mediocrity and waste.

$120,000–$140,000/yr
US Unlimited PTO

  • Architect and manage scalable cloud infrastructure within AWS.
  • Implement and maintain infrastructure using Terraform.
  • Develop automation scripts to improve operational efficiency.

Attune empowers insurance agents with their technology solutions. We foster a remote-first culture and value employee development.

$182,000–$217,000/yr
US 6w PTO

  • Serve as the primary technical point of contact for a portfolio of Grafana customers.
  • Design the observability maturity journey of customers and assist them on that path.
  • Provide expert-level troubleshooting and guidance to drive adoption.

Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack. Their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.