Latest Remote Evaluation Frameworks Software engineering Jobs (1+)

Staff Software Engineer, Agentic Applications - Grafana Ops, AI/ML

Grafana Labs 📊🧪🏢 25 days ago

$112,253–$134,708

USD/year

We are looking for an experienced engineer with expertise in evaluating Generative AI systems, particularly Large Language Models (LLMs), to help us build and evolve our internal evaluation frameworks, and/or integrate existing best-of-breed tools. This role involves designing and scaling automated evaluation pipelines, integrating them into CI/CD workflows, and defining metrics that reflect both product goals and model behavior.

Remote Software engineering Jobs • Evaluation Frameworks

Job listings

Staff Software Engineer, Agentic Applications - Grafana Ops, AI/ML