Job Description
Mercor is looking for a data-driven analyst to conduct comprehensive failure analysis on AI agent performance across finance-sector tasks. The role involves identifying patterns, root causes, and systemic issues in our evaluation framework by analyzing task performance across multiple dimensions, such as task types, file types, and criteria. Key responsibilities include statistical failure analysis to identify patterns in AI agent failures, root cause analysis to determine the sources of failures, dimension analysis to analyze performance variations, reporting & visualization to highlight failure clusters, quality framework enhancements, and stakeholder communication to present insights to data labeling experts and technical teams.
About Mercor
Mercor uses RippleMatch to find top talent.