Data Scientist
Conduct comprehensive failure analysis on AI agent performance across finance-sector tasks to identify patterns, root causes, and systemic issues in our evaluation framework by analyzing task performance across multiple dimensions (task types, file types, criteria, etc.). Key responsibilities include statistical failure analysis, root cause analysis, dimension analysis, reporting & visualization, quality framework, and stakeholder communication.