Job Description
Lead the productization of ML/LLM-powered decision support systems with strict quality, safety, and latency benchmarks, building and scaling engineering teams with a strong observability and reliability culture. Design and launch inference pipelines with fallbacks, caching, A/B evaluation, and rollback capabilities. Collaborate with clinicians, compliance experts, and cross-functional leaders to ensure interpretability, auditability, and clinical trust in deployed systems. Drive innovation in agentic workflows that balance automation with safety. Solve scalability challenges in a rapidly growing company.
About Sully.ai
Sully.ai's model outperforms Claude, Gemini, and GPT-4.5 on clinical benchmarks and they've signed 400+ healthcare orgs in 16 months raising $25M.