Job Description
Set architecture & strategy for multi‑tenant, AI‑powered communications services (voice, SMS, email). Design and build RAG pipelines, vector‑store–backed retrieval layers, and fine‑tuning workflows that deliver low‑latency, context‑aware experiences. Lead end‑to‑end platform initiatives including data ingestion, event processing, model hosting, continuous evaluation, and cost/latency optimization. Champion best practices for LLM safety, prompt management, experiment tracking, observability, and auto‑scaling. Influence roadmaps across Product, Design, and multiple engineering squads. Mentor & level‑up senior and staff engineers and foster a culture of ownership, experimentation, and inclusive collaboration.
About Weave
Weave is an equal opportunity employer that is committed to fostering an inclusive workplace where all individuals are valued and supported.