Principal Performance Engineer

Writer 📝✍️⌨️

Benefits

2w paternity

Job Description

Writer is seeking a highly skilled and motivated Principal performance engineer to lead the performance optimization of our cutting-edge Generative AI technology stack. This role is critical in ensuring the scalability, efficiency, and reliability of our Large Language Models (LLMs) and Retrieval Augmented Generation (RAG) systems. You will be a key driver in identifying and resolving performance bottlenecks, optimizing resource utilization, and ensuring a seamless user experience. Your responsibilities include performance leadership, LLM capacity and tuning, and RAG performance optimization. You would design and implement performance tests which include retrieval, ranking and generation components. You'll also collaborate with infrastructure teams to optimize hardware and software configurations for AI workloads. You would work with AI researchers, software engineers, and product managers to ensure performance requirements are met.

About Writer

Writer is an equal-opportunity employer and is committed to diversity.

Apply for This Position