Job Description

The Senior Researcher will lead research efforts on generative video and audio models (e.g., text-to-speech, speech-to-speech, audio-to-expression and other speech and multimodal AI topics). They will work with the Applied ML team to help productionize our research and stay relevant with the latest advancements (and help us create the latest advancements!). Requirements include proven experience with flow matching, diffusion models, auto regressive networks in the audio domain and experience training deep learning models. They should have strong foundations in audio modeling and demonstrated ability to innovate rapidly through prototyping. This position is preferably hybrid in San Francisco and Tavus offers relocation, however they are open to remote candidates as well.

About Tavus

Tavus is building the human layer of AI to make human-AI interaction as natural as face-to-face interaction, enabling the human touch where it has been previously unscalable.

Apply for This Position

Benefits

Job Description

About Tavus