The Senior Researcher will lead research efforts on generative video and audio models (e.g., text-to-speech, speech-to-speech, audio-to-expression and other speech and multimodal AI topics). They will work with the Applied ML team to help productionize our research and stay relevant with the latest advancements (and help us create the latest advancements!). Requirements include proven experience with flow matching, diffusion models, auto regressive networks in the audio domain and experience training deep learning models. They should have strong foundations in audio modeling and demonstrated ability to innovate rapidly through prototyping. This position is preferably hybrid in San Francisco and Tavus offers relocation, however they are open to remote candidates as well.