As a Senior Data Engineer, you will be at the forefront of Tavus's data initiatives, playing a pivotal role in shaping the future of human-AI interaction. You will anticipate the data needs and curate diverse, high-quality datasets to ensure AI models reach their full potential. Your work will directly impact AI model performance, efficiency, and inference accuracy, collaborating closely with ML engineers to optimize datasets for maximum AI effectiveness.
You will own, build and scale the data pipeline, being highly involved in data sourcing, and expanding and owning the curation, filtering and preprocessing pipelines across a variety of data modalities. You'll find, collect, and curate the best multimodal data (text, video, images) to power our models. You will also manage large-scale data procurement to ensure our models train on the highest quality information. Optimize the data labeling process and build automated workflows to make cleaning, labeling, and structuring data as efficient as possible.