Overview:
- This role is responsible for all aspects of data collection to support our model training operations.
- We are able to build high-quality datasets at petabyte-scale and low cost through a tight integration of infrastructure, engineering, and research work.
What You’ll Do:
- Collaborate closely with our Scientists to shift the cost/throughput/quality frontier.
- Craft the AI Team’s dataset roadmap to power Speechify’s next-generation consumer and enterprise products.
An Ideal Candidate Should Have:
- BS/MS/PhD in Computer Science or a related field.
- 5+ years of industry experience in software development.
- Proficiency with bash/Python scripting in Linux environments
Speechify
Speechify's mission is to make sure that reading is never a barrier to learning by offering text-to-speech products. They are a fully distributed company with nearly 200 employees around the globe, including engineers and scientists from top companies and programs.