Produce verbatim transcripts from single-speaker and multi-speaker audio recordings with a target accuracy rate of 98% or higher.
Timestamp transcript segments with precise alignment to audio, maintaining offsets of less than 500 milliseconds.
Identify and label speakers in conversations involving two to three participants.
They are a leading provider of high-quality, diverse datasets for AI model development. The company's size and culture are not specified in the posting.
Transcribe audio recordings accurately into text, capturing speech including pauses, filler words, and speaker labels.
Add timestamps and mark non-speech events such as laughter or background noise.
Review and refine transcripts to ensure high-quality data for AI speech recognition models.
Appen provides high-quality training data for AI and machine learning systems. It operates a global platform with a large community of independent contractors, offering flexible remote work opportunities.
Review audio files and transcribe spoken content with high accuracy, including pauses, filler words, and speaker labels.
Add timestamps and mark non-speech events such as laughter or background noise.
Ensure transcripts meet quality standards for training advanced language models.
Appen, through its CrowdGen platform, provides data for AI and machine learning projects. It operates as a large community of independent contractors working on project-based tasks globally.
Transcribe 5-second audio clips of English dialects accurately according to style guide.
Flag audio issues such as background noise or cut-off speech using simple codes.
Work flexible hours with an average completion time of ~125 seconds per task.
Welo Data, part of Welocalize, is a global AI data company that delivers high-quality, ethical data to train the world’s most advanced AI systems. With over 500,000 contributors in 100+ countries, the company values diversity and community.