Source Job

Europe 5w PTO

  • Design and deliver features across the script preview and voice orchestration stack, integrating multiple TTS providers and building recommendation systems.
  • Take ownership of features from idea to production, working with loosely defined requirements to scope, prototype, and ship solutions.
  • Build backend systems for TTS provider orchestration and frontend experiences for voice selection and pronunciation control.

Backend Engineering

20 jobs similar to Software Engineer (Speech & Voice Generation)

Jobs ranked by similarity.

Europe 5w PTO

  • Lead the delivery of complex engineering projects in AI-powered video, avatar rendering, and generative media systems.
  • Challenge and support senior engineers working across backend systems, frontend rendering, and AI integrations to achieve technical excellence.
  • Monitor team performance proactively, addressing issues in delivery, system reliability, or quality.

Synthesia is the world’s leading AI video platform for business, used by over 90% of the Fortune 100. Following their recent Series E funding round, where they raised $200 million, their valuation stands at $4 billion and culture is passionate about building, not talking, planning or politicising.

India

  • Assess the ElevenLabs environment and identify inefficiencies and scalability issues.
  • Define/implement architecture and operational standards for call agent configuration and reporting.
  • Optimise AI voice workflows, integrations, and improve system reliability/maintainability.

Smart Working connects skilled professionals with global teams for full-time, long-term roles. They value growth and well-being, fostering a remote-first culture with a supportive community.

Europe 5w PTO

  • Drive execution and promote DS partnerships, including baseline audits and CI automation.
  • Ship technical fixes and lead cross-team remediation efforts to embed accessibility.
  • Partner with Design, Product, Legal, and External Audit to integrate accessibility into workflows.

Synthesia is an AI video platform for business used by over 90% of the Fortune 100. Its valuation is $4 billion with over $530 million in funding, and it has offices across Europe and the US.

US

  • Ship product and talk to users, joining customer calls to turn feedback into improvements.
  • Own features end-to-end across the platform: console, APIs, fine-tuning, and billing.
  • Sprint on 0→1 initiatives from concept to shipped product in weeks.

Fireworks builds generative AI infrastructure, delivering high-quality models with the fastest inference. It's a $4B Series C company backed by top investors and staffed by a collaborative team of builders.

Europe South Africa

  • Design, build, and iterate on AI systems across computer vision, video generation, and agentic data analysis.
  • Own the full pipeline from R&D of AI models to deploying scalable web applications.
  • Work directly with the founder using AI-assisted development tools to build prototypes in days.

Foxelli builds D2C e-commerce brands and proprietary AI infrastructure, generating over $20M annually. The team is a fast-moving, remote-first tribe of AI enthusiasts and ambitious builders with a strong culture and focus on meaningful work.

US

  • Directly manage a pod of engineers responsible for Canary's Voice AI product.
  • Set an example as an IC in terms of both the quality and velocity of code that you ship.
  • Own the architecture and its evolution across telephony integrations, real-time AI components and integration points.

Canary Technologies is changing the game for hotels with modern software powered by their hospitality-specific AI platform. They are utilized by 20,000+ hoteliers in 100+ countries and is backed by top Silicon Valley investors like Y Combinator, F-Prime, Brighton Park Capital and Insight Partners.

Europe 5w PTO

  • Lead complex engineering projects focused on user acquisition and activation.
  • Partner with product and data teams to prioritize high-impact work.
  • Scale the team and fill skill gaps in experimentation and optimization.

Synthesia is the world’s leading AI video platform for business, used by over 90% of the Fortune 100. Founded in 2017, the company is headquartered in London, with offices and teams across Europe and the US. They strive to hire the smartest, kindest and most unrelenting people and let them do their best work without distractions.

US

  • Build scalable community and ambassador programs that grow the creator ecosystem beyond your own reach.
  • Develop lasting relationships with creators, studios, agencies, and community leaders.
  • Represent ElevenLabs at IRL creative meetups, AI events, industry conferences, and community gatherings across the US.

ElevenLabs is an AI research and product company transforming how we interact with technology. We serve millions of users and thousands of businesses, and our investors are some of the world's most prominent, including Andreessen Horowitz and Sequoia.

$125,000–$175,000/yr
Canada

  • Break down larger projects into individual tasks and deliver them in multiple phases.
  • Collaborate with product management, design & analytics by participating in ideation.
  • Contribute to a sense of community on your team by engaging in growth and development activities.

Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest. Affirm is a remote-first company.

Global

  • Negotiate billboard inventory in SFO airport.
  • Craft talking points for a keynote speech.
  • Build a report to analyze influencer marketing efficacy.

ElevenLabs is an AI research and product company transforming how we interact with technology. They serve millions of users and thousands of businesses, ranging from fast-growing startups to large enterprises and are backed by investors like Andreessen Horowitz and Sequoia.

US

  • Write behavioral specs, architectural constraints, and feature requirements that agents implement against.
  • Build and maintain harness infrastructure including structural tests, linting rules, and CI gates.
  • Design validation systems where agents write the tests and you verify features work from the user's perspective.

Bolo.ai builds generative AI systems for the energy industry, making daily work faster, safer, and better for heavy industry workers. We have Fortune 500 contracts, production deployments, and growing enterprise demand, and we're scaling with a small, senior-leaning engineering team.

$105,000–$140,000/yr
US

  • Own the full sales cycle from prospecting through close for mid-market and enterprise accounts.
  • Run technically deep discovery calls to understand customer use cases, tech stack, and integration requirements.
  • Build and demo custom voice agent prototypes during the sales process to accelerate buy-in.

Speechify's mission is to make sure that reading is never a barrier to learning. Nearly 200 people around the globe work on Speechify in a 100% distributed setting that includes frontend and backend engineers, AI research scientists, and others.

Global

  • Complete short voice recordings or conversation tasks for AI training purposes.
  • Follow clear project instructions to ensure natural, accurate, and usable speech data.
  • Submit recordings through the designated online workflow or platform.

Wing Data supports AI training projects through short voice-based tasks. They offer flexible, project-based opportunities rather than full-time positions.

  • Design, build, ship, and maintain core capabilities for North's Agents & Automations platform.
  • Build product and platform features for creating, running, debugging, evaluating, and improving agents and automations.
  • Own features end-to-end from design to launch, working across the full stack and collaborating with cross-functional teams.

Cohere is a security-first enterprise AI company that builds cutting-edge foundation models and end-to-end AI products for businesses. It is a global team of researchers, engineers, and designers with offices in Toronto, San Francisco, London, New York, and more, fostering a collaborative and innovative culture.

US Unlimited PTO 18w maternity 12w paternity

  • Build and ship fullstack features across multiple AI-first products, including agentic music production and audio ML experiences.
  • Work directly with PMs and designers to take rough ideas to shipped product quickly, owning the full arc from ticket to production.
  • Use agentic coding tools (Claude Code, Cursor, Codex) as a core part of your daily development workflow and own features end to end.

Splice is a creative platform for people who make music, offering a subscription service with an industry-leading catalog of sounds and samples and an expanding AI stack. The company has a growing global community of chart-topping producers, students, and DIY creators, and embraces a culture of remote work with regular communication and team get-togethers.

US 16w maternity 12w paternity

  • Orchestrate High-Velocity Workflows: Leverage advanced agentic coding tools (e.g., Cursor, multi-agent environments) to dramatically accelerate feature prototyping, code generation, and test coverage.
  • Own the Guardrails & Quality: Act as the ultimate reviewer and architect; define the specifications, establish repo-context guardrails, and review AI-accelerated output for hidden security risks, scale bottlenecks, and architectural alignment.
  • Build Scalable Application and Data Layers: Design, build, and maintain our data pipelines and application to service our hundreds of users.

EvolutionIQ provides technology to improve insurance claims handling. The company is experiencing massive growth and has been named a top workplace, prioritizing its team.

$135,000–$150,000/yr
US

  • Talk to people, then build things, working directly with business and engineering teams to understand what's slowing them down.
  • Own the whole thing by prototyping, hardening, deploying, and monitoring internal tools that need to work reliably.
  • Write code other people can maintain building clean systems and establishing practical patterns for secure AI usage.

Promenade empowers local businesses with products and services that allow them to thrive online and offline. They build vertically-focused software catered to each industry, leveling the playing field between small businesses and large aggregators; backed by industry investors.

US

  • Produce, record, and edit technical product walkthroughs, screencasts, and explainer videos using AI video generation tools.
  • Turn instructional blueprints into high-quality multimedia assets that power our new hire training programs.
  • Rigorously test all visual assets for UX and branding compliance, keeping pace with our rapid product release cycle.

Wiz is reinventing cloud security, helping organizations thrive securely in the cloud. As the fastest-growing startup ever, we have hundreds of customers, including over 50% of the Fortune 100, and a culture that values world-class talent.

India

  • Architect and ship production-grade agentic AI applications including multi-agent orchestration, retrieval systems, and evaluation pipelines.
  • Design and build learner-facing AI experiences and operator tools end-to-end using React and TypeScript.
  • Own production reliability for AI systems including model failover, rate limiting, cost monitoring, and incident response.

Chegg Skills builds applications that help motivated career switchers transition into high-growth roles. The company serves thousands of learners and educators each year through a high-ownership engineering team rethinking modern education.

Global

  • Work in an AI-powered platform to review and refine synthetic speech
  • Rate dubbed media content for Portuguese and German audiences
  • Evaluate voice profiles and ensure high-quality dubbing output

RWS is a technology-enabled language services company specializing in localization and content management. They are a large global organization with a focus on innovation and diversity, offering freelance opportunities.