Source Job

EMEA

  • Design and build intuitive SDKs, libraries, APIs, and tooling.
  • Translate feedback into tooling improvements, error messaging, onboarding flows, and reference examples.
  • Collaborate with Product to influence roadmap decisions with real usage and pain-point insights.

Python

20 jobs similar to ML Developer Experience Engineer

Jobs ranked by similarity.

$145,500–$235,400/yr
US

  • Contribute to development for SDKs in supported platforms.
  • Collaborate with our engineering and product teams to drive the implementation and release of major features.
  • Actively maintain our open-source repositories.

The LaunchDarkly platform helps developers innovate on new features faster while protecting them with a safety valve to instantly rewind when things go wrong.

  • Build AI Developer Tool Features: Implement features for AI-powered developer tools such as code review assistants, test generators, deployment diagnostics, and on-call assistance tools.
  • Implement LLM Integrations: Build integrations with LLM APIs (OpenAI, Anthropic, etc.) such as prompt engineering, response handling, error management, and performance optimization.
  • Contribute to Platform Infrastructure: Help build self-service platform capabilities such as deployment pipelines, observability integration, security controls, and operational tooling that enable teams to rapidly deploy AI developer tools.

At Docker, we make app development easier so developers can focus on what matters.

$315,000–$340,000/yr
US

  • Design and build infrastructure that enables researchers to rapidly iterate on reward signals.
  • Develop systems for automated quality assessment of rewards, including detection of reward hacks and other pathologies.
  • Collaborate with researchers to translate science requirements into platform capabilities.

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems to be safe and beneficial for users and society.

  • Design and implement interfaces across the platform for compute orchestration and RL training.
  • Translate complex backend systems into intuitive, production-ready product experiences.
  • Build for technical audiences, including AI and general software engineers.

Prime Intellect makes frontier AI accessible to everyone and enables individuals/organizations to train models using their agentic training infrastructure.

5w PTO

  • Build and maintain an internal LLM gateway that handles routing, fallbacks, and rate limiting
  • Create reusable components for common AI patterns (RAG, function calling, streaming responses)
  • Develop SDKs or libraries that simplify AI integration for application developers

ButterflyMX empowers people to open and manage doors & gates from a smartphone and their products are installed in multifamily, commercial, and gated communities. As a distributed workforce, they're looking for intelligent, collaborative, and down-to-earth individuals to join their growing team.

  • Build AI agents and tools that transform how developers write code and debug issues.
  • Architect and implement AI-powered tools such as code review assistants and automated test generators.
  • Collaborate with the Principal Engineer and product/design teams in a remote-first environment.

Docker makes app development easier so developers can focus on what matters.

Europe

  • Design, implement, and maintain SFT and RL post-training pipelines for multi-step coding agents.
  • Train and adapt LLMs for agent workflows, including planning, tool use, and multi-step interactions inside JetBrains IDEs.
  • Build and develop evaluation and simulation environments where coding agents can act, be measured, and compared on realistic developer tasks.

At JetBrains, code is their passion and they strive to make the strongest, most effective developer tools on earth. Today, AI-powered assistance and agents are becoming a core part of how developers work in their IDEs.

$145,831–$218,747/yr
Canada

  • Build, maintain and improve Torc ML frameworks.
  • Use Terraform, AWS Managed Services, EKS, Ray.
  • Focus on data ops, ML development pipeline, logging & aggregation.

Torc has been a leader in autonomous driving since 2007. Now a part of the Daimler family, they are focused solely on developing software for automated trucks to transform how the world moves freight. Their culture is collaborative, energetic, and team focused.

North America

  • Optimize how the team produces code and collaborates to build WorkOS.
  • Build trust and credibility across the engineering team, while identifying pain points and recommendations to improve how software is built internally.
  • Serve as a bridge between infrastructure, product, and leadership to ensure tools and systems are maturing alongside the product.

WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness.

  • Design, build, and optimize high-performance systems in Python supporting AI data pipelines and evaluation workflows
  • Develop full-stack tooling and backend services for large-scale data annotation , validation, and quality control
  • Improve reliability, performance, and safety across existing Python codebases

Alignerr connects top technical experts with leading AI labs to build, evaluate, and improve next-generation models. They work on real production systems and high-impact research workflows across data, tooling, and infrastructure.

  • Design, build, and optimize high-performance systems in Python supporting AI data pipelines and evaluation workflows.
  • Develop full-stack tooling and backend services for large-scale data annotation, validation, and quality control.
  • Improve reliability, performance, and safety across existing Python codebases.

Alignerr connects top technical experts with leading AI labs to build, evaluate, and improve next-generation models. They work on real production systems and high-impact research workflows across data, tooling, and infrastructure.

US

  • Draft detailed natural-language plans and code implementations for machine learning tasks.
  • Convert novel machine learning problems into agent-executable tasks for reinforcement learning environments.
  • Identify failure modes and apply golden patches to LLM-generated trajectories for machine learning tasks.

At Mercor, we’re building the talent engine that helps leading labs and research orgs move AI forward.

$104,000–$156,000/hr

  • Design, build, and optimize high-performance systems in Python supporting AI data pipelines and evaluation workflows
  • Develop full-stack tooling and backend services for large-scale data annotation , validation, and quality control
  • Improve reliability, performance, and safety across existing Python codebases

Alignerr connects top technical experts with leading AI labs to build, evaluate, and improve next-generation models. They work on real production systems and high-impact research workflows across data, tooling, and infrastructure.

Global Unlimited PTO

  • Define the technical vision and architecture for AI-powered developer tools.
  • Design and build the foundational platform that empowers product and platform teams.
  • Partner with product and engineering leadership to evaluate productization opportunities.

Docker makes app development easier, allowing developers to focus on their work. They are a remote-first team and the #1 tool for building, sharing, and running apps, trusted by startups and Fortune 100s alike.

US

Draft detailed natural-language plans and code implementations for machine learning tasks. Convert novel machine learning problems into agent-executable tasks for reinforcement learning environments. Identify failure modes and apply golden patches to LLM-generated trajectories for machine learning tasks.

Mercor is building the talent engine that helps leading labs and research orgs move AI forward.

Europe Unlimited PTO

  • Contribute to designing, building, evaluating, shipping, and improving Sword’s products by hands-on AI/ML development.
  • Develop and maintain data processing pipelines for getting the data required to build and evaluate models.
  • Work alongside the Product, Data & Engineering Teams to define and implement AI/ML-powered features for internal and external users.

Sword Health is shifting healthcare from human-first to AI-first through its AI Care platform, making world-class healthcare available anytime, anywhere.

$126,279–$184,068/yr
US Canada UK Unlimited PTO

  • Successfully lead high-stakes consulting engagements from discovery to production.
  • Build high-quality, maintainable software collaboratively, incrementally, and through an approach tailored towards the unique needs of the clients.
  • Write production-quality code in Python, Java, Javascript, and React, while leveraging AI-assistive development tools.

8th Light is a technology solutions provider that partners with organizations to solve meaningful challenges and drive sustainable growth. Founded in 2006 and headquartered in Chicago, they foster continuous education, fueling growth through mentorship and hands-on work in an open, collaborative culture grounded in honesty that builds trust.

Build resilient AI Agents using LangGraph and microservices. Develop complex automation workflows in n8n. Collaborate with Internal Business Analysts to focus on coding, not guessing requirements.

At Gcore, you’ll help design and deliver that foundation for an AI-driven world, being a global provider of infrastructure and software solutions for AI, cloud, network, and security.

US

  • Fix bugs without writing code or waiting for engineering.
  • Build AI agents that handle entire support scenarios end-to-end.
  • Consult with users on achieving real outcomes, building infrastructure that scales, and collaborating with product and engineering.

We're building an AI‑native workspace—an operating system for work that puts co‑intelligence at the center.

US Europe Israel

  • Own product strategy for developer platform—spanning PR integrations, IDE plugins, CLI tools, SDKs, APIs, and workflow automation.
  • Define product roadmap based on how developers work today and where security friction exists in their daily flow.
  • Drive key metrics around developer adoption, PR merge rates, integration usage, and developer NPS.

Aisle is redefining how enterprises secure their software with an AI agent for autonomous vulnerability remediation.