Source Job

Global 4w PTO

  • Act as a Player/Coach: Architect the system and mentor the team, but spend significant time hands-on in the codebase (Python/PyTorch).
  • Drive our strategy for SFT (Supervised Fine-Tuning) and RLHF/DPO (Preference Optimization).
  • Build the immune system of the platform. You will design and train custom classifiers to detect and filter non-consensual or illegal content within an explicit environment.

Python PyTorch ML LLM AI

20 jobs similar to Tech Lead, LLM

Jobs ranked by similarity.

Global

  • Architect and build agentic workflows that combine large language models, reasoning components, and data pipelines to create adaptive, goal-driven conversational systems
  • Lead the design and development of advanced ML/NLP products, from ideation to production - including model training, evaluation, optimization, and deployment
  • Drive experimentation with new approaches for agentic reasoning, coordination, and autonomous system design

SmartRecruiters is the Recruiting AI Company that transforms hiring for the world’s leading enterprises. Built for global scale, SmartRecruiters, an SAP company, delivers an AI-powered hiring platform that automates and optimizes the entire talent acquisition process, ensuring faster and smarter hiring decisions. They are a values-driven, globally focused tech company with strong financial backing and a bold vision for the future of work.

$104,000–$166,000/yr
US

  • Provides program‑level leadership for AI/ML strategy.
  • Directs architecture and integration of AI/ML capabilities.
  • Establishes enterprise model governance, ethical AI principles, and production standards.

Peraton is a next-generation national security company that drives missions of consequence spanning the globe. As the world’s leading mission capability integrator and transformative enterprise IT provider, they deliver trusted, highly differentiated solutions and technologies to protect our nation and allies.

$190,000–$240,000/yr
US

  • Scope and lead ML initiatives end-to-end from identifying opportunities through production deployment.
  • Design, develop, and optimize ML models and AI systems for document processing and automation.
  • Build and maintain production ML pipelines that are robust, observable, and scalable.

Medallion is a healthcare technology company building a provider operations platform to eliminate administrative bottlenecks. They are one of the fastest-growing healthcare technology companies, with a mission to transform healthcare at scale and are backed by $130M in funding.

  • Design, implement, and evaluate machine learning models and AI algorithms.
  • Develop and optimize prompts for LLMs to improve model outputs.
  • Collaborate with software engineers, data scientists, and product teams.

Cadre AI is focused on building and optimizing AI-powered platforms, bringing together cutting-edge technologies and expertise in machine learning and large language models. The team is dedicated to advancing AI capabilities and applying them to real-world challenges through scalable, high-impact solutions.

Global 6w PTO

  • Conduct experiments with LLMs and evaluate different architectures and techniques to improve conversational AI quality.
  • Develop and maintain robust evaluation frameworks to assess model performance, accuracy, and user satisfaction using offline and online metrics.
  • Optimize models for inference, improving speed, efficiency, and scalability for production environments.

Social Discovery Group (SDG) unites millions of users on dozens of products, solving loneliness, isolation, and disconnection by transforming virtual intimacy into the new normal. Their international team of 1000+ professionals works remotely from various locations, and they've been recognized as a "Great Place to Work".

Australia

  • Research, develop and deploy AI based solutions to automate content moderation and review of Canva’s content
  • Work with diverse stakeholders to guide the technical vision while being responsible for break-down and delivery of large projects
  • Design, develop and deploy solutions and hands-on software development – working closely with leads, designers, and product managers

Canva is a design platform that empowers users to create and share visual content. They have campuses in Sydney and Melbourne, with co-working spaces in other Australian cities, and aims for a culture of flexibility and empowerment.

$104,000–$166,000/yr
US

  • Responsible for architecting, developing, and integrating AI and machine learning capabilities.
  • Leads design and development of AI/ML-enabled application features.
  • Applies machine learning, natural language processing, or automation techniques.

Peraton is a next-generation national security company that drives missions of consequence spanning the globe. As the world’s leading mission capability integrator, they deliver trusted, highly differentiated solutions and technologies to protect our nation and allies.

Europe

  • Design and run post-training experiments on frontier and open-weight LLMs (SFT, preference-based methods, rubric-driven training)
  • Translate raw annotation artifacts (multi-step solutions, evaluations, adversarial prompts) into training-ready datasets.
  • Prototype new reward signals beyond pairwise preferences (rubrics, constraints, structured critics).

Vetto is a global talent platform connecting top-tier professionals to high-impact AI projects around the world. Their mission is to build trust, quality, and long-term value in the AI ecosystem - for both exceptional talents and companies operating at the frontier of technology.

US

  • Develop, test, and deploy LLM-powered extraction pipelines for clinical text at scale.
  • Automate prompt execution, result validation, and error handling to enhance reliability.
  • Monitor and maintain production AI models, ensuring uptime, accuracy, and compliance.

iCIMS is a software company. The job posting mentions thriving in a start-up environment.

Canada Unlimited PTO

  • Lead the design, development, and deployment of production, multi-turn LLM-powered features.
  • Own backend services in Python that integrate LLM agents with Fullscript’s platform and support reliable production use.
  • Partner with medical, product, and engineering teams to identify high-value opportunities for AI and turn them into practical, scalable product capabilities.

Fullscript is a health technology company committed to helping people get better by creating a platform that powers every part of care. More than 125,000 practitioners use Fullscript for clinical insights, lab interpretations, patient analytics, education, and access to high-quality supplements.

$179,000–$199,000/yr
US

  • Set the technical vision and reference architecture for agentic AI across applications.
  • Build and govern reusable platform components to accelerate adoption across teams.
  • Drive cross-functional roadmaps and integration standards across OCIO and business teams.

PointClickCare helps providers deliver exceptional care. They are a leading health tech company that’s founder-led and privately held, empowering their employees to push boundaries, innovate, and shape the future of healthcare.

$150,000–$200,000/yr
US Europe

  • Own and maintain Runway's content policies, balancing user safety, creative expression, and operational feasibility
  • Translate policies into LLM prompts and continuously iterate to drive accuracy improvements
  • Track shifts in cultural and market norms and new use cases, and continuously evaluate and update policies accordingly

Runway is building AI to simulate the world through merging art and science. They believe that world models are at the frontier of progress in artificial intelligence. Their team consists of creative, open minded, caring and ambitious people who are determined to change the world.

US

  • Own agent quality end-to-end: diagnosis, improvement, and validation across SmartAssist's orchestrator and subagents
  • Drive quality improvements through prompt engineering, context engineering, and RAG retrieval tuning
  • Extend and mature our evaluation framework: scorers, golden datasets, regression gates, and online evaluation for production traffic

Smartsheet has been helping people and teams achieve for over 20 years. They are building tools that empower teams to automate the manual, uncover insights, and scale smarter.

Global

  • Design and develop an AI-powered productivity analytics platform.
  • Build scalable LLM pipelines and create a meta-workflow system.
  • Develop system-level prompt engineering and build an evaluation framework for AI output quality control.

Appflame is a Ukrainian product-driven tech company committed to building world-class products. They have 500+ team members and offices in Kyiv, London, Limassol, and a co-working hub in Warsaw; they value bold, driven people who are passionate about building real products.

Europe

  • Completing AI training tasks such as analyzing, editing, and writing Python
  • Judging the performance of AI in performing Python-related prompts
  • Improving cutting-edge AI models

Prolific is building the biggest pool of quality human data in the world. Over 35,000 AI developers, researchers, and organizations use Prolific to gather data from paid study participants with a wide variety of experiences, knowledge, and skills.

$80,000–$400,000/yr
US

  • Shape best practices and mentor team members.
  • Work on varied projects and influence technologies and solutions.
  • Identify and experiment with new approaches, technologies, or tools.

Qvest US is a global leader in technology and business consulting for the Media & Entertainment and Consumer Packaged Goods & Retail industries. They strategize, advise, design, develop, and implement future-forward business & technology solutions. Qvest US is currently 300+ people strong and they’ve been recognized as a “Best Place to Work,” a “Great Place to Work,” “Fastest Growing,” and “A Jewel."

Europe Unlimited PTO

  • Contribute to designing, evaluating, and shipping our mental health AI Agent and its supporting infrastructure.
  • Develop and maintain robust data pipelines to power model training and evaluation.
  • Partner with AI Research, Product, and Engineering teams to define new features.

Sword Health is shifting healthcare from human-first to AI-first through its AI Care platform. They aim to make world-class healthcare available anytime, anywhere, while significantly reducing costs. Backed by clinical studies and patents, Sword Health has raised more than $500 million from leading investors.

Europe

  • Partner with pharmaceutical and biotech companies to solve high-impact problems by designing, building, and deploying solutions on top of Owkin’s agentic AI platform, Owkin K.
  • Design, implement, and deploy production-grade AI systems on top of Owkin K by writing code, building integrations, designing data pipelines, and turning ambiguous problems into reliable software.
  • Integrate solutions into large, complex, and regulated organizations and work with diverse datasets, existing systems, and multiple stakeholders to ensure that the tools you build are reliable, scalable, and actively used in production.

Owkin is an AI company with a mission to solve the complexity of biology by building the first Biology Super Intelligence (BASI). They combine biological large language models, multimodal patient data, and agentic software, with Owkin K being an AI copilot that researchers, clinicians, and drug developers use.

Australia

  • Design and optimise AI-ready tools and APIs that enable LLM platforms to reliably interact with Canva's design capabilities.
  • Build and maintain evaluation frameworks to systematically measure tool-use accuracy across platforms.
  • Experiment with LLM orchestration and agent architectures – Develop Canva agents that any 3rd party provider can call to design quickly, efficiently and at scale.

Canva is a platform redefining how the world experiences design. They have a flagship campus in Sydney, with a second campus in Melbourne and co-working spaces in Brisbane, Perth, Adelaide, and Auckland, NZ.

US

  • Lead Rula’s applied AI investments as they scale.
  • Own technical direction for high-impact AI products and work across teams to turn big ideas into shipped systems.
  • Help raise the bar for how they build, evaluate, and operate AI in production.

Rula strives to create a world where mental health is no longer stigmatized and provides quality, evidence-based care. They are a remote-first company that is dedicated to treating the whole person, not just the symptoms, and making a positive impact in the field of mental healthcare.