Source Job

Global

  • Architect and build agentic workflows that combine large language models, reasoning components, and data pipelines to create adaptive, goal-driven conversational systems
  • Lead the design and development of advanced ML/NLP products, from ideation to production - including model training, evaluation, optimization, and deployment
  • Drive experimentation with new approaches for agentic reasoning, coordination, and autonomous system design

NLP LLM Python Machine Learning Deep Learning

20 jobs similar to Senior AI Engineer

Jobs ranked by similarity.

$190,000–$240,000/yr
US

  • Scope and lead ML initiatives end-to-end from identifying opportunities through production deployment.
  • Design, develop, and optimize ML models and AI systems for document processing and automation.
  • Build and maintain production ML pipelines that are robust, observable, and scalable.

Medallion is a healthcare technology company building a provider operations platform to eliminate administrative bottlenecks. They are one of the fastest-growing healthcare technology companies, with a mission to transform healthcare at scale and are backed by $130M in funding.

  • Design, implement, and evaluate machine learning models and AI algorithms.
  • Develop and optimize prompts for LLMs to improve model outputs.
  • Collaborate with software engineers, data scientists, and product teams.

Cadre AI is focused on building and optimizing AI-powered platforms, bringing together cutting-edge technologies and expertise in machine learning and large language models. The team is dedicated to advancing AI capabilities and applying them to real-world challenges through scalable, high-impact solutions.

Europe Unlimited PTO

  • Contribute to designing, evaluating, and shipping our mental health AI Agent and its supporting infrastructure.
  • Develop and maintain robust data pipelines to power model training and evaluation.
  • Partner with AI Research, Product, and Engineering teams to define new features.

Sword Health is shifting healthcare from human-first to AI-first through its AI Care platform. They aim to make world-class healthcare available anytime, anywhere, while significantly reducing costs. Backed by clinical studies and patents, Sword Health has raised more than $500 million from leading investors.

Europe

  • Build and ship AI-powered product and internal solutions using LLMs, RAG, tool calling, workflows, and agentic patterns
  • Design quality and evaluation frameworks for AI systems, including offline evals, online signals, failure analysis, and continuous improvement loops
  • Contribute to AI platform and tooling decisions that improve reuse, speed, and consistency across teams

Finom is a European tech startup headquartered in Amsterdam, revolutionizing financial landscape for entrepreneurs. They develop an all-in-one financial B2B solution integrating banking, accounting, financial management, and invoicing into a mobile-first platform and nurture innovation in an inspiring work environment.

US

  • Develop, test, and deploy LLM-powered extraction pipelines for clinical text at scale.
  • Automate prompt execution, result validation, and error handling to enhance reliability.
  • Monitor and maintain production AI models, ensuring uptime, accuracy, and compliance.

iCIMS is a software company. The job posting mentions thriving in a start-up environment.

Global 6w PTO

  • Conduct experiments with LLMs and evaluate different architectures and techniques to improve conversational AI quality.
  • Develop and maintain robust evaluation frameworks to assess model performance, accuracy, and user satisfaction using offline and online metrics.
  • Optimize models for inference, improving speed, efficiency, and scalability for production environments.

Social Discovery Group (SDG) unites millions of users on dozens of products, solving loneliness, isolation, and disconnection by transforming virtual intimacy into the new normal. Their international team of 1000+ professionals works remotely from various locations, and they've been recognized as a "Great Place to Work".

$90,000–$160,000/yr
US Unlimited PTO

  • Design, develop, and refine large language model workflows to steer and improve model behaviors.
  • Build language processing components for intent detection, summarization and conversational response quality.
  • Drive R&D-style exploration on cutting-edge speech and language systems, rapidly prototyping novel approaches.

Cresta's platform combines AI and human intelligence to help contact centers discover customer insights and behavioral best practices, automate conversations, and empower team members. They are led by founders with experience at Google, Waymo, and Open AI, and are on a mission to revolutionize the workforce with AI.

$179,000–$199,000/yr
US

  • Set the technical vision and reference architecture for agentic AI across applications.
  • Build and govern reusable platform components to accelerate adoption across teams.
  • Drive cross-functional roadmaps and integration standards across OCIO and business teams.

PointClickCare helps providers deliver exceptional care. They are a leading health tech company that’s founder-led and privately held, empowering their employees to push boundaries, innovate, and shape the future of healthcare.

Canada Unlimited PTO

  • Lead the design, development, and deployment of production, multi-turn LLM-powered features.
  • Own backend services in Python that integrate LLM agents with Fullscript’s platform and support reliable production use.
  • Partner with medical, product, and engineering teams to identify high-value opportunities for AI and turn them into practical, scalable product capabilities.

Fullscript is a health technology company committed to helping people get better by creating a platform that powers every part of care. More than 125,000 practitioners use Fullscript for clinical insights, lab interpretations, patient analytics, education, and access to high-quality supplements.

Europe

  • Design and deploy state-of-the-art models to extract structured knowledge.
  • Fine-tune Large Language Models (LLMs) and Small Language Models (SLMs) with domain-specific context.
  • Collaborate with data infrastructure engineers to architect a scalable platform.

Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. They identify the top-fitting candidates, and this shortlist is then shared directly with the hiring company.

$160,000–$190,000/yr
US Unlimited PTO

  • Design, build, and deploy production AI agents and multi-agent orchestration systems.
  • Architect RAG pipelines with vector search and knowledge base management for AI-driven support.
  • Build production microservices and APIs serving as orchestration layers for AI agent systems.

Greenlight is a family fintech company helping parents raise financially smart kids. They serve over 6 million parents and kids with their banking app, aiming to ensure every child has the opportunity to become financially healthy and happy.

US

  • Own agent quality end-to-end: diagnosis, improvement, and validation across SmartAssist's orchestrator and subagents
  • Drive quality improvements through prompt engineering, context engineering, and RAG retrieval tuning
  • Extend and mature our evaluation framework: scorers, golden datasets, regression gates, and online evaluation for production traffic

Smartsheet has been helping people and teams achieve for over 20 years. They are building tools that empower teams to automate the manual, uncover insights, and scale smarter.

$150,000–$200,000/yr
US Unlimited PTO

  • Design, implement and validate high-reliability, distributed platforms for machine learning, natural language processing, and LLMs.
  • Create, debug, interpret and improve production machine learning and natural language processing models.
  • Build the tools and validation processes that help Counterpart translate insights into action at scale.

Counterpart Health transforms healthcare and improves patient care with its innovative primary care tool, Counterpart Assistant. They are a subsidiary of Clover Health, with an exceptional team of value-based care and technology experts, driving value-based care at the speed of software.

$174,000–$233,000/yr
US 4w PTO

  • Design and implement evaluation systems and tooling to validate Oura’s custom AI models and Advisor
  • Develop novel evaluation methods to measure grounding, reliability, and actionability of LLM and agentic systems
  • Build and optimize custom AI models through fine-tuning, knowledge distillation, and quantization

Oura's mission is to empower every person to own their inner potential. Their award-winning products help their global community gain a deeper knowledge of their readiness, activity, and sleep quality by using their Oura Ring and its connected app. They are focused on helping people live healthier and happier lives, and ensure that their team members have what they need to do their best work — both in and out of the office.

$170,000–$200,000/yr
US Unlimited PTO

  • Contribute to the design and evolution of agentic systems that participate directly in care delivery.
  • Define and build architectural patterns for agent reasoning, tool use, memory, and human-in-the-loop collaboration.
  • Own complex problem spaces end-to-end — from system design and implementation through observability, evaluation, and continuous improvement in production.

Pair Team is building a new kind of healthcare system across Medicaid, Medicare, and public assistance programs. As a public benefit corporation and AI-enabled medical group, they partner with shelters, food pantries, and community organizations to deliver “whole-person” care to the 115 million Americans who rely on the safety net, employing over 500 people while expanding nationally.

$88,911–$117,952/yr
Canada

  • Build AI-Powered Features: Design, develop, and deploy production-grade AI applications from concept to deployment.
  • Architect Scalable Systems: Create robust backend architectures that support AI workloads, ensuring low latency and high reliability.
  • Drive AI Innovation: Implement and optimize agentic AI systems, RAG pipelines, and multi-agent workflows using modern LLM frameworks.

Procurify is the AI-enhanced procurement and AP automation platform for mid-market organizations, making it easy for organizations to take control of spending and save money. It is a remote-first company with a big heart and a strong ambition to modernize how organizations manage business spend.

Australia

  • Developing ranking and recommendation models that identify high-performing team designs.
  • Building brandification pipelines to conform to an organisation's brand guidelines.
  • Building layout extraction and understanding systems that parse Canva's design format.

Canva is a design platform that makes it easy for anyone to create professional-looking designs. They have a flagship campus in Sydney, a second campus in Melbourne, and co-working spaces in Brisbane, Perth, & Adelaide, and provides flexibility in how and where you work.

Europe

  • Own the architecture and delivery of production-grade LLM systems and classical ML solutions.
  • Design, evaluate, and optimize RAG pipelines (retrieval strategy, chunking, indexing, monitoring).
  • Build scalable, production-grade LLM services and agentic workflows, alongside traditional ML systems where appropriate.

Hiflylabs is a team of 250+ data and tech enthusiasts based in Budapest. They focus on data engineering, data science, artificial intelligence and application development, working on a wide range of projects around the world. Hiflylabs values its people and is committed to nurturing their personal and professional development through a mentoring system.

US 3w PTO

  • Design, build, and deploy generative AI applications using Google Gemini, PaLM 2, and other Google-hosted foundation models via Vertex AI.
  • Implement Retrieval-Augmented Generation (RAG) architectures using Vertex AI Search, Vector Search, and document embedding pipelines for enterprise knowledge retrieval.
  • Develop end-to-end ML pipelines from data ingestion and feature engineering through model training, evaluation, and production deployment on Vertex AI Pipelines / Kubeflow.

Roboyo is a category shaper in Agentic Automation that helps leading brands embed autonomous, AI‑powered agents into their workflows, processes, products and services so they can scale faster and operate smarter. They are a global team of builders, consultants and engineers that are top practitioners of taking solutions to the next level for clients in pursuit of excellence.

$115,000–$150,000/yr

  • You will personally own the Intelligence Engine -- Scoring & Activation and the behavioral scoring algorithms that power user-facing activation across B2C and B2B.
  • You will own the entire ML Pipeline Architecture, from data ingestion and feature stores to model training, evaluation, deployment, monitoring, and retraining triggers.
  • You will be responsible for LLM Integration & Optimization, integrating, fine-tuning, and deploying large language models for contextual inference, personalization, and behavioral pattern recognition.

Gesture is a fast-growing tech company using AI, machine learning, and intelligent logistics to power a unique platform that connects people and brands through real-world, tangible experiences. Inside their NYC headquarters, you'll find an environment that moves with the pace and precision of Silicon Valley but with the heart of something far greater.