Source Job

LATAM

  • Design and implement scalable ML infrastructure to support model development and deployment
  • Develop and maintain evaluation frameworks for Large Language Models (LLMs), including RAG-based systems
  • Evaluate model performance using tools such as RAGAS, DeepEval, or similar frameworks

Python Machine Learning LLMs RAG AWS

20 jobs similar to Machine Learning Engineer

Jobs ranked by similarity.

$150,000–$180,000/yr
US

  • Design and implementation of reliable, maintainable, and scalable GenAI systems.
  • Serve as a subject matter expert for machine learning systems owned by the team.
  • Mentor junior and mid level engineers through code reviews and design collaboration.

Trajector specializes in medical evidence services, guiding clients through disability benefits complexities. They are a global team of over 1,800 dedicated individuals, streamlining the path to benefits and ensuring access to rightful compensation for those with disabilities.

US

  • Design and Develop machine learning infrastructure, tooling, and models to help teams deliver world class experiences.
  • Help product and development teams understand the data lifecycle and the inherent experimental nature of machine learning.
  • Build internal products and platforms to enable teams to incorporate AI into their features and customer facing products.

Weave provides an all-in-one platform for small businesses to streamline communications, and patient experiences. The company has a phenomenal culture, and Weave's teams are cross-functional agile teams composed of a product owner, backend and frontend devs and devops.

$130,000–$170,000/yr
US

  • Design AI integration patterns and architecture standards across the SaaS platform
  • Integrate LLM APIs (OpenAI, Anthropic, AWS Bedrock) into production features
  • Establish model evaluation, benchmarking, and observability processes

PerfectServe offers Best in KLAS clinical communication and physician scheduling solutions and is a Leader in the Gartner Magic Quadrant for Clinical Communication and Collaboration. They have seen an 88% growth rate over the past three years.

US

  • Lead Rula’s applied AI investments as they scale.
  • Own technical direction for high-impact AI products and work across teams to turn big ideas into shipped systems.
  • Help raise the bar for how they build, evaluate, and operate AI in production.

Rula strives to create a world where mental health is no longer stigmatized and provides quality, evidence-based care. They are a remote-first company that is dedicated to treating the whole person, not just the symptoms, and making a positive impact in the field of mental healthcare.

$88,911–$117,952/yr
Canada

  • Build AI-Powered Features: Design, develop, and deploy production-grade AI applications from concept to deployment.
  • Architect Scalable Systems: Create robust backend architectures that support AI workloads, ensuring low latency and high reliability.
  • Drive AI Innovation: Implement and optimize agentic AI systems, RAG pipelines, and multi-agent workflows using modern LLM frameworks.

Procurify is the AI-enhanced procurement and AP automation platform for mid-market organizations, making it easy for organizations to take control of spending and save money. It is a remote-first company with a big heart and a strong ambition to modernize how organizations manage business spend.

Latin America

  • Design, build, and deploy AI-powered features and applications rapidly.
  • Integrate LLMs and AI models into existing products, handling the full stack.
  • Build robust APIs and backend services to power AI features.

Clara is the fastest-growing company in Latin America, providing solutions for businesses to manage payments. They have over 20,000 customers and are backed by top investors, fostering a fast-paced, supportive environment across the Americas.

Canada 5w PTO

  • Build, ship, and own product features end-to-end
  • Work with designers and product managers to create high-performing product features.
  • Apply ML techniques to LLM-based approaches with a strong focus on reliability, performance, and maintainability.

Optro is the leading audit, risk, ESG, and InfoSec platform on the market and has surpassed $300M ARR. They inspire each other to innovate and assist each other to create the most loved platform, which has allowed them to become one of the 500 fastest-growing tech companies in North America.

Canada Unlimited PTO

  • Lead the design, development, and deployment of production, multi-turn LLM-powered features.
  • Own backend services in Python that integrate LLM agents with Fullscript’s platform and support reliable production use.
  • Partner with medical, product, and engineering teams to identify high-value opportunities for AI and turn them into practical, scalable product capabilities.

Fullscript is a health technology company committed to helping people get better by creating a platform that powers every part of care. More than 125,000 practitioners use Fullscript for clinical insights, lab interpretations, patient analytics, education, and access to high-quality supplements.

Europe Unlimited PTO

  • Contribute to designing, evaluating, and shipping our mental health AI Agent and its supporting infrastructure.
  • Develop and maintain robust data pipelines to power model training and evaluation.
  • Partner with AI Research, Product, and Engineering teams to define new features.

Sword Health is shifting healthcare from human-first to AI-first through its AI Care platform. They aim to make world-class healthcare available anytime, anywhere, while significantly reducing costs. Backed by clinical studies and patents, Sword Health has raised more than $500 million from leading investors.

Europe

  • Own the architecture and delivery of production-grade LLM systems and classical ML solutions.
  • Design, evaluate, and optimize RAG pipelines (retrieval strategy, chunking, indexing, monitoring).
  • Build scalable, production-grade LLM services and agentic workflows, alongside traditional ML systems where appropriate.

Hiflylabs is a team of 250+ data and tech enthusiasts based in Budapest. They focus on data engineering, data science, artificial intelligence and application development, working on a wide range of projects around the world. Hiflylabs values its people and is committed to nurturing their personal and professional development through a mentoring system.

LATAM

  • Design and implement complex AI agents and multi-agent workflows using frameworks like LangChain or LangGraph.
  • Build and maintain scalable, high-performance backend services using Python to support AI-driven features.
  • Develop and optimize Retrieval-Augmented Generation (RAG) pipelines to provide agents with accurate, context-aware information.

We are a Managed Nearshore Teams provider headquartered in Austin, specializing in building and embedding high-performing software development teams. Our model allows you to work on international challenges, collaborate with diverse teams, and grow your career while being part of a company that values expertise, creativity, and impact.

$35–$50/hr
Global

  • Design and implement LLM-powered application workflows
  • Architect retrieval-augmented generation pipelines
  • Collaborate with backend architects to integrate AI services into APIs

They are seeking a hands-on AI Engineer with deep expertise in Large Language Model integration and production AI systems. The company's culture sounds innovative and collaborative, focusing on building scalable and secure AI applications.

US

  • Own agent quality end-to-end: diagnosis, improvement, and validation across SmartAssist's orchestrator and subagents
  • Drive quality improvements through prompt engineering, context engineering, and RAG retrieval tuning
  • Extend and mature our evaluation framework: scorers, golden datasets, regression gates, and online evaluation for production traffic

Smartsheet has been helping people and teams achieve for over 20 years. They are building tools that empower teams to automate the manual, uncover insights, and scale smarter.

$160,000–$190,000/yr
US Unlimited PTO

  • Design, build, and deploy production AI agents and multi-agent orchestration systems.
  • Architect RAG pipelines with vector search and knowledge base management for AI-driven support.
  • Build production microservices and APIs serving as orchestration layers for AI agent systems.

Greenlight is a family fintech company helping parents raise financially smart kids. They serve over 6 million parents and kids with their banking app, aiming to ensure every child has the opportunity to become financially healthy and happy.

Australia

  • Drive the design and evolution of AI-ready tools and APIs for LLM platforms.
  • Own and evolve evaluation frameworks that measure tool-use accuracy across platforms.
  • Shape Canva's agent architecture, making strategic technical decisions about intelligence location.

Canva is a design platform that enables users to create various visual content. They have offices in multiple locations in Australia and New Zealand, and they offer a flexible work environment.

$190,000–$240,000/yr
US

  • Scope and lead ML initiatives end-to-end from identifying opportunities through production deployment.
  • Design, develop, and optimize ML models and AI systems for document processing and automation.
  • Build and maintain production ML pipelines that are robust, observable, and scalable.

Medallion is a healthcare technology company building a provider operations platform to eliminate administrative bottlenecks. They are one of the fastest-growing healthcare technology companies, with a mission to transform healthcare at scale and are backed by $130M in funding.

Europe

  • Lead Agent Development: Drive the development of Owkin’s Data Transformation Agent (DTA).
  • Orchestrate Data Workflows: Design, implement, and maintain complex data transformation workflows.
  • Ensure Code Excellence: Define and enforce robust engineering practices.

Owkin is an AI company on a mission to solve the complexity of biology. They are building the first Biology Super Intelligence (BASI) by combining powerful biological large language models, multimodal patient data, and agentic software.

$179,000–$199,000/yr
US

  • Set the technical vision and reference architecture for agentic AI across applications.
  • Build and govern reusable platform components to accelerate adoption across teams.
  • Drive cross-functional roadmaps and integration standards across OCIO and business teams.

PointClickCare helps providers deliver exceptional care. They are a leading health tech company that’s founder-led and privately held, empowering their employees to push boundaries, innovate, and shape the future of healthcare.

  • Design, implement, and evaluate machine learning models and AI algorithms.
  • Develop and optimize prompts for LLMs to improve model outputs.
  • Collaborate with software engineers, data scientists, and product teams.

Cadre AI is focused on building and optimizing AI-powered platforms, bringing together cutting-edge technologies and expertise in machine learning and large language models. The team is dedicated to advancing AI capabilities and applying them to real-world challenges through scalable, high-impact solutions.

Europe

  • Design, develop, and deploy intelligent AI Agents using Python-based frameworks.
  • Architect and implement robust Retrieval-Augmented Generation (RAG) pipelines from scratch.
  • Integrate LLMs into public sector and healthcare applications while ensuring high accuracy and reliability.

Deutsche Telekom IT Solutions, a subsidiary of the Deutsche Telekom Group, is Hungary’s most attractive employer in 2025. The company provides a wide portfolio of IT and telecommunications services with more than 5300 employees, serving hundreds of large customers, corporations in Germany and in other European countries.