Source Job

Europe

  • Design, optimize, and version prompts for production voice and chat LLM applications.
  • Architect and orchestrate multi-agent systems for complex conversations.
  • Build automated testing and validation frameworks for LLM outputs.

LLM Prompt Engineering Python RAG

20 jobs similar to AI/LLM Engineer

Jobs ranked by similarity.

North America Canada

  • Lead domain-specific model optimization using PEFT (LoRA/QLoRA) and knowledge distillation to balance cost, latency, and reasoning capability.
  • Build next-gen Retrieval-Augmented Generation pipelines using hybrid search, cross-encoders, and self-correcting retrieval loops.
  • Design and deploy multi-agent systems using frameworks like LangGraph or CrewAI, enabling autonomous task planning and tool-use (Function Calling).

ServiceNow is a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500®. Their intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work.

$175,000–$220,000/yr
US

  • Design agentic systems & ship AI to production: Turn prototypes into resilient, observable services with clear SLAs, rollback/fallback strategies, and cost/latency budgets.
  • Build tool‑using LLM “agents” (task planning, function/tool calling, multi‑step workflows, guardrails) for tasks like grant discovery, application drafting, and research assistance.
  • Own RAG end‑to-end: Ingest and normalize content, choose chunking/embedding strategies, implement hybrid retrieval, re‑ranking, citations, and grounding.

Instrumentl is a hyper-growth YC-backed startup that provides a SaaS platform to help nonprofits discover, track, and manage grants efficiently. They have over 4,000 nonprofit clients and are cash flow positive, doubling year-over-year, with customers who love them.

$230,000–$300,000/yr
US

  • Design, develop, and deploy agentic AI solutions for clients.
  • Build multi-agent systems and integrate models with enterprise systems.
  • Collaborate with clients and engineers to create scalable solutions.

AHEAD builds platforms for digital business, weaving together advances in cloud infrastructure, automation, analytics, and software delivery to help enterprises deliver on digital transformation. They prioritize creating a culture of belonging where all perspectives are valued and heard.

$80,000–$150,000/yr

  • Research, Document, Test, and Ideate: Explore the best ways to achieve our customers’ goals using LLMs and other AI tools.
  • Master Our Dialogue Platform: Become an expert, answer questions, and train others on prompting both within and outside of our platform.
  • Train Our AIs: Utilize prompting, knowledge-base creation, and fine-tuning to enhance our AI capabilities.

1mind is a platform that deploys multimodal Superhumans for revenue teams, combining a face, a voice, and a GTM brain. The company has a remote-first, fast-moving culture with ownership, autonomy, and impact from day one.

US

  • Designing, developing, and deploying generative AI models.
  • Architecting and building agentic systems with autonomous decision-making capabilities.
  • Integrating generative AI and agentic solutions into existing products and services.

Jobgether is a partner company that focuses on connecting talent with the right job opportunities. Their AI-powered matching process ensures applications are reviewed quickly, objectively, and fairly against the role's core requirements.

North America

  • Design, refine, and evaluate prompts, context, and system instructions for various product use cases
  • Conduct experiments to assess model behavior, accuracy, and cost impact with new or existing prompts
  • Continuously improve prompt engineering processes by adopting new techniques and technologies

Applied Systems transforms the insurance industry. They have 40+ years of experience and are building a team ready to learn and deliver innovative software and services.

5w PTO

  • Build and maintain an internal LLM gateway that handles routing, fallbacks, and rate limiting
  • Create reusable components for common AI patterns (RAG, function calling, streaming responses)
  • Develop SDKs or libraries that simplify AI integration for application developers

ButterflyMX empowers people to open and manage doors & gates from a smartphone and their products are installed in multifamily, commercial, and gated communities. As a distributed workforce, they're looking for intelligent, collaborative, and down-to-earth individuals to join their growing team.

  • Drive Prompt’s mission to improve healthcare through modern technology including AI
  • Develop and deploy real-time AI systems involving speech and natural language understanding (e.g., transcription, summarization, classification)
  • Partner with Product and Engineering to identify high-leverage AI opportunities and deliver measurable outcomes

Prompt delivers highly automated and modern B2B enterprise software to rehab therapy businesses, their teams, and most importantly the patients they serve. They’ve established themselves as the go-to platform in the space and are rapidly growing their market share by delivering software people love.

Canada Unlimited PTO

  • Development and deployment of LLM-powered features, including summarization tools.
  • Build backend services in Python that integrate ML/LLM models with Fullscript’s platform.
  • Collaborate with medical and product teams to deliver AI features for practitioners and patients.

Fullscript is a health technology company committed to helping people get better by connecting practitioners to products and patients to care plans. They empower over 125,000 practitioners and 10 million patients through their comprehensive platform.

US

  • Build and maintain gen AI prompts aligned with ad formats and community dynamics.
  • Improve the quality and brand safety of model outputs across text, images, and video.
  • Partner with Product and Engineering to prioritize improvements and accelerate feature development.

Reddit is a community built on shared interests and trust, home to open conversations and one of the internet’s largest sources of information.

Global

  • Integrate AI platform APIs across our product line - both internally and externally
  • Develop and refine LLM prompt chains and agents to meet the customer’s needs and expectations
  • Create eval systems to fine tune our agents to better suit the needs of our customers

Hone is revolutionizing the way companies develop and support their managers and teams with its AI-powered people development platform. They are funded by leading VCs and have raised over $50M to support their mission, with a remote-first and fully-distributed organization.

US

  • Design, develop, and maintain a robust platform to enable users to create and manage AI agents and their interactions.
  • Integrate and work with multiple LLMs, ensuring seamless orchestration and scalability for both individual and coordinated agent operations.
  • Leverage orchestration frameworks like LangGraph and others to build complex workflows and pipelines that support diverse agent functionalities, including frameworks for multi-agent coordination.

ClickUp is building the future of work by creating a converged AI workspace that unifies tasks, docs, chat, calendar, and enterprise search. Their AI-powered platform helps teams break free from silos and unlock new levels of productivity.

Europe

  • Design, implement, and maintain SFT and RL post-training pipelines for multi-step coding agents.
  • Train and adapt LLMs for agent workflows, including planning, tool use, and multi-step interactions inside JetBrains IDEs.
  • Build and develop evaluation and simulation environments where coding agents can act, be measured, and compared on realistic developer tasks.

At JetBrains, code is their passion and they strive to make the strongest, most effective developer tools on earth. Today, AI-powered assistance and agents are becoming a core part of how developers work in their IDEs.

  • Drive Prompt’s mission to improve healthcare through modern technology including AI
  • Lead AI projects from ideation → architecture → production → iteration until tools are widely adopted and loved!
  • Design, build, and deploy end-to-end AI systems across both traditional ML and LLM-based workflows

Prompt delivers highly automated and modern B2B enterprise software to rehab therapy businesses, their teams, and most importantly the patients they serve. They’ve established themselves as the go-to platform in the space and are rapidly growing their market share by delivering software people love.

Europe

  • Apply bleeding edge AI theory to the design and implementation of large-scale data systems that feed AI agents and autonomous workflows.
  • Use data science techniques to fine-tune, evaluate, and optimize LLMs for marketing-specific tasks.
  • Build end-to-end automations using LLMs, internal data, and external signals to eliminate repetitive human tasks.

Rockerbox is building the next generation of marketing intelligence. They are looking for someone to help them build the AI systems everyone else just theorizes about.

US North America

  • Design complex LLM prompts that accurately represent real customer journeys and service interactions.
  • Partner with Field Engineers to transform raw data into structured, high-quality tasks for model training.
  • Annotate and review tasks to ensure strict quality standards and alignment with expected customer outcomes.

Welo Data works with technology companies to provide datasets that are high-quality, ethically sourced, relevant, diverse, and scalable to supercharge their AI models.

$150,000–$220,000/yr
US Unlimited PTO

  • Incorporating the best research work on agents and code generation into the OpenHands framework
  • Performing novel improvements in areas of interest to improve agent performance and efficiency
  • Running and implementing evaluations to ensure agent quality

OpenHands is building an open-source AI platform that empowers engineering teams to accelerate development, automate workflows, and integrate intelligent coding assistance into real-world software delivery. The company fosters a culture built on kindness, candor, autonomy, and learning.

Design and implement agentic architecture, defining context management, data flow, and action orchestration. Build AI variables capable of autonomous action loops to enrich leads and trigger actions. Deliver Copilot v1, initially semi-agentic, with potential for autonomous workflows, while implementing monitoring of all output.

lemlist is a global B2B SaaS business with $43M ARR, fully bootstrapped, profitable, and growing fast, shipping one of the most loved Sales Engagement Platforms worldwide.

North America 4w PTO

  • Establish the technical vision for our AI product infrastructure.
  • Develop frameworks that make LLM integration seamless and reliable; building APIs and SDKs that allow LLMs to interface with Wealthsimple data and functionality.
  • Build new AI-powered product capabilities from 0 → 1; collaborating directly with product teams to bring AI features to life.

Wealthsimple is on a mission to help everyone achieve financial freedom by reimagining what it means to manage your money.