Source Job

Europe US

  • Architect and implement AI-assisted workflow building capabilities.
  • Design systems that provide LLMs with the right context.
  • Develop evaluation benchmarks and automated testing for AI output quality.

Typescript Node.js API Design AI LLM

20 jobs similar to Staff LLM Interaction Engineer

Jobs ranked by similarity.

$63,500–$88,900/yr
Europe 5w PTO

  • Design and maintain the "Reasoning Layer" for Engines, building workflows that "think" and route tasks based on complex business logic.
  • Partner with Engine Owners to translate manual processes into automation and iterate quickly based on real-world operational feedback.
  • Connect internal platforms with third-party services using APIs, webhooks, and workflow automation tools.

Tem is fixing the energy market by enabling businesses to buy energy directly from renewable generators. They have saved UK businesses and generators over £25 million since launching in 2021 and are backed by top-tier VCs including Atomico and Albion.

US Unlimited PTO

  • Design and implement AI-powered features, integrating LLMs with existing products.
  • Improve AI systems through evaluations, guardrails, monitoring, and customer usage.
  • Collaborate with AI Platform engineers to shape foundational AI systems and tooling.

Vanta helps businesses earn and prove trust by empowering companies to practice better security. They have a kind and talented team of employees determined to make security easier for companies to manage and prove.

$200,000–$300,000/yr
US Canada

  • Build Claude Skills and internal AI tooling.
  • Ship autonomous AI agents.
  • Own the AI development environment.

TLDR is the largest network of tech newsletters in the world, with over 7M subscribers, covering topics from startups to AI. Their 24-person full time team includes alumni of top media brands, and they doubled revenue from 2024 to 2025.

Global

  • Design and implement AI-powered features end to end, including prompts, agents, tools, retrieval, evaluation, and feedback loops.
  • Build agent systems that interact safely with infrastructure, codebases, and deployment pipelines.
  • Integrate LLMs deeply into product workflows as core platform primitives.

SuperPlane is an AI-native DevOps control plane with a mission to build the platform teams use to ship and manage software in the AI era. They are a fast-moving company aiming high, rethinking DevOps from first principles for the AI era to create a single control layer for engineers and agents to collaborate safely.

US

  • Design, build, and deploy AI Agents including custom tools, prompt engineering, orchestration workflows, and agent design patterns.
  • Contribute to the backend infrastructure powering Candidly's AI capabilities, including API development, data integrations, and data pipelines.
  • Work closely with stakeholders across product, design, engineering, and leadership to translate complex AI concepts into actionable strategies and features.

Candidly, founded in 2016, is the category leader with the market’s most comprehensive AI-driven student debt and savings optimization platform. They partner with hundreds of top employers, financial institutions, and retirement record keepers, positioning Candidly to serve more than 35 million Americans. Candidly is a high-growth, Series B startup, funded by leading investors with an international team of 70 (and counting).

Europe Unlimited PTO

  • Designing complex, dynamic prompt templates with conditional logic.
  • Implementing various response schemes to ensure AI outputs are predictable.
  • Building robust evaluation pipelines and using Langfuse to collect feedback.

Ruby Labs is a leading tech company that creates and operates innovative consumer products. We offer a diverse range of opportunities across the health, education, and entertainment industries, and our innovative teams are driving the future of consumer-led products.

5w PTO

  • Build and maintain an internal LLM gateway that handles routing, fallbacks, and rate limiting
  • Create reusable components for common AI patterns (RAG, function calling, streaming responses)
  • Develop SDKs or libraries that simplify AI integration for application developers

ButterflyMX empowers people to open and manage doors & gates from a smartphone and their products are installed in multifamily, commercial, and gated communities. As a distributed workforce, they're looking for intelligent, collaborative, and down-to-earth individuals to join their growing team.

US

  • Design, develop, and maintain a robust platform to enable users to create and manage AI agents and their interactions.
  • Integrate and work with multiple LLMs, ensuring seamless orchestration and scalability for both individual and coordinated agent operations.
  • Leverage orchestration frameworks like LangGraph and others to build complex workflows and pipelines that support diverse agent functionalities, including frameworks for multi-agent coordination.

ClickUp is building the future of work by creating a converged AI workspace that unifies tasks, docs, chat, calendar, and enterprise search. Their AI-powered platform helps teams break free from silos and unlock new levels of productivity.

$61,900–$105,300/yr
US

  • Implement features for AI applications such as conversational assistants and copilots and text generation, summarization, and content classification.
  • Design and optimize prompts and system instructions to improve task completion, reliability, and latency, minimize hallucinations and toxic/unsafe outputs and implement structured outputs.
  • Write unit, integration, and regression tests for AI features, run evaluation scripts and log results for model quality metrics, and work with AI observability tools under guidance.

RealPage is at the forefront of the Generative AI revolution, dedicated to shaping the future of artificial intelligence within the Property Tech domain. Our Agentic AI team is focused on driving innovation by building next generation AI applications and enhancing existing systems with Generative AI capabilities.

Europe

  • Design, optimize, and version prompts for production voice and chat LLM applications.
  • Architect and orchestrate multi-agent systems for complex conversations.
  • Build automated testing and validation frameworks for LLM outputs.

Tuotempo transforms healthcare experiences through intelligent digital solutions and is a trusted patient engagement platform powering some of Europe and Latin America's leading healthcare institutions. They have a remote-first culture with vibrant hubs in Bologna or Barcelona.

Global

  • Build AI-Powered Features.
  • Utilize Existing Frameworks.
  • Enhance System Performance.

Rollstack is revolutionizing how businesses share data and insights by fully automating the creation of slide decks and documents. They are a remote-friendly workplace backed by Insight Partners and Y Combinator, with a diverse team that values intelligence and kindness.

$175,000–$220,000/yr
US

  • Design agentic systems & ship AI to production: Turn prototypes into resilient, observable services with clear SLAs, rollback/fallback strategies, and cost/latency budgets.
  • Build tool‑using LLM “agents” (task planning, function/tool calling, multi‑step workflows, guardrails) for tasks like grant discovery, application drafting, and research assistance.
  • Own RAG end‑to-end: Ingest and normalize content, choose chunking/embedding strategies, implement hybrid retrieval, re‑ranking, citations, and grounding.

Instrumentl is a hyper-growth YC-backed startup that provides a SaaS platform to help nonprofits discover, track, and manage grants efficiently. They have over 4,000 nonprofit clients and are cash flow positive, doubling year-over-year, with customers who love them.

$125,000–$156,300/yr
US

  • Design, build, and operate LLM-powered systems used in production.
  • Build scalable agentic AI automation solutions, selecting appropriate patterns based on business requirements.
  • Make system-level tradeoffs across model choice, latency, cost, accuracy, and operational complexity.

Natera is a global leader in cell-free DNA (cfDNA) testing, dedicated to oncology, women’s health, and organ health, aiming to make personalized genetic testing and diagnostics part of the standard of care. The Natera team consists of highly dedicated statisticians, geneticists, doctors, laboratory scientists, business professionals, software engineers and many other professionals from world-class institutions.

$100,000–$150,000/yr
US

  • Partner with title, escrow, and corporate teams to identify high-value workflows.
  • Design, build, test, and deploy Copilot Studio agents, including conversation flows.
  • Implement structured QA processes, including accuracy validation, guardrail behavior testing.

FNF streamlines work across title, escrow, and back-end corporate functions. They are an equal opportunity employer.

$220,000–$235,000/yr
US Canada Unlimited PTO

  • Own the ideation and execution of high-impact projects that directly influence the user experience and business outcomes
  • Flesh out and evolve the Conversational AI Engineering roadmap alongside product and technical leadership
  • Incept projects: identify opportunities, design solutions, and lead implementation

Trellis is rewriting the insurance experience from the inside out. With powerful tools and a customer-first mindset, they're making insurance shopping refreshingly effortless, and they are a profitable, fast-growing Series A startup.

$150,000–$220,000/yr
US Unlimited PTO

  • Incorporating the best research work on agents and code generation into the OpenHands framework
  • Performing novel improvements in areas of interest to improve agent performance and efficiency
  • Running and implementing evaluations to ensure agent quality

OpenHands is building an open-source AI platform that empowers engineering teams to accelerate development, automate workflows, and integrate intelligent coding assistance into real-world software delivery. The company fosters a culture built on kindness, candor, autonomy, and learning.

AI Engineer

Quora
$107,360–$152,900/yr
Global

  • Work with other engineers on a wide variety of AI engineering tasks to improve our existing applied AI systems
  • Identify new opportunities to apply emerging AI capabilities to different parts of the Poe product
  • Take end-to-end ownership of applied AI systems - from prototyping, data pipelines, model optimization/evaluation to reliable deployment at scale

Quora's mission is to grow the world's collective intelligence. They have two platforms: Quora, a global knowledge sharing platform, and Poe, a platform to chat, explore and build with AI language models. They have a culture rooted in transparency, idea-sharing, and experimentation.

Europe

  • Apply bleeding edge AI theory to the design and implementation of large-scale data systems that feed AI agents and autonomous workflows.
  • Use data science techniques to fine-tune, evaluate, and optimize LLMs for marketing-specific tasks.
  • Build end-to-end automations using LLMs, internal data, and external signals to eliminate repetitive human tasks.

Rockerbox is building the next generation of marketing intelligence. They are looking for someone to help them build the AI systems everyone else just theorizes about.

Global

  • Design and implement agentic workflows for requirements clarification, planning, PR generation, and refinement, with human-in-the-loop and safety gates by default.
  • Develop internal tools across IDEs, GitHub, Slack, and web apps to accelerate the SDLC and create consistent “golden paths.”
  • Build and maintain integrations with internal systems (CI, docs, metrics, task trackers) to enable context-rich agent behavior.

Polygon Labs is a software development company building and developing a network of aggregated blockchains via the Agglayer, secured by Ethereum. As public infrastructure, the Agglayer will bring together user bases and liquidity for any connected chain, and leverage Ethereum as a settlement layer.