Source Job

US North America

  • Design complex LLM prompts that accurately represent real customer journeys and service interactions.
  • Partner with Field Engineers to transform raw data into structured, high-quality tasks for model training.
  • Annotate and review tasks to ensure strict quality standards and alignment with expected customer outcomes.

SQL Python LLM AI/ML Prompt Engineering

20 jobs similar to Data Engineer (LLM Data & Prompt Engineering)

Jobs ranked by similarity.

$80,000–$150,000/yr

  • Research, Document, Test, and Ideate: Explore the best ways to achieve our customers’ goals using LLMs and other AI tools.
  • Master Our Dialogue Platform: Become an expert, answer questions, and train others on prompting both within and outside of our platform.
  • Train Our AIs: Utilize prompting, knowledge-base creation, and fine-tuning to enhance our AI capabilities.

1mind is a platform that deploys multimodal Superhumans for revenue teams, combining a face, a voice, and a GTM brain. The company has a remote-first, fast-moving culture with ownership, autonomy, and impact from day one.

Europe

  • Apply bleeding edge AI theory to the design and implementation of large-scale data systems that feed AI agents and autonomous workflows.
  • Use data science techniques to fine-tune, evaluate, and optimize LLMs for marketing-specific tasks.
  • Build end-to-end automations using LLMs, internal data, and external signals to eliminate repetitive human tasks.

Rockerbox is building the next generation of marketing intelligence. They are looking for someone to help them build the AI systems everyone else just theorizes about.

$85,000–$225,000/yr
US Canada

This role validates Veeva AI Agents through evaluation. You will define strategies for new AI Agents. The role involves analysis of model behaviors to identify defects.

Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster.

$259,300–$305,000/yr
US Unlimited PTO

  • Own end-to-end implementation of AI-powered product features, from prototypes to production.
  • Mentor other engineers on the team, leveling up the team as a whole.
  • Collaborate across the organization to support shipping these features to production.

Honeycomb is a service for the near and present future, defining observability and raising expectations of what developer tools can do!

$160,000–$190,000/yr

  • Design, implement, and deploy AI-powered features, including model training, fine-tuning, and prompt engineering workflows.
  • Translate product requirements into robust, production-ready AI solutions, working with Product Managers, Software Engineers, and Data Scientists.
  • Optimize models and infrastructure for scalability, latency, and cost efficiency, partnering with DevOps and MLOps to ensure reliable and maintainable AI pipelines.

Paper is reimagining how schools support students so that every learner can reach their full potential.

US

  • Engage with leading LLM labs to advance LLMs across STEM domain
  • Define and understand data quality rubric
  • Ship proactive data packs

Turing, based in San Francisco, is a research accelerator for frontier AI labs and a partner for global enterprises deploying advanced AI systems. The leadership team includes AI technologists from Meta, Google, Microsoft, Apple, Amazon, McKinsey, Bain, Stanford, Caltech, and MIT and is recognized by Forbes, The Information, and Fast Company among the world’s top innovators.

US Europe

  • Design, develop, and deploy AI-driven applications to make our software more accessible.
  • Own the software from requirements development through deployment and maintenance.
  • Design, build, test, and deploy a scalable system architecture.

Epistemix empowers organizations to make smarter decisions by simulating real-world outcomes using synthetic populations.

  • Drive the development of intelligent, user-facing features using AI and LLMs.
  • Collaborate with product, design, and engineering teams to deliver AI-powered experiences.
  • Integrate multiple LLMs and AI models to deliver context-aware and personalized user experiences.

At ClickUp, they are not just building software, they're architecting the future of work with a converged AI workspace unifying tasks, docs, chat, calendar, and enterprise search.

North America Canada

  • Lead domain-specific model optimization using PEFT (LoRA/QLoRA) and knowledge distillation to balance cost, latency, and reasoning capability.
  • Build next-gen Retrieval-Augmented Generation pipelines using hybrid search, cross-encoders, and self-correcting retrieval loops.
  • Design and deploy multi-agent systems using frameworks like LangGraph or CrewAI, enabling autonomous task planning and tool-use (Function Calling).

ServiceNow is a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500®. Their intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work.

  • Design, build, and optimize high-performance systems in Python supporting AI data pipelines and evaluation workflows.
  • Develop full-stack tooling and backend services for large-scale data annotation, validation, and quality control.
  • Improve reliability, performance, and safety across existing Python codebases.

Alignerr connects top technical experts with leading AI labs to build, evaluate, and improve next-generation models. They work on real production systems and high-impact research workflows across data, tooling, and infrastructure.

Mexico

  • Set client QA strategies and adapt to scope/volume changes.
  • Run root-cause analyses; drive CAPA plans with owners, timelines, and effectiveness checks.
  • Plan training & certification for raters/annotators and coordinators; track completion and impact.

Welo Data provides high-quality, ethically sourced, relevant, diverse, and scalable datasets to technology companies to supercharge their AI models. As a Welocalize brand, WeloData leverages over 25 years of experience and brings together a curated global community of over 500,000 AI training and domain experts.

  • Design, develop, and maintain a robust platform to enable users to create and manage AI agents.
  • Integrate and work with multiple LLMs, ensuring seamless orchestration and scalability.
  • Develop and implement evaluation frameworks for testing AI agents in challenging and complex scenarios.

ClickUp is building the first truly converged AI workspace, unifying tasks, docs, chat, calendar, and enterprise search, all supercharged by context-driven AI.

$175,000–$200,000/yr
US

Lead AI and ML initiatives to design and implement production-grade machine learning systems and pipelines. Develop scalable infrastructure for model training, evaluation, and deployment, ensuring reliability and observability. Collaborate with cross-functional teams to drive innovation and efficiency.

Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.

$230,000–$300,000/yr
US

This role involves hands-on design, development, and deployment of enterprise-grade agentic AI solutions. You will work on multi-agent systems, workflow automation, and AI integration across business processes. The position offers exposure to cutting-edge AI technologies and collaboration with cross-functional teams.

This position is posted by Jobgether on behalf of a partner company.

US

  • Transform raw customer data into structured, high-fidelity datasets.
  • Create data pipelines, labeling workflows, and reference models.
  • Partner with engineering and research teams to understand model data requirements.

Foundation EGI is an MIT-born, venture-backed Silicon Valley startup building Engineering General Intelligence (EGI)—an AI Copilot for design and manufacturing.

US

  • Build and maintain gen AI prompts aligned with ad formats and community dynamics.
  • Improve the quality and brand safety of model outputs across text, images, and video.
  • Partner with Product and Engineering to prioritize improvements and accelerate feature development.

Reddit is a community built on shared interests and trust, home to open conversations and one of the internet’s largest sources of information.

$120,000–$150,000/yr

  • Architect, build, and deploy LLM-powered applications that augment and automate key workflows.
  • Design autonomous AI systems that can execute technical analysis, testing, troubleshooting, and decision-making at scale.
  • Develop AI-driven tools that create measurable business impact — improving efficiency, accelerating innovation, and driving revenue growth.

Sierra Studio connects talented Brazilian professionals with exciting career opportunities in a highly-vetted small community of growing companies in the US. They specialize in enabling merchants, consumers, and partners to operate with flexibility, intelligence, and trust with over 250 people.

$180,000–$230,000/yr
US

The AI Engineer develops and deploys agentic AI solutions for clients. Implements components for document processing, workflow automation, data retrieval, and structured output generation. Contributes to monitoring, logging, metrics, and guardrail configurations for agentic systems.

AHEAD builds platforms for digital business by weaving together advances in cloud infrastructure, automation and analytics, and software delivery.

Canada Unlimited PTO

  • Development and deployment of LLM-powered features, including summarization tools.
  • Build backend services in Python that integrate ML/LLM models with Fullscript’s platform.
  • Collaborate with medical and product teams to deliver AI features for practitioners and patients.

Fullscript is a health technology company committed to helping people get better by connecting practitioners to products and patients to care plans. They empower over 125,000 practitioners and 10 million patients through their comprehensive platform.