Source Job

US

  • Utilize Automatic Prompt Generation (APG) tools to create baseline prompts.
  • Run and supervise Automated Prompt Optimization (APO) tool.
  • Manually draft, test, and refine prompts to navigate complex template architectures.

Prompt Engineering LLMs SQL Data Analysis Communication

18 jobs similar to Prompt Engineer

Jobs ranked by similarity.

$36–$36/hr
US

  • Creatively writing prompts and responses to a variety of diverse topics.
  • Leading labeling initiatives with third party firms and internal customers.
  • Creating and updating detailed guidelines and specifications for stakeholders.

Welo Data provides AI services, specifically data annotation. They enable brands and companies to reach, engage, and grow international audiences, delivering multilingual content transformation services in translation, localization, and adaptation.

  • Design, implement, and evaluate machine learning models and AI algorithms.
  • Develop and optimize prompts for LLMs to improve model outputs.
  • Collaborate with software engineers, data scientists, and product teams.

Cadre AI is focused on building and optimizing AI-powered platforms, bringing together cutting-edge technologies and expertise in machine learning and large language models. The team is dedicated to advancing AI capabilities and applying them to real-world challenges through scalable, high-impact solutions.

$27–$27/hr
US

  • Creatively writing prompts and responses to a variety of diverse topics
  • Perform LLM annotation and evaluation tasks (ranking, scoring, labeling, tagging)
  • Evaluate model outputs for accuracy, relevance, and instruction-following

Welo Data is an AI services company that specializes in data annotation. They deliver high-quality training data transformation solutions for NLP-enabled machine learning by blending technology and human intelligence to collect, annotate, and evaluate all content types.

North America Canada

  • Design & Develop AI-Powered Implementation Agents.
  • Lead Prompt Engineering & AI Quality Optimization.
  • Integrate AI Solutions with Professional Services GTM & Delivery.

ServiceNow is a global market leader in AI-enhanced technology with over 8,100 customers. They connect people, systems, and processes to empower organizations to find smarter, faster, and better ways to work.

$58,872–$62,551/yr
Canada

  • Design and implement prompts for data labeling and localization processes within software applications.
  • Analyze model performance using key performance indicators (KPIs) and metrics, ensuring that AI models meet customer acceptance criteria and deliver high-quality outputs.
  • Create guidelines and training materials for prompt usage in data labeling and localization projects.

Innodata is a global data engineering company enabling the responsible advancement of artificial intelligence. They have a 36+ year legacy delivering the highest quality data and outstanding outcomes for their customers.

US

  • Own agent quality end-to-end: diagnosis, improvement, and validation across SmartAssist's orchestrator and subagents
  • Drive quality improvements through prompt engineering, context engineering, and RAG retrieval tuning
  • Extend and mature our evaluation framework: scorers, golden datasets, regression gates, and online evaluation for production traffic

Smartsheet has been helping people and teams achieve for over 20 years. They are building tools that empower teams to automate the manual, uncover insights, and scale smarter.

Global

  • Design and develop an AI-powered productivity analytics platform.
  • Build scalable LLM pipelines and create a meta-workflow system.
  • Develop system-level prompt engineering and build an evaluation framework for AI output quality control.

Appflame is a Ukrainian product-driven tech company committed to building world-class products. They have 500+ team members and offices in Kyiv, London, Limassol, and a co-working hub in Warsaw; they value bold, driven people who are passionate about building real products.

3w PTO

  • Play a defining role in shaping the quality, reliability, and evolution of MEDFAR's generative AI features.
  • Work closely with product designers, developers, QA, and client-facing teams to ensure that every AI-generated output meets the high standards required in a healthcare setting.
  • Mentor and lead a prompt engineering practice as MEDFAR's AI product surface expands.

MEDFAR Clinical Solutions, founded in 2010 by aeronautical engineers, focuses on leveraging technology to improve the healthcare system. They offer a unique healthcare management solution for clinics, replacing inefficient processes with technological alternatives, and are supported by a community of medical experts.

Global

  • Challenge advanced language models on topics like sentence structure and idiomatic expressions.
  • Verify factual accuracy and logical soundness of the AI's responses.
  • Suggest improvements to the model's prompt engineering and evaluation metrics.

Invisible Technologies makes AI work by structuring messy data, automating digital workflows, deploying agentic solutions, and integrating human expertise. They reached $134M in revenue and ranked as the number two fastest growing AI company on the 2024 Inc. 5000.

$120,000–$170,000/yr

  • Design, develop, and maintain LLM-powered applications and autonomous agents
  • Build agentic workflows and orchestration layers using LangGraph or similar frameworks
  • Implement and refine prompt engineering strategies for reliability, safety, and performance

NBCUniversal is a world-leading media and entertainment company that creates and distributes content across film, television, and streaming. We own leading entertainment and news brands and operate industry-leading theme parks around the world and champion an inclusive culture.

$171,000–$196,500/yr
US

  • Design, prototype, and deploy Generative AI solutions across client-facing and internal platforms.
  • Build and optimize applications using large language models (LLMs), vector databases, prompt engineering, and RAG pipelines.
  • Lead development of AI agents for both digital and voice channels, supporting real-time interactions with clients and internal users.

National Debt Relief, founded in 2009, aims to help consumers deal with overwhelming debt. They are a debt settlement organization that has helped over 450,000 people settle over $10 billion of debt, striving to empower them to lead a healthier financial lifestyle.

  • Build and maintain context infrastructure for AI tools.
  • Design and run evaluation frameworks for AI-generated insights.
  • Build and orchestrate AI agent systems for analytics tools.

Airtable is a no-code app platform empowering people to accelerate critical business processes. More than 500,000 organizations rely on Airtable to transform how work gets done, suggesting a large company size and a culture of innovation.

Global

  • Develop and iterate realistic prompts to test the relevance and quality of AI-generated insights.
  • Evaluate divergence between professional advisory judgment and AI outputs.
  • Translate practitioner approaches into problems that push the limits of AI reasoning.

Mentis AI operates at the intersection of institutional investment expertise and frontier AI systems, collaborating with leading AI labs to improve how models reason and make decisions in high-stakes financial contexts. Their team combines asset management experience with machine learning and applied AI research, operating across London and San Francisco.

$175,000–$275,000/yr
US Unlimited PTO

  • Define quality metrics, build evaluation datasets, and design rubrics for LLM-generated technical documentation across different content types and languages.
  • Build benchmarking and experimentation infrastructure, including automated evaluation pipelines and CI-integrated tooling for A/B comparisons and regression detection.
  • Develop automated quality signals at scale, monitor trends, and run experiments to quantify tradeoffs and inform decisions on model selection and pipeline architecture.

Driver builds the context layer for employees and AI agents to use in developing software, turning source code into human language. It is an early-stage, fast-growing startup backed by Y Combinator and Google Ventures, with a culture that values delivery speed, flexibility, and working within a small close-knit team.

Global

  • Develop and iterate realistic prompts to test the relevance and quality of AI-generated insights.
  • Systematically evaluate divergence between professional real estate judgment and AI outputs across asset classes/risk profiles.
  • Translate how REPE professionals evaluate acquisitions into problems that push the limits of AI reasoning.

Mentis AI operates at the intersection of institutional investment expertise and frontier AI systems. Their team combines asset management experience with machine learning and applied AI research, collaborating with leading AI labs to improve how models reason and make decisions in financial contexts.

Global

  • Build a team of AI-native forward-deployed engineers and designers.
  • Define architecture patterns for AI agents and client products.
  • Ensure the team ships excellent work.

GrowthX is building the modern growth engine for marketing teams. Since launching in 2024, they've grown to eight-figure annual revenue, raised a $12M Series A, and partner with 60+ companies.

Europe

  • Design and run post-training experiments on frontier and open-weight LLMs (SFT, preference-based methods, rubric-driven training)
  • Translate raw annotation artifacts (multi-step solutions, evaluations, adversarial prompts) into training-ready datasets.
  • Prototype new reward signals beyond pairwise preferences (rubrics, constraints, structured critics).

Vetto is a global talent platform connecting top-tier professionals to high-impact AI projects around the world. Their mission is to build trust, quality, and long-term value in the AI ecosystem - for both exceptional talents and companies operating at the frontier of technology.

Canada Unlimited PTO

  • Lead the design, development, and deployment of production, multi-turn LLM-powered features.
  • Own backend services in Python that integrate LLM agents with Fullscript’s platform and support reliable production use.
  • Partner with medical, product, and engineering teams to identify high-value opportunities for AI and turn them into practical, scalable product capabilities.

Fullscript is a health technology company committed to helping people get better by creating a platform that powers every part of care. More than 125,000 practitioners use Fullscript for clinical insights, lab interpretations, patient analytics, education, and access to high-quality supplements.