Source Job

Global Unlimited PTO

  • Architect and optimize how MagicSchool's AI agents reason, remember, and operate within complex educational workflows.
  • Design context management systems that determine what information our agents see and how they maintain state across multi-turn interactions.
  • Implement the technical foundation of how AI agents manage their "mental workspace" and ensure agentic capabilities remain accurate and focused.

Python TypeScript Node.js LLM APIs

20 jobs similar to Senior Context Engineer for AI Systems

Jobs ranked by similarity.

Europe

  • Design, optimize, and version prompts for production voice and chat LLM applications.
  • Architect and orchestrate multi-agent systems for complex conversations.
  • Build automated testing and validation frameworks for LLM outputs.

Tuotempo transforms healthcare experiences through intelligent digital solutions and is a trusted patient engagement platform powering some of Europe and Latin America's leading healthcare institutions. They have a remote-first culture with vibrant hubs in Bologna or Barcelona.

$175,000–$195,000/yr
US

  • Design and deliver AI-powered advisors, assistants, and analytic agents.
  • Build and maintain high-quality, production-ready Python services.
  • Apply, adapt, and fine-tune foundation models to deliver reliable AI experiences.

Energage helps organizations turn employee feedback into useful business intelligence and credible employer recognition through Top Workplaces. Built on culture research and the results from 23 million employees surveyed across more than 70,000 organizations, Energage delivers the most accurate competitive benchmark available.

North America Canada

  • Lead domain-specific model optimization using PEFT (LoRA/QLoRA) and knowledge distillation to balance cost, latency, and reasoning capability.
  • Build next-gen Retrieval-Augmented Generation pipelines using hybrid search, cross-encoders, and self-correcting retrieval loops.
  • Design and deploy multi-agent systems using frameworks like LangGraph or CrewAI, enabling autonomous task planning and tool-use (Function Calling).

ServiceNow is a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500®. Their intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work.

US

  • Design, build, and deploy AI Agents including custom tools, prompt engineering, orchestration workflows, and agent design patterns.
  • Contribute to the backend infrastructure powering Candidly's AI capabilities, including API development, data integrations, and data pipelines.
  • Work closely with stakeholders across product, design, engineering, and leadership to translate complex AI concepts into actionable strategies and features.

Candidly, founded in 2016, is the category leader with the market’s most comprehensive AI-driven student debt and savings optimization platform. They partner with hundreds of top employers, financial institutions, and retirement record keepers, positioning Candidly to serve more than 35 million Americans. Candidly is a high-growth, Series B startup, funded by leading investors with an international team of 70 (and counting).

$175,000–$220,000/yr
US

  • Design agentic systems & ship AI to production: Turn prototypes into resilient, observable services with clear SLAs, rollback/fallback strategies, and cost/latency budgets.
  • Build tool‑using LLM “agents” (task planning, function/tool calling, multi‑step workflows, guardrails) for tasks like grant discovery, application drafting, and research assistance.
  • Own RAG end‑to-end: Ingest and normalize content, choose chunking/embedding strategies, implement hybrid retrieval, re‑ranking, citations, and grounding.

Instrumentl is a hyper-growth YC-backed startup that provides a SaaS platform to help nonprofits discover, track, and manage grants efficiently. They have over 4,000 nonprofit clients and are cash flow positive, doubling year-over-year, with customers who love them.

$230,000–$300,000/yr
US

  • Design, develop, and deploy agentic AI solutions for clients.
  • Build multi-agent systems and integrate models with enterprise systems.
  • Collaborate with clients and engineers to create scalable solutions.

AHEAD builds platforms for digital business, weaving together advances in cloud infrastructure, automation, analytics, and software delivery to help enterprises deliver on digital transformation. They prioritize creating a culture of belonging where all perspectives are valued and heard.

US

  • Build and maintain gen AI prompts aligned with ad formats and community dynamics.
  • Improve the quality and brand safety of model outputs across text, images, and video.
  • Partner with Product and Engineering to prioritize improvements and accelerate feature development.

Reddit is a community built on shared interests and trust, home to open conversations and one of the internet’s largest sources of information.

$125,600–$157,000/yr
US

  • Design, build, and scale enterprise-grade AI/ML systems that power internal workflows and external-facing AI/ML platforms.
  • Develop a production-ready Generative AI and MLOps platform with reusable components used to deploy multiple AI solutions across Natera’s business units.
  • Implement cloud-native infrastructure for large-scale model training and serving using Kubernetes, MLflow, Terraform, and AWS-native services

Natera is a global leader in cell-free DNA (cfDNA) testing. They are dedicated to oncology, women’s health, and organ health, aiming to make personalized genetic testing and diagnostics part of the standard of care. The Natera team consists of highly dedicated statisticians, geneticists, doctors, laboratory scientists, business professionals, software engineers and many other professionals from world-class institutions.

US

  • Design, develop, and maintain a robust platform to enable users to create and manage AI agents and their interactions.
  • Integrate and work with multiple LLMs, ensuring seamless orchestration and scalability for both individual and coordinated agent operations.
  • Leverage orchestration frameworks like LangGraph and others to build complex workflows and pipelines that support diverse agent functionalities, including frameworks for multi-agent coordination.

ClickUp is building the future of work by creating a converged AI workspace that unifies tasks, docs, chat, calendar, and enterprise search. Their AI-powered platform helps teams break free from silos and unlock new levels of productivity.

North America

  • Design, refine, and evaluate prompts, context, and system instructions for various product use cases
  • Conduct experiments to assess model behavior, accuracy, and cost impact with new or existing prompts
  • Continuously improve prompt engineering processes by adopting new techniques and technologies

Applied Systems transforms the insurance industry. They have 40+ years of experience and are building a team ready to learn and deliver innovative software and services.

Global

  • Evaluate what AI models produce in your field.
  • Assess content related to your field of work.
  • Deliver clear, structured feedback that strengthens the model’s understanding.

Handshake is connecting students to early talent opportunities. The company is focused on helping students find the right job and employers connect with the right candidate.

$85,000–$225,000/yr
US Canada

This role validates Veeva AI Agents through evaluation. You will define strategies for new AI Agents. The role involves analysis of model behaviors to identify defects.

Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster.

Europe Unlimited PTO

  • Designing complex, dynamic prompt templates with conditional logic.
  • Implementing various response schemes to ensure AI outputs are predictable.
  • Building robust evaluation pipelines and using Langfuse to collect feedback.

Ruby Labs is a leading tech company that creates and operates innovative consumer products. We offer a diverse range of opportunities across the health, education, and entertainment industries, and our innovative teams are driving the future of consumer-led products.

$120,000–$150,000/yr

  • Architect, build, and deploy LLM-powered applications that augment and automate key workflows.
  • Design autonomous AI systems that can execute technical analysis, testing, troubleshooting, and decision-making at scale.
  • Develop AI-driven tools that create measurable business impact — improving efficiency, accelerating innovation, and driving revenue growth.

Sierra Studio connects talented Brazilian professionals with exciting career opportunities in a highly-vetted small community of growing companies in the US. They specialize in enabling merchants, consumers, and partners to operate with flexibility, intelligence, and trust with over 250 people.

Global

  • Evaluate AI model outputs related to your field.
  • Assess content relevant to your area of expertise.
  • Deliver clear feedback to improve the model's comprehension.

Handshake is recruiting College Career/Technical Education Professors to contribute to an hourly, temporary AI research project. In this program, you’ll leverage your professional experience to evaluate what AI models produce in your field.

$187,000–$250,000/yr
US

  • Own and execute a strategic roadmap for AI research, messaging, and context capabilities.
  • Enhance Apollo's AI research agents to surface actionable insights from the web.
  • Define how AI understands each user's business, transforming generic AI outputs into relevant recommendations.

Apollo.io is the leading go-to-market solution for revenue teams, trusted by over 500,000 companies and millions of users globally.

US

  • Own the product lifecycle for AI-based decision-support tools, including roadmap planning, feature prioritization, and technical configuration
  • Serve as lead system prompt engineer, creating and maintaining instructions for LLM-based products
  • Collaborate with clients, internal teams, and stakeholders to conceptualize, test, and implement new features

Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. The system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.

US

  • Designing, developing, and deploying generative AI models.
  • Architecting and building agentic systems with autonomous decision-making capabilities.
  • Integrating generative AI and agentic solutions into existing products and services.

Jobgether is a partner company that focuses on connecting talent with the right job opportunities. Their AI-powered matching process ensures applications are reviewed quickly, objectively, and fairly against the role's core requirements.

5w PTO

  • Build and maintain an internal LLM gateway that handles routing, fallbacks, and rate limiting
  • Create reusable components for common AI patterns (RAG, function calling, streaming responses)
  • Develop SDKs or libraries that simplify AI integration for application developers

ButterflyMX empowers people to open and manage doors & gates from a smartphone and their products are installed in multifamily, commercial, and gated communities. As a distributed workforce, they're looking for intelligent, collaborative, and down-to-earth individuals to join their growing team.