Source Job

US

  • Own the agent layer of the platform, including architecture, prompts, tool surfaces, and multi-agent orchestration.
  • Drive translation and dependency-mapping accuracy across unfamiliar legacy paradigms.
  • Write production agent code daily, using subagents and multi-agent workflows as the normal way of working.

TypeScript Python MCP RAG

20 jobs similar to Senior Agent Engineer

Jobs ranked by similarity.

India

  • Build and ship specialized agents including parsers, extractors, and synthesizers for the Aedeon agent-native modernization platform.
  • Own the full delivery of assigned agents from prototype through deployment and post-release validation, practicing test-driven development.
  • Write clear Python, document agent contracts and decision logic, and promote a culture of release discipline and quality across the team.

Mactores is a trusted leader in providing modern data platform solutions, enabling businesses to accelerate value through automation with end-to-end data solutions that are automated, agile, and secure. Since 2008, they have collaborated with customers to strategize and navigate digital transformation via assessments, migration, or modernization, fostering a culture driven by 10 core leadership principles.

$154,384–$198,893/yr
Europe

  • Design, build, and own core components of the agent platform, from the orchestration layer to the tool integrations connecting it to internal systems.
  • Build and evolve the capabilities layer: APIs, data access patterns, and service integrations for agents to execute operational workflows.
  • Architect the knowledge and memory infrastructure, allowing agents to retrieve the data and act across our systems.

Justworks helps businesses get off the ground by enabling them to focus on running their business and solves HR issues. The company embraces a supportive, entrepreneurial environment where employees are encouraged to build something meaningful and have fun.

North America Canada

  • Design, build, and operate production-grade agentic AI systems embedded across ServiceNow's platform.
  • Build agents that leverage ServiceNow's data layer to make decisions with context no frontier model has on its own.
  • Own the guardrails: observability, human-in-the-loop controls, and compliance infrastructure that make autonomous systems safe to deploy at scale.

ServiceNow is the AI control tower for business reinvention. Our AI platform brings together any AI, any data, and any workflow— helping 85% of the Fortune 500® work smarter, faster, and better. We're building an AI-native culture where technology and talent are unstoppable together.

APAC

  • Design, build, and operate production-grade agentic AI systems embedded across ServiceNow's platform.
  • Build agents that leverage ServiceNow's data layer to make decisions with context no frontier model has on its own.
  • Own the guardrails: observability, human-in-the-loop controls, and compliance infrastructure that make autonomous systems safe to deploy at scale.

ServiceNow is an AI control tower for business reinvention. Their AI platform brings together any AI, any data, and any workflow— helping 85% of the Fortune 500® work smarter, faster, and better. We're building an AI-native culture where technology and talent are unstoppable together.

US

  • Shape technical direction and architecture: Define the foundational architecture for enterprise agentic AI at Benchling.
  • Build and ship the early portfolio yourself: Write production code at least half your time, particularly during the team's first year.
  • Design for enterprise from day one: Build for multi-tenant isolation, secrets management, audit logging, payload encryption, role-based access controls, and human-in-the-loop controls calibrated to risk.

Benchling is the AI platform for biotech R&D. Scientists use Benchling to design experiments, capture structured data, and run AI agents and models directly in their workflows. They have over 200,000 scientists around the world, from academic labs to Sanofi and Moderna.

United States Unlimited PTO

  • Design and build scalable backend systems powering AI agents in real-time enterprise environments.
  • Develop agent orchestration frameworks and low-latency inference pipelines integrating LLMs and SLMs.
  • Build robust APIs and work with cross-functional teams to productionize agentic AI at scale.

Level AI is an AI-native platform that helps enterprises transform contact centers into engines of customer intelligence and operational efficiency. The company is a Series C startup backed by Battery Ventures and ENIAC, based in Mountain View, California, with a globally distributed team.

Global Unlimited PTO

  • Build agents that investigate incidents and surface anomalies.
  • Write reusable skills that capture debugging and incident response playbooks.
  • Own the agent stack end-to-end, including context engineering and evals.

Recognized on the 2025 Forbes Cloud 100 list, ClickHouse is one of the most innovative and fast-growing private cloud companies. With more than 3,000 customers and ARR that has grown over 250 percent year over year, ClickHouse leads the market in real-time analytics, data warehousing, observability, and AI workloads.

Global

  • Design and write high-quality prompts for LLM-based agents and build agentic tools and workflows using Netomi's no-code platform.
  • Integrate external and internal APIs, implement unit tests, and ensure reliability of agent workflows.
  • Optimize agents for performance, cost, and fault tolerance while collaborating with Product, QA, and Delivery teams.

Netomi is the leading agentic AI platform for enterprise customer experience, working with global brands like Delta Airlines and MetLife. Backed by WndrCo, Y Combinator, and Index Ventures, the company helps enterprises drive efficiency and lower costs.

$0–$0/yr
US Canada

  • Design, build, and ship agentic workflows across multiple domains.
  • Build multi-step agents capable of autonomous planning, context tracking, memory, tool use, and API orchestration.
  • Drive technical and architectural decisions to meet product requirements while also anticipating and designing for future needs

Cority helps customers see and prevent risks across their operations in real time. Our EHS+ platform converges people, data, and AI agents to provide a clear view of information people can trust. For 40 years, Cority has been the market leader in EHS+, recognized by top analysts and trusted by more than 1,500 of the most complex organizations worldwide.

India

  • Own end-to-end execution of AI agent deployments from discovery and scoping through launch and optimization.
  • Configure agent workflows, decision logic, and automation behaviors to maximize accuracy, reliability, and business outcomes.
  • Implement guardrails and validation frameworks to ensure safe, compliant, and predictable agent performance.

Level AI is transforming how enterprises understand and engage with their customers. Their AI-native CX platform combines conversation intelligence, real-time agent guidance, and AI Virtual Agents to help brands deliver exceptional customer experiences at scale. At Level AI, they operate with urgency, ownership, and a deep customer-first mindset.

Canada

  • Design and implement multi-agent AI systems using frameworks like LangChain and CrewAI, building agent-to-agent orchestration pipelines.
  • Fine-tune foundation models, integrate retrieval-augmented generation, and develop APIs and backend services for production deployment.
  • Containerize and deploy agents with Docker and Kubernetes, while collaborating with QA and product teams to benchmark accuracy and safety.

Innodata is a global data engineering company focused on enabling the responsible advancement of artificial intelligence by providing data, evaluation frameworks, and human expertise. With over 36 years of experience, the company delivers high-quality data solutions and services for Generative AI builders and adopters.

US

  • Write behavioral specs, architectural constraints, and feature requirements that agents implement against.
  • Build and maintain harness infrastructure including structural tests, linting rules, and CI gates.
  • Design validation systems where agents write the tests and you verify features work from the user's perspective.

Bolo.ai builds generative AI systems for the energy industry, making daily work faster, safer, and better for heavy industry workers. We have Fortune 500 contracts, production deployments, and growing enterprise demand, and we're scaling with a small, senior-leaning engineering team.

Global 4w PTO 16w maternity 16w paternity

  • Work cross-functionally to design and implement AI-powered features that integrate LLMs with Vanta's existing products and systems.
  • Instrument evaluations, guardrails, and monitoring to continually improve quality based on customer usage.
  • Mentor engineers and champion a collaborative, high-ownership engineering culture.

Vanta helps businesses earn and prove trust by automating security monitoring and compliance. The company has a kind and talented team, with offices in SF, NYC, London, Dublin, Tel Aviv, and Sydney.

US

  • Operate daily with Claude Code using structured, repeatable workflows: plan, decompose, implement, test, ship
  • Serve as a technical resource for AI-augmented workflows across the engineering team, sharing playbooks, patterns, and tooling that help other developers level up
  • Continuously research and evaluate new AI tools, models, and techniques with a framework for deciding what to adopt

Mango Languages is an industry leader in providing engaging language-learning experiences to millions of users around the globe. Their software uses real-world conversations and cultural insights to share world languages with public library patrons, students, corporate employees, government officials, and learners of all ages.

Global

  • Design and ship production AI features end-to-end across LangGraph / LiteLLM / pgvector / Langfuse.
  • Drive technical architecture for the AI product line: LLM orchestration, evals, observability, latency / cost / reliability tradeoffs.
  • Own AI initiatives technically — from spec through production, including rollout and post-launch eval improvements.

airSlate is a global SaaS technology company that develops no-code workflow automation, electronic signature, and document management solutions. They have over hundreds of millions of users and more than one million customers worldwide, helping organizations of every size digitize processes, improve efficiency, and transform how they work.

$194,000–$228,000/yr
US

  • Design, build, and ship LLM-powered features and agentic workflows for Gametime users.
  • Build and maintain evaluation frameworks and prompt testing pipelines for AI-powered experiences.
  • Contribute to orchestration layer, including agent routing, tool use, and multi-step workflow coordination.

Gametime helps people connect through shared live experiences. They operate platforms on iOS, Android, mobile web, and desktop, supporting over 60,000 events across the US and Canada, fostering a collaborative and inclusive environment where diverse perspectives are valued.

US

  • Design, build, and maintain production-grade AI systems and customer-facing AI features.
  • Develop agentic workflows using LLMs, retrieval systems, tools, APIs, and backend services.
  • Design and implement retrieval-augmented generation (RAG) systems, including ingestion pipelines, embeddings, semantic retrieval, and context assembly.

Givzey is a fast-growing and innovative technology company serving the nonprofit sector, on a mission to unlock more generosity through AI-powered donor engagement. In just three years, Givzey’s platform has already helped organizations raise $10M+ through autonomous engagement.

$90,000–$150,000/yr
US

  • Design and deliver production AI and agentic systems across document intelligence, workflow automation, and copilots.
  • Define architecture decisions for LLM-based systems, including retrieval, tool use, orchestration, memory, and evaluation.
  • Own evals and observability for production AI and manage cost and latency at production volume.

Maxwell is a mortgage technology and fulfillment company with a mission to make lending simpler, faster, and more accessible. They power hundreds of lending institutions with their mortgage Point of Sale and related capabilities and are a remote-first team that takes craft seriously.

US UK

  • Architect and pilot a client-facing advisory business for agentic strategy, designing multi-agent systems that drive ROI for elite investment teams.
  • Develop proprietary pattern libraries and sophisticated system prompts for complex fund strategies like Long/Short Equity and Macro.
  • Lead high-stakes design sprints for C-suite stakeholders and enable internal teams with agent fluency playbooks.

Atlas Technica is a premier Managed Service Provider for the alternative investment industry, powering technology for hedge funds, private equity firms, and family offices. They are evolving into a Managed Intelligence Provider, architecting secure agentic frameworks for institutional alpha generation.

$145,000–$193,750/yr
US

  • Architect AI Agents: Take a leading role in designing Agentic Workflows using AWS Step Functions and Bedrock Agents that reason through multi-step business problems.
  • Standardize Context via MCP: Drive the implementation of the Model Context Protocol (MCP) across the org to unify how AI interacts with Smartsheet, CRM, and ERP data.
  • Design Human-in-the-Loop: Own the end-to-end design of high-stakes automations where UiPath Action Center acts as the safety gate for AI-driven decisions.

Smartsheet helps people and teams achieve their goals by providing seamless work management and smart, scalable solutions. They empower teams to automate manual tasks, uncover insights, and scale smarter. The company has over 20 years in the industry and fosters a culture where ideas are heard and contributions have a real impact.