Source Job

  • Designing and developing the core platform that enables the efficient deployment, scaling, and management of LLMs and multi-agent systems.
  • Building specialized infrastructure to support long-running agentic workflows, including state management, tool-calling interfaces, and complex reasoning loops.
  • Scaling inference for LLMs to handle global demand while optimizing for latency, throughput, and cost.

Python Kubernetes Docker MLOps LangChain

20 jobs similar to Senior Staff GenAI Platform Engineer

Jobs ranked by similarity.

$125,000–$156,300/yr
US

  • Design, build, and operate LLM-powered systems used in production.
  • Build scalable agentic AI automation solutions, selecting appropriate patterns based on business requirements.
  • Make system-level tradeoffs across model choice, latency, cost, accuracy, and operational complexity.

Natera is a global leader in cell-free DNA (cfDNA) testing, dedicated to oncology, women’s health, and organ health, aiming to make personalized genetic testing and diagnostics part of the standard of care. The Natera team consists of highly dedicated statisticians, geneticists, doctors, laboratory scientists, business professionals, software engineers and many other professionals from world-class institutions.

$89,769–$122,862/yr
Canada Unlimited PTO

  • Build AI-Powered Features: Design, develop, and deploy production-grade AI applications that solve real customer problems.
  • Architect Scalable Systems: Create robust backend architectures that support AI workloads, ensuring low latency and high reliability.
  • Drive AI Innovation: Implement and optimize agentic AI systems, RAG pipelines, and multi-agent workflows using modern LLM frameworks.

Procurify is the AI-enhanced procurement and AP automation platform for mid-market organizations. They help organizations take control of spend and save money as a remote-first company with a big heart and a strong ambition to modernize the way organizations manage business spend.

  • Design and deploy high-performance agentic systems that leverage Fastino’s optimized model architectures.
  • Collaborate with engineering teams to turn novel architectural breakthroughs into scalable solutions for enterprise customers.
  • Drive rapid, iterative prototyping of AI functionalities, refining model performance and task-accuracy based on real-world telemetry.

Fastino is building the next generation of LLMs with a team of alumni from Google Research, Apple, Stanford, and Cambridge and has developed the GLiNER family of open source models. Fastino has raised $25M through seed round and is backed by leading investors including Microsoft, Khosla Ventures, and Insight Partners.

US

  • Build and scale the AI Agent platform.
  • Design and implement APIs, services, and infrastructure.
  • Prototype rapidly and continuously improve system performance.

Podium brings AI Employees to local businesses that turn every conversation into revenue. Trusted by 60,000+ businesses, they have crossed $100M in AI Agent ARR, scaling 300% year-over-year and empowering real business outcomes for their customers.

Global

  • Define and evolve the corporate GenAI architecture, ensuring strategic alignment and scalability.
  • Design and implement intelligent agent platforms, including orchestration and multi-agent workflows.
  • Define and standardize communication protocols between agents, adopting the Model Context Protocol (MCP).

Getnet is a global technology company specializing in payment solutions for commerce. As part of PagoNxt, the global fintech of Grupo Santander, we operate as an acquiring hub with a strong presence in Spain, Portugal, Brazil, Mexico, Chile, Argentina, and Uruguay.

$135,000–$216,000/yr
US

  • Design, develop, and optimize generative AI solutions, including prompt engineering and RAG implementations.
  • Implement agentic LLM architectures and multi-agent orchestration to generate reliable outputs.
  • Develop prompt libraries and evaluation frameworks to ensure accurate and secure AI-generated content.

Peraton is a next-generation national security company that drives missions of consequence spanning the globe and extending to the farthest reaches of the galaxy. As the world’s leading mission capability integrator and transformative enterprise IT provider, they deliver trusted, highly differentiated solutions and technologies to protect our nation and allies.

$170,000–$210,000/yr

  • Develop, implement, and validate machine learning and agentic flows.
  • Drive innovation in modeling approaches while balancing accuracy, efficiency, and interpretability.
  • Lead the design of benchmarking frameworks, evaluation tools, and metrics.

Federato is on a mission to defend the right to efficient, equitable insurance for all. They are AI-native platform that spans the full policy lifecycle. They move fast, are eager to listen to our users, take a first principles approach to solving problems, and value learning.

Global 5w PTO

  • Design, develop, and deploy robust ML systems and multi-model AI agents that solve real-world retail challenges.
  • Lead the entire lifecycle, including prototyping, deployment, monitoring, and maintenance using modern CI/CD and containerisation practices.
  • Build high-performance data pipelines (ETL/ELT) for both training and real-time inference, ensuring our systems are scalable and reliable.

EDITED is the world’s leading AI-driven retail intelligence platform. They empower the world’s most successful brands and retailers with real-time decision making power. Their environment is dynamic and supportive, encouraging team members to take initiative, innovate, and continuously grow.

India

  • Build the technical roadmap given a business requirement and own the delivery of the same.
  • Develop and optimize LLM-based solutions : Lead the design, training, fine-tuning, and deployment of large language models, leveraging techniques like prompt engineering, retrieval-augmented generation (RAG), and agent-based architectures.
  • Codebase ownership : Maintain high-quality, efficient code in Python (using frameworks like LangChain/LangGraph) and SQL, focusing on reusable components, scalability, and performance best practices.

Turing, based in San Francisco, is a research accelerator for frontier AI labs, partnering with global enterprises to deploy advanced AI systems. They accelerate research with data, talent, and training pipelines and build proprietary intelligence systems, recognized among the world's top innovators.

AI Engineer

Ethos
$146,000–$236,000/yr
US

  • Own the LLM + retrieval + context layer that makes copilots accurate and fast.
  • Design and ship the end-to-end pipeline, improving quality and trust via evaluation.
  • Reduce cost/latency with a concrete inference optimization plan shipped to production.

Ethos is built to make it faster and easier to get life insurance. They blend industry expertise, technology, and the human touch to find the right policy to protect loved ones and have been named on CB Insights' Global Insurtech 50 list and BuiltIn's Top 100 Midsize Companies in San Francisco.

Europe

  • Designing, developing, and deploying generative AI models.
  • Architecting and building agentic systems with autonomous decision-making capabilities.
  • Integrating generative AI and agentic solutions into existing products and services.

Jobgether leverages AI to match job seekers with roles. They use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements.

$107,000–$145,000/yr
Canada

  • Support the full operational lifecycle of both traditional machine learning systems and emerging generative AI driven applications.
  • Enable scalable training, evaluation, deployment, and monitoring for a wide range of ML and GenAI workloads.
  • Manage model upgrades, framework versions, regression testing, maintenance tasks and maintaining performance across systems and solutions.

Achievers' employee recognition and rewards platform empowers organizations to build cultures where people feel seen and valued, everyday. They're a team of passionate, thoughtful builders with more than 4.3 million users across 190 countries, who care deeply about their product, their customers, and each other.

$110,000–$140,000/yr
US Canada

  • Design, build, and ship agentic workflows across multiple domains.
  • Build multi-step agents capable of autonomous planning, context tracking, memory, tool use, and API orchestration.
  • Drive technical and architectural decisions to meet product requirements while also anticipating and designing for future needs

Cority helps customers see and prevent risks across their operations in real time. They provide a platform that converges people, data, and AI agents and is trusted by more than 1,500 of the most complex organizations worldwide.

South America

  • Design, develop, and deploy production-ready applications using Large Language Models (LLMs).
  • Develop robust backend services and APIs for AI applications using Python and modern frameworks.
  • Design and optimize high-quality prompts and templates that guide LLM behavior and responses.

CI&T is a tech transformation specialist, uniting human expertise with AI to create scalable tech solutions. With over 8,000 CI&Ters around the world, they’ve built partnerships with more than 1,000 clients during our 30 years of history.

Global

  • Design and ship production-grade agentic AI systems that meaningfully improve customer workflows and internal operations.
  • Establish a clear technical architecture for AI at Moxie, including agent orchestration, tool/function calling and observability.
  • Integrate AI deeply into the Moxie platform, ensuring AI systems are secure, resilient, cost-aware, and aligned with a regulated environment.

Moxie empowers ambitious aesthetic entrepreneurs to build profitable, independent practices. They are a global, remote-first team of more than 140 people, supporting hundreds of practices nationwide, aiming to unlock sustainable success for aesthetic entrepreneurs.

North America Europe Asia

  • Build and productionize LLM and NLP models across retrieval, summarization, classification, and generative tasks.
  • Design and implement scalable ML services and inference pipelines in Python using modern ML frameworks.
  • Translate complex NLP and LLM product requirements into structured engineering plans with clear milestones.

Loopio provides a workplace that recognizes the advantages of working flexibly, operating as a remote-first company. They have established hub regions around the world and foster a supportive culture with opportunities for connection.

  • Build, optimize, and evolve RAG pipelines.
  • Develop prompts and guardrails for domain-specific LLM applications.
  • Implement hallucination detection, mitigation, and fact-checking mechanisms.

Robots & Pencils builds meaningful, scalable digital products by blending strategy, design, and engineering. They are a small, senior team with direct access to enterprise clients.

Europe

  • Work side by side with clients, PMs, and Architects to scope and deploy AI systems.
  • Build and integrate systems using LLMs, RAG pipelines, agent frameworks, vector databases and related tools.
  • Debug relentlessly and optimize for reliability in production, not just elegance in code.

Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Their system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.

Global

  • Design, implement, and maintain high-performance ML training and inference platforms.
  • Ship tools that allow any ML engineer to deploy a model in minutes, not days.
  • Improve scalability, reliability, and cost efficiency of model training and serving systems.

Speechify's mission is to make sure that reading is never a barrier to learning. With nearly 200 people around the globe working in a 100% distributed setting, Speechify's team includes frontend and backend engineers, AI research scientists, and others.

India

  • Build sophisticated multi-agent systems that can reason, plan, and execute complex sales workflows.
  • Develop systems that maintain conversational context across complex multi-turn interactions.
  • Build scalable large language model and agentic platforms that enable widespread adoption and viability of agent development within the Apollo ecosystem.

Apollo.io provides sales and marketing teams with easy access to verified contact data and tools to engage and convert contacts in one unified platform. They are a fast-growing SaaS company with over 500,000 companies and millions of users globally, valued at $1.6 billion.