Source Job

Europe

  • Define and evolve the architecture and roadmap for enterprise‑scale Data and AI platforms.
  • Design and build multi‑tenant, multi‑region, highly available AI platforms with governance.
  • Lead capacity planning and cost optimization strategies for GPU and CPU workloads.

AWS Python Terraform MLOps Kubernetes

20 jobs similar to AI Platform Engineer

Jobs ranked by similarity.

$138,700–$173,350/yr
US

  • Lead the architecture of a high-scale AWS environment optimized for AI workloads.
  • Manage and mentor a high-performing team of 8 engineers, providing technical leadership and career coaching.
  • Conduct user research with internal Natera developers to identify friction points.

Natera is a global leader in cell-free DNA (cfDNA) testing, dedicated to oncology, women’s health, and organ health. The Natera team consists of statisticians, geneticists, doctors, laboratory scientists, business professionals, software engineers, and many other professionals from world-class institutions.

Global

  • Design, build, and implement an end-to-end AI-powered tender response platform.
  • Develop and maintain backend services and AI workflows using AWS-native technologies.
  • Build and manage infrastructure using Terraform and cloud-native best practices.

Smart Working believes that your job should not only look right on paper but also feel right every day. They break down geographic barriers and connect skilled professionals with outstanding global teams and products for full-time, long-term roles.

$124,800–$156,000/yr
US

  • Architect, implement, and maintain AI agent orchestration platforms for multi-agent delegation, session management, and streaming interfaces.
  • Design and build Model Context Protocol (MCP) servers to expose domain-specific capabilities as composable tools with rigorous security and isolation.
  • Deploy and operate containerized services on AWS using Terraform, optimizing for cost, reliability, and observability while partnering with ML and data engineers.

Natera is a global leader in cell-free DNA testing dedicated to oncology, women’s health, and organ health, aiming to make personalized genetic testing part of standard care. The company consists of a dedicated team of professionals from world-class institutions who are stretched and challenged while working to change genetic disease management.

$428–$500/mo
Global

  • Own the complete lifecycle of your assigned workstream.
  • Build AI systems designed to perform consistently and efficiently at scale.
  • Collaborate with our team to translate requirements into maintainable AI solutions.

They are looking for a contract AI Engineer to join them for an active, ongoing project and contribute immediately. Contract performance can lead to repeat engagements.

US Global

  • Define end-to-end architecture for AI/ML and Gen AI systems.
  • Serve as a strategic advisor to clients, leading solution design discussions.
  • Architect scalable solutions using cloud-native AI tools.

3Pillar Global provides a flexible work environment with a remote-first approach, offering opportunities for global teamwork and leveraging diverse resources. They focus on well-being, career growth, and diversity.

US Unlimited PTO

  • Maintain, improve, and extend an AI platform already running in production.
  • Handle a mix of backend development, data pipelines, DevOps, and infrastructure work.
  • Translate business and product requirements into technical decisions independently.

Provectus is an AI consultancy and solutions provider. We help businesses adopt AI technologies, offering development and integration services. While the job posting doesn't mention company size information, they seem to foster a flexible, autonomous, and tech-forward culture.

$188,550–$212,150/yr
Global Unlimited PTO

  • Own the technical direction of Remote's SRE/Platform domain.
  • Define and drive the reliability strategy across the platform.
  • Identify and lead AI enablement initiatives across the engineering organisation.

Remote is solving modern organizations’ biggest challenge – navigating global employment compliantly with ease. With our core values at heart and a future-focused work culture, our team works tirelessly on ambitious problems, asynchronously, around the world.

$165,000–$165,000/yr
North America Europe Middle East APAC

  • Implement and manage AI-powered tools, copilots, and workflow automations from POC to production, owning the full technical lifecycle.
  • Design, deploy, and maintain cloud infrastructure on AWS and Azure, including IAM, VPCs, security groups, multi-account strategies, and cost optimization.
  • Own reliability, observability, and security controls across all AI and cloud services, including incident response, debugging complex multi-service environments, and driving continuous improvement.

Dragos is dedicated to arming customers with best-in-class technology, threat intelligence, and services to protect their systems. They're a remote-first culture with operations in North America, Europe, the Middle East, and APAC, looking for mission-oriented teammates who embody their core values of authenticity, transparency, and trust.

Colombia

  • Design, implement, and deploy ML/AI models end-to-end, from concept through production, including data pipelines, training workflows, and optimization.
  • Maintain and evolve AI systems in production, monitoring for drift, debugging issues, and driving ongoing improvements to reliability and scalability.
  • Partner closely with product, engineering, and data teams to align AI work with broader product and business goals.

Robots & Pencils is an applied AI engineering firm that designs and ships AI co-workers integrating into operations and delivering results for clients. Founded in 2009, they have delivery centers in Canada, the United States, Eastern Europe, and Latin America, with teams averaging 15+ years of experience.

Global

  • Design and implement multi-agent AI architectures using AWS Bedrock.
  • Develop agent orchestration logic and collaborative agent workflows.
  • Build Retrieval-Augmented Generation (RAG) pipelines using vector databases and embeddings.

We unite human expertise with AI to create scalable tech solutions. With over 8,000 CI&Ters around the world, CI&T has built partnerships with more than 1,000 clients during our 30 years of history.

US 8w paternity

  • Build and deploy AI-enabled tools, workflows, and internal automations that improve productivity.
  • Connect AI agents, tools, and automations with enterprise systems, ensuring reliable interoperability.
  • Build, maintain, and improve MCP connections and supporting infrastructure.

Openly is rebuilding insurance from the ground up, re-envisioning and enhancing every aspect of the customer experience. They are a rapidly growing team of exceptional, curious, empathetic people with a wide range of skill sets.

$229,000–$240,000/yr
US

  • Lead AI strategy and execution, integrating AI into core products and accelerating development workflows.
  • Architect and build complex, scalable systems and AI-driven features using technologies like Python, Java, AWS, and GCP.
  • Provide technical leadership, mentorship, and cross-functional collaboration to solve complex problems and drive successful outcomes.

Tebra provides an all-in-one EHR+ platform designed exclusively for independent healthcare practices to connect EHR software, billing, automation, telehealth, and marketing. The company serves over 42,000 private practices and fosters a culture of entrepreneurship, customer focus, simplicity, teamwork, and celebration.

$125,000–$175,000/yr
US Unlimited PTO

  • Act as a trusted advisor to customers, building relationships with technical and business stakeholders.
  • Advise on GenAI and ML best practices, giving product demos to technical and business stakeholders.
  • Partner with product and engineering teams to drive the product roadmap and spearhead new opportunities within existing accounts.

Arize AI is transforming the world by helping teams monitor, troubleshoot, and optimize their AI systems with its AI & Agent Engineering observability and evaluation platform. They are a Series C company backed by top-tier investors, with over $135M in funding and a rapidly growing customer base.

Poland

  • Design and deploy GPU cluster architectures using tools like Ansible, Terraform, Kubernetes, and Slurm.
  • Lead technical deep-dives, workshops, and present solutions to stakeholders, translating complex concepts.
  • Automate provisioning and monitoring with Infrastructure as Code, and produce documentation and training materials.

Gcore is a global provider of infrastructure and software solutions for AI, cloud, network, and security, powering digital experiences worldwide. The company collaborates with leading technology partners and employs over 550 professionals building foundational technologies.

Canada

  • Define, drive, design, and build/ship end-to-end solutions that solve real customer problems.
  • Contribute to the end-to-end AI/ML software development lifecycle, ensuring reproducible research.
  • Drive architecture, design, and delivery of advanced ML systems in the Product R&D team.

Kinaxis is a global leader in modern supply chain orchestration. Known for its AI-infused platform and transparency across end-to-end supply chains, Kinaxis helps customers make faster, better decisions. The company has over 2000 employees worldwide and is recognized with Top Employer awards.

$160,000–$180,000/yr
US Unlimited PTO

  • Identify systemic engineering challenges across our platforms and drive their resolution.
  • Write code, review PRs, debug production issues, and optimize system performance.
  • Partner with engineering teams as a technical point of contact on complex projects.

Zeta Global is an AI-Powered Marketing Cloud that leverages advanced artificial intelligence (AI) and trillions of consumer signals to help marketers acquire, grow, and retain customers more efficiently. They were founded in 2007 and are headquartered in New York City with offices around the world.

US

  • Lead the design and evolution of Fieldguide's core platform services.
  • Build platform capabilities that compound the leverage of product and AI engineers.
  • Define the architecture for how new product capabilities get delivered across environments.

Fieldguide is automating and streamlining the work of assurance and audit practitioners. They are based in San Francisco, CA, remote-first and backed by Goldman Sachs Alternatives, Bessemer Venture Partners, 8VC, Floodgate, Y Combinator, and more.

Europe

  • Design, build, and maintain scalable services that support the AI lifecycle.
  • Develop tools for pre/post-processing data for AI and other usage.
  • Design scalable pipelines for data collection, processing, and transformation.

Planner 5D is a global hub for home design, uniting over 100+ million users. They simplify the home renovation process with their cutting-edge software, fostering a vibrant community of enthusiastic and product-oriented professionals.

Global

  • Own and operate GPU and accelerator clusters for AI training, inference, and experimentation, ensuring reliability and cost-efficiency.
  • Build and optimize scheduling, orchestration, and serving systems using frameworks like vLLM and Triton to improve latency, throughput, and memory efficiency.
  • Partner with ML engineers to remove workflow bottlenecks and build observability for GPU utilization, capacity, and incident response.

Kraken is a crypto exchange platform building premium financial products for traders and institutions, accelerating global crypto adoption. It is a mission-driven, fully remote company with a world-class team of crypto experts spread across more than 70 countries.

India

  • Collaborate with engineering and cross-functional teams to translate business problems into an ML product roadmap.
  • Contribute hands-on technical expertise as a player-coach, providing strategic direction and mentorship to the team.
  • Establish an engineering setup enabling rapid iteration, experimentation, and deployment of models, fostering operational excellence.

Twilio is shaping the future of communications by delivering innovative solutions to hundreds of thousands of businesses. They empower millions of developers worldwide to craft personalized customer experiences, emphasizing a remote-first culture with a vibrant and globally inclusive team.