Source Job

  • Implement production AI / ML workloads using Ray and Anyscale.
  • Advise customers on ML system architecture.
  • Partner with customer MLE and MLOps teams to integrate Ray into existing platforms and workflows.

Python MLOps MLflow Airflow Kubeflow

20 jobs similar to AI / ML Solutions Engineer

Jobs ranked by similarity.

$125,000–$175,000/yr
Unlimited PTO

  • Act as a trusted advisor to customers, building relationships with technical and business stakeholders.
  • Advise on GenAI and ML best practices and give product demos to stakeholders.
  • Partner with product and engineering teams to drive the product roadmap and spearhead new opportunities within existing accounts.

Arize AI is the leading AI & Agent Engineering observability and evaluation platform, empowering AI engineers to ship high-performing, reliable agents and applications.

Brazil

Combine Software Engineering and Data Science disciplines to create production-ready Machine Learning models. Develop frameworks and platform to build, deploy, serve and monitor ML-based services. Contribute to vision and architecture to scale ML solutions at QuintoAndar's business.

We are Grupo QuintoAndar, the largest real estate ecosystem in Latin America, guided by a shared purpose of helping people love the place they live.

Australia New Zealand

  • Act as a solution expert across ML domains including evaluations, training, inference, data pipelines, quality, and optimisation.
  • Work directly alongside product teams as a trusted partner, helping them navigate technical challenges and arrive at effective solutions.
  • Develop blueprints, patterns, and paved roads that allow other teams to follow proven approaches and accelerate their own implementations.

Canva is a design platform that enables users to create professional designs. They have a flagship campus in Sydney, a second campus in Melbourne, and co-working spaces in other locations, with a flexible work environment.

$145,831–$218,747/yr
Canada

  • Build, maintain and improve Torc ML frameworks.
  • Use Terraform, AWS Managed Services, EKS, Ray.
  • Focus on data ops, ML development pipeline, logging & aggregation.

Torc has been a leader in autonomous driving since 2007. Now a part of the Daimler family, they are focused solely on developing software for automated trucks to transform how the world moves freight. Their culture is collaborative, energetic, and team focused.

  • Help define the direction for the team.
  • Define and prioritize ML Platform initiatives.
  • Enable teams to build features at scale by providing a foundation of reusable software components and infrastructure.

Motive empowers the people who run physical operations with tools to make their work safer, more productive, and more profitable. Motive serves nearly 100,000 customers – from Fortune 500 enterprises to small businesses – across a wide range of industries.

Colombia

  • Lead technical discovery sessions and translate business problems into ML solutions.
  • Design end-to-end ML architectures and present technical solutions to clients.
  • Collaborate with delivery teams and provide technical guidance during project execution.

Provectus is a company that focuses on AI enablement, helping businesses become AI-first through AI strategy, data engineering, and AI solutions. They foster a collaborative environment where team members share learnings and best practices.

US 4w PTO

  • Architect, design, and oversee delivery of end-to-end AI/ML solutions.
  • Lead cross-functional teams to implement robust ML platforms, pipelines, and applications.
  • Communicate the business value and ROI of AI/ML solutions to stakeholders.

Jobgether is using an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. The system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.

$125,600–$157,000/yr
US

  • Design, build, and scale enterprise-grade AI/ML systems that power internal workflows and external-facing AI/ML platforms.
  • Develop a production-ready Generative AI and MLOps platform with reusable components used to deploy multiple AI solutions across Natera’s business units.
  • Implement cloud-native infrastructure for large-scale model training and serving using Kubernetes, MLflow, Terraform, and AWS-native services

Natera is a global leader in cell-free DNA (cfDNA) testing. They are dedicated to oncology, women’s health, and organ health, aiming to make personalized genetic testing and diagnostics part of the standard of care. The Natera team consists of highly dedicated statisticians, geneticists, doctors, laboratory scientists, business professionals, software engineers and many other professionals from world-class institutions.

US Canada

  • Deliver a project that helps generalize model configuration and enables no-code model deploys.
  • Own the design and implementation of significant scalability improvements and additions to the ML platform.
  • Power replenishment decisions on more than 15% of all produce sold in the United States.

Afresh is on a mission to eliminate food waste and increase access to fresh, nutritious food with the first fresh technology platform built to solve the biggest challenges in the fresh supply chain. They are partnered with some of the largest grocers in the US and have raised over $148 million in funding from investors.

US UK

As a Principal Decision Scientist, you will define high-level business objectives directly with clients, then develop and execute the project plan to meet those objectives. You will provide technical leadership to guide development work across teams while also owning and delivering specific technical components yourself. You will design and develop feature engineering pipelines, build ML & AI infrastructure, deploy models, and orchestrate advanced analytical insights.

Aimpoint Digital is a premier analytics consulting firm with a mission to drive business value for clients through expertise in data strategy, data analytics, decision sciences

  • Own the end-to-end lifecycle of ML model deployment—from training artifacts to production inference services.
  • Design, build, and maintain scalable inference pipelines using modern orchestration frameworks (e.g., Kubeflow, Airflow, Ray, MLflow).
  • Implement and optimize model serving infrastructure for latency, throughput, and cost efficiency across GPU and CPU clusters.

MARA is building a modular platform that unifies IaaS, PaaS, and SaaS which will enable governments, enterprises, and AI innovators to deploy, scale, and govern workloads across data centers, edge environments, and sovereign clouds. They are redefining the future of sovereign, energy-aware AI infrastructure.

Europe

  • Drive technical success of Cerebras customers.
  • Serve as the technical lead throughout the customer journey ensuring smooth deployment and long-term value realization.
  • Partner with customers to uncover new opportunities where Cerebras products can drive value.

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Their novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. The team at Cerebras enjoys a simple, non-corporate work culture that respects individual beliefs and celebrates different backgrounds, perspectives, and skills.

$250,000–$325,000/yr
US

  • Design cloud-native architectures for agentic AI workloads using Kubernetes/EKS, Terraform, Docker, serverless APIs, AWS Batch, and async orchestration frameworks.
  • Define agentic system patterns using LangChain, LangGraph, Autogen, LlamaIndex, Pinecone, and other multi-agent frameworks; ensure consistency of prompt/tool design.
  • Architect vector database, RAG, embeddings pipelines, and model-serving endpoints (LLM/SLM) with strong emphasis on scalability and latency management.

AHEAD builds platforms for digital business by weaving together advances in cloud infrastructure, automation and analytics, and software delivery, helping enterprises deliver on the promise of digital transformation. They prioritize creating a culture of belonging, where all perspectives and voices are represented, valued, respected, and heard.

6w PTO 26w maternity

  • Develop and deliver cutting-edge agentic AI solutions utilizing Cohere’s foundation models and Agentic AI Foundry - North.
  • Architect scalable, secure, and customizable NLP and generative AI solutions tailored to enterprise customer needs.
  • Collaborate with customers to understand complex workflows, design pilots, and translate business requirements into technical solutions.

Cohere is training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences.

$82,300–$140,580/yr
US

  • Deploy and optimize ML/LLM models on platforms like NVIDIA Triton and vLLM within Kubernetes clusters.
  • Integrate models with Rackspace’s Unified Inference API and API Gateway for multi-tenant routing.
  • Configure telemetry for GPU utilization, request tracing, and error monitoring.

We combine our expertise with the world’s leading technologies — across applications, data and security — to deliver end-to-end solutions.

US Canada Argentina India

  • Work with research teams to design and build our training infrastructure
  • Prototype new training frameworks and production-ize solutions at scale
  • Design, optimize and test model integration infrastructure

Clarifai is a leading AI platform specializing in computer vision, NLP, LLMs, and audio recognition, helping organizations transform unstructured data into structured data. Founded in 2013, they remotely operate across multiple countries with backing from industry leaders, fostering a diverse and equal opportunity workplace.

$315,000–$340,000/yr
US

  • Design and build infrastructure that enables researchers to rapidly iterate on reward signals.
  • Develop systems for automated quality assessment of rewards, including detection of reward hacks and other pathologies.
  • Collaborate with researchers to translate science requirements into platform capabilities.

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems to be safe and beneficial for users and society.

US

  • Design and deploy production-ready ML solutions, leveraging cutting-edge platforms.
  • Conduct causal analysis to guide high-stakes business strategy.
  • Integrate models into production systems and monitor their performance using advanced observability tools.

CSC Generation is the AI-native holding company re-engineering omni-channel retail by acquiring iconic brands and transforming them with their operating platform.

Canada

  • Contribute to our core ML infrastructure.
  • Prototype new training frameworks and production-ize solutions at scale.
  • Design, optimize and test model integration infrastructure.

Clarifai is a leading, full-lifecycle deep learning AI platform for computer vision, natural language processing, LLM's and audio recognition. Clarifai was founded in 2013 and has employees remotely based throughout the United States, Canada, Argentina, India and Estonia.

$146,625–$241,950/yr
North America Canada

  • Lead architectural design sessions focused on AI Experience and Platform solutions.
  • Showcase platform capabilities through technical deep-dive demos.
  • Develop proof of concepts and technical validation frameworks for customers.

ServiceNow, founded in 2004, provides AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500. Their cloud-based platform connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work.