Source Job

Poland

  • Design and deploy GPU cluster architectures using tools like Ansible, Terraform, Kubernetes, and Slurm.
  • Lead technical deep-dives, workshops, and present solutions to stakeholders, translating complex concepts.
  • Automate provisioning and monitoring with Infrastructure as Code, and produce documentation and training materials.

Python Kubernetes Terraform Ansible

11 jobs similar to AI Solution Architect

Jobs ranked by similarity.

Global

  • Own and operate GPU and accelerator clusters for AI training, inference, and experimentation, ensuring reliability and cost-efficiency.
  • Build and optimize scheduling, orchestration, and serving systems using frameworks like vLLM and Triton to improve latency, throughput, and memory efficiency.
  • Partner with ML engineers to remove workflow bottlenecks and build observability for GPU utilization, capacity, and incident response.

Kraken is a crypto exchange platform building premium financial products for traders and institutions, accelerating global crypto adoption. It is a mission-driven, fully remote company with a world-class team of crypto experts spread across more than 70 countries.

$180,000–$300,000/yr
US 20w maternity 12w paternity

  • Act as a trusted advisor to clients, providing technical expertise and guidance throughout engagements
  • Conduct PoCs, workshops, presentations, and training sessions on GPU cloud technologies and best practices
  • Collaborate with clients to understand their business requirements and develop solution architectures

Lavendo partners with startups and high‑growth companies to help them hire top‑tier sales, go‑to-market, and technical talent. They are an equal opportunity workplace and consider all qualified applicants without regard to race, color, religion, national origin, age, sex, marital status, ancestry, disability, genetic information, veteran or military status, gender identity or expression, sexual orientation, or any other characteristic protected by law.

Canada

  • Define, drive, design, and build/ship end-to-end solutions that solve real customer problems.
  • Contribute to the end-to-end AI/ML software development lifecycle, ensuring reproducible research.
  • Drive architecture, design, and delivery of advanced ML systems in the Product R&D team.

Kinaxis is a global leader in modern supply chain orchestration. Known for its AI-infused platform and transparency across end-to-end supply chains, Kinaxis helps customers make faster, better decisions. The company has over 2000 employees worldwide and is recognized with Top Employer awards.

SRE

Fal
$180,000–$250,000/yr
US

  • Own and operate our Kubernetes infrastructure.
  • Build and maintain CI/CD pipelines and deployment infrastructure.
  • Leverage AI to automate analysis and resolution of production issues.

Fal is the generative media ecosystem powering the next generation of AI products. They build the infrastructure, tools, and model access that teams need to move from idea to production.

US Global

  • Define end-to-end architecture for AI/ML and Gen AI systems.
  • Serve as a strategic advisor to clients, leading solution design discussions.
  • Architect scalable solutions using cloud-native AI tools.

3Pillar Global provides a flexible work environment with a remote-first approach, offering opportunities for global teamwork and leveraging diverse resources. They focus on well-being, career growth, and diversity.

Europe

  • Define and evolve the architecture and roadmap for enterprise‑scale Data and AI platforms.
  • Design and build multi‑tenant, multi‑region, highly available AI platforms with governance.
  • Lead capacity planning and cost optimization strategies for GPU and CPU workloads.

NEORIS accelerates growth in Ibero‑America, combining global engineering with regional expertise. With over 60,000 professionals across 55+ countries, they offer technical specialization career paths and value responsibility, collaboration, creativity, and commitment.

Europe

  • Design, build, and maintain scalable services that support the AI lifecycle.
  • Develop tools for pre/post-processing data for AI and other usage.
  • Design scalable pipelines for data collection, processing, and transformation.

Planner 5D is a global hub for home design, uniting over 100+ million users. They simplify the home renovation process with their cutting-edge software, fostering a vibrant community of enthusiastic and product-oriented professionals.

Colombia

  • Design, implement, and deploy ML/AI models end-to-end, from concept through production, including data pipelines, training workflows, and optimization.
  • Maintain and evolve AI systems in production, monitoring for drift, debugging issues, and driving ongoing improvements to reliability and scalability.
  • Partner closely with product, engineering, and data teams to align AI work with broader product and business goals.

Robots & Pencils is an applied AI engineering firm that designs and ships AI co-workers integrating into operations and delivering results for clients. Founded in 2009, they have delivery centers in Canada, the United States, Eastern Europe, and Latin America, with teams averaging 15+ years of experience.

US

  • Design, deploy, and maintain scalable ML infrastructure supporting model training, batch inference, and real-time inference workloads.

National Debt Relief was founded in 2009 with the goal of helping consumers deal with overwhelming debt. They are one of the most-trusted and best-rated consumer debt relief providers in the United States, having helped over 450,000 people settle over $10 billion of debt.

$81,112–$92,025/yr
Europe

  • Empower ML Engineers with the tools, infrastructure, and frameworks they need to iterate fast autonomously.
  • Accelerate time-to-market for production-ready ML products with seamless integration and access to data and resources.
  • Own ML CI/CD in close collaboration with the DevExp team, adapting existing frameworks to ML-specific needs.

Dailymotion is a video platform designed to broaden users' horizons with a unique algorithm. They foster inclusivity and aim to build a better and safer Internet with cutting-edge solutions for video hosting and advertising. With 400 employees in France, New York, and Singapore, Dailymotion is shaking up the global video platform ecosystem.

Global

  • Contribute to the development of the Everywhere Inference platform, a Kubernetes-based solution.
  • Design and implement APIs and developer tools to simplify deployment, management, and monitoring of AI applications.
  • Optimize serverless container workflows for AI workloads, ensuring performance, scalability, and seamless autoscaling.

Gcore provides infrastructure and software solutions for AI, cloud, network, and security. They have 550+ professionals globally and power everything from real-time communication and streaming to enterprise AI and secure web applications.