Source Job

Europe

  • Contribute to the development of the Everywhere Inference platform, a Kubernetes-based solution.
  • Design and implement APIs and developer tools to simplify deployment, management, and monitoring of AI applications.
  • Focus on packaging and integrating new ML models into the platform, using Python and common ML frameworks.

Python Kubernetes AI/ML Docker Helm

20 jobs similar to Software Engineer (Python, Kubernetes, AI/ML)

Jobs ranked by similarity.

Global

  • Contribute to the development of the Everywhere Inference platform, a Kubernetes-based solution.
  • Design and implement APIs and developer tools to simplify deployment, management, and monitoring of AI applications.
  • Optimize serverless container workflows for AI workloads, ensuring performance, scalability, and seamless autoscaling.

Gcore provides infrastructure and software solutions for AI, cloud, network, and security. They have 550+ professionals globally and collaborate with technology partners such as Intel, NVIDIA, Dell, and Equinix.

Europe

  • Design, develop, and maintain high-quality software solutions using Python.
  • Contribute to the design and evolution of scalable and maintainable software architectures.
  • Deploy, operate, and monitor applications in cloud environments (AWS, Azure, or GCP).

Lynx Analytics works on real-world AI and advanced analytics solutions with measurable business impact. They have a collaborative culture that values real outcomes, offering high ownership and rapid learning opportunities.

EMEA

  • Design and implement tooling that enables researchers to quickly deploy and evaluate new models in production
  • Design, build, and maintain high-performance, cost-efficient inference pipelines, making architectural decisions about scaling, reliability, and cost trade-offs
  • Proactively identify and resolve infrastructure bottlenecks, proposing and scoping improvements to iteration speed and production reliability

AssemblyAI builds best-in-class Speech AI models that power the next generation of voice applications. They are a remote team building one of the next great AI companies where teammates define and build their company culture.

Canada 3w PTO

  • Own model serving: Design, build, and maintain low-latency, highly-available serving stacks for in-house ML model serving and integrating with LLM serving partners.
  • Automate training pipelines: Orchestrate data prep, training, evaluation, and registry workflows on Kubernetes with solid MLOps practices.
  • Optimize at scale: Profile and tune throughput, memory, and cost; introduce caching, sharding, batching, and GPU/CPU autoscaling where it pays off.

Cresta aims to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. Their platform combines AI and human intelligence to help contact centers discover customer insights and automate conversations.

Europe

  • Design and implement AI inference and model training cloud products optimized for Kubernetes.
  • Write clean, efficient, and maintainable Go code to power Kubernetes controllers, operators, and custom resources supporting AI workloads.
  • Build APIs, CLIs, and developer tools that simplify the deployment, lifecycle management, and monitoring of AI applications.

Gcore provides infrastructure and software solutions for AI, cloud, network, and security, powering everything from real-time communication and streaming to enterprise AI and secure web applications. With 210+ edge locations, 50+ cloud regions, and thousands of GPUs, Gcore has a global team of 550+ professionals.

US

  • Develop and maintain backend systems and services for generative AI and agentic workflows.
  • Integrate AI-driven capabilities across the Seismic platform, working with data scientists and AI engineers.
  • Monitor and optimize agentic workflows’ performance, ensuring low-latency query responses.

Seismic provides sales enablement solutions, leveraging AI to enhance sales and marketing organizations. They focus on improving productivity and sales outcomes through their AI engine, Seismic Aura, integrated into their enablement cloud.

North America 4w PTO

  • Partner with stakeholders to tackle technical problems at scale, building framework agnostic services.
  • Establish roadmap and architecture for Wealthsimple’s Machine Learning platform.
  • Build highly performant scalable systems, contributing to our ML platform on Kubernetes, Bedrock and Sagemaker.

Wealthsimple aims to provide financial freedom by making financial services transparent and low-cost. As the largest fintech company in Canada, with over 1,500 employees, they manage over $100 billion in assets and foster a collaborative and quality-focused culture.

Europe 5w PTO

  • Help build, implement, and deploy agentic platforms within client environments.
  • Focus on building and deploying scalable AI-driven solutions.
  • Work closely with client stakeholders, technical leaders, and delivery teams to build and roll out agentic capabilities.

Nearform is an independent team of data & AI experts, engineers, and designers who build intelligent digital solutions and capability at pace. With a team of 500 experts in 20+ countries, they are trusted by leading enterprises including Lululemon, Puma, and Starbucks.

$135,000–$175,000/yr
US

  • Architect and scale the core intelligence behind our platform.
  • Design, build, and optimize the pipelines and agent systems that drive live customer interactions.
  • Build real-time and batch pipelines for ingestion, training, and inference.

Raynmaker is building RaynBrain, an agentic AI platform for complex conversations grounded in machine learning, neuroscience, and forensic linguistics. They empower autonomous systems that interpret, adapt, and act in real time, turning raw leads into revenue without scripts or human handoffs. Raynmaker is a small team helping other small teams move faster and convert more leads.

US

  • Responsible for building clean, scalable, and reliable solutions that support data-driven and AI‑enabled environments.
  • Developing well‑tested Python applications and working within cloud-native ecosystems.
  • Collaborating effectively across engineering, data, and scientific teams, turning complex requirements into production-ready solutions.

Onebridge, a Marlabs Company, is a global AI and Data Analytics Consulting Firm that empowers organizations worldwide to drive better outcomes through data and technology. Since 2005, we have partnered with some of the largest healthcare, life sciences, financial services, and government entities across the globe.

US

  • Work with customers to develop requirements and scope for new AI/ML projects.
  • Develop computer vision and machine learning based solutions for inspection platforms.
  • Analyze large datasets to extract meaningful insights and drive business decisions.

Loram provides advanced insights into inspection data collected for customers worldwide. The company has a small, collaborative team managing the entire project lifecycle, offering employees an outsized impact on inspections and maintenance recommendations.

Global

  • Develop, train, fine-tune, and deploy machine learning models.
  • Work with Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) Systems, NLP, and computer vision.
  • Design and develop RESTful APIs for AI model deployment using Flask or FastAPI.

Jobgether is a platform that partners with companies to connect them with talent. They use an AI-powered matching process to ensure applications are reviewed quickly and fairly.

$137,000–$180,000/yr
US

  • Design, develop, and maintain high-performance, scalable, and secure backend services, primarily using Python and frameworks like FastAPI
  • Translate ambiguous business and technical requirements into concrete software designs and actionable tasks for cross-functional teams
  • Operate and maintain production applications at scale, ensuring high availability, performance, and reliability

SmartAsset is an online destination for consumer-focused financial information and advice, whose mission is helping people make smart financial decisions, reaching over an estimated 59 million people each month. Valued at over $1 billion, SmartAsset has earned recognition on the Inc. 5000 and Deloitte Technology Fast 500 lists.

$170,000–$240,000/yr
US Unlimited PTO

  • Own SentiLink’s real-time ML model monitoring domain.
  • Own our ML experimentation, model tracking, and versioning infrastructure.
  • Drive improvements to the model development process.

SentiLink provides identity and risk solutions for secure transactions. They are backed by investors like Craft Ventures and Andreessen Horowitz, recognized by Forbes Fintech 50, and have offices across the U.S. and India.

Latin America

  • 3+ years of coding experience with Python.
  • Advanced knowledge of AWS services including ML services (AWS SageMaker and AWS Step Functions).
  • Experience with ML monitoring and automation tools (MLflow, SagaMaker Pipelines).

Bluelight is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion.

Europe

  • Design and build AI solutions for real-world business problems.
  • Own the end-to-end lifecycle of AI features.
  • Develop and optimize LLM-based applications.

Remote People is building the infrastructure to power borderless teams. Their technology handles global payroll, benefits, taxes, and compliance, enabling businesses to hire anyone anywhere compliantly at the push of a button. They are committed to building a global, diverse team representing different and varied backgrounds, perspectives, and experiences.

US

  • Architect and lead end-to-end ML/AI pipelines.
  • Design and own CI/CD pipelines for ML/AI workflows.
  • Build and maintain scalable ML/AI infrastructure on cloud platforms.

Equip is a virtual, evidence-based eating disorder treatment program. They aim to ensure everyone can access effective treatment, operating in all 50 states and partnering with major health insurance plans, recognized for its engaged culture and sustainable treatment.

$225,000–$315,000/yr
US 20w maternity 12w paternity

  • Architect and optimize distributed training and inference systems for large-scale AI models
  • Design and deliver customer-focused solutions that maximize performance and business value
  • Lead the transition of ML pipelines from POC to scalable production systems

The company offers an AI-centric cloud platform reshaping the landscape of artificial intelligence. They provide infrastructure, tools, and services for developers to service the explosive growth of the global AI industry, catering to Fortune 1000 companies, startups, and AI researchers.

US

  • Develop and optimize AI-driven solutions for our end-to-end platform.
  • Integrate AI capabilities into production systems to enhance functionality and performance.
  • Fine-tune and optimize existing solutions for specific product requirements, ensuring efficiency and reliability.

Smarter Technologies is an automation and insight platform for healthcare efficiency, providing an AI-powered revenue cycle management (RCM) platform. They combine proprietary agentic agents, human-in-the-loop AI agents, clinical ontology, and global financial and administrative services to empower healthcare organizations.

Global

  • Build repeatable systems for creating AI-powered digital products.
  • Leverage existing platforms and APIs to automate various processes.
  • Integrate AI capabilities with our existing tech stack.

SnappyCX builds and scales monetizable digital products using AI. The company values practical implementation and repeatable systems.