Source Job

Canada

  • Design and develop backend systems using Python or Kotlin for the ML Feature Platform.
  • Build and maintain a self-serve platform for feature creation, exploration, and serving for machine learning and decisioning.
  • Own end-to-end flows including data storage, availability, backfilling infrastructure, and platform improvements.

Python Kotlin AWS MySQL Kubernetes

14 jobs similar to Software Engineer II, Machine Learning (Feature Platform)

Jobs ranked by similarity.

US

  • Break down larger projects into individual tasks and deliver them in multiple phases with collaboration.
  • Support the product development lifecycle by partnering with product management, design, and analytics on risks and trade-offs.
  • Support operations and availability by creating and monitoring metrics and participating in on-call efforts.

Affirm is reinventing credit to make it more honest and friendly, offering buy now, pay later solutions. The company has a remote-first culture and employs thousands, focusing on transparency and inclusivity.

UK

  • Build and maintain backend services, Python libraries, and model lifecycle tooling for internal ML teams.
  • Design and operate distributed systems for model serving, evaluation, and feature engineering.
  • Focus on developer experience and reliability to help teams train, deploy, and serve ML models safely.

Monzo is on a mission to make money work for everyone, offering personal and business bank accounts, savings, investments, and more through a modern digital banking platform. With around 600 engineers out of roughly 5,000 employees, we value flexibility, collaboration, and open source contributions.

US

  • Design and build a next-generation reliability platform for Affirm's production systems, blending distributed systems engineering with AI-assisted development.
  • Create AI agents and a centralized command center to assist with incident triage, root-cause analysis, and unified system health visualization.
  • Own projects end-to-end, from requirements to rollout, collaborating with partner teams to build powerful, simple solutions for developers.

Affirm is reinventing credit to make it more honest and friendly, offering consumers the flexibility to buy now and pay later without hidden fees. The company is a remote-first organization with a strong focus on people-first values and inclusive benefits.

Canada

  • Lead and manage a team of engineering managers and software engineers, supporting their growth and performance through regular feedback and coaching.
  • Create a long-term technical roadmap, establish OKRs, and drive continuous improvement in engineering processes.
  • Collaborate across teams to ensure technical sustainability, manage priorities, and build a strong engineering culture.

Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest. They are a large remote-first financial technology company with a focus on innovation and people.

Brazil

  • Evolve and maintain our Kubeflow, Feast, and Spark-on-Kubernetes ML infrastructure.
  • Design tools and APIs empowering teams to transition from centralized bottlenecks to self-service excellence.
  • Collaborate with Data Science teams to apply software engineering best practices to ML workflows.

Wellhub revolutionizes workplace wellness by connecting employees to partners for fitness, mindfulness, therapy, nutrition, and sleep in one subscription. Headquartered in NYC with team members across the globe, we value wellbeing, collaboration, and different perspectives.

US

  • Build and lead a high-performance product engineering team focused on innovation, accountability, and reliability.
  • Develop scalable reliability, risk management, and operational governance capabilities for production systems.
  • Drive alignment across Platform Engineering, SRE, Infrastructure, and product teams to deliver long-term technical roadmap outcomes.

Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without hidden fees or compounding interest. It is a publicly traded, remote-first company with competitive benefits and a culture focused on innovation and people.

Canada

  • Design and operate core AI platform components for training, deploying, and serving ML models at scale.
  • Own model serving and inference workflows end-to-end, optimizing for reliability, latency, throughput, and cost.
  • Collaborate with product, infrastructure, and security teams to build scalable platform capabilities for AI-powered features.

Mozilla Corporation is the non-profit-backed technology company behind Firefox and Pocket, with over 225 million monthly users. A wholly-owned subsidiary of the Mozilla Foundation, the company is mission-driven, employee-owned, and focused on privacy and open standards.

United States Canada

  • Build and operate the real-time inference service for the risk decision engine with low latency and high availability.
  • Own model deployment infrastructure including CI/CD, shadow mode, and staged rollouts.
  • Build model observability and partner with Risk Data Science for production operation.

Mercury is a fintech company that provides banking services for startups via partner banks. The company is committed to creating a safe environment and values diversity, with a growing team focused on innovation.

Global Unlimited PTO

  • Build and scale high-throughput ingestion and trace-query systems for LangSmith, a purpose-built observability platform.
  • Set API, SDK, and CLI standards across Python, TypeScript, Go, and Java for consistent developer experiences.
  • Own integrations with AI frameworks and tools, ensuring LangSmith remains framework-agnostic and easy to adopt.

LangChain builds the foundation for agent engineering, helping developers go from prototypes to production-ready AI agents with platforms like LangSmith and open-source frameworks. With $125M raised from top VCs and 100M+ monthly open source downloads, the team is small but impactful, shaping how AI agents operate in the real world.

United States

  • Build and improve scalable, fault-tolerant, self-serve data infrastructure technologies to support ML and analytics workflows.
  • Own the Data Movement Platform for batch and stream data processing, and invest in building new infrastructure for Spark, Flink, and Airflow.
  • Collaborate with teammates on on-call responsibilities and monitoring/alerting to improve reliability, scalability, latency, and efficiency.

Reddit is a community of communities built on shared interests, passion, and trust, hosting the most open and authentic conversations on the internet. With over 100,000 active communities and approximately 126 million daily active unique visitors, Reddit is one of the internet's largest sources of information.

US Unlimited PTO

  • Drive end-to-end ML development for customer-facing SaaS products, from pipelines to production deployment and monitoring.
  • Design evaluation strategies and A/B tests to prove ML features improve customer outcomes and business impact.
  • Influence product roadmap by communicating ML capabilities and trade-offs to cross-functional teams.

WorkWave provides field service and logistics software solutions that help businesses manage their operations and serve their customers. They are a global company with a remote-first culture, recognized as a Best Place to Work and named among the top software companies worldwide.

US Unlimited PTO 8w maternity 7w paternity

  • Build, deploy, and monitor ChowNow's backend applications using Python and cloud infrastructure.
  • Collaborate with cross-functional teams to design and deliver high-quality features with emphasis on observability, security, and scalability.
  • Drive continuous improvement in development processes, CI/CD, and system architecture to support a growing platform.

ChowNow is a restaurant technology platform that helps independent restaurants manage their digital dining experience, including online ordering, branded websites, and marketing. The company supports over 20,000 restaurants across North America and has been recognized as a 'Best Place to Work' with a strong focus on employee experience.

Taiwan

  • Design, build, and maintain scalable backend services and APIs for AI-powered products.
  • Collaborate with cross-functional teams including ML engineers and product managers to deliver end-to-end solutions.
  • Ensure reliability, performance, and scalability of production systems through proactive monitoring and optimization.

Cresta is an AI platform that transforms customer experience through conversational AI agents and real-time augmentation. The company has raised over $270 million from leading investors and is backed by a team of AI experts from Stanford and Google.

US Unlimited PTO

  • Own and scale AI compute and deployment platforms including Kubernetes and GitOps pipelines.
  • Build inference infrastructure and observability stacks for LLM-powered workflows.
  • Drive security, compliance, and governance at the systems level in a regulated healthcare environment.

Hims & Hers is a leading health and wellness platform focused on making healthcare accessible and personal. As a publicly traded company on the NYSE (HIMS), it offers flexible/remote work and a culture centered on innovation and employee well-being.