Build backend and pipeline systems that turn models into real search experiences for 110M+ daily users, owning data flows, ranking and retrieval services, and low-latency model-serving APIs. Integrate models into production through robust interfaces and DAGs, enabling fast iteration and powering discovery across the internet’s largest community platform. Ensure pipelines and systems support high scale, low latency, and operational excellence.
Source Job
20 jobs similar to Staff Software Engineer, ML Search
Jobs ranked by similarity.
- Deliver on technical initiatives that have significant company-wide impact.
- Set technical direction for the broader ML and Search teams at Reddit, able to identify opportunities and influence strategy for all of Reddit.
- Mentor and grow Staff and Senior Staff engineers and create a strong healthy engineering culture.
Reddit is the place where people come together to have the most authentic and interesting conversations on the internet—Whether you’re into video games, world news, or skincare, there’s a community on Reddit that’s perfect for you. Reddit has over 100,000 active communities and approximately 116 million daily active unique visitors.
Build and deploy ML models serving 100M+ predictions per day to personalize user experiences at scale. Design ranking algorithms that balance relevance, diversity, and revenue. Partner with product, engineering, and analytics to launch high-impact personalization features.
Launch Potato is a profitable digital media company that reaches over 30M+ monthly visitors through brands such as FinanceBuzz, All About Cookies, and OnlyInYourState.
- Build and deploy ML models serving 100M+ predictions per day to personalize user experiences at scale.
- Design ranking algorithms that balance relevance, diversity, and revenue.
- Run statistically rigorous A/B tests to measure true business impact.
Launch Potato is a profitable digital media company that reaches over 30M+ monthly visitors through brands such as FinanceBuzz, All About Cookies, and OnlyInYourState.
- Design and implement robust search solutions that scale with our rapidly growing user base.
- Improve search relevance, accuracy, and speed to deliver the most relevant results to users.
- Build and enhance vector search capabilities to power next-generation search experiences.
ClickUp is building the first truly converged AI workspace, unifying tasks, docs, chat, calendar, and enterprise search.
- Own the end-to-end lifecycle of ML model deployment—from training artifacts to production inference services.
- Design, build, and maintain scalable inference pipelines using modern orchestration frameworks (e.g., Kubeflow, Airflow, Ray, MLflow).
- Implement and optimize model serving infrastructure for latency, throughput, and cost efficiency across GPU and CPU clusters.
MARA is building a modular platform that unifies IaaS, PaaS, and SaaS which will enable governments, enterprises, and AI innovators to deploy, scale, and govern workloads across data centers, edge environments, and sovereign clouds. They are redefining the future of sovereign, energy-aware AI infrastructure.
- Design and maintain data models that organize rich content into canonical structures optimized for product features, search, and retrieval.
- Build high-reliability ETLs and streaming pipelines to process usage events, analytics data, behavioral signals, and application logs.
- Develop data services that expose unified content to the application, such as metadata access APIs, indexing workflows, and retrieval-ready representations.
Udio's success hinges on hiring great people and creating an environment where we can be happy, feel challenged, and do our best work.
- Optimize ad performance using both mature models and emerging ML technologies.
- Build scalable infrastructure to support real-time ad decisioning across millions of requests per day.
- Collaborate with global teams across product, data, and engineering to launch high-impact ad features.
Launch Potato is a profitable digital media company that reaches over 30M+ monthly visitors through brands such as FinanceBuzz, All About Cookies, and OnlyInYourState.
- Architect scalable, low-latency backend systems and data pipelines while writing code as necessary to support the team
- Provide technical leadership in the design of scalable solutions and the establishment of best practices
- Contribute positively to the team's productivity and growth
StackAdapt is the leading technology company that empowers marketers to reach, engage, and convert audiences with precision. The most forward-thinking marketers choose StackAdapt to orchestrate high-impact campaigns across programmatic advertising and marketing channels. StackAdapt is a diverse and inclusive team of collaborative, hardworking individuals trying to make a dent in the universe.
- Build the core Machine Learning foundations that power Nova’s agentic experiences.
- Design and implement the underlying components that support rich, intelligent interactions in the Iterable platform.
- Develop generalized evaluation frameworks for LLM- and agent-based features.
Iterable is the leading AI-powered customer engagement platform that helps leading brands create dynamic, individualized experiences at scale.
Combine Software Engineering and Data Science disciplines to create production-ready Machine Learning models. Develop frameworks and platform to build, deploy, serve and monitor ML-based services. Contribute to vision and architecture to scale ML solutions at QuintoAndar's business.
We are Grupo QuintoAndar, the largest real estate ecosystem in Latin America, guided by a shared purpose of helping people love the place they live.
Shape the future of AI-powered search across all OLX verticals. Lead the design and evolution of OLX’s Search AI Platform, developing LLM- and GenAI-based systems that power discovery and relevance. Mentor and inspire data scientists and ML engineers across OLX’s global hubs, sharing best practices and shaping the future of applied ML at scale.
At OLX, we work together to build a more sustainable world through trade.
- Design, build, and optimize high-performance systems in Python supporting AI data pipelines and evaluation workflows
- Develop full-stack tooling and backend services for large-scale data annotation , validation, and quality control
- Improve reliability, performance, and safety across existing Python codebases
Alignerr connects top technical experts with leading AI labs to build, evaluate, and improve next-generation models. They work on real production systems and high-impact research workflows across data, tooling, and infrastructure.
- Design scalable, future-proof data platforms optimized for AI research workloads.
- Build efficient self-serve data processing pipelines leveraging GCP's advanced services.
- Implement guardrails for cost, quality, and performance.
AssemblyAI is at the forefront of Speech AI, creating powerful models for speech-to-text and speech understanding via an API. They're a remote team of startup veterans and AI researchers looking to build one of the next great AI companies.
- Design, build, and optimize high-performance systems in Python supporting AI data pipelines and evaluation workflows
- Develop full-stack tooling and backend services for large-scale data annotation , validation, and quality control
- Improve reliability, performance, and safety across existing Python codebases
Alignerr connects top technical experts with leading AI labs to build, evaluate, and improve next-generation models. They work on real production systems and high-impact research workflows across data, tooling, and infrastructure.
- Explore and preprocess raw, messy datasets and design data strategies.
- Prototype model ideas and translate prototypes into production.
- Collaborate with cross-functional teams to turn ideas into impactful features.
Hostinger is shaping the future of online success powered by AI and driven by people with over 4 million clients in 150 countries.
- Design scalable systems using modern cloud technology.
- Build and run experiments to power the growth of Coinbase’s retail products.
- Articulate a long term vision for maintaining and scaling our backend systems and the teams running them.
Coinbase's mission is to increase economic freedom in the world by building the emerging onchain platform.
- Design and build infrastructure that enables researchers to rapidly iterate on reward signals.
- Develop systems for automated quality assessment of rewards, including detection of reward hacks and other pathologies.
- Collaborate with researchers to translate science requirements into platform capabilities.
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems to be safe and beneficial for users and society.
- Shape the developer experience, from code creation to built artifact.
- Ensure developer tooling is smooth and easy to use.
- Engage with engineering org to understand pain points.
Reddit is a community of communities built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet.
Help rethink how people navigate the web and transition publishers into a world of hyper-relevance. Design and implement search strategies that combine traditional lexical search with semantic vector search to improve result quality. Measure and improve search metrics and fix ranking logic for a better user experience.
TensorOps is a specialized consultancy dedicated to helping organizations accelerate their adoption of Artificial Intelligence and Machine Learning.
- Architect and implement scalable AI platform services for LLMs and other AI models.
- Apply LLMs and AI technologies to build and enhance intelligent product features.
- Develop robust APIs and backend systems for seamless integration of AI-powered features.
ClickUp is creating the first truly converged AI workspace, unifying tasks, docs, chat, calendar, and enterprise search, all supercharged by context-driven AI.