Design and deploy high-performance agentic systems that leverage Fastino’s optimized model architectures.
Collaborate with engineering teams to turn novel architectural breakthroughs into scalable solutions for enterprise customers.
Drive rapid, iterative prototyping of AI functionalities, refining model performance and task-accuracy based on real-world telemetry.
Fastino is building the next generation of LLMs with a team of alumni from Google Research, Apple, Stanford, and Cambridge and has developed the GLiNER family of open source models. Fastino has raised $25M through seed round and is backed by leading investors including Microsoft, Khosla Ventures, and Insight Partners.
Design and implement knowledge distillation pipelines.
Distill large foundation models into smaller, faster, and cheaper models for inference.
Run and analyze large-scale training experiments to evaluate quality, latency, and cost tradeoffs.
They are a small, senior team with a strong research and engineering culture, offering high ownership and direct impact on product and roadmap. The company values being remote-friendly and an async-first environment.
Build and productionize LLM and NLP models across retrieval, summarization, classification, and generative tasks.
Design and implement scalable ML services and inference pipelines in Python using modern ML frameworks.
Translate complex NLP and LLM product requirements into structured engineering plans with clear milestones.
Loopio provides a workplace that recognizes the advantages of working flexibly, operating as a remote-first company. They have established hub regions around the world and foster a supportive culture with opportunities for connection.
Develop, implement, and validate machine learning and agentic flows.
Drive innovation in modeling approaches while balancing accuracy, efficiency, and interpretability.
Lead the design of benchmarking frameworks, evaluation tools, and metrics.
Federato is on a mission to defend the right to efficient, equitable insurance for all. They are AI-native platform that spans the full policy lifecycle. They move fast, are eager to listen to our users, take a first principles approach to solving problems, and value learning.
Conduct cutting-edge machine learning research, building and training large language models.
Focus on research projects aimed at expanding the frontier of knowledge in language modelling and associate areas such as evaluation, multimodal models, optimisation etc.
Disseminate your research results through the production of publications, datasets, and code.
Cohere is dedicated to scaling intelligence to serve humanity by training and deploying frontier models for developers and enterprises, building AI systems for content generation, semantic search, and more! They foster a culture of hard work, valuing diverse perspectives and contributions to model capabilities and customer value.
Own the LLM + retrieval + context layer that makes copilots accurate and fast.
Design and ship the end-to-end pipeline, improving quality and trust via evaluation.
Reduce cost/latency with a concrete inference optimization plan shipped to production.
Ethos is built to make it faster and easier to get life insurance. They blend industry expertise, technology, and the human touch to find the right policy to protect loved ones and have been named on CB Insights' Global Insurtech 50 list and BuiltIn's Top 100 Midsize Companies in San Francisco.
Research and develop Machine Learning models and optimize them for scaled production usage.
Work with colleagues to explore ongoing product issues and recommend innovative ML/AI based solutions.
Work with subject matter experts to curate and generate optimal datasets following responsible data collection and model maintenance practices.
Turnitin is a recognized innovator in the global education space, partnering with educational institutions to promote honesty, consistency, and fairness across all subject areas and assessment types. They are a global organization with team members in over 35 countries, offering a remote-first culture which empowers team members to work with purpose and accountability.
Design, develop, and deploy robust ML systems and multi-model AI agents that solve real-world retail challenges.
Lead the entire lifecycle, including prototyping, deployment, monitoring, and maintenance using modern CI/CD and containerisation practices.
Build high-performance data pipelines (ETL/ELT) for both training and real-time inference, ensuring our systems are scalable and reliable.
EDITED is the world’s leading AI-driven retail intelligence platform. They empower the world’s most successful brands and retailers with real-time decision making power. Their environment is dynamic and supportive, encouraging team members to take initiative, innovate, and continuously grow.
Develop prompts and guardrails for domain-specific LLM applications.
Implement hallucination detection, mitigation, and fact-checking mechanisms.
Robots & Pencils builds meaningful, scalable digital products by blending strategy, design, and engineering. They are a small, senior team with direct access to enterprise clients.
Perform in-depth analysis of healthcare data to independently design, develop, and deliver clinical ML models.
Build reliable and scalable production machine learning systems
Work cross-functionally across diverse stakeholders, including product managers, statisticians, clinicians, and clinical analysts.
Cohere Health's clinical intelligence platform delivers AI-powered solutions that streamline access to quality care by improving payer-provider collaboration, cost containment, and healthcare economics. Cohere Health works with over 660,000 providers and handles over 12 million prior authorization requests annually.
Design, build, and operate LLM-powered systems used in production.
Build scalable agentic AI automation solutions, selecting appropriate patterns based on business requirements.
Make system-level tradeoffs across model choice, latency, cost, accuracy, and operational complexity.
Natera is a global leader in cell-free DNA (cfDNA) testing, dedicated to oncology, women’s health, and organ health, aiming to make personalized genetic testing and diagnostics part of the standard of care. The Natera team consists of highly dedicated statisticians, geneticists, doctors, laboratory scientists, business professionals, software engineers and many other professionals from world-class institutions.
Build the technical roadmap given a business requirement and own the delivery of the same.
Develop and optimize LLM-based solutions : Lead the design, training, fine-tuning, and deployment of large language models, leveraging techniques like prompt engineering, retrieval-augmented generation (RAG), and agent-based architectures.
Codebase ownership : Maintain high-quality, efficient code in Python (using frameworks like LangChain/LangGraph) and SQL, focusing on reusable components, scalability, and performance best practices.
Turing, based in San Francisco, is a research accelerator for frontier AI labs, partnering with global enterprises to deploy advanced AI systems. They accelerate research with data, talent, and training pipelines and build proprietary intelligence systems, recognized among the world's top innovators.
Work side by side with clients, PMs, and Architects to scope and deploy AI systems.
Build and integrate systems using LLMs, RAG pipelines, agent frameworks, vector databases and related tools.
Debug relentlessly and optimize for reliability in production, not just elegance in code.
Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Their system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.
Work in a small, cross-functional team of 3-4 people focused on AI/ML systems.
Take ownership of projects from ideation to deployment with a high degree of autonomy.
Collaborate with product managers and stakeholders to understand customer pain points and deliver impactful solutions.
TriumphPay is building the transportation payments network for the future. Their software touches a combined $37.1B in annualized freight volume. They foster an environment that provides exceptional customer service, entrepreneurial spirit, and building successful partnerships with their clients.
Own and evolve ML Ops architecture, including CI/CD for models.
Serve as a player-coach, contributing directly to design reviews.
AvaSure is revolutionizing healthcare with cutting-edge virtual care solutions that protect patients and empower clinical teams. We're proud of our collaborative culture where innovation thrives and every team member is valued.
Engineer logic for serializing Reddit’s complex conversational trees into optimal training contexts.
Reddit is a community-driven platform where users submit, vote, and comment on what interests them. With over 100,000 active communities and 116 million daily active users, they foster open conversations and shared interests.
Design, adapt, and optimize deep learning architectures for scientific domains and data modalities.
Own and deliver on complex ML projects, including experiment design, implementation, and evaluation.
Write clean, well-tested code in PyTorch and NumPy enabling a high experimentation rate.
Matterworks builds AI tools to extract insights from biological data and unlock opportunities in therapeutic discovery, development, and manufacturing. They are building large-scale deep learning models of biological data to predict the phenotype and behavior of biological systems.
Shape the future of AI-powered search across all OLX verticals.
Lead the design and evolution of OLX’s Search AI Platform, developing LLM- and GenAI-based systems.
Prototype, evaluate, and productionize new ML models, including embedding-based retrieval, personalization, and relevance optimization.
OLX is building a more sustainable world through trade, making it safe and convenient to buy and sell cars, find housing, get jobs, and buy and sell household goods. They serve millions of people around the world every month through consumer brands including OLX, Otodom, AutoTrader, Property24.
Build, maintain, and scale document ingestion + processing pipelines.
Integrate and productionize LLM-powered workflows.
Improve accuracy, reliability, and cost/performance of models and pipelines.
They are building the AI-native operating system for litigation. Their platform turns chaos into knowledge graphs to provide a lasting edge in high-stakes litigation.