Own the LLM + retrieval + context layer that makes copilots accurate and fast.
Design and ship the end-to-end pipeline, improving quality and trust via evaluation.
Reduce cost/latency with a concrete inference optimization plan shipped to production.
Ethos is built to make it faster and easier to get life insurance. They blend industry expertise, technology, and the human touch to find the right policy to protect loved ones and have been named on CB Insights' Global Insurtech 50 list and BuiltIn's Top 100 Midsize Companies in San Francisco.
Design, optimize, and version prompts for production voice and chat LLM applications.
Architect and orchestrate multi-agent systems for complex conversations.
Build automated testing and validation frameworks for LLM outputs.
Tuotempo transforms healthcare experiences through intelligent digital solutions and is a trusted patient engagement platform powering some of Europe and Latin America's leading healthcare institutions. They have a remote-first culture with vibrant hubs in Bologna or Barcelona.
Design and deliver AI-powered advisors, assistants, and analytic agents.
Build and maintain high-quality, production-ready Python services.
Apply, adapt, and fine-tune foundation models to deliver reliable AI experiences.
Energage helps organizations turn employee feedback into useful business intelligence and credible employer recognition through Top Workplaces. Built on culture research and the results from 23 million employees surveyed across more than 70,000 organizations, Energage delivers the most accurate competitive benchmark available.
Design agentic systems & ship AI to production: Turn prototypes into resilient, observable services with clear SLAs, rollback/fallback strategies, and cost/latency budgets.
Build tool‑using LLM “agents” (task planning, function/tool calling, multi‑step workflows, guardrails) for tasks like grant discovery, application drafting, and research assistance.
Own RAG end‑to-end: Ingest and normalize content, choose chunking/embedding strategies, implement hybrid retrieval, re‑ranking, citations, and grounding.
Instrumentl is a hyper-growth YC-backed startup that provides a SaaS platform to help nonprofits discover, track, and manage grants efficiently. They have over 4,000 nonprofit clients and are cash flow positive, doubling year-over-year, with customers who love them.
Build and maintain an internal LLM gateway that handles routing, fallbacks, and rate limiting
Create reusable components for common AI patterns (RAG, function calling, streaming responses)
Develop SDKs or libraries that simplify AI integration for application developers
ButterflyMX empowers people to open and manage doors & gates from a smartphone and their products are installed in multifamily, commercial, and gated communities. As a distributed workforce, they're looking for intelligent, collaborative, and down-to-earth individuals to join their growing team.
Implement features for AI applications such as conversational assistants and copilots and text generation, summarization, and content classification.
Design and optimize prompts and system instructions to improve task completion, reliability, and latency, minimize hallucinations and toxic/unsafe outputs and implement structured outputs.
Write unit, integration, and regression tests for AI features, run evaluation scripts and log results for model quality metrics, and work with AI observability tools under guidance.
RealPage is at the forefront of the Generative AI revolution, dedicated to shaping the future of artificial intelligence within the Property Tech domain. Our Agentic AI team is focused on driving innovation by building next generation AI applications and enhancing existing systems with Generative AI capabilities.
Design, implement, and evolve RAG pipelines combining structured data, embeddings, and LLMs.
Develop and maintain prompt strategies used across multi-step agent workflows.
Integrate LLMs into production systems with attention to reliability, cost, and latency.
FirmPilot builds AI-powered systems that automate and scale real-world business outcomes. They focus on applied AI, using best-in-class large language models and tooling to deliver reliable, production-grade automation.
Design and implement AI-powered features end to end, including prompts, agents, tools, retrieval, evaluation, and feedback loops.
Build agent systems that interact safely with infrastructure, codebases, and deployment pipelines.
Integrate LLMs deeply into product workflows as core platform primitives.
SuperPlane is an AI-native DevOps control plane with a mission to build the platform teams use to ship and manage software in the AI era. They are a fast-moving company aiming high, rethinking DevOps from first principles for the AI era to create a single control layer for engineers and agents to collaborate safely.
Design, build, and deploy AI Agents including custom tools, prompt engineering, orchestration workflows, and agent design patterns.
Contribute to the backend infrastructure powering Candidly's AI capabilities, including API development, data integrations, and data pipelines.
Work closely with stakeholders across product, design, engineering, and leadership to translate complex AI concepts into actionable strategies and features.
Candidly, founded in 2016, is the category leader with the market’s most comprehensive AI-driven student debt and savings optimization platform. They partner with hundreds of top employers, financial institutions, and retirement record keepers, positioning Candidly to serve more than 35 million Americans. Candidly is a high-growth, Series B startup, funded by leading investors with an international team of 70 (and counting).
Work with other engineers on a wide variety of AI engineering tasks to improve our existing applied AI systems
Identify new opportunities to apply emerging AI capabilities to different parts of the Poe product
Take end-to-end ownership of applied AI systems - from prototyping, data pipelines, model optimization/evaluation to reliable deployment at scale
Quora's mission is to grow the world's collective intelligence. They have two platforms: Quora, a global knowledge sharing platform, and Poe, a platform to chat, explore and build with AI language models. They have a culture rooted in transparency, idea-sharing, and experimentation.
Design and implement AI-powered features, integrating LLMs with existing products.
Improve AI systems through evaluations, guardrails, monitoring, and customer usage.
Collaborate with AI Platform engineers to shape foundational AI systems and tooling.
Vanta helps businesses earn and prove trust by empowering companies to practice better security. They have a kind and talented team of employees determined to make security easier for companies to manage and prove.
Build scalable backend services and internal APIs for the AI platform.
Integrate LLMs and retrieval into reliable, production-ready workflows.
Build knowledge ingestion pipelines for LLMs (documents, APIs, semi-structured data).
MaintainX is the world's leading Asset and Work Intelligence platform for industrial and frontline environments. It powers operational excellence for 13,000+ businesses. They recently completed a $150 million Series D round, at a valuation of $2.5 billion.
Design, build, and release AI products/features that solve real user problems
Design and implement streaming/batch data pipelines to support training and inference
Continuously monitor and improve the quality and performance of the systems
Insider One is a platform that integrates marketing and customer engagement tools, enabling teams to maximize their impact. With over 1,500 employees across 30+ offices, Insider One values innovation, collaboration, and social responsibility, fostering a fast-paced and agile environment.
Design, build, and scale enterprise-grade AI/ML systems that power internal workflows and external-facing AI/ML platforms.
Develop a production-ready Generative AI and MLOps platform with reusable components used to deploy multiple AI solutions across Natera’s business units.
Implement cloud-native infrastructure for large-scale model training and serving using Kubernetes, MLflow, Terraform, and AWS-native services
Natera is a global leader in cell-free DNA (cfDNA) testing. They are dedicated to oncology, women’s health, and organ health, aiming to make personalized genetic testing and diagnostics part of the standard of care. The Natera team consists of highly dedicated statisticians, geneticists, doctors, laboratory scientists, business professionals, software engineers and many other professionals from world-class institutions.
Design, refine, and evaluate prompts, context, and system instructions for various product use cases
Conduct experiments to assess model behavior, accuracy, and cost impact with new or existing prompts
Continuously improve prompt engineering processes by adopting new techniques and technologies
Applied Systems transforms the insurance industry. They have 40+ years of experience and are building a team ready to learn and deliver innovative software and services.
Architect and optimize how MagicSchool's AI agents reason, remember, and operate within complex educational workflows.
Design context management systems that determine what information our agents see and how they maintain state across multi-turn interactions.
Implement the technical foundation of how AI agents manage their "mental workspace" and ensure agentic capabilities remain accurate and focused.
MagicSchool is a generative AI platform for teachers. They are a fast-growing company of over 7 million teachers working towards real social impact and fostering a unique culture built on relationships, trust, communication, and collaboration.
Build AI-Powered Features: Design, develop, and deploy production-grade AI applications that solve real customer problems.
Architect Scalable Systems: Create robust backend architectures that support AI workloads, ensuring low latency and high reliability.
Drive AI Innovation: Implement and optimize agentic AI systems, RAG pipelines, and multi-agent workflows using modern LLM frameworks.
Procurify is the AI-enhanced procurement and AP automation platform for mid-market organizations. They help organizations take control of spend and save money as a remote-first company with a big heart and a strong ambition to modernize the way organizations manage business spend.
Integrate AI platform APIs across our product line - both internally and externally
Develop and refine LLM prompt chains and agents to meet the customer’s needs and expectations
Create eval systems to fine tune our agents to better suit the needs of our customers
Hone is revolutionizing the way companies develop and support their managers and teams with its AI-powered people development platform. They are funded by leading VCs and have raised over $50M to support their mission, with a remote-first and fully-distributed organization.
TLDR is the largest network of tech newsletters in the world, with over 7M subscribers, covering topics from startups to AI. Their 24-person full time team includes alumni of top media brands, and they doubled revenue from 2024 to 2025.
Development and deployment of LLM-powered features, including summarization tools.
Build backend services in Python that integrate ML/LLM models with Fullscript’s platform.
Collaborate with medical and product teams to deliver AI features for practitioners and patients.
Fullscript is a health technology company committed to helping people get better by connecting practitioners to products and patients to care plans. They empower over 125,000 practitioners and 10 million patients through their comprehensive platform.