Advance the state of the art in agentic systems, including retrieval, grounding, memory, context, personalization, etc.
Foundational Model Research: Advance Workday’s proprietary capabilities in pre-training, post-training (RLHF, DPO), and domain-specific alignment for HR and Finance workflows.
Workday is a Fortune 500 company and a leading AI platform for managing people, money, and agents. Their culture is rooted in integrity, empathy, and shared enthusiasm.
Design, build, and ship LLM-powered features and agentic workflows for Gametime users.
Build and maintain evaluation frameworks and prompt testing pipelines for AI-powered experiences.
Contribute to orchestration layer, including agent routing, tool use, and multi-step workflow coordination.
Gametime helps people connect through shared live experiences. They operate platforms on iOS, Android, mobile web, and desktop, supporting over 60,000 events across the US and Canada, fostering a collaborative and inclusive environment where diverse perspectives are valued.
Evaluate and refine AI prototypes built by business units to enhance commercial ROI, security, and architecture.
Refactor high-value internal AI prototypes into secure, scalable, enterprise-grade applications.
Build and maintain secure LLM integrations with internal systems like data lakes and Salesforce, ensuring full-lifecycle maintenance of applications.
Impiricus is an AI-powered HCP Engagement Engine that ethically connects healthcare professionals to pharmaceutical resources to reduce go-to-market costs and accelerate patient access to treatments. It is a fast-growing company with a unique network of HCPs and advisors, fostering a collaborative and impactful culture where employees can work flexibly.
Develop AI systems that automate dispute and chargeback handling using structured evidence and business logic, creating a better experience for our customers.
Build models that automate refunds, getting money back to our customers faster.
Build and maintain evidence extraction pipelines that process unstructured data using LLM-powered workflows to produce structured, actionable outputs.
Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest. They are a remote-first company with competitive benefits and focus on an inclusive interview experience.
Operate at the frontier of applied AI as a hands-on practitioner.
Design, implement, and deploy advanced predictive models that drive core business outcomes.
Analyze large-scale datasets to identify patterns, trends, and optimization opportunities.
Sovrn is a software and data business that helps open web businesses be and remain independent. Through software products and data solutions they help their customers understand their business better, operate more efficiently, and make & keep more money.
Fellows will use external infrastructure to work on an empirical project aligned with research priorities.
Projects aim to produce a public output, such as a paper submission.
Fellows receive mentorship and can access a shared workspace in Berkeley or London.
Anthropic's mission is to create reliable, interpretable, and steerable AI systems. Their team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
Design, build, and maintain AI-powered workflows and internal tools that support teams.
Develop and manage LLM-based solutions tailored to specific operational needs.
Collaborate with leads, analysts, and managers to translate business problems into AI solutions.
Chime is a financial technology company that aims to make banking services helpful, easy, and free. They empower members to take control of their finances through user-friendly tools. Chime is a team of problem solvers, dreamers, and builders passionate about helping millions unlock their financial potential.
Define end-to-end architecture for AI/ML and Gen AI systems.
Serve as a strategic advisor to clients, leading solution design discussions.
Architect scalable solutions using cloud-native AI tools.
3Pillar Global provides a flexible work environment with a remote-first approach, offering opportunities for global teamwork and leveraging diverse resources. They focus on well-being, career growth, and diversity.
Own technical direction for high-impact AI products.
Work across teams to turn big ideas into shipped systems.
Help raise the bar for how we build, evaluate, and operate AI in production.
Rula is dedicated to treating the whole person, not just the symptoms, and aims to create a world where mental health is no longer stigmatized. They are a remote-first company that hires in most U.S. states and are passionate about making a positive impact on mental healthcare.
Design, develop, test, and deploy AI/ML models and applications including NLP pipelines, predictive models, recommendation engines, and intelligent automation workflows.
Build and integrate large language model (LLM) powered features using APIs such as OpenAI, Azure OpenAI, or Anthropic; implement retrieval-augmented generation (RAG) patterns and AI agent workflows.
Develop and maintain data pipelines that support model training, fine-tuning, evaluation, and real-time or batch inference.
ExtensisHR is a Professional Employer Organization (PEO) in the U.S. with client employees in all fifty states. They deliver personalized HR services for HR, employee benefits, payroll and taxes, employer risk, compliance, and employee management.
Design and develop data pipelines, scoring algorithms, and API infrastructure to power AI-driven matching and recommendation capabilities.
Build and maintain integrations between the matching engine and an existing program management platform.
Collaborate with SMEs to build, test, and refine user-configurable matching logic.
LMI is dedicated to accelerating government impact with innovation and speed, bringing commercial-grade platforms and mission-ready AI to federal agencies. Headquartered in Tysons, Virginia, they are committed to delivering impactful results that strengthen missions and drive lasting value.
Design and ship agentic systems and multi-step LLM workflows using Claude, OpenAI, or equivalent - including tool use, memory, structured output extraction, and failure handling.
Build and maintain MCP integrations connecting internal tools, portco systems, and external data sources into reliable, observable pipelines.
Write production-grade Python for data pipelines, integration scripts, and scheduled jobs running via BullMQ-backed queues on the Node/TypeScript stack.
Emergence is a PE holdco backed by the Pritzker Organization focused on acquiring and scaling B2B SaaS businesses. It combines operational rigor with a growth equity mindset to drive ARR growth and profitability across its portfolio.
Design, develop, and deploy AI/ML models to automate and improve internal workflow.
Build and maintain ML pipelines within an AWS cloud environment.
Integrate ML capabilities into existing Java and React application workflows.
Oddball aims to improve daily lives by delivering quality software to the federal space. With a team of experienced engineering, product, and UX professionals, we value learning, growth, and making a big impact in a rapidly growing company.
Diagnose business problems before building solutions, mapping workflows and confirming AI is the right intervention.
Own AI initiatives end-to-end, from stakeholder discovery and technical design through implementation and iteration.
Improve organizational flow by building solutions that reduce bottlenecks and increase throughput, measuring success using flow metrics.
GitLab is the intelligent orchestration platform for DevSecOps, enabling organizations to increase developer productivity, improve operational efficiency, reduce security and compliance risk, and accelerate digital transformation. GitLab has more than 50 million registered users and a high-performance culture driven by its values and continuous knowledge exchange.
Design and ship production AI features end-to-end across LangGraph / LiteLLM / pgvector / Langfuse.
Drive technical architecture for the AI product line: LLM orchestration, evals, observability, latency / cost / reliability tradeoffs.
Own AI initiatives technically — from spec through production, including rollout and post-launch eval improvements.
airSlate is a global SaaS technology company that develops no-code workflow automation, electronic signature, and document management solutions. They have over hundreds of millions of users and more than one million customers worldwide, helping organizations of every size digitize processes, improve efficiency, and transform how they work.
Design and deliver production AI and agentic systems across document intelligence, workflow automation, and copilots.
Define architecture decisions for LLM-based systems, including retrieval, tool use, orchestration, memory, and evaluation.
Own evals and observability for production AI and manage cost and latency at production volume.
Maxwell is a mortgage technology and fulfillment company with a mission to make lending simpler, faster, and more accessible. They power hundreds of lending institutions with their mortgage Point of Sale and related capabilities and are a remote-first team that takes craft seriously.
Lead AI strategy and execution, integrating AI into core products and accelerating development workflows.
Architect and build complex, scalable systems and AI-driven features using technologies like Python, Java, AWS, and GCP.
Provide technical leadership, mentorship, and cross-functional collaboration to solve complex problems and drive successful outcomes.
Tebra provides an all-in-one EHR+ platform designed exclusively for independent healthcare practices to connect EHR software, billing, automation, telehealth, and marketing. The company serves over 42,000 private practices and fosters a culture of entrepreneurship, customer focus, simplicity, teamwork, and celebration.
Ship zero-to-one AI products end-to-end — from customer discovery and prototyping through production deployment and iteration
Build agentic AI systems — design and implement autonomous and semi-autonomous workflows using LLMs, tool-use, memory, and orchestration
Develop AI tools that improve efficiency across clinical operations, data extraction, manual workflows, and more
Natera is a global leader in cell-free DNA (cfDNA) testing, dedicated to oncology, women’s health, and organ health. The Natera team consists of statisticians, geneticists, doctors, scientists, business professionals, software engineers and other professionals from world-class institutions.
Pick up live work across data ingestion, knowledge graph integration, and the application layer.
Contribute to the front-end and runtime layer that surfaces AI agent activity, recommendations, and human-in-the-loop governance to client users.
Move freely between Python backend, TypeScript frontend, and infrastructure work as the build demands.
Peach Pilot builds a platform that ingests everything about how a company operates and constructs a Company Brain: a living knowledge graph that connects people, decisions, and outcomes across the entire organization. They are co-founded by Mario Montag and JP James and have a working platform with live infrastructure and a proven data-to-insights methodology.
Design and build AI systems in production that solve real business problems, end-to-end: from discovery to operation.
Work with product and operations to translate ambiguous problems into measurable and maintainable solutions.
Build data pipelines that feed models and product surfaces.
Skydropx is innovating logistics with a team of visionary people who want to grow and change the world. They are integrating AI into their logistics platform for LATAM, working at a multi-tenant scale with hundreds of thousands of shipments per month.