Partner with full-stack and backend engineers on the features they are shipping, write tests that prove it works, and flag gaps early.
Help build and run evaluation pipelines for non-deterministic LLM outputs, prompt regression, model drift detection, and output quality scoring across the LiteLLM routing layer.
Test the Nango-based integration layer across connectors and the file ingestion pipeline including encryption, formatting edge cases, and audit trail continuity.
Peach Pilot transforms how businesses run with a platform that ingests everything about how a company operates and constructs a Company Brain. It is a funded early-stage AI startup headquartered in Atlanta, Georgia, with a working platform on live infrastructure.
Build agentic AI systems that change how Dataiku runs internally.
Turn real problems into working software.
See your solutions through from first conversation to production.
Dataiku is the Platform for AI Success, the enterprise orchestration layer for building, deploying, and governing AI. The world’s leading companies rely on Dataiku to operationalize AI and run it as a true business performance engine delivering measurable value.
Write, iterate, and maintain system prompts and instruction sets for Noodle’s AI agents across the student journey.
Build and maintain evaluation frameworks to measure agent accuracy, tone, hallucination rate, task completion, and alignment with rubric-based learning objectives.
Partner with Noodle teammates and university stakeholders to design, build, and test agents — translating learning objectives, operational flows, rubric assessments, and more into prompt-level agent instructions.
Noodle is higher education’s leading strategy, services, and technology partner that develops infrastructure, provides life-changing learning experiences, and grows the awareness of and the enrollment in some of the best academic institutions in the world. They empower universities to change the world by offering university partners various products and services.
Evaluate and refine AI prototypes built by business units to enhance commercial ROI, security, and architecture.
Refactor high-value internal AI prototypes into secure, scalable, enterprise-grade applications.
Build and maintain secure LLM integrations with internal systems like data lakes and Salesforce, ensuring full-lifecycle maintenance of applications.
Impiricus is an AI-powered HCP Engagement Engine that ethically connects healthcare professionals to pharmaceutical resources to reduce go-to-market costs and accelerate patient access to treatments. It is a fast-growing company with a unique network of HCPs and advisors, fostering a collaborative and impactful culture where employees can work flexibly.
Design and deliver production AI and agentic systems across document intelligence, workflow automation, and copilots.
Define architecture decisions for LLM-based systems, including retrieval, tool use, orchestration, memory, and evaluation.
Own evals and observability for production AI and manage cost and latency at production volume.
Maxwell is a mortgage technology and fulfillment company with a mission to make lending simpler, faster, and more accessible. They power hundreds of lending institutions with their mortgage Point of Sale and related capabilities and are a remote-first team that takes craft seriously.
Develop AI systems that automate dispute and chargeback handling using structured evidence and business logic, creating a better experience for our customers.
Build models that automate refunds, getting money back to our customers faster.
Build and maintain evidence extraction pipelines that process unstructured data using LLM-powered workflows to produce structured, actionable outputs.
Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest. They are a remote-first company with competitive benefits and focus on an inclusive interview experience.
Design, build, and maintain AI-powered workflows and internal tools that support teams.
Develop and manage LLM-based solutions tailored to specific operational needs.
Collaborate with leads, analysts, and managers to translate business problems into AI solutions.
Chime is a financial technology company that aims to make banking services helpful, easy, and free. They empower members to take control of their finances through user-friendly tools. Chime is a team of problem solvers, dreamers, and builders passionate about helping millions unlock their financial potential.
Shape technical direction and architecture: Define the foundational architecture for enterprise agentic AI at Benchling.
Build and ship the early portfolio yourself: Write production code at least half your time, particularly during the team's first year.
Design for enterprise from day one: Build for multi-tenant isolation, secrets management, audit logging, payload encryption, role-based access controls, and human-in-the-loop controls calibrated to risk.
Benchling is the AI platform for biotech R&D. Scientists use Benchling to design experiments, capture structured data, and run AI agents and models directly in their workflows. They have over 200,000 scientists around the world, from academic labs to Sanofi and Moderna.
Be the AI engineering technical authority and set the technical standard for how AI is used in code generation, review, testing, and task automation.
Drive the architecture and implementation of automated PR Review, Android and API Test Automation and AI Agent Swarms.
Evangelize by shipping, not by presenting, measuring everything and reporting to the CTO monthly with quantified value delivered.
SpotOn provides independent restaurants with tools to compete and win, including point-of-sale systems and AI-powered profit tools. They are known for their innovative software and technology solutions and are a Great Places to Work recipient.
Pick up live work across data ingestion, knowledge graph integration, and the application layer.
Contribute to the front-end and runtime layer that surfaces AI agent activity, recommendations, and human-in-the-loop governance to client users.
Move freely between Python backend, TypeScript frontend, and infrastructure work as the build demands.
Peach Pilot builds a platform that ingests everything about how a company operates and constructs a Company Brain: a living knowledge graph that connects people, decisions, and outcomes across the entire organization. They are co-founded by Mario Montag and JP James and have a working platform with live infrastructure and a proven data-to-insights methodology.
Translate product vision into production-ready code, working closely with Product Managers to turn business goals into actionable plans.
Help drive the transition to a self-service model, ensuring infrastructure remains performant across new global regions as the team and technology scales.
Take end-to-end ownership of service health, including participating in design docs and implementing robust monitoring/alerting.
Addepar is a global data and AI platform that empowers investment professionals to turn complex financial information into actionable intelligence. More than 1,400 firms manage and advise on nearly $9 trillion in assets, and we strive to promote a welcoming environment, and inclusion and belonging are held as a shared responsibility.
Design and build Claude skills, MCP integrations, and automated pipelines that transform internal knowledge into publication-ready docs with minimal manual intervention.
Act as the final reviewer for content produced by AI-assisted workflows and engineers, maintaining a high bar for technical accuracy and polish.
Define content structures and metadata standards that ensure our documentation is agent-consumable and machine-parseable.
Upsun, formerly Platform.sh, is the cloud application platform humans and robots love. They give developers, DevOps engineers, and platform teams the ability to build, ship, and scale confidently without wrestling with backend infrastructure.
Design, build, and ship LLM-powered features and agentic workflows for Gametime users.
Build and maintain evaluation frameworks and prompt testing pipelines for AI-powered experiences.
Contribute to orchestration layer, including agent routing, tool use, and multi-step workflow coordination.
Gametime helps people connect through shared live experiences. They operate platforms on iOS, Android, mobile web, and desktop, supporting over 60,000 events across the US and Canada, fostering a collaborative and inclusive environment where diverse perspectives are valued.
Talk to people, then build things, working directly with business and engineering teams to understand what's slowing them down.
Own the whole thing by prototyping, hardening, deploying, and monitoring internal tools that need to work reliably.
Write code other people can maintain building clean systems and establishing practical patterns for secure AI usage.
Promenade empowers local businesses with products and services that allow them to thrive online and offline. They build vertically-focused software catered to each industry, leveling the playing field between small businesses and large aggregators; backed by industry investors.
Design and ship agentic systems and multi-step LLM workflows using Claude, OpenAI, or equivalent - including tool use, memory, structured output extraction, and failure handling.
Build and maintain MCP integrations connecting internal tools, portco systems, and external data sources into reliable, observable pipelines.
Write production-grade Python for data pipelines, integration scripts, and scheduled jobs running via BullMQ-backed queues on the Node/TypeScript stack.
Emergence is a PE holdco backed by the Pritzker Organization focused on acquiring and scaling B2B SaaS businesses. It combines operational rigor with a growth equity mindset to drive ARR growth and profitability across its portfolio.
Design, build, and ship agentic workflows across multiple domains.
Build multi-step agents capable of autonomous planning, context tracking, memory, tool use, and API orchestration.
Drive technical and architectural decisions to meet product requirements while also anticipating and designing for future needs
Cority helps customers see and prevent risks across their operations in real time. Our EHS+ platform converges people, data, and AI agents to provide a clear view of information people can trust. For 40 years, Cority has been the market leader in EHS+, recognized by top analysts and trusted by more than 1,500 of the most complex organizations worldwide.
Collaborate with engineering and design to optimize prompt engineering frameworks for open-ended generative AI features.
Research customer interaction models from LLMs to downstream features.
Evaluate the evolving AI ecosystem, including the ChatGPT store and third-party LLM integrations.
Acorns is a financial wellness app that helps everyday people and families save and invest money for the long term. Since 2014, Acorns has grown into a global company with multiple life-stage products serving the needs of kids, teens, adults, and parents.
Design and build AI systems in production that solve real business problems, end-to-end: from discovery to operation.
Work with product and operations to translate ambiguous problems into measurable and maintainable solutions.
Build data pipelines that feed models and product surfaces.
Skydropx is innovating logistics with a team of visionary people who want to grow and change the world. They are integrating AI into their logistics platform for LATAM, working at a multi-tenant scale with hundreds of thousands of shipments per month.
Design end-to-end AI integration architectures connecting LLM APIs, vector databases, and inference systems to existing backend infrastructure.
Build reusable ML infrastructure components like feature pipelines, model serving layers, and evaluation frameworks that multiple portfolio companies standardize on.
Establish AI system integration best practices and governance patterns that become repeatable playbooks across the holding company.
Emergence is a thematic holding company backed by the Pritzker Organization focused exclusively on acquiring and scaling category-defining software businesses. They invest in focused portfolios, specialized operating groups with deep domain expertise and proven playbooks.
Benchmark FP8 quantization across GPU families and ship a production config to achieve speedup.
Evaluate serving frameworks with speculative decoding to improve performance.
Build a fine-tuning pipeline to enable faster model training and deployment.
Fathom eliminates the needless overhead of meetings with an AI assistant that captures, summarizes, and organizes key moments. They are a small company that creates magical experiences through focused builders and values a supportive environment.