Write, iterate, and maintain system prompts and instruction sets for Noodle’s AI agents across the student journey.
Build and maintain evaluation frameworks to measure agent accuracy, tone, hallucination rate, task completion, and alignment with rubric-based learning objectives.
Partner with Noodle teammates and university stakeholders to design, build, and test agents — translating learning objectives, operational flows, rubric assessments, and more into prompt-level agent instructions.
Build agentic AI systems that change how Dataiku runs internally.
Turn real problems into working software.
See your solutions through from first conversation to production.
Dataiku is the Platform for AI Success, the enterprise orchestration layer for building, deploying, and governing AI. The world’s leading companies rely on Dataiku to operationalize AI and run it as a true business performance engine delivering measurable value.
Interact with generative AI models and project guidelines.
Create prompts to test model behavior across safety categories.
Document model breakability and effort level.
Welo Data provides AI services and specializes in data annotation. We foster a collaborative and innovative culture where employees contribute to cutting-edge AI safety evaluation.
Design pragmatic solutions for real problems, assessing each use case and selecting the right approach.
Rapid prototyping and iterative delivery, shipping functional prototypes within days and validating value with real users.
Build agentic AI systems where justified, designing and implementing multi-agent architectures and LLM-based tooling.
Zinier empowers frontline workers to achieve greater things. They are a remote-first, global team headquartered in Silicon Valley with a hybrid workforce across the United States, Canada, Europe, Latin America, Singapore, and Bangalore, India.
Interact with generative AI models using project-provided guidelines, safety taxonomies, and attack-vector guidance.
Create and evaluate prompts designed to test model behavior across safety-related categories.
Identify where model responses become unsafe, noncompliant, inconsistent, or otherwise problematic.
Welo Data is an AI services company that specializes in data annotation. They deliver multilingual content transformation services in translation, localization, and adaptation for over 250 languages with a growing network of over 400,000 in-country linguistic resources.
Consult clients during presales to assess AI readiness and translate visions into actionable requirements.
Architect multi-agent frameworks, design AI systems with defined roles, and implement learning & feedback loops.
Develop RAG pipelines, design custom models, and ensure governance, security, and cost-efficiency.
Sigma Software is seeking a Senior/Principal AI Engineer to join their Stellar AdTech Business Unit. They deliver innovative systems to global AdTech leaders and startups since 2008, with a strong AdTech competence center of 300+ employees.
Collaborate with engineering and design to optimize prompt engineering frameworks for open-ended generative AI features.
Research customer interaction models from LLMs to downstream features.
Evaluate the evolving AI ecosystem, including the ChatGPT store and third-party LLM integrations.
Acorns is a financial wellness app that helps everyday people and families save and invest money for the long term. Since 2014, Acorns has grown into a global company with multiple life-stage products serving the needs of kids, teens, adults, and parents.
Shape technical direction and architecture: Define the foundational architecture for enterprise agentic AI at Benchling.
Build and ship the early portfolio yourself: Write production code at least half your time, particularly during the team's first year.
Design for enterprise from day one: Build for multi-tenant isolation, secrets management, audit logging, payload encryption, role-based access controls, and human-in-the-loop controls calibrated to risk.
Benchling is the AI platform for biotech R&D. Scientists use Benchling to design experiments, capture structured data, and run AI agents and models directly in their workflows. They have over 200,000 scientists around the world, from academic labs to Sanofi and Moderna.
Continuously explore emerging shifts in AI interfaces, orchestration, agents, and autonomy through hands-on experimentation and ecosystem research.
Rapidly prototype, validate, and launch new AI-native product ideas with minimal support and high autonomy.
Use structured thinking, research, and experimentation to evaluate what n8n should invest in over the next 1–3 years.
N8n is the open workflow orchestration platform built for the new era of AI. They give technical teams the freedom of code with the speed of no-code, so they can automate faster, smarter, and without limits. Since their founding in 2019, they’ve grown into a diverse team of over 260 working across Europe and the US.
Design, build, train, evaluate and improve advanced machine learning and LLM-based systems for patient and provider-facing products.
Own problems end-to-end: scope the problem with clinicians and product partners, build datasets and evaluations, iterate on modeling.
Develop robust evaluation frameworks that give us confidence our models are safe, accurate, and improving over time.
Curai aims to transform healthcare delivery using artificial intelligence and clinical expertise to make care more affordable, accessible, and effective. They focus on improving health outcomes, expanding access to care, and setting new standards for trustworthy, patient-centered healthcare.
Design, build, and operate production-grade agentic AI systems embedded across ServiceNow's platform.
Build agents that leverage ServiceNow's data layer to make decisions with context no frontier model has on its own.
Own the guardrails: observability, human-in-the-loop controls, and compliance infrastructure that make autonomous systems safe to deploy at scale.
ServiceNow is the AI control tower for business reinvention. Our AI platform brings together any AI, any data, and any workflow— helping 85% of the Fortune 500® work smarter, faster, and better. We're building an AI-native culture where technology and talent are unstoppable together.
Partner with full-stack and backend engineers on the features they are shipping, write tests that prove it works, and flag gaps early.
Help build and run evaluation pipelines for non-deterministic LLM outputs, prompt regression, model drift detection, and output quality scoring across the LiteLLM routing layer.
Test the Nango-based integration layer across connectors and the file ingestion pipeline including encryption, formatting edge cases, and audit trail continuity.
Peach Pilot transforms how businesses run with a platform that ingests everything about how a company operates and constructs a Company Brain. It is a funded early-stage AI startup headquartered in Atlanta, Georgia, with a working platform on live infrastructure.
Improve prompts, model selection, and tool usage so the system gets more decisions right over time.
Reduce latency, token usage, and cost while preserving decision quality and operational reliability.
Design validation, retries, and human review paths for ambiguous, adversarial, incomplete, or conflicting inputs.
Risk Labs is the core team behind UMA and Across, building infrastructure that pushes crypto forward. They value ownership, curiosity, thoughtful risk-taking, and direct communication.
Design and ship agentic systems and multi-step LLM workflows using Claude, OpenAI, or equivalent - including tool use, memory, structured output extraction, and failure handling.
Build and maintain MCP integrations connecting internal tools, portco systems, and external data sources into reliable, observable pipelines.
Write production-grade Python for data pipelines, integration scripts, and scheduled jobs running via BullMQ-backed queues on the Node/TypeScript stack.
Emergence is a PE holdco backed by the Pritzker Organization focused on acquiring and scaling B2B SaaS businesses. It combines operational rigor with a growth equity mindset to drive ARR growth and profitability across its portfolio.
Design, build, and operate production-grade agentic AI systems embedded across ServiceNow's platform.
Build agents that leverage ServiceNow's data layer to make decisions with context no frontier model has on its own.
Own the guardrails: observability, human-in-the-loop controls, and compliance infrastructure that make autonomous systems safe to deploy at scale.
ServiceNow is an AI control tower for business reinvention. Their AI platform brings together any AI, any data, and any workflow— helping 85% of the Fortune 500® work smarter, faster, and better. We're building an AI-native culture where technology and talent are unstoppable together.
Quickly iterate and develop proofs of concept to explore integrating AI into data and marketing workflows.
Make key decisions about the choice of AI architecture and frameworks.
Build production data agents to seamlessly answer analytics and data science questions.
Hightouch is an Agentic Marketing Platform that provides a composable CDP. They enable marketing teams to analyze performance, brainstorm ideas, and generate creative quickly. The team is ambitious and impact-driven, with a focus on humility, kindness, and compassion.
Design, build, and ship agentic workflows across multiple domains.
Build multi-step agents capable of autonomous planning, context tracking, memory, tool use, and API orchestration.
Drive technical and architectural decisions to meet product requirements while also anticipating and designing for future needs
Cority helps customers see and prevent risks across their operations in real time. Our EHS+ platform converges people, data, and AI agents to provide a clear view of information people can trust. For 40 years, Cority has been the market leader in EHS+, recognized by top analysts and trusted by more than 1,500 of the most complex organizations worldwide.
Own the end-to-end systems that generate and process restaurant imagery and video at scale.
Build a style system that creates brand-appropriate outputs across restaurant types.
Go deep with models and prompting to push quality, consistency, and creative range.
Owner provides an AI-native system local business owners use to succeed, starting with restaurants, replacing multiple tools with one. Their team is in the low hundreds, and they attract top talent from companies like Shopify, HubSpot, and Stripe, scaling rapidly to keep pace with customer growth.
Work directly with business and technical stakeholders to identify high-value AI use cases and translate business problems into executable technical solutions.
Design and build enterprise-grade Claude enabled applications, agentic workflows, workflow copilots, knowledge assistants, and decision-support systems.
Help enterprise clients rationalize Claude licensing types, evaluate usage models, and design an overall licensing strategy aligned to adoption, governance, cost management, and business value.
Aimpoint Digital is a market-leading data, AI, analytics, and operations research advisory and solution engineering firm. They help organizations design, build, and operationalize enterprise-grade data and AI platforms, decision intelligence solutions, optimization systems, and production AI applications.
Pick up live work across data ingestion, knowledge graph integration, and the application layer.
Contribute to the front-end and runtime layer that surfaces AI agent activity, recommendations, and human-in-the-loop governance to client users.
Move freely between Python backend, TypeScript frontend, and infrastructure work as the build demands.
Peach Pilot builds a platform that ingests everything about how a company operates and constructs a Company Brain: a living knowledge graph that connects people, decisions, and outcomes across the entire organization. They are co-founded by Mario Montag and JP James and have a working platform with live infrastructure and a proven data-to-insights methodology.