Design agentic systems & ship AI to production: Turn prototypes into resilient, observable services with clear SLAs, rollback/fallback strategies, and cost/latency budgets.
Build tool‑using LLM “agents” (task planning, function/tool calling, multi‑step workflows, guardrails) for tasks like grant discovery, application drafting, and research assistance.
Own RAG end‑to-end: Ingest and normalize content, choose chunking/embedding strategies, implement hybrid retrieval, re‑ranking, citations, and grounding.
Instrumentl is a hyper-growth YC-backed startup that provides a SaaS platform to help nonprofits discover, track, and manage grants efficiently. They have over 4,000 nonprofit clients and are cash flow positive, doubling year-over-year, with customers who love them.
Lead domain-specific model optimization using PEFT (LoRA/QLoRA) and knowledge distillation to balance cost, latency, and reasoning capability.
Build next-gen Retrieval-Augmented Generation pipelines using hybrid search, cross-encoders, and self-correcting retrieval loops.
Design and deploy multi-agent systems using frameworks like LangGraph or CrewAI, enabling autonomous task planning and tool-use (Function Calling).
ServiceNow is a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500®. Their intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work.
Design, optimize, and version prompts for production voice and chat LLM applications.
Architect and orchestrate multi-agent systems for complex conversations.
Build automated testing and validation frameworks for LLM outputs.
Tuotempo transforms healthcare experiences through intelligent digital solutions and is a trusted patient engagement platform powering some of Europe and Latin America's leading healthcare institutions. They have a remote-first culture with vibrant hubs in Bologna or Barcelona.
Design and deliver AI-powered advisors, assistants, and analytic agents.
Build and maintain high-quality, production-ready Python services.
Apply, adapt, and fine-tune foundation models to deliver reliable AI experiences.
Energage helps organizations turn employee feedback into useful business intelligence and credible employer recognition through Top Workplaces. Built on culture research and the results from 23 million employees surveyed across more than 70,000 organizations, Energage delivers the most accurate competitive benchmark available.
Design, develop, and deploy agentic AI solutions for clients.
Build multi-agent systems and integrate models with enterprise systems.
Collaborate with clients and engineers to create scalable solutions.
AHEAD builds platforms for digital business, weaving together advances in cloud infrastructure, automation, analytics, and software delivery to help enterprises deliver on digital transformation. They prioritize creating a culture of belonging where all perspectives are valued and heard.
Build and maintain an internal LLM gateway that handles routing, fallbacks, and rate limiting
Create reusable components for common AI patterns (RAG, function calling, streaming responses)
Develop SDKs or libraries that simplify AI integration for application developers
ButterflyMX empowers people to open and manage doors & gates from a smartphone and their products are installed in multifamily, commercial, and gated communities. As a distributed workforce, they're looking for intelligent, collaborative, and down-to-earth individuals to join their growing team.
Integrate AI platform APIs across our product line - both internally and externally
Develop and refine LLM prompt chains and agents to meet the customer’s needs and expectations
Create eval systems to fine tune our agents to better suit the needs of our customers
Hone is revolutionizing the way companies develop and support their managers and teams with its AI-powered people development platform. They are funded by leading VCs and have raised over $50M to support their mission, with a remote-first and fully-distributed organization.
Design, build, and deploy AI Agents including custom tools, prompt engineering, orchestration workflows, and agent design patterns.
Contribute to the backend infrastructure powering Candidly's AI capabilities, including API development, data integrations, and data pipelines.
Work closely with stakeholders across product, design, engineering, and leadership to translate complex AI concepts into actionable strategies and features.
Candidly, founded in 2016, is the category leader with the market’s most comprehensive AI-driven student debt and savings optimization platform. They partner with hundreds of top employers, financial institutions, and retirement record keepers, positioning Candidly to serve more than 35 million Americans. Candidly is a high-growth, Series B startup, funded by leading investors with an international team of 70 (and counting).
Own complex, full-stack AI solutions end-to-end, from applied research to production deployment.
Set technical direction for ambiguous and high-impact use cases, while scaling the AI systems.
Mentor others, lead architectural decisions, and deepen Komodo’s AI-first culture.
Komodo Health is dedicated to reducing the global burden of disease by leveraging data. They have built the Healthcare Map, the industry’s largest view of the U.S. healthcare system. At Komodo, employees are ambitious, supportive, and passionate about delivering on its mission.
Building features related to our data pipelines, usage of LLMs, analytics APIs, etc.
Doing whatever is necessary to deliver value to our customers.
Creating notifications, alert emails, scheduled reports, connectors into third party systems, etc.
Scrunch helps marketing teams rethink how their products and services are discovered and surfaced on AI platforms like ChatGPT, Claude, Gemini. They have scaled rapidly since commercial launch and have more than 500 paying brands using the platform.
Design and implement automated testing frameworks for backend services and APIs.
Develop strategies and implement solutions for testing AI-driven workflows, including validating RAG outputs and agent behavior.
Develop integration, contract, and end-to-end tests for event-driven systems.
FirmPilot builds complex, AI-powered automation systems. Reliability and correctness are critical, especially where distributed systems and AI-driven workflows intersect. The company views testing and quality as engineering disciplines, not as a final checkpoint.
Design and deliver scalable AI systems that connect models, data, and products.
Turn research prototypes into secure, reliable, production-ready services.
Build pipelines and serving layers that power adaptive, real-time features.
KnowBe4 is a cybersecurity company that puts security first, offering an AI-driven Human Risk Management platform. They empower over 70,000 organizations worldwide to strengthen their security culture and transform their workforce into their strongest security asset.
Drive Prompt’s mission to improve healthcare through modern technology including AI
Lead AI projects from ideation → architecture → production → iteration until tools are widely adopted and loved!
Design, build, and deploy end-to-end AI systems across both traditional ML and LLM-based workflows
Prompt delivers highly automated and modern B2B enterprise software to rehab therapy businesses, their teams, and most importantly the patients they serve. They’ve established themselves as the go-to platform in the space and are rapidly growing their market share by delivering software people love.
Development and deployment of LLM-powered features, including summarization tools.
Build backend services in Python that integrate ML/LLM models with Fullscript’s platform.
Collaborate with medical and product teams to deliver AI features for practitioners and patients.
Fullscript is a health technology company committed to helping people get better by connecting practitioners to products and patients to care plans. They empower over 125,000 practitioners and 10 million patients through their comprehensive platform.
Work with other engineers on a wide variety of AI engineering tasks to improve our existing applied AI systems
Identify new opportunities to apply emerging AI capabilities to different parts of the Poe product
Take end-to-end ownership of applied AI systems - from prototyping, data pipelines, model optimization/evaluation to reliable deployment at scale
Quora's mission is to grow the world's collective intelligence. They have two platforms: Quora, a global knowledge sharing platform, and Poe, a platform to chat, explore and build with AI language models. They have a culture rooted in transparency, idea-sharing, and experimentation.
Designing and building AI-powered features and tooling used by customers and internal teams.
Owning fullstack solutions that include frontend, backend, and AI components.
Establishing patterns, guardrails, and examples that other engineers can safely build on.
FlowFuse is a company that builds real product features and internal tooling that apply artificial intelligence to practical user and engineering problems. They foster a remote, async-first environment across multiple time zones where pragmatic use of AI tools is encouraged to accelerate development and improve outcomes.
Design, build, and maintain backend services using modern dotnet.
Implement and evolve APIs (REST and GraphQL) for internal tools, AI agents, and client-facing features.
Build event-driven, cloud-native services on AWS.
FirmPilot builds AI-powered systems that automate and scale real-world business outcomes for law firms. We are a young, remote-first company focused on building durable, production-grade software.
Build cutting edge Generative AI models, using techniques like Supervised Finetuning (SFT), Reinforcement Learning (RL), prompt improvements and synthetic data generation
Collaborate closely with product managers and engineers to transform user feedback into requirements for AI systems.
Figma’s platform helps teams bring ideas to life—whether you're brainstorming, creating a prototype, translating designs into code, or iterating with AI.
Design, refine, and evaluate prompts, context, and system instructions for various product use cases
Conduct experiments to assess model behavior, accuracy, and cost impact with new or existing prompts
Continuously improve prompt engineering processes by adopting new techniques and technologies
Applied Systems transforms the insurance industry. They have 40+ years of experience and are building a team ready to learn and deliver innovative software and services.