Build and implement RAG pipelines, including document chunking, embedding generation and retrieval logic.
Integrate and maintain LLM APIs such as Claude, including streaming responses, tool calling.
Develop agent orchestration logic to route user requests to appropriate specialist agents.
Craftsman+ is a global team building tools that help brands and teams produce world-class creative faster and at scale. Recognized by Inc. 5000 and Deloitte Fast 500, the company operates at the intersection of creativity and technology, delivering creative services for iconic brands.
Design, implement, and evolve RAG pipelines combining structured data, embeddings, and LLMs.
Develop and maintain prompt strategies used across multi-step agent workflows.
Integrate LLMs into production systems with attention to reliability, cost, and latency.
FirmPilot builds AI-powered systems that automate and scale real-world business outcomes. They focus on applied AI, using best-in-class large language models and tooling to deliver reliable, production-grade automation.
Rollstack is revolutionizing how businesses share data and insights by fully automating the creation of slide decks and documents. They are a remote-friendly workplace backed by Insight Partners and Y Combinator, with a diverse team that values intelligence and kindness.
Analyze requirements and propose innovative AI-native solutions to technical problems
Write clean scalable code
Test and deploy features & services
WorkHero is building the AI-powered back office for the skilled trades, starting with the $50B+ HVAC industry. They have exciting traction and just closed a $5M seed round to expand their engineering and product organization, as well as add additional services.
Implement features for AI applications such as conversational assistants and copilots and text generation, summarization, and content classification.
Design and optimize prompts and system instructions to improve task completion, reliability, and latency, minimize hallucinations and toxic/unsafe outputs and implement structured outputs.
Write unit, integration, and regression tests for AI features, run evaluation scripts and log results for model quality metrics, and work with AI observability tools under guidance.
RealPage is at the forefront of the Generative AI revolution, dedicated to shaping the future of artificial intelligence within the Property Tech domain. Our Agentic AI team is focused on driving innovation by building next generation AI applications and enhancing existing systems with Generative AI capabilities.
Design and implement automated testing frameworks for backend services and APIs.
Develop strategies and implement solutions for testing AI-driven workflows, including validating RAG outputs and agent behavior.
Develop integration, contract, and end-to-end tests for event-driven systems.
FirmPilot builds complex, AI-powered automation systems. Reliability and correctness are critical, especially where distributed systems and AI-driven workflows intersect. The company views testing and quality as engineering disciplines, not as a final checkpoint.
You will be utilizing existing Large Language Models to build applied AI applications focused on producing high accuracy rates.
You will work with product, and engineering teams and build models/services that can ingest data, extract key information and surface insights.
You can build tooling to support model training, evaluation, inference serving, monitoring and alerting.
Vanilla is an AI-powered estate advisory platform, built by advisors, planners, and attorneys to transform how wealth is transferred across generations. Our team is distributed across the U.S., with a mix of fully remote and hybrid roles, and embraces flexibility while staying closely connected.
Designing complex, dynamic prompt templates with conditional logic.
Implementing various response schemes to ensure AI outputs are predictable.
Building robust evaluation pipelines and using Langfuse to collect feedback.
Ruby Labs is a leading tech company that creates and operates innovative consumer products. We offer a diverse range of opportunities across the health, education, and entertainment industries, and our innovative teams are driving the future of consumer-led products.
Design, optimize, and version prompts for production voice and chat LLM applications.
Architect and orchestrate multi-agent systems for complex conversations.
Build automated testing and validation frameworks for LLM outputs.
Tuotempo transforms healthcare experiences through intelligent digital solutions and is a trusted patient engagement platform powering some of Europe and Latin America's leading healthcare institutions. They have a remote-first culture with vibrant hubs in Bologna or Barcelona.
Design and implement AI-powered features, integrating LLMs with existing products.
Improve AI systems through evaluations, guardrails, monitoring, and customer usage.
Collaborate with AI Platform engineers to shape foundational AI systems and tooling.
Vanta helps businesses earn and prove trust by empowering companies to practice better security. They have a kind and talented team of employees determined to make security easier for companies to manage and prove.
Design agentic systems & ship AI to production: Turn prototypes into resilient, observable services with clear SLAs, rollback/fallback strategies, and cost/latency budgets.
Build tool‑using LLM “agents” (task planning, function/tool calling, multi‑step workflows, guardrails) for tasks like grant discovery, application drafting, and research assistance.
Own RAG end‑to-end: Ingest and normalize content, choose chunking/embedding strategies, implement hybrid retrieval, re‑ranking, citations, and grounding.
Instrumentl is a hyper-growth YC-backed startup that provides a SaaS platform to help nonprofits discover, track, and manage grants efficiently. They have over 4,000 nonprofit clients and are cash flow positive, doubling year-over-year, with customers who love them.
Architect and implement AI-assisted workflow building capabilities.
Design systems that provide LLMs with the right context.
Develop evaluation benchmarks and automated testing for AI output quality.
N8n is the open workflow orchestration platform built for the new era of AI, giving technical teams the freedom of code with the speed of no-code. They have a diverse team of over 160, working across Europe and the US, and are connected by a shared builder spirit with their center of gravity in Berlin.
Design, build, and deploy AI Agents including custom tools, prompt engineering, orchestration workflows, and agent design patterns.
Contribute to the backend infrastructure powering Candidly's AI capabilities, including API development, data integrations, and data pipelines.
Work closely with stakeholders across product, design, engineering, and leadership to translate complex AI concepts into actionable strategies and features.
Candidly, founded in 2016, is the category leader with the market’s most comprehensive AI-driven student debt and savings optimization platform. They partner with hundreds of top employers, financial institutions, and retirement record keepers, positioning Candidly to serve more than 35 million Americans. Candidly is a high-growth, Series B startup, funded by leading investors with an international team of 70 (and counting).
Build and maintain an internal LLM gateway that handles routing, fallbacks, and rate limiting
Create reusable components for common AI patterns (RAG, function calling, streaming responses)
Develop SDKs or libraries that simplify AI integration for application developers
ButterflyMX empowers people to open and manage doors & gates from a smartphone and their products are installed in multifamily, commercial, and gated communities. As a distributed workforce, they're looking for intelligent, collaborative, and down-to-earth individuals to join their growing team.
Build new features powering millions of daily conversations on Replicant’s core AI voice and chat products, focusing on frontend services and building industry-leading user experience
Collaborate closely with product managers, designers, and ML engineers in an agile, iterative environment
Thoughtfully review code and architecture to maintain quality and mentor peers
Replicant is the industry leader in voice AI for customer service since it was founded in 2017. The company is backed by top tier Silicon Valley investors and growing rapidly with offices in the SF Bay Area, Toronto, and New York, and remote employees throughout other parts of the US and Canada.
Design, build, and implement AI-driven solutions aligned with business use cases.
Work with frameworks like LangChain and LLMs to enable data interpretation functionalities.
Develop systems for data ingestion, transformation, and contextual response generation using Python.
Jobgether is a platform that connects job seekers with companies. They use AI to match candidates with the right roles and aim to provide a fast, objective, and fair application review process.
Lead discovery sessions to understand customer system landscapes, requirements, and constraints.
Build, review, and debug code for AI-driven workflows, integrations, and agent orchestration.
Deploy and configure Tessera’s AI agents in complex enterprise environments.
Tessera Labs is redefining how enterprises adopt and operationalize Artificial Intelligence. Backed by Foundation Capital and led by a world-class founding team, they build multi-agent AI systems that automate complex business workflows across platforms.
Building features related to our data pipelines, usage of LLMs, analytics APIs, etc.
Doing whatever is necessary to deliver value to our customers.
Creating notifications, alert emails, scheduled reports, connectors into third party systems, etc.
Scrunch helps marketing teams rethink how their products and services are discovered and surfaced on AI platforms like ChatGPT, Claude, Gemini. They have scaled rapidly since commercial launch and have more than 500 paying brands using the platform.
Design and build LLM-powered features—from initial prototyping through production deployment
Architect and implement agentic AI systems, including tool use, multi-step reasoning, and orchestration patterns
Collaborate closely with Engineering, Product, Data, and Design to identify high-impact opportunities for AI integration
Practice Better is an all-in-one platform helping health and wellness practitioners run their businesses, care for their clients, and scale their impact. They are a remote-first team headquartered in Toronto, made up of curious, driven, and empathetic people building tools that help practitioners create sustainable, independent practices.