Implement features for AI applications such as conversational assistants and copilots and text generation, summarization, and content classification.
Design and optimize prompts and system instructions to improve task completion, reliability, and latency, minimize hallucinations and toxic/unsafe outputs and implement structured outputs.
Write unit, integration, and regression tests for AI features, run evaluation scripts and log results for model quality metrics, and work with AI observability tools under guidance.
Design, refine, and evaluate prompts, context, and system instructions for various product use cases
Conduct experiments to assess model behavior, accuracy, and cost impact with new or existing prompts
Continuously improve prompt engineering processes by adopting new techniques and technologies
Applied Systems transforms the insurance industry. They have 40+ years of experience and are building a team ready to learn and deliver innovative software and services.
Guide customers through their entire product journey.
Build custom demos and prototypes that showcase how AssemblyAI's models can solve specific customer use cases.
Serve as the voice of the customer to our Product and Research teams.
AssemblyAI builds best-in-class Speech AI models that power the next generation of voice applications. They are a remote team building one of the next great AI companies, composed of startup veterans and experienced AI researchers.
Work with other engineers on a wide variety of AI engineering tasks to improve our existing applied AI systems
Identify new opportunities to apply emerging AI capabilities to different parts of the Poe product
Take end-to-end ownership of applied AI systems - from prototyping, data pipelines, model optimization/evaluation to reliable deployment at scale
Quora's mission is to grow the world's collective intelligence. They have two platforms: Quora, a global knowledge sharing platform, and Poe, a platform to chat, explore and build with AI language models. They have a culture rooted in transparency, idea-sharing, and experimentation.
Design, build, and deploy AI Agents including custom tools, prompt engineering, orchestration workflows, and agent design patterns.
Contribute to the backend infrastructure powering Candidly's AI capabilities, including API development, data integrations, and data pipelines.
Work closely with stakeholders across product, design, engineering, and leadership to translate complex AI concepts into actionable strategies and features.
Candidly, founded in 2016, is the category leader with the market’s most comprehensive AI-driven student debt and savings optimization platform. They partner with hundreds of top employers, financial institutions, and retirement record keepers, positioning Candidly to serve more than 35 million Americans. Candidly is a high-growth, Series B startup, funded by leading investors with an international team of 70 (and counting).
Build and maintain an internal LLM gateway that handles routing, fallbacks, and rate limiting
Create reusable components for common AI patterns (RAG, function calling, streaming responses)
Develop SDKs or libraries that simplify AI integration for application developers
ButterflyMX empowers people to open and manage doors & gates from a smartphone and their products are installed in multifamily, commercial, and gated communities. As a distributed workforce, they're looking for intelligent, collaborative, and down-to-earth individuals to join their growing team.
Design and deliver AI-powered advisors, assistants, and analytic agents.
Build and maintain high-quality, production-ready Python services.
Apply, adapt, and fine-tune foundation models to deliver reliable AI experiences.
Energage helps organizations turn employee feedback into useful business intelligence and credible employer recognition through Top Workplaces. Built on culture research and the results from 23 million employees surveyed across more than 70,000 organizations, Energage delivers the most accurate competitive benchmark available.
Build AI-Powered GTM Tools using data and AI capabilities to design internal tools, such as AI-assisted lead scoring and intelligent churn prediction.
Build High-Impact Customer Journey Experiences by developing self-serve interactive product demos and ROI calculators.
Own and Instrument Our GTM Data Stack by ensuring data is clean, connected, and actionable between product, website, and GTM systems like Salesforce and Hubspot.
Gather AI is pioneering a new era of warehouse intelligence with its vision-powered platform that uses autonomous drones and existing equipment to capture real-time data, digitizing manual workflows. They're leading the charge in the rapidly evolving robotics industry, reshaping the global supply chain.
Design, optimize, and version prompts for production voice and chat LLM applications.
Architect and orchestrate multi-agent systems for complex conversations.
Build automated testing and validation frameworks for LLM outputs.
Tuotempo transforms healthcare experiences through intelligent digital solutions and is a trusted patient engagement platform powering some of Europe and Latin America's leading healthcare institutions. They have a remote-first culture with vibrant hubs in Bologna or Barcelona.
Research, Document, Test, and Ideate: Explore the best ways to achieve our customers’ goals using LLMs and other AI tools.
Master Our Dialogue Platform: Become an expert, answer questions, and train others on prompting both within and outside of our platform.
Train Our AIs: Utilize prompting, knowledge-base creation, and fine-tuning to enhance our AI capabilities.
1mind is a platform that deploys multimodal Superhumans for revenue teams. These Superhumans combine a face, a voice, and a GTM brain — equipped with deep technical and product knowledge. They seem to have a remote-first, fast-moving culture with ownership, autonomy, and impact from day one.
Integrate AI platform APIs across our product line - both internally and externally
Develop and refine LLM prompt chains and agents to meet the customer’s needs and expectations
Create eval systems to fine tune our agents to better suit the needs of our customers
Hone is revolutionizing the way companies develop and support their managers and teams with its AI-powered people development platform. They are funded by leading VCs and have raised over $50M to support their mission, with a remote-first and fully-distributed organization.
Research, Document, Test, and Ideate: Explore the best ways to achieve our customers’ goals using LLMs and other AI tools.
Master Our Dialogue Platform: Become an expert, answer questions, and train others on prompting both within and outside of our platform.
Train Our AIs: Utilize prompting, knowledge-base creation, and fine-tuning to enhance our AI capabilities.
1mind is a platform that deploys multimodal Superhumans for revenue teams, combining a face, a voice, and a GTM brain. The company has a remote-first, fast-moving culture with ownership, autonomy, and impact from day one.
Provide technical leadership on a new team prototyping and experimenting with new AI features
Productionize and ship AI integrations into Modern Health’s core product
Collaborate with cross-functional teams to deliver product features on time
Modern Health is a mental health benefits platform for employers, offering access to various resources for emotional, professional, social, financial, and physical well-being. They are the fastest entirely female-founded company in the U.S. to reach Unicorn status, with a "It Takes a Village" culture centered around high empathy and accountability.
Lead domain-specific model optimization using PEFT (LoRA/QLoRA) and knowledge distillation to balance cost, latency, and reasoning capability.
Build next-gen Retrieval-Augmented Generation pipelines using hybrid search, cross-encoders, and self-correcting retrieval loops.
Design and deploy multi-agent systems using frameworks like LangGraph or CrewAI, enabling autonomous task planning and tool-use (Function Calling).
ServiceNow is a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500®. Their intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work.
Design and implement AI-powered features end to end, including prompts, agents, tools, retrieval, evaluation, and feedback loops.
Build agent systems that interact safely with infrastructure, codebases, and deployment pipelines.
Integrate LLMs deeply into product workflows as core platform primitives.
SuperPlane is an AI-native DevOps control plane with a mission to build the platform teams use to ship and manage software in the AI era. They are a fast-moving company aiming high, rethinking DevOps from first principles for the AI era to create a single control layer for engineers and agents to collaborate safely.
TLDR is the largest network of tech newsletters in the world, with over 7M subscribers, covering topics from startups to AI. Their 24-person full time team includes alumni of top media brands, and they doubled revenue from 2024 to 2025.
Design agentic systems & ship AI to production: Turn prototypes into resilient, observable services with clear SLAs, rollback/fallback strategies, and cost/latency budgets.
Build tool‑using LLM “agents” (task planning, function/tool calling, multi‑step workflows, guardrails) for tasks like grant discovery, application drafting, and research assistance.
Own RAG end‑to-end: Ingest and normalize content, choose chunking/embedding strategies, implement hybrid retrieval, re‑ranking, citations, and grounding.
Instrumentl is a hyper-growth YC-backed startup that provides a SaaS platform to help nonprofits discover, track, and manage grants efficiently. They have over 4,000 nonprofit clients and are cash flow positive, doubling year-over-year, with customers who love them.
Own the ideation and execution of high-impact projects that directly influence the user experience and business outcomes
Flesh out and evolve the Conversational AI Engineering roadmap alongside product and technical leadership
Incept projects: identify opportunities, design solutions, and lead implementation
Trellis is rewriting the insurance experience from the inside out. With powerful tools and a customer-first mindset, they're making insurance shopping refreshingly effortless, and they are a profitable, fast-growing Series A startup.
Design, develop, and test AI agents to support business objectives and improve operational outcomes.
Integrate agents with enterprise data sources, APIs, and workflows to ensure seamless functionality.
Translate evolving AI capabilities into actionable business and sales use cases.
Highstreet is developing next-generation agentic AI solutions that empower public sector and education (SLED) clients to achieve real-world business outcomes. The company seems to have a modern, flexible workplace culture built for collaboration and growth.
Designing complex, dynamic prompt templates with conditional logic.
Implementing various response schemes to ensure AI outputs are predictable.
Building robust evaluation pipelines and using Langfuse to collect feedback.
Ruby Labs is a leading tech company that creates and operates innovative consumer products. We offer a diverse range of opportunities across the health, education, and entertainment industries, and our innovative teams are driving the future of consumer-led products.
You will be utilizing existing Large Language Models to build applied AI applications focused on producing high accuracy rates.
You will work with product, and engineering teams and build models/services that can ingest data, extract key information and surface insights.
You can build tooling to support model training, evaluation, inference serving, monitoring and alerting.
Vanilla is an AI-powered estate advisory platform, built by advisors, planners, and attorneys to transform how wealth is transferred across generations. Our team is distributed across the U.S., with a mix of fully remote and hybrid roles, and embraces flexibility while staying closely connected.