Build agent harnesses in Python using LangChain and LangGraph, including tool-calling and structured outputs.
Develop evaluation frameworks with golden datasets, LLM-as-judge, and regression suites, wiring them into CI.
Collaborate with data engineers on Snowflake-backed retrieval patterns like Cortex Analyst and Search Services.
Kobie is a loyalty solutions partner that helps brands build emotional connections with their consumers. Named a Top Workplace in the USA, they offer a flexible remote culture with a focus on collaboration and growth.
Write behavioral specs, architectural constraints, and feature requirements that agents implement against.
Build and maintain harness infrastructure including structural tests, linting rules, and CI gates.
Design validation systems where agents write the tests and you verify features work from the user's perspective.
Bolo.ai builds generative AI systems for the energy industry, making daily work faster, safer, and better for heavy industry workers. We have Fortune 500 contracts, production deployments, and growing enterprise demand, and we're scaling with a small, senior-leaning engineering team.
Design, build, and ship agentic workflows across multiple domains.
Build multi-step agents capable of autonomous planning, context tracking, memory, tool use, and API orchestration.
Drive technical and architectural decisions to meet product requirements while also anticipating and designing for future needs
Cority helps customers see and prevent risks across their operations in real time. Our EHS+ platform converges people, data, and AI agents to provide a clear view of information people can trust. For 40 years, Cority has been the market leader in EHS+, recognized by top analysts and trusted by more than 1,500 of the most complex organizations worldwide.
Own the agent layer of the platform, including architecture, prompts, tool surfaces, and multi-agent orchestration.
Drive translation and dependency-mapping accuracy across unfamiliar legacy paradigms.
Write production agent code daily, using subagents and multi-agent workflows as the normal way of working.
LTS applies frontier AI to modernize legacy systems in healthcare and government IT. It is a small, senior engineering team operating with high leverage and a culture of innovation and collaboration.
Design and implement guardrails for agentic AI systems, including tool access controls and step-level validation.
Build runtime security controls like interceptors, policy enforcement, and kill-switches for AI behavior.
Implement non-human identity access controls, observability, and threat modeling for AI-driven activity.
Backblaze is the object storage leader in the open cloud movement, offering cloud storage built to unlock budgets and unburden administrators. Founded in 2007, the company has over $100m in revenue and manages over three billion gigabytes of data for 500K+ customers across 175+ countries, with a culture of innovation and inclusion.
Design and build production-grade AI systems including RAG pipelines, agentic workflows, and LLM integrations.
Own backend services, APIs, and data pipelines end-to-end with Python as the primary language.
Translate ambiguous client requirements into scoped technical decisions and delivery plans.
Provectus is a consulting and professional services firm specializing in AI and ML solutions, serving as an AWS Premier Consulting Partner and strategic partner of Anthropic. They build custom, production-grade AI for global enterprises and operate with a remote-first culture emphasizing autonomy and ownership.
Architect and ship production-grade agentic AI applications including multi-agent orchestration, retrieval systems, and evaluation pipelines.
Design and build learner-facing AI experiences and operator tools end-to-end using React and TypeScript.
Own production reliability for AI systems including model failover, rate limiting, cost monitoring, and incident response.
Chegg Skills builds applications that help motivated career switchers transition into high-growth roles. The company serves thousands of learners and educators each year through a high-ownership engineering team rethinking modern education.
Design, build, and maintain production-grade AI systems and customer-facing AI features.
Develop agentic workflows using LLMs, retrieval systems, tools, APIs, and backend services.
Design and implement retrieval-augmented generation (RAG) systems, including ingestion pipelines, embeddings, semantic retrieval, and context assembly.
Givzey is a fast-growing and innovative technology company serving the nonprofit sector, on a mission to unlock more generosity through AI-powered donor engagement. In just three years, Givzey’s platform has already helped organizations raise $10M+ through autonomous engagement.
Build and ship specialized agents including parsers, extractors, and synthesizers for the Aedeon agent-native modernization platform.
Own the full delivery of assigned agents from prototype through deployment and post-release validation, practicing test-driven development.
Write clear Python, document agent contracts and decision logic, and promote a culture of release discipline and quality across the team.
Mactores is a trusted leader in providing modern data platform solutions, enabling businesses to accelerate value through automation with end-to-end data solutions that are automated, agile, and secure. Since 2008, they have collaborated with customers to strategize and navigate digital transformation via assessments, migration, or modernization, fostering a culture driven by 10 core leadership principles.
Build and ship features across the Champion platform, improving developer experience and maintaining agent infrastructure.
Support Applied AI engineers in building, deploying, and operating Champions, reviewing agent implementations and contributing to engineering guides.
Design and build Champion agents for business-unit use cases, including writing system prompts and configuring deployment setups.
Redzone is the #1 Connected Workforce Solution for manufacturers, improving plant efficiency and empowering front-line workers. The company combines strong leadership, manufacturing expertise, and an incredible technology team to create great products and customer outcomes.
Design and build a next-generation reliability platform for Affirm's production systems, blending distributed systems engineering with AI-assisted development.
Create AI agents and a centralized command center to assist with incident triage, root-cause analysis, and unified system health visualization.
Own projects end-to-end, from requirements to rollout, collaborating with partner teams to build powerful, simple solutions for developers.
Affirm is reinventing credit to make it more honest and friendly, offering consumers the flexibility to buy now and pay later without hidden fees. The company is a remote-first organization with a strong focus on people-first values and inclusive benefits.
Conduct offensive security research on agentic AI systems, identifying vulnerabilities like prompt injection and privilege escalation.
Build reusable security tooling and perform manual code reviews to strengthen product security across the SDL.
Represent Okta externally through research publications, conference talks, and mentor engineers on AI security.
Okta is The World's Identity Company, providing a neutral platform for secure access and identity management across any technology. With over 7,000 pre-built integrations and trusted by more than 19,300 organizations, Okta fosters a culture of innovation and inclusion with global teams across 20 offices.
Identify and prototype high-leverage AI opportunities across engineering, GTM, and operations to improve revenue, efficiency, and quality.
Partner with functional owners to build internal workflow automations, prompt systems, and AI-enabled tools using approved platforms like Claude Code and Anthropic API.
Document and share prototypes through demos and showcases, setting the standard for responsible, visible AI building.
Chainguard delivers hardened, secure builds of open source software for enterprises. Backed by leading investors, it serves Fortune 500 clients including OpenAI and Snap, and fosters a values-driven remote culture focused on security and innovation.
Design, build, and ship LLM-powered features and agentic workflows for Gametime users.
Build and maintain evaluation frameworks and prompt testing pipelines for AI-powered experiences.
Contribute to orchestration layer, including agent routing, tool use, and multi-step workflow coordination.
Gametime helps people connect through shared live experiences. They operate platforms on iOS, Android, mobile web, and desktop, supporting over 60,000 events across the US and Canada, fostering a collaborative and inclusive environment where diverse perspectives are valued.
Design and build scalable backend systems powering AI agents in real-time enterprise environments.
Develop agent orchestration frameworks and low-latency inference pipelines integrating LLMs and SLMs.
Build robust APIs and work with cross-functional teams to productionize agentic AI at scale.
Level AI is an AI-native platform that helps enterprises transform contact centers into engines of customer intelligence and operational efficiency. The company is a Series C startup backed by Battery Ventures and ENIAC, based in Mountain View, California, with a globally distributed team.
Orchestrate High-Velocity Workflows: Leverage advanced agentic coding tools (e.g., Cursor, multi-agent environments) to dramatically accelerate feature prototyping, code generation, and test coverage.
Own the Guardrails & Quality: Act as the ultimate reviewer and architect; define the specifications, establish repo-context guardrails, and review AI-accelerated output for hidden security risks, scale bottlenecks, and architectural alignment.
Build Scalable Application and Data Layers: Design, build, and maintain our data pipelines and application to service our hundreds of users.
EvolutionIQ provides technology to improve insurance claims handling. The company is experiencing massive growth and has been named a top workplace, prioritizing its team.
Build multi-agent AI systems and automation platforms for Marketing Operations at scale.
Design and implement LLM integrations, backend services, and agentic workflows.
Partner with cross-functional teams to identify high-impact automation problems and ship measurable solutions.
Grafana Labs is the company behind the open observability cloud, offering a fully managed platform with AI capabilities to help organizations monitor and optimize their systems. With over 1,600 team members across 40+ countries and backed by top investors, we maintain a 100% remote, collaborative culture rooted in open source and innovation.
Design, build, ship, and maintain core capabilities for North's Agents & Automations platform.
Build product and platform features for creating, running, debugging, evaluating, and improving agents and automations.
Own features end-to-end from design to launch, working across the full stack and collaborating with cross-functional teams.
Cohere is a security-first enterprise AI company that builds cutting-edge foundation models and end-to-end AI products for businesses. It is a global team of researchers, engineers, and designers with offices in Toronto, San Francisco, London, New York, and more, fostering a collaborative and innovative culture.
Design and develop data pipelines, scoring algorithms, and API infrastructure to power AI-driven matching and recommendation capabilities.
Build and maintain integrations between the matching engine and an existing program management platform.
Collaborate with SMEs to build, test, and refine user-configurable matching logic.
LMI is dedicated to accelerating government impact with innovation and speed, bringing commercial-grade platforms and mission-ready AI to federal agencies. Headquartered in Tysons, Virginia, they are committed to delivering impactful results that strengthen missions and drive lasting value.
Build and deploy AI agents and multi-agent systems on Azure using composable patterns.
Design orchestration patterns, tool integrations, and RAG pipelines for production.
Implement CI/CD, responsible AI controls, and production monitoring for AI solutions.
Beyondsoft is a leading mid-sized IT and consulting company that combines modern technologies and proven methodologies to tailor solutions. Our global team of diverse experts thrives on innovation and pushing the bounds of technology, with a presence spanning four continents and a customer-centric engagement model.