Source Job

US Unlimited PTO

  • Direct the agent array on production workstreams, decompose problems for agent execution, and integrate output into shipped software.
  • Review agent-generated pull requests at volume and depth, identifying correctness, security, and accessibility defects missed by automated tests.
  • Author evaluation suites that make quality measurable, using eval-driven development as standard practice.

Software Engineering Code Review Systems Design FedRAMP

20 jobs similar to Software Engineer 4, AI-Native

Jobs ranked by similarity.

US

  • Write behavioral specs, architectural constraints, and feature requirements that agents implement against.
  • Build and maintain harness infrastructure including structural tests, linting rules, and CI gates.
  • Design validation systems where agents write the tests and you verify features work from the user's perspective.

Bolo.ai builds generative AI systems for the energy industry, making daily work faster, safer, and better for heavy industry workers. We have Fortune 500 contracts, production deployments, and growing enterprise demand, and we're scaling with a small, senior-leaning engineering team.

APAC

  • Design, build, and operate production-grade agentic AI systems embedded across ServiceNow's platform.
  • Build agents that leverage ServiceNow's data layer to make decisions with context no frontier model has on its own.
  • Own the guardrails: observability, human-in-the-loop controls, and compliance infrastructure that make autonomous systems safe to deploy at scale.

ServiceNow is an AI control tower for business reinvention. Their AI platform brings together any AI, any data, and any workflow— helping 85% of the Fortune 500® work smarter, faster, and better. We're building an AI-native culture where technology and talent are unstoppable together.

US 16w maternity 12w paternity

  • Orchestrate High-Velocity Workflows: Leverage advanced agentic coding tools (e.g., Cursor, multi-agent environments) to dramatically accelerate feature prototyping, code generation, and test coverage.
  • Own the Guardrails & Quality: Act as the ultimate reviewer and architect; define the specifications, establish repo-context guardrails, and review AI-accelerated output for hidden security risks, scale bottlenecks, and architectural alignment.
  • Build Scalable Application and Data Layers: Design, build, and maintain our data pipelines and application to service our hundreds of users.

EvolutionIQ provides technology to improve insurance claims handling. The company is experiencing massive growth and has been named a top workplace, prioritizing its team.

India

  • Build and ship specialized agents including parsers, extractors, and synthesizers for the Aedeon agent-native modernization platform.
  • Own the full delivery of assigned agents from prototype through deployment and post-release validation, practicing test-driven development.
  • Write clear Python, document agent contracts and decision logic, and promote a culture of release discipline and quality across the team.

Mactores is a trusted leader in providing modern data platform solutions, enabling businesses to accelerate value through automation with end-to-end data solutions that are automated, agile, and secure. Since 2008, they have collaborated with customers to strategize and navigate digital transformation via assessments, migration, or modernization, fostering a culture driven by 10 core leadership principles.

North America Canada

  • Design, build, and operate production-grade agentic AI systems embedded across ServiceNow's platform.
  • Build agents that leverage ServiceNow's data layer to make decisions with context no frontier model has on its own.
  • Own the guardrails: observability, human-in-the-loop controls, and compliance infrastructure that make autonomous systems safe to deploy at scale.

ServiceNow is the AI control tower for business reinvention. Our AI platform brings together any AI, any data, and any workflow— helping 85% of the Fortune 500® work smarter, faster, and better. We're building an AI-native culture where technology and talent are unstoppable together.

North America

  • Accelerate engineering productivity by building AI coding tools, infrastructure, and documentation.
  • Own the agentic development environment and build MCP server integrations.
  • Collaborate with engineers to address root causes of struggles with AI agents.

Hightouch is an Agentic Marketing Platform powered by the Composable CDP. Founded in 2019 and headquartered in San Francisco, the team is ambitious, impact-driven, and efficient, with humility, kindness, and compassion essential to their success.

US

  • Own the agent layer of the platform, including architecture, prompts, tool surfaces, and multi-agent orchestration.
  • Drive translation and dependency-mapping accuracy across unfamiliar legacy paradigms.
  • Write production agent code daily, using subagents and multi-agent workflows as the normal way of working.

LTS applies frontier AI to modernize legacy systems in healthcare and government IT. It is a small, senior engineering team operating with high leverage and a culture of innovation and collaboration.

Global

  • Build, test, ship, and maintain high-quality software across Seso’s platform.
  • Use modern AI tools to improve your own engineering speed and quality across coding, debugging, testing, documentation, refactoring, and technical exploration.
  • Help design and build product features that use LLMs, agents, structured data extraction, summarization, recommendations, search, classification, document understanding, or other AI-enabled capabilities.

Seso is modernizing the back-office for farms by building the premier platform for agribusiness to hire and manage their workforce and improve the lives of agricultural workers. They have raised over $60M from Tier I investors and have been recognized with awards including Forbes Rising Stars and Andreessen Horowitz's American Dynamism 50.

$154,384–$198,893/yr
Europe

  • Design, build, and own core components of the agent platform, from the orchestration layer to the tool integrations connecting it to internal systems.
  • Build and evolve the capabilities layer: APIs, data access patterns, and service integrations for agents to execute operational workflows.
  • Architect the knowledge and memory infrastructure, allowing agents to retrieve the data and act across our systems.

Justworks helps businesses get off the ground by enabling them to focus on running their business and solves HR issues. The company embraces a supportive, entrepreneurial environment where employees are encouraged to build something meaningful and have fun.

$0–$0/yr
US Canada

  • Design, build, and ship agentic workflows across multiple domains.
  • Build multi-step agents capable of autonomous planning, context tracking, memory, tool use, and API orchestration.
  • Drive technical and architectural decisions to meet product requirements while also anticipating and designing for future needs

Cority helps customers see and prevent risks across their operations in real time. Our EHS+ platform converges people, data, and AI agents to provide a clear view of information people can trust. For 40 years, Cority has been the market leader in EHS+, recognized by top analysts and trusted by more than 1,500 of the most complex organizations worldwide.

US

  • Shape technical direction and architecture: Define the foundational architecture for enterprise agentic AI at Benchling.
  • Build and ship the early portfolio yourself: Write production code at least half your time, particularly during the team's first year.
  • Design for enterprise from day one: Build for multi-tenant isolation, secrets management, audit logging, payload encryption, role-based access controls, and human-in-the-loop controls calibrated to risk.

Benchling is the AI platform for biotech R&D. Scientists use Benchling to design experiments, capture structured data, and run AI agents and models directly in their workflows. They have over 200,000 scientists around the world, from academic labs to Sanofi and Moderna.

US

  • Lead high-performing engineering teams focused on AI-native developer productivity.
  • Partner with leaders to translate strategy into scalable platforms and engineering roadmaps.
  • Drive alignment across various departments and build organizational processes for AI-assisted workflows.

Reddit is a community-based platform built on shared interests and open conversations. With over 100,000 active communities and millions of daily active users, it's a major source of information and discussion on the internet.

$115,000–$130,000/yr
US 4w PTO

  • Write, iterate, and maintain system prompts and instruction sets for Noodle’s AI agents across the student journey.
  • Build and maintain evaluation frameworks to measure agent accuracy, tone, hallucination rate, task completion, and alignment with rubric-based learning objectives.
  • Partner with Noodle teammates and university stakeholders to design, build, and test agents — translating learning objectives, operational flows, rubric assessments, and more into prompt-level agent instructions.

Noodle is higher education’s leading strategy, services, and technology partner that develops infrastructure, provides life-changing learning experiences, and grows the awareness of and the enrollment in some of the best academic institutions in the world. They empower universities to change the world by offering university partners various products and services.

India

  • Own end-to-end execution of AI agent deployments from discovery and scoping through launch and optimization.
  • Configure agent workflows, decision logic, and automation behaviors to maximize accuracy, reliability, and business outcomes.
  • Implement guardrails and validation frameworks to ensure safe, compliant, and predictable agent performance.

Level AI is transforming how enterprises understand and engage with their customers. Their AI-native CX platform combines conversation intelligence, real-time agent guidance, and AI Virtual Agents to help brands deliver exceptional customer experiences at scale. At Level AI, they operate with urgency, ownership, and a deep customer-first mindset.

$160,000–$240,000/yr
US

  • Build agentic AI systems that change how Dataiku runs internally.
  • Turn real problems into working software.
  • See your solutions through from first conversation to production.

Dataiku is the Platform for AI Success, the enterprise orchestration layer for building, deploying, and governing AI. The world’s leading companies rely on Dataiku to operationalize AI and run it as a true business performance engine delivering measurable value.

Europe

  • Lead the adoption of AI across the engineering function by designing agentic workflows and building RAG systems.
  • Prototype and deploy AI-powered solutions, translating business problems into scalable architectures.
  • Pioneer AI within the product delivery team and provide AI thought leadership across the organization.

HSP Group provides global expansion services, helping companies simplify the complex challenges of operating internationally. As a Stage B startup, they focus on delivering value through relentless innovation, execution, and customer delight.

Europe US

  • Lead a team of 6-10 Automation Quality Engineers and drive a transition toward agentic quality engineering.
  • Define quality architecture and co-design deterministic agent workflows and non-deterministic approaches.
  • Build tools that make the broader engineering org faster and more quality-conscious.

airSlate is a global SaaS technology company that develops no-code workflow automation, electronic signature, and document management solutions. They have teammates in more than 20 countries across three continents and main hubs in the United States, Poland, Romania, Ukraine and Philippines with an exciting phase of growth and transformation.

$180,000–$320,000/yr
North America

  • Quickly iterate and develop proofs of concept to explore integrating AI into data and marketing workflows.
  • Make key decisions about the choice of AI architecture and frameworks.
  • Build production data agents to seamlessly answer analytics and data science questions.

Hightouch is an Agentic Marketing Platform that provides a composable CDP. They enable marketing teams to analyze performance, brainstorm ideas, and generate creative quickly. The team is ambitious and impact-driven, with a focus on humility, kindness, and compassion.

Global 4w PTO

  • Lead through code, spending approximately 75% of time hands-on coding with AI tools to solve patient engagement challenges.
  • Drive transformation and technical leadership by pioneering AI-enabled development practices and scaling them across the engineering organization.
  • Partner closely with Product and Design to deliver measurable improvements in engagement, retention, and continuity of care.

Docplanner is a healthcare technology company that empowers patients by giving them access to doctor reviews and online booking. They have over 2,500 employees across 13 countries and maintain a startup mindset while being backed by major venture capital funds.

India

  • Architect and ship production-grade agentic AI applications including multi-agent orchestration, retrieval systems, and evaluation pipelines.
  • Design and build learner-facing AI experiences and operator tools end-to-end using React and TypeScript.
  • Own production reliability for AI systems including model failover, rate limiting, cost monitoring, and incident response.

Chegg Skills builds applications that help motivated career switchers transition into high-growth roles. The company serves thousands of learners and educators each year through a high-ownership engineering team rethinking modern education.