Partner With Engineers:

  • Pair with full-stack and backend engineers on the features they are shipping.
  • Reproduce and triage bugs with enough detail that an engineer can fix them without a round-trip.
  • Contribute to and help evolve our automated test suites alongside the QA Lead.

AI & Agent Testing:

  • Help build and run evaluation pipelines for non-deterministic LLM outputs, prompt regression, model drift detection, and output quality scoring across the LiteLLM routing layer.
  • Build and run automated tests for the agent orchestration layer, covering governance audit trail integrity, human-in-the-loop override behavior, and cross-agent handoffs.

Platform & Integration Testing:

  • Test the Nango-based integration layer across connectors and the file ingestion pipeline including encryption, formatting edge cases, and audit trail continuity.
  • Validate streaming response handling, latency thresholds, and graceful degradation when a model is unavailable or slow.

UX Quality:

  • Test the trust-layer UX onboarding flows, progressive disclosure, uncertainty states, agent activity surfacing, and human-in-the-loop governance interfaces and help shape the standards as we go.
  • Flag anything that would confuse a non-technical enterprise user.

Peach Pilot

Peach Pilot transforms how businesses run with a platform that ingests everything about how a company operates and constructs a Company Brain. It is a funded early-stage AI startup headquartered in Atlanta, Georgia, with a working platform on live infrastructure.

Apply for This Position