Job Description
Train large-language models (LLMs) to write production-grade infrastructure and DevOps code. Help teach AI how to write, debug, and optimize infrastructure code like a top-tier DevOps engineer. Compare and rank Terraform or IaC code snippets, explaining which is more reliable, efficient, or scalable. Refactor or repair AI-generated infrastructure setups for correctness, security, and clarity. Provide structured feedback (edits, test outcomes, architectural notes) that feeds into the RLHF pipeline. The model learns to reason about DevOps the way you doβsmart, scalable, and safe. The role involves ranking, editing, and explaining code, converting feedback into reward signals, and reinforcement learning tunes the model to think like a real infra engineer.