Job Description
We're seeking a Member of Technical Staff, DevOps / Infrastructure Engineering to architect, automate, and scale the infrastructure that underpins our large-scale model training and research workflows. This role spans both cloud environments (AWS) and HPC infrastructure (Buzz & Lambda HPC GPU clusters with high-speed interconnects), requiring you to design and codify the systems, pipelines, and automation that enable our researchers and engineers to move fast with confidence. This is not a "click in the console" role - you'll bring strong fundamentals in Unix/Linux, experience in CI/CD and infrastructure-as-code, and a systems mindset to build automation and establish the standards that power breakthrough scientific discoveries. You'll be instrumental in building the reliable, scalable foundation that powers our autonomous AI Physicist while partnering closely with training engineers and researchers.
About FirstPrinciples
FirstPrinciples is a non-profit organization building an autonomous AI Physicist designed to advance humanity's understanding of the fundamental laws of nature.