Assume a pivotal role in supporting our federal government client, transforming our clients' infrastructure to be more efficient, secure, and aligned with business objectives. This position places significant emphasis on the implementation of everything-as-code and DevSecOps best practices and cloud security. Collaborate with key stakeholders and leading secure cloud projects that align with client objectives and regulations.
Job listings
Manage the roadmap for observability and regularly communicate and align progress with cross functional partners. Initiate and drive collaborative discussions with Engineers, Voice of Customer, UAT, CX, Analytics, Product Owners, etc. to identify gaps in alerts and monitoring. Leverage front-end engineering knowledge to investigate customer struggle surfaced in session replay tools.
The Veeva RTSM team is expanding and is looking for a Technical Operations associate to help scale its world-class IRT/RTSM system; focusing on Systems Administration, Development Operations, Site Reliability Engineering, and Release Management; to solve complex problems, working together as a team, sharing knowledge, and who enjoy implementing creative solutions to address business needs.
Drive the design of our next-generation AI infrastructure. In this high-impact, hands-on role, you will make end-to-end architectural decisions across compute, networking, and storage β ensuring our platforms can meet the massive scale, performance, and reliability requirements of modern AI workloads. This is a high-impact architecture role where youβll define how tens of thousands of GPUs are interconnected optimized across multiple data center sites.
As a Machine Learning Operations Engineer at Field AI, you will play a pivotal role in ensuring the scalability, efficiency, and reliability of our machine learning systems. You will manage and utilize data to optimize the performance of robots and drive innovation across industries, bridging the gap between machine learning models and production systems. This role offers the opportunity to work with cutting-edge technologies, solve complex problems, and contribute to the success of large-scale, real-time data systems.
Seeking an experienced Senior DevSecOps Engineer to support a federal modernization effort focused on transforming legacy systems into secure, scalable, and resilient cloud-native architectures. This role is integral to advancing enterprise DevSecOps maturity and enhancing cloud-based infrastructure reliability across hybrid environments using AWS. The ideal candidate will possess deep expertise in infrastructure-as-code (IaC), CI/CD pipelines, container orchestration, and AWS service architecture.
As an AI Automation Engineer III, you'll work on challenging projects across the company with AI-first thinking and data-driven decisions to drive innovation. You will translate operational needs into scalable, automated systems while collaborating closely with internal stakeholders and external development partners to ensure effective implementation. You will also help to manage business-critical assets.
As the sole Senior DevOps Engineer, you will own the design, deployment, and optimization of our AWS cloud infrastructure using Terraform. Youβll collaborate with product and engineering teams to build scalable, secure, and cost-efficient systems while establishing best practices for automation, monitoring, security, and governance.
Agile Six is looking for a DevOps Engineer to join a newly formed team to design and implement infrastructure-as-code, container orchestration, and deployment automations that power critical services, crafting cloud-native build, test, and release workflows that support goals for CI/CD automation, infrastructure, deployments, and security/compliance.
Weβre hiring a Staff Software Engineer, Site Reliability to lead reliability across our production platform. As a Staffβlevel Individual contributor, you will drive strategy and handsβon execution across incident response, SLO/SLI programs, and production readiness, directly owning highly available services in AWS; all while partnering with Platform/Infra to build pavedβroad tooling in our monorepo.