Job Description
VetsEZ is seeking a Site Reliability Engineer (SRE) to ensure the reliability, scalability, and seamless integration of cloud-native microservices supporting Decision Precision Plus (DP+) and backend Health Service Mesh (HSM) services for federal healthcare modernization. This role will drive operational excellence, automation, and secure integration across VA EHR platforms.
Responsibilities include designing, implementing, and maintaining highly available, scalable, and secure cloud-native microservices in Kubernetes and AWS environments. You'll lead integration efforts for DP+, CDS Hooks, and backend services within the Health Service Mesh, ensuring interoperability with VA EHR systems (CPRS, Cerner Oracle, VistA).
Additional responsibilities consist of setting up, configuring, and managing CI/CD pipelines (Jenkins), automated deployments, and infrastructure as code, monitoring system performance, reliability, and security using observability tools (Prometheus, Dynatrace, ELK/EFK), and troubleshooting production issues while ensuring compliance with federal security standards.
About VetsEZ
VetsEZ is seeking a Site Reliability Engineer (SRE) to ensure the reliability, scalability, and seamless integration of cloud-native microservices.