Job Description
As a Site Reliability Engineer, you will be responsible for driving the effort to identify, design, and develop the best technical and field solutions to automate our production systems. This position will collaborate often with various internal and external business and engineering teams. You will also have an opportunity to eventually lead efforts to champion and instill a culture of Site Reliability Engineering at Skyflow.
You will utilize programming languages like Python and Go, Container Orchestration services including Docker and Kubernetes, CM tools including Terraform and Helm, and a variety of AWS tools and services on a daily basis. You will develop and maintain CI/CD pipelines to enable automated testing, building, and deployment of applications. You will also collaborate with cross-functional teams and clients to deliver robust cloud-based solutions that drive best-in-class experiences to Skyflow customers.
The role also involves automating and maintaining tools/systems involving software builds, continuous testing, automated deployments, software health monitoring and software releases and evaluating reliability, performance, scalability, and engineering aspects to ensure a smooth software production rollout and delivery. You will be a thought leader and key contributor within our SRE team and help build an SRE culture.
About Skyflow
Skyflow is a data privacy vault company built to radically simplify how companies isolate, protect, and govern their customersβ most sensitive data.