Technical Responsibilities:
- Design, deploy, and scale cloud infrastructure primarily on AWS, including EC2, VPC, IAM, and S3.
- Build and automate CI/CD pipelines and deployment systems using infrastructure as code tools like CDK and Terraform.
- Write code in Python, Go, or similar to create tools, manage containers with Docker, and handle orchestration via Kubernetes or ECS.
System and Operations Focus:
- Manage and optimize Linux-based production systems, configuring networking, DNS, firewalls, and load balancing.
- Implement comprehensive monitoring, logging, and alerting systems to ensure reliability and performance.
- Continuously refine infrastructure for performance, reliability, and cost-efficiency, improving developer experience and internal tooling.
AI-Driven Workflow and Collaboration:
- Actively leverage AI tools like Claude and ChatGPT to accelerate operations, debugging, and engineering workflows.
- Manage bi-weekly releases and work with sales engineering to optimize customer onboarding and automation processes.
- Thrive in a fast-moving, ambiguous environment, communicating across silos to coordinate and achieve resolution.