Manage and maintain our AWS infrastructure (ECS, SQS, RDS, Lambda, etc.). Implement and optimize CI/CD pipelines for multiple environments. Automate infrastructure provisioning with Terraform or similar IaC tools. Monitor and manage resource usage, scaling, and cost optimization.
Job listings
Design, implement, and manage AI Platform architecture. Control AI-related costs, including models, GPUs, and other resources. Work closely with Product teams to provide technical expertise and propose innovative solutions. Guarantee highly available AI services through best practices and automation. Collaborate with ML teams to operationalize AI models and integrate them into systems. Troubleshoot critical issues and continuously optimize system performance.