Similar Jobs
See allPlatform Engineer
HHAexchange
AWS
Terraform
CI/CD
Senior ML Operations (MLOps) Engineer
Jobgether
US
Python
PyTorch
TensorFlow
Senior MLOps Engineer
Clutch
Global
Python
AWS
Terraform
MLOps Engineer
Dv01
US
Python
Kubernetes
Terraform
Staff ML Systems Engineer
Hims & Hers
US
Kubernetes
Terraform
Python
Platform Operations & Reliability:
- Provide technical leadership for AI/ML platforms ensuring reliability, scalability, and security of production workloads.
- Establish operational standards, support models, and governance practices.
MLOps, Automation & Observability:
- Develop dashboards, health metrics, alerts, and operational runbooks.
- Enhance CI/CD pipelines, infrastructure-as-code, and model release processes.
Incident Management & Problem Resolution:
- Lead root cause analysis and drive corrective actions for platform stability.
- Solve performance, availability, and integration issues across AI ecosystems.
Technical Leadership & Collaboration:
- Mentor engineers and influence platform strategy and architecture decisions.
- Communicate complex technical concepts to both technical and non-technical audiences.
CSAA Insurance Group
CSAA Insurance Group, a AAA insurer, offers personal lines of property and casualty insurance to AAA members across 23 states and DC. Founded in 1914, they are one of the top personal lines insurers in the US with over 3,800 employees, known for a values-based culture and recognition in leadership development and community involvement.