Building world-class AI infrastructure to support a 100+ person research team.
Designing and scaling multi-cloud systems that support high-performance model training and inference.
Improving monitoring, alerting and system observability for AI workloads.
Canva is redefining how the world experiences design. They have campuses in Sydney and Melbourne, co-working spaces in Brisbane, Perth, Adelaide and Auckland, and trust their employees to choose the balance that empowers them and their team to achieve their goals.
Implement and maintain observability tools and dashboards using [e.g., AWS CloudWatch, Datadog, Sentry, OpenTelemetry].
Assist with cloud cost visibility and optimization, analyze infrastructure usage patterns to identify waste and implement aggressive tagging strategies.
Manage the tooling and processes for deploying applications to AWS EKS / Kubernetes / ECS / Serverless and facilitate modern deployment strategies.
True is a global platform of companies that optimizes value creation by placing executive talent, developing business leaders, creating diverse and inclusive networks, and using innovative technology to advance executive talent priorities. True was founded on the belief that doing good is the pathway to doing well and their growth and success are a by-product of their values treating people right, listening to new ideas and keeping culture at the heart of their business.
Building monitoring, alerting, logging, and observability from the ground up.
Improving our security posture across auth, IAM, policies, and data access.
Software Mind develops solutions that make an impact for companies around the globe. They build cross-functional engineering teams that take ownership and crave more, embracing openness, respect, and grit. They combine employment with enjoyment in their culture.
Lead the Reliability & Operations function within the Developer & Production Enablement (DPE) division of RWS’s Product & Technology organization. Take ownership of global production operations and lead the transition from manual, ticket-based workflows to platform-integrated automation. Ensure stability today, while designing for scalability and autonomy in the future.
RWS's purpose is to unlock global understanding, valuing every language and culture, and celebrating diversity and inclusion to make the company strong.
Drive FinOps and DevOps initiatives across multi-cloud environments.
Enhance cost visibility, governance, and resource optimization.
Design automated workflows, policy-as-code, and reporting dashboards.
This position is posted by Jobgether on behalf of a partner company; Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly.
Deploy and manage cloud infrastructure across AWS, Azure, and GCP using Terraform and Infrastructure as Code (IaC) principles.
Architect, build, and maintain CI/CD pipelines using GitHub Actions and ArgoCD to support continuous delivery and software deployments.
Monitor, test, and maintain observability and alerting coverage across infrastructure and platforms.
Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements.
Manage and execute releases across multiple platforms (Web, Android, iOS, CLIs). Develop and maintain release automation tools and bots. Troubleshoot and resolve release automation issues, including rollback procedures.
Founded in 2014, League is the leading healthcare consumer experience (CX) platform, powered by artificial intelligence (AI), reaching more than 63 million people around the world.