You would be working in our pre-training team focused on building out our distributed training and inference of Large Language Models (LLMs). This is a hands-on role that focuses on software development best practices, maintenance, and code architecture. You will have access to thousands of GPUs to verify changes.
The engineering organization is a dynamic group of builders, thinkers, and problem-solvers dedicated to delivering scalable, AI-powered software products. As a Principal Software Engineer, you will participate in all technical aspects of team deliverables, communicate technical decisions, and evolve ServiceNow’s end-end CI/CD pipeline. You will also implement AI assistance tools and optimize the performance and reliability of our mission critical developer pipeline.
As a Site Reliability Engineer (SRE) at Alpaca, you will be responsible for ensuring the reliability, scalability, and performance of our systems and services. You will work closely with development, operations and DevOps teams to build and maintain robust applications, ensuring they run smoothly and efficiently. This role requires a blend of software engineering and operations skills, with a strong ability to troubleshoot technical issues and resolve problems before they impact our users.
Pioneer new ways to improve developer productivity by leveraging emerging technologies like AI to optimize workflows, improve test coverage and reduce friction in the development lifecycle. Shape the next generation of our development environments and modernize core internal libraries and tooling. Define and measure operational excellence at scale and elevate our testing infrastructure.
We’re seeking an experienced Platform Engineer to join the Edge Services Development team, working on the platform that powers the Megaport Cloud Router and Megaport Virtual Edge products. You’ll be instrumental in scaling our global infrastructure and enabling our engineering teams to deliver innovative products to our customers worldwide.
We’re looking for a Cloud Operations Engineer to join our Infrastructure Team and help support the backbone of how we deliver for our clients. You’ll work hands-on in our AWS environments every day, helping provision and tune systems, digging into performance issues, and supporting the integrations and configurations that keep projects moving through implementation, testing, and launch.
The Cloud Developer is responsible for designing, building and maintaining cloud hosted services and platforms to support Smile Digital Health’s SaaS and offering. This includes owning the development and maintenance of deployment artifacts such as HELM charts, docker container/ compose configurations, infrastructure as code and build/deployment/automation pipelines. The role works closely with platform, infrastructure, architecture and security teams to ensure cloud deployments are scalable, reliable, secure and aligned with enterprise architecture patterns.
As a Principal Technical Consultant specializing in ServiceNow DevOps and Integrations, you will serve as both a technical leader and a business process advisor. You will guide customers through their DevOps transformation journey—ensuring solutions are technically sound, process-aligned, and strategically impactful. This hybrid role combines deep technical expertise in DevOps, toolchain integrations, and ServiceNow architecture with business process consulting to ensure technology delivers measurable business outcomes.