Contribute to the design and evolution of hybrid infrastructure systems.
Build and enhance internal tools and automation to improve scalability.
Partner with Dev, DevOps, and QA teams to resolve infrastructure or deployment blockers.
DataDirect Networks (DDN) is a global market leader renowned for powering many of the world's most demanding AI data centers. DDN's cutting-edge data intelligence platform is designed to accelerate AI workloads, enabling organizations to extract maximum value from their data.
Build and operate production-grade model serving infrastructure using frameworks such as vLLM, TGI, Triton, or equivalent
Design and implement robust deployment pipelines with blue/green and canary rollout strategies for ML models
Develop and maintain auto-scaling systems, multi-model serving architectures, and intelligent request routing layers
Pragmatike is recruiting on behalf of a fast-scaling, well-funded distributed cloud infrastructure startup building next-generation AI-native cloud services. The company is redefining how compute is delivered by providing GPU-powered infrastructure for AI/ML workloads, secure storage, and high-speed data transfer through a decentralized architecture that significantly reduces environmental impact compared to traditional cloud providers.
Conduct deep technical discovery in selected strategic accounts to assess platform readiness.
Lead architecture and delivery design for complex enterprise environments.
Design and build bounded proofs, prototypes, deployment patterns, and reusable accelerators.
GitLab is the intelligent orchestration platform for DevSecOps. They enable organizations to increase developer productivity, improve operational efficiency, reduce security and compliance risk, and accelerate digital transformation. GitLab embraces AI as a core productivity multiplier.
Develop and maintain observability solutions using platforms like Datadog, Prometheus and Grafana
Take a leading role in incident management, including coordinating response efforts, troubleshooting issues, and identifying follow-up actions
Partner with product engineering teams to architect reliable systems, recover from incidents, and learn from mistakes
Ditto is redefining how data moves at the edge, aiming to make resilient, real-time applications seamless for developers, regardless of network conditions. It's a globally distributed and fast-growing startup with over $145 million in funding that is committed to building a diverse and inclusive team.
Rackner is seeking an DevSecOps (Kubernetes) Engineer SME to support a US Air Force program called Platform One.
Big Bang provides the tooling for mission application owners to create a Platform as a Service in their own Kubernetes cluster running in a cloud or datacenter.
We're looking for a DevSecOps Engineer who has deep experience in Kubernetes, Terraform and CI/CD Pipelines to join our team.
Rackner is a software consultancy that builds cloud-native solutions for startups, enterprises, and the public sector. They are an energetic, growing consultancy with a passion for solving big problems for both startups and enterprises.
Partner with customers to assess infrastructure needs and deployment preferences.
Design and execute deployment strategies for cloud and on-premise infrastructure.
Troubleshoot deployment issues across diverse infrastructure environments.
PhysicsX is a deep-tech company rooted in numerical physics and Formula One, dedicated to accelerating hardware innovation at software speed. They are building an AI-driven simulation software stack for engineering and manufacturing across advanced industries, with customers including leading innovators in Aerospace & Defense, Materials, Energy, Semiconductors, and Automotive.
Build and maintain infrastructure-as-code for our AWS EKS and GCP GKE clusters, plus on-premises deployments.
Own CI/CD pipelines and drive GitOps adoption.
Deploy, scale, and optimize ML/NLP inference workloads.
Vectara is the Enterprise Agent Platform that enables businesses to build and deploy governed, grounded, auditable AI agents across SaaS, VPC, and on-prem. We’re a passionate team that’s hyper-focused on solving enterprise-level technology and business problems with AI.