Design, build, and maintain CI/CD pipelines and Infrastructure as Code using tools like CloudFormation, Ansible, and Terraform.
Monitor and respond to infrastructure and application health, troubleshoot operational issues, and provide on-call support.
Maintain operational documentation, communicate proactively with teams, and ensure service delivery meets client expectations.
NICE Ltd. provides software used by 25,000+ global businesses, including 85 of the Fortune 100, to deliver customer experiences, fight financial crime, and ensure public safety. With over 8,500 employees across 30+ countries, NICE is recognized as a market leader in AI, cloud, and digital innovation.
Design, deploy, and manage Kubernetes-based platforms in production.
Implement and manage automation frameworks for infrastructure provisioning and operations.
Administer and optimize VMware environments (vSphere, ESXi, vCenter).
EPlus believes technology is a people business and delivers solutions that make a real difference. Their team is passionate, skilled, and driven, valuing collaboration, innovation, and extraordinary results and dedicated to fostering, cultivating, and preserving a culture that represents diversity, enables inclusion.
Automate CI/CD pipelines for software build and deployment.
Provision and maintain infrastructure including load balancers, firewalls, and databases.
Troubleshoot infrastructure and application issues to ensure platform stability.
Granicus builds cloud-based digital solutions for government, serving over 5,500 agencies and 300 million citizens. The company has appeared on the GovTech 100 list and been recognized as a top workplace by BuiltIn, fostering a transparent and inclusive culture.
Support and improve hybrid production infrastructure for 15+ development teams handling 100+ products, 10K+ domains, and billions of hits per day.
Architect and plan improvements of a multi-datacenter development environment, advocating for migration to automated, elastic infrastructures using cloud, Kubernetes, and serverless technologies.
Aylo is a tech pioneer that offers world-class adult entertainment and games on safe, popular platforms. With an international team of dynamic innovators, the company focuses on trust-and-safety protocols and has offices in Montreal, Austin, and Nicosia.
Oversee a specialized SRE team focused on the design, deployment, and maintenance of automation toolsets.
Establish and enforce standards for IaC to ensure consistent, repeatable, and secure deployments.
Drive the automated lifecycle of both physical and virtual assets, from initial template creation/deployment to automated patching, scaling, and decommissioning.
Galaxy is a global leader in digital assets and data center infrastructure, delivering solutions that accelerate progress in finance and artificial intelligence. Led by CEO and Founder Michael Novogratz, their team blends deep crypto expertise with institutional experience and a shared commitment to shaping the future of Web3 and AI.
Design, deploy, and manage production Kubernetes clusters with workload scheduling, resource quotas, network policies, and RBAC.
Build and optimize CI/CD pipelines using Infrastructure as Code and GitOps principles.
Implement observability solutions using Prometheus, Grafana, and OpenTelemetry for performance tuning and reliability.
VerTALENTS is a subsidiary of VerSprite Cybersecurity, specializing in technology staffing. The company connects top technical talent with industry clients through various methods, adding value to both clients and candidates for full-time and contracting opportunities.
Assist in managing multiregion and multicloud infrastructure, ensuring resiliency, scalability, and performance.
Support infrastructure provisioning and deployments primarily on GCP, while gaining exposure to other cloud providers.
Collaborate with development teams to design and maintain CI/CD pipelines in GitLab CI and contribute to GitOps-based deployments using ArgoCD.
Learneo is a platform of builder-driven businesses, including Course Hero, CliffsNotes, LitCharts, Quillbot, Symbolab, and Scribbr, united around supercharging productivity and learning. Each team innovates independently, supported by centralized corporate operations functions, and the company values collaboration and growth.
Build and maintain Python fleet tracking system that manages the full lifecycle of servers.
Build server management tooling that automates provisioning, health checks, GPU diagnostics, recovery and alerting.
Create and maintain metrics, dashboards, and alerting for hardware health across the fleet.
FAL is committed to keeping a large fleet of GPU servers healthy and productive. They offer a collaborative and supportive culture with learning and growth opportunities.
Deploy and maintain infrastructure using Terraform on AWS.
Operate and govern production-grade platforms running on Kubernetes / EKS.
Build and maintain CI/CD pipelines using GitHub Actions.
Muttdata is a dynamic startup committed to crafting innovative systems using cutting-edge Big Data and Machine Learning technologies. They are looking for a hands-on DevOps to join a strategic initiative focused on deploying and operating Data & AI platforms.
Build internal tooling to help other engineers and the rest of the company understand and operate our system.
Design and implement security best practices for our team and infrastructure.
Reduce toil through automation, including building and maintaining CI/CD infrastructure.
Openly is rebuilding insurance from the ground up by re-envisioning and enhancing every aspect of the customer experience. They are a rapidly growing team of exceptional, curious, empathetic people with a wide range of skill sets, spanning many departments.
Build and maintain CI/CD pipelines and deployment infrastructure.
Leverage AI to automate analysis and resolution of production issues.
Fal is the generative media ecosystem powering the next generation of AI products. They build the infrastructure, tools, and model access that teams need to move from idea to production.
Design, build, and implement robust infrastructure solutions aligned with business needs and security best practices.
Automate resource deployment, compute and storage allocation, and optimize delivery of key infrastructure services.
Troubleshoot escalated issues, perform root-cause analyses, and drive process improvements using AI and automation.
Hyland is the pioneer of the Content Innovation Cloud™, delivering ubiquitous enterprise intelligence to organizations through solutions that unlock actionable insights and drive automation. Trusted by thousands of organizations worldwide, including many of the Fortune 100, Hyland has grown to nearly 4,000 employees with a culture focused on employee initiatives, wellbeing, and innovation.
Design, build, and maintain scalable, reliable systems on GCP.
Develop automation for infrastructure provisioning using Terraform, Ansible, or Deployment Manager.
Manage incident response, conduct postmortems, and implement improvements to reduce recurrence.
SupplyHouse.com is an industry-leading e-commerce company specializing in HVAC, plumbing, heating, and electrical supplies since 2004. They value every individual team member and cultivate a community where people come first with Generosity, Respect, Innovation, Teamwork, and GRIT.
Construct infrastructure as code, developing and enforcing best practice across configurations while preventing drift between Terraform configurations and infrastructure deployments.
SentiLink provides innovative identity and risk solutions, empowering institutions and individuals to transaction with confidence. They are building the future of identity verification in the United States replacing a clunky, ineffective, and expensive status quo with solutions that are 10x faster, smarter, and more accurate.
Maintain and develop secure, reliable, and scalable AWS cloud infrastructure to meet business and development needs.
Deploy and operate microservices running on EC2 (Docker Compose + Caddy) and Kubernetes (EKS + Karpenter).
Write and maintain Terraform modules and stacks for EC2, RDS, EKS, ECR, S3, IAM, VPC, and Secrets Manager.
INFUSE is a digital marketing company headquartered in the US and operating worldwide, providing services in demand generation. Our team is dispersed across 20 countries, and we are committed to giving each candidate a fair and detailed assessment.
Own the delivery of developer platform capabilities end-to-end, including design, implementation, rollout, and iteration.
Build and evolve paved roads that make it easy to deploy, operate, and scale services.
Drive improvements to GitOps workflows and harden CI/CD to improve pipeline performance and developer ergonomics.
Phaidra is building the future of industrial automation with AI-powered control systems. They are a 100% remote company with employees located throughout the USA, Canada, UK, Sweden, Spain, Portugal, the Netherlands, Singapore, Australia, and India.
Own the technical direction of Remote's SRE/Platform domain.
Define and drive the reliability strategy across the platform.
Identify and lead AI enablement initiatives across the engineering organisation.
Remote is solving modern organizations’ biggest challenge – navigating global employment compliantly with ease. With our core values at heart and a future-focused work culture, our team works tirelessly on ambitious problems, asynchronously, around the world.
Design and develop CI/CD systems for websites, services, and release workflows, and operate an EKS-based Kubernetes platform.
Diagnose debug production incidents, drive root-cause analysis, and implement improvements to enhance system reliability.
Write and maintain infrastructure as code using Pulumi or Terraform/OpenTofu across multiple AWS accounts with security-conscious practices.
Thunderbird is one of the world’s most trusted open-source email applications, empowering more than 20 million people globally. Our small but growing distributed team includes 65+ people across seven countries, and we build privacy-respecting communication tools with a collaborative, inclusive, and user-first spirit.
Build end-to-end automation solutions using GitLab CI, AKS, Terraform, and Ansible with security controls built in from the start.
Design, deploy, and secure MCP servers on Azure, exposing tools and data for AI agents with attention to access boundaries.
Integrate AI agent skills, orchestrate multi-step workflows, and enable autonomous interactions within defined security guardrails.
General Dynamics Mission Systems engineers a diverse portfolio of high technology solutions for defense and scientific missions. With a global team of 12,000+ professionals, they value trust, honesty, and transparency, offering a flexible work environment and competitive benefits.