Remote Devops Jobs · Python

Job listings

$160,000–$180,000/yr

  • Responsible for availability, latency, performance, efficiency, monitoring/observability, emergency response, capacity planning.
  • Analyze, troubleshoot and resolve operational challenges contributing to defined SLO's.
  • Manage site stability, performance, reliability, and maintain uptime for production environments.

CentralReach provides autism and IDD care software for Applied Behavior Analysis (ABA), multidisciplinary therapy, and special education. They are trusted by more than 200,000 users and is backed by Roper Technologies, Inc. (Nasdaq: ROP). Their culture is centered around impact, inclusion, and flexibility.

$160,000–$180,000/yr

  • Champion the teams to become best-in-class in cloud-based software development while promoting approaches that greatly improve customer experience.
  • Leverage an obsession for the customer to lead and maintain a world-class SaaS, PaaS, IaaS, Cloud Infrastructure.
  • Own the build & deploy lifecycle; drastically reduce build, deploy & rollback times while simultaneously reducing risk and exposure.

CentralReach is a leading provider of autism and IDD care software for Applied Behavior Analysis (ABA), multidisciplinary therapy, and special education. Recognized as one of the best places to work over 10 times, CentralReach's culture is centered around impact, inclusion, and flexibility.

  • Build Reliable Cloud Infrastructure: Implement and maintain AWS infrastructure using Terraform across EKS, Lambda, EC2, and S3.
  • Improve Developer Workflows: Contribute to CI/CD pipelines, starter kits, and internal tooling that reduce manual effort and improve deployment confidence.
  • Strengthen Observability & Operations: Add monitoring, logging, and alerting (DataDog) to platform services and participate in an on-call rotation.

Spreetail helps brands increase their ecommerce market share globally while improving operational costs. They are building one of the fastest-growing ecommerce companies in history with a focus on innovation.

$141,000–$230,000/yr

  • Collaborate with engineering teams to design and implement scalable, secure systems.
  • Establish and manage service level objectives (SLOs) and service level agreements (SLAs).
  • Enhance incident response processes and post-mortem analysis for outages.

ClickHouse, recognized on the 2025 Forbes Cloud 100 list, is one of the most innovative and fast-growing private cloud companies. With more than 3,000 customers and ARR that has grown over 250 percent year over year, ClickHouse leads the market in real-time analytics, data warehousing, observability, and AI workloads.

  • Architect, implement, and maintain sophisticated developer tooling, frameworks, and automation to streamline and enhance software development processes.
  • Lead improvements and optimizations of CI/CD pipelines to ensure fast, reliable, and secure software deployments.
  • Proactively identify, troubleshoot, and resolve bottlenecks within development workflows, continuously improving developer productivity and satisfaction.

Galaxy is a global leader in digital assets and data center infrastructure, delivering solutions that accelerate progress in finance and artificial intelligence. Their team blends deep crypto expertise with institutional experience and a shared commitment to shaping the future of Web3 and AI.

$172,614–$172,614/yr

  • Design infrastructure, networking, and software platform architecture.
  • Build and maintain automation of Continuous Integration and Continuous Deployment pipelines.
  • Troubleshoot infrastructure, internal applications, networking, and security issues.

Loadsmart is a technology company focused on the logistics and supply chain industry. They leverage data and technology to automate and optimize freight transportation, connecting shippers and carriers to streamline the shipping process. They are a mid-sized company passionate about transforming the future of freight.

  • Set up and manage GPU cluster infrastructure on major cloud providers.
  • Build and operate job orchestration and scheduling systems.
  • Integrate and maintain ML training frameworks and post-training pipelines.

Snorkel AI helps enterprises transform expert knowledge into specialized AI at scale. They started as a research project in the Stanford AI Lab and work with some of the world’s largest organizations to empower scientists, engineers, financial experts, product creators, journalists, and more to build custom AI with their data faster than ever before.

$106,500–$202,500/yr

  • Architect new and existing systems to enhance performance, reliability, and scalability.
  • Build, implement, iterate over CI/CD pipelines.
  • Assist with the Management, Development, Design, and Deployment of microservice and containerized applications.

AbbVie's mission is to discover and deliver innovative medicines and solutions that solve serious health issues today and address the medical challenges of tomorrow. They strive to have a remarkable impact on people's lives across several key therapeutic areas.

US 6w PTO

  • Design, implement, and maintain scalable integrations for metrics, logs, and traces across cloud and Kubernetes environments.
  • Build middleware, libraries, and services to simplify development and observability workflows.
  • Lead technical direction and strategic planning for observability projects.

They are currently looking for a Staff Software Engineer - Grafana Cloud Observability, Kubernetes Monitoring in United States. This role offers a unique opportunity to shape and advance cloud observability solutions for large-scale systems, focusing on metrics, logs, and traces.