Source Job

  • Design and implement foundational patterns and libraries for Python applications.
  • Develop and maintain robust CI/CD pipelines using tools such as Jenkins, ArgoCD.
  • Instrument observability through tools such as CloudWatch and DataDog to monitor and optimize application performance across multiple environments.

Python AWS Kubernetes Jenkins

20 jobs similar to Staff Platform Engineer - Infra + DevOps

Jobs ranked by similarity.

$125,000–$169,000/yr
Unlimited PTO

  • Design, scale, and operate resilient, cloud-native infrastructure in AWS with an emphasis on EKS, IAM, RBAC, and modern security-first practices.
  • Build and optimize CI/CD pipelines with GitHub Actions and GitHub Advanced Security enabling velocity without compromising safety.
  • Own observability across the stack using Datadog (metrics, logging, alerting, and tracing).

DexCare optimizes time in healthcare, streamlining patient access, reducing waits, and enhancing overall experiences. They are committed to creating an inclusive workplace where diversity drives innovation and belonging strengthens collaboration, enabling everyone to thrive.

US Unlimited PTO

  • Implement and maintain observability tools and dashboards using [e.g., AWS CloudWatch, Datadog, Sentry, OpenTelemetry].
  • Assist with cloud cost visibility and optimization, analyze infrastructure usage patterns to identify waste and implement aggressive tagging strategies.
  • Manage the tooling and processes for deploying applications to AWS EKS / Kubernetes / ECS / Serverless and facilitate modern deployment strategies.

True is a global platform of companies that optimizes value creation by placing executive talent, developing business leaders, creating diverse and inclusive networks, and using innovative technology to advance executive talent priorities. True was founded on the belief that doing good is the pathway to doing well and their growth and success are a by-product of their values treating people right, listening to new ideas and keeping culture at the heart of their business.

Brazil 26w maternity 4w paternity

Support the evolution of our platform by improving scalability, reliability, observability, and security. Proactively identify bottlenecks and unlock the autonomy of the entire engineering team. Maintain infrastructure & deployment pipelines and collaborate with engineering teams on architectural decisions and production-readiness practices.

Feegow joined the Docplanner Group, a health-tech company, in 2022 and is dedicated to developing innovative solutions for physicians and managers.

Design, implement, and maintain cloud infrastructure and deployment pipelines across AWS environments. Ensure efficient CI/CD operations and infrastructure automation. Uphold high platform reliability and security standards.

Software Mind develops solutions that make an impact for companies around the globe.

  • Design, implement, and manage infrastructure for our cloud-based platforms (AWS).
  • Create and automate deployment pipelines using CI/CD tools (Gitlab / Github Actions).
  • Ensure system scalability, availability, and reliability through proactive monitoring and automation.

Prompt is revolutionizing healthcare by delivering highly automated and modern B2B enterprise software to rehab therapy businesses, the teams within, and the patients they serve.

US Unlimited PTO

  • Deploy and manage cloud infrastructure across all three clouds using Terraform IaC.
  • Architect, build, and maintain reliable CI/CD pipelines in Github Actions and ArgoCD.
  • Contribute to decisions around our departmental roadmap and project priorities.

Coalesce is the only data transformation and governance platform designed for the AI era, improving data professionals' lives since its founding in 2020.

Europe

Heavily contribute to the architecture and migration of our CI/CD platform. Act as a pragmatic driver and senior contributor, responsible for designing and implementing solutions. Design and build the paved path as a product, ensuring they are reliable, secure, and well-documented.

Glia is the leading AI customer service solution for banks and credit unions offering AI and human agents across every voice and digital conversation.

US

  • Ramp on AWS architecture, Terraform patterns, Kubernetes setup, CI/CD pipelines, and observability stack.
  • Take ownership of an infrastructure area: CI/CD pipelines, observability stack, Kubernetes platform, or AWS security/networking.
  • Shape infrastructure direction with design docs, RFC proposals, and mentoring engineering teams.

Bastion enables financial institutions and enterprises to issue regulated stablecoins, generate revenue on reserves, and expand their ecosystems.

UK

Run the production environment by monitoring availability and taking a holistic view of system health. Build software and systems to manage platform infrastructure and applications. Improve reliability, quality, and time-to-market of our suite of software solutions.

NICE software products are used by 25,000+ global businesses to deliver extraordinary customer experiences, fight financial crime and ensure public safety.

Design, implement, monitor and maintain Sysdig's Infrastructure at scale on different clouds and on-prem. Collaborate with development teams to improve system reliability, performance, and scalability. Participate in on-call rotation, respond to incidents, conduct root cause analyses, and implement preventive measures.

Sysdig helps organizations secure innovation in the cloud with runtime insights, open innovation, and agentic AI, trusted by over 60% of the Fortune 500.

  • Lead the design, implementation, and continuous improvement of our cloud-native platform infrastructure.
  • Create and maintain tooling and automation that improves efficiency and developer experience.
  • Drive platform optimization initiatives focused on performance, cost efficiency, and reliability.

Intelerad's medical imaging solutions streamline the flow of information, simplifying complex processes, maximizing efficiencies, and shining a light on the unknown.

$120,000–$140,000/yr

  • Design and plan cloud-native systems aligned with business goals and security best practices.
  • Implement and support AI-based automation tools and services.
  • Continuously tune cloud and automation workloads to improve reliability and performance.

PerfectServe offers unified healthcare communication solutions to help physicians, nurses, and care team members provide exceptional patient care.

$174,600–$220,000/yr
US

  • Lead capacity planning, autoscaling, and performance optimization across our application.
  • Define and enforce best practices for scalability, reliability, observability, and infrastructure resilience.
  • Conduct architectural reviews and propose improvements to enhance performance and cost efficiency.

Hypori Inc., a leading provider of SaaS cybersecurity solutions, is a disruptive technology company transforming secure mobility for government and commercial customers.

US Unlimited PTO

Architect, build, and maintain secure, scalable, HIPAA- and HITRUST-compliant infrastructure on multiple cloud platforms (AWS and Azure). Design, implement, and manage scalable, secure, and highly available cloud infrastructure. Collaborate with engineering, product, and security teams to design robust infrastructure solutions.

Abacus Insights is changing the way healthcare works for you and is on a mission to unlock the power of data.

US

  • Design and implement the next generation of our Continuous Integration and Continuous Delivery (CI/CD) pipelines, focusing on security, speed, and reliability.
  • Maintain and optimize the health of our monorepo, ensuring scalable dependency management and fast incremental builds.
  • Work with GCP to architect secure, scalable runtime environments.

Anchorage Digital is building the world’s most advanced digital asset platform for institutions to participate in crypto. As a diverse team of more than 600 members, they are united in one common goal: building the future of finance by providing the foundation upon which value moves safely in the new global economy.

  • Develop and maintain RESTful APIs that facilitate the effective management of GPU clusters, virtual machines, and dedicated servers.
  • Enhance CI/CD processes and infrastructure reliability through proactive service monitoring and problem resolution.
  • Design and implement system architectures that support high availability and disaster recovery principles.

Gcore is a global provider of infrastructure and software solutions for AI, cloud, network, and security, powering everything from real-time communication.

Contribute to AWS Architecture by designing and refining our AWS stack, ensuring scalability, reliability, and cost-effectiveness. Implement Infrastructure Automation using Terraform, CloudFormation, or similar tooling to automate deployments and enforce consistency across environments. Support Developer Experience by helping establish and maintain efficient CI/CD pipelines, streamlining deployments and enhancing overall developer productivity.

Oddin.gg is a leading B2B provider of esports betting solutions with a comprehensive ecosystem that helps their partners grow.

$150,100–$188,100/yr
US Canada 2w PTO 12w maternity 12w paternity

  • Create and test reliable cloud infrastructure services that support Webflow’s range of products.
  • Balance reliability, scalability, and cost efficiency concerns while refactoring and modernizing existing services.
  • Collaborate with product engineering teams to deliver new solutions for services and ways of working that might not exist yet.

Webflow is the leading visual development platform for building powerful websites without writing code.

Canada

  • Build and deploy better services in partnership with Development groups.
  • Implement system and service telemetry to improve reliability and availability.
  • Design and evolve deployment systems and pipelines for reliability, security, and efficiency.

Jobgether is a platform that connects job seekers with companies. They utilize AI-powered matching to ensure applications are reviewed quickly and objectively.

$105,271–$131,588/yr
US 4w PTO 4w paternity

  • Responsible for administration, support, troubleshooting and implementation of Azure DevOps.
  • Implement DevOps principles at an enterprise level and enable continuous integration and continuous delivery.
  • Streamline and optimize the application lifecycle, adding visibility to technical debt and increasing software delivery speed.

Lumicera Health Services is defining the “new norm” in specialty pharmacy to optimize patient well-being through our core principles of transparency and stewardship.