As a DevOps Engineer, youโll play a key role in maintaining and scaling the cloud infrastructure that powers AI-driven platform. This is a long-term, hands-on engineering position focused on automation, observability, and security. Youโll operate and harden AWS environments, automate deployments using Infrastructure-as-Code, and build resilient CI/CD and monitoring workflows that ensure stability and compliance as we grow.
Job listings
Expanding infrastructure and automation capabilities, we need a senior-level DevOps engineer who can build and maintain reliable, secure, and scalable systems to support continuous delivery, monitoring, and growthโwhile collaborating closely with engineers and product teams.
Manage and improve our growing AWS and data center infrastructures; design, implement, and maintain a CI/CD pipeline to improve developer workflows; utilize centralized monitoring and logging to improve visibility across the team; assist development teams in solving issues around scaling and bottlenecks; work with teammates to develop high-quality software, balancing security, reliability, and operational concerns.
We're seeking an experienced Senior DevOps Engineer to audit our current infrastructure and Python/Django application, provide expert recommendations, and guide the transition to a shared tenant (multi-tenant) architecture. Our existing DevOps team handles day-to-day operations, but we need your specialized expertise in high-availability, high-traffic systems to optimize for peak enrollment cycles and ensure enterprise-grade performance as we scale.
Deploy and operate containerized services using orchestration frameworks to ensure scalability and resilience, automate infrastructure through Infrastructure as Code (IaC) to provide consistent and repeatable environments, and deploy and monitor workloads in cloud environments.
We are looking for an experienced Software Engineer to contribute on our Experian Assistant for Model Risk Management product comprising of a COTS application with a complex microservice Architecture running on Kubernetes within our Ascend Platform . You will help guide innovation, streamlining deployment workflows, and enhancing team efficiency through established CI/CD pipelines and industry best practices. You will report into the Engineering Director.
As a Lead Site Reliability Engineer within the newly created โProduct Reliabilityโ team, you'll be responsible for ensuring the availability, performance, and scalability of the products on our platform. Your proficiency in leading technical teams that support products serving millions of customers will ensure stability and high performance for our brands and clients. You will keep up with best practices in building products for scale.
As our ML Engineer Intern, you'll be the technical backbone powering our content platform by building ML systems that scale. You'll design scalable ML infrastructure and pipelines that handle massive media datasets and implement inference systems for content optimization.
NBCUniversal is seeking a Site Reliability Engineer to support live channel distribution on the Video Streaming Engineering team. This role involves 24x7 support and maintenance of distribution systems, diagnosing and preventing on-air issues. Responsibilities include investigating issues, creating documentation, assisting in deployment and testing, and providing 24x7 on-air systems support.
As the manager of the Poe Platform Team, you will drive technical excellence, mentor engineers, and collaborate across functions to deliver high-quality solutions. You'll define team roadmaps aligned with business goals while fostering an inclusive and innovative engineering culture. This role involves leading a team, improving developer velocity, and driving high engineering standards across Poe teams.