Similar Jobs
See allSenior Software Engineer - Filesystem and Container Runtime
Docker
Global
Go
Rust
Software Engineer
MinIO
Saudi Arabia
Networking
Distributed Systems
Systems Programming
Senior HPC Cluster Engineer
Jobgether
Europe
C
C++
Go
Staff Software Engineer
Johnson Controls
US
C++
Rust
Networking
Senior Software Engineer II, Network Simulation
SimSpace
Global
C
Rust
Kubernetes
Key Responsibilities:
- Provide deep technical leadership across LustreFS subsystems including metadata, object storage, and LNet.
- Own complex root-cause analysis for difficult customer and production issues across kernel, filesystem, and network layers.
- Lead design and implementation of new features, reliability improvements, and performance optimizations.
Qualifications:
- 15+ years of experience in distributed systems or Linux kernel development.
- Strong hands-on expertise in LustreFS internals and production operations.
- Proficiency in C programming and Linux debugging tools like gdb, crash, and eBPF.
What You Will Work On:
- Complex customer escalations and deep production issues involving failover, recovery, and performance regression.
- Architecture and implementation of new LustreFS capabilities for scale and resilience.
- Building AI-enabled engineering workflows that accelerate triage, debugging, and design iteration.
DataDirect Networks
DataDirect Networks (DDN) is a global market leader in AI and high-performance data storage innovation, powering many of the world's most demanding AI data centers across industries like life sciences, healthcare, and financial services. DDN has been at the forefront of AI infrastructure for over two decades, with a team of passionate professionals driven by innovation, customer-centricity, and a commitment to shaping the future of data management.