Similar Jobs
See allSr Linux System Administrator
Fal
Linux
Ansible
Terraform
Senior Linux & Cloud Platform Engineer
Breezy
Global
Linux
Networking
Virtualization
Principal Support Engineer (L3, Edge Network)
Gcore
TCP/IP
Grafana
Senior Network Operations Engineer - Federal - 3rd Shift (Nights)
ServiceNow
US
Networking
Cloud
Azure
Network Engineer with Python - Work From Home
Parallel Partners
US
Python
TCP/IP
BGP
Key Responsibilities:
- Design, build, and operate the network fabric that interconnects our GPU fleet, including spine-leaf architectures, RDMA/RoCEv2 networks for distributed inference, and overlay networks for tenant isolation.
- Own L2/L3 network design across bare-metal and cloud environments, including BGP peering, ECMP, VXLAN/EVPN, and high-bandwidth interconnects between data centers.
- Develop and maintain network automation using Ansible, Terraform, and custom tooling to provision, configure, and validate switches, routers, DPUs, and SmartNICs at scale.
Requirements:
- 8+ years of experience building and operating large-scale networks, ideally in GPU cloud, HPC, or hyperscale environments.
- Deep expertise in Linux networking internals: kernel networking stack, iptables/nftables, tc, eBPF, network namespaces, bonding/teaming, and SR-IOV.
- Strong command of routing and switching protocols: BGP, OSPF, ECMP, VXLAN, EVPN, MPLS, and segment routing.
What we offer at fal:
- Interesting and challenging work
- Competitive salary and equity
- Visa sponsorship
Fal
Fal's platform orchestrates AI inference workloads across thousands of GPUs spread over multiple data centers and cloud providers. They offer visa sponsorship and will help you relocate to San Francisco.