We are seeking a highly skilled Network Engineer to join our Infrastructure team. This role is essential to designing, securing, and scaling the network infrastructure that powers Scale AI’s global operations, from high-throughput datacenters to office environments. You will own the architecture and implementation of scalable, redundant, and secure Layer 2 and Layer 3 networks to support massive data transfer, including multi-gigabit connectivity and direct cloud integrations. This role requires a deep understanding of datacenter best practices, structured cabling, secure routing, and network automation. You’ll work across teams to ensure our infrastructure remains performant, fault-tolerant, and secure, even under petabyte-scale load.
You will:
- Design, build, and maintain high-performance, multi-site network infrastructure supporting 10–100Gbps+ throughput.
- Lead physical and logical network deployments in both enterprise and colocation environments.
- Define and enforce secure network segmentation, routing policy, firewall zones, and access control models.
- Configure and manage Layer 2/3 connectivity, including VLANs, link aggregation (LAG/LACP), and dynamic routing protocols (BGP, OSPF, etc.).
- Implement telemetry and observability systems for real-time performance monitoring and alerting.
- Manage the deployment and provisioning of switches, optics, cabling, and rack-level power infrastructure.
- Collaborate with hardware and software teams to support scalable, fault-tolerant data capture and upload workflows.
- Using Infrastructure-as-Code tools such as Terraform to manage cloud and network infrastructure.
- Troubleshoot performance bottlenecks, carrier handoffs, and hardware-level issues using tools like iperf, ethtool, tcpdump, mtr, and nmap.
- Interface with datacenter providers and carriers to coordinate cross-connects and bandwidth services.
- Ensure compliance with best practices in datacenter operations, structured cabling, airflow containment, etc.
Ideally, you’d have:
- Proven experience designing and managing high-throughput networks in office and datacenter environments.
- Deep knowledge of switching, routing, and structured cabling best practices across L2/L3 protocols.
- Experience with network hardware from multiple vendors (Cisco, Juniper, etc.); Junos or IOS familiarity preferred.
- Strong understanding of network security principles: MACsec, VPNs, firewall policy, Zero Trust segmentation.
- Hands-on experience with 10/25/40/100GbE hardware, SFP/QSFP transceivers, and fiber/copper cabling.
- Experience automating network provisioning and configuration using scripting (e.g., Python) and version-controlled workflows.
- Understanding of cloud networking and edge integrations across cloud providers; AWS experience is a plus.
- Strong debugging and troubleshooting skills, from physical layer issues to protocol misconfigurations.
- Familiarity with datacenter deployment and vendor environments (e.g., Equinix, Digital Realty, etc.).
- Excellent documentation and communication skills with both technical and cross-functional teams.