Network Engineer
US
Overview
We are looking for a Network Engineer seeking to apply their technical skills in a fast-paced and complex HPC environment. Working knowledge of server and network hardware and software and the desire to participate in projects at a large-scale data center is central to this role. This position will work with various engineering teams to resolve and diagnose network issues at scale. Adaptability and flexibility within the environment will be key to the candidate’s success. Penguin supports HPC environments where network performance is critical.
Responsibilities
- Monitor and perform on-going maintenance and upgrades to network equipment.
- Provide support to staff, as well as respond to server and network issues.
- Manage and maintain both InfiniBand and Ethernet networks.
- Run hardware diagnostics and replace failing parts in a timely manner.
- Monitor all network processes to ensure the smooth flow of data across the network.
- Collaborate with software and network engineering teams on cybersecurity and network efficiency.
- Support a Linux-based, high-performance computing (HPC) and artificial intelligence (AI) environment, featuring a wide range of technologies.
- Maintain meticulous documentation for internal and external build instructions - such as configuration examples, guidance on technical details, and best practices.
- Develop automation and other tools to improve operations.
- Supervise on-site staff in updating cards and other components in the environment.
- Respond to network and server errors, sometimes after hours.
- Stay up-to-date with advancements in data center infrastructure and technologies.
Qualifications
- 5+ years of hands-on experience with enterprise scale networks.
- In-depth knowledge of data center environments, servers, and network equipment.
- Experience administering both InfiniBand (Mellanox/NVIDIA) and Ethernet (Cumulus/SONiC) networks.
- Proficiency in documenting network processes and diagrams.
- Thorough understanding of L2 and L3 network protocols.
- Exceptional ability to work as part of a team, provide IT support, and resolve network errors.
Preferred Skills
- Extensive experience in installing, monitoring, and maintaining data center networks.
- Hands-on experience configuring and supporting large scale Ethernet and InfiniBand networks.
- Demonstrated practice with low-latency/high-bandwidth networking performance optimization.
- Experience with VXLAN and EVPN architectures and routing protocols such as BGP and OSPF.
- Knowledge of communication libraries such as NCCL, UCX and MPI.
- Familiarity with UFM, OpenSM and NetQ or other network management software.
Location
This is a remote position in the United States.
Travel
Minimal travel may be required.
Compensation & Benefits
The base pay range that the Company reasonably expects to pay for this position in the United States is $97,000 - $114,000; the pay ultimately offered may vary based on business considerations, including job-related knowledge, skills, experience, and education. The position is bonus-eligible, and there are medical, dental, and vision benefits available. There is a 401k saving plan and other benefits, such as Paid Time Off, Life Insurance, and an Employee Assistance Plan.
Inclusion & Belonging Statement
We are committed to creating an inclusive environment that embraces differences and fosters belonging for all.
Equal Opportunity Statement
We are an Affirmative Action/Equal Opportunity Employer and strongly committed to all policies which will afford equal opportunity employment to all qualified persons without regard to age, national origin, race, ethnicity, creed, gender, disability, veteran status, or any other characteristic protected by law.