Systems Engineer
US
Overview
Penguin Solutions Managed Services provides dedicated, remote, Linux systems administrators for complex, integrated environments involving high-performance computing, cloud, and enterprise systems. This position requires highly technical skills and the ability to understand, document, configure, administer, troubleshoot, and resolve issues in Linux environments. This is a customer-facing position.
Responsibilities
- Support a Linux-based, High-Performance Computing (HPC) and Artificial Intelligence (AI) environment featuring a wide range of technologies.
- Maintain, administer, and patch Linux Operating Systems and associated software.
- Work as part of a team to render professional, timely, and expert customer support.
- Analyze system log files and perform comprehensive troubleshooting.
- Design, implement and maintain systems automation using Ansible.
- Document configurations, processes, and troubleshooting procedures for knowledge sharing and operational efficiency.
- Respond to alerts from monitoring systems.
- Participate in an on-call rotation to provide critical support for AI and HPC operations.
- Manage and maintain containerized and virtual environments such as OpenStack, Kubernetes, Singularity, and VMware.
- Follow and improve procedures to meet SLAs.
- Mentor junior employees.
Qualifications
- BA/BS in Information Technology, Computer Science, or a related field of study (or equivalent experience in systems administration/engineering).
- 5+ years of hands-on experience with UNIX/Linux (RedHat preferred) server environments.
- Practical knowledge of the administration of High-Performance Computing (HPC) technologies or similar clustered Linux environments.
- Linux systems administration skills and experience with open-source technologies.
- Familiarity with IPMI tools and BMC configuration and troubleshooting.
- Understanding of Linux networking implementation and protocols.
- Ability to work in ITIL operating models.
- Strong focus on accuracy and consistency.
- Proven experience in scripting to support automation and system administration activities (i.e. Ansible, Python, BASH, Perl).
- Ability to communicate clearly and effectively with team members and clients.
- Demonstrated experience installing, configuring, and tuning software applications and providing overall support.
- Desire to take initiative to refer to Application OEM/Vendor resources for operations, features, functions, and questions.
- Outstanding verbal, written, and interpersonal communication skills.
- US Citizenship is required for this role.
Preferred Qualifications
- Experience with HPC application support, optimization and/or installation.
- Knowledge of the administration of HPC technologies, including cluster resource management, job scheduling, Ethernet/InfiniBand networks, licensing services, GPUs, etc.
- Experience with containers and orchestration such as Singularity, Docker, Kubernetes, etc.
- Familiarity with high-performance storage and parallel file systems used in HPC/AI and Cloud (Weka, Ceph, Lustre, GPFS).
- Deep Linux administration skills, including kernel tuning, system security, and package management.
- In-depth knowledge of Linux cluster technologies and optimization techniques.
- Understanding of HPC scheduling systems (SLURM, PBS, LSF).
- RedHat certifications and/or other Linux-related certifications will be looked upon favorably.
Location
This is a remote position in the United States.
Travel
Minimal travel may be required.
Compensation & Benefits
The base pay range that the Company reasonably expects to pay for this position in the United States is $96,000 - $112,000; the pay ultimately offered may vary based on business considerations, including job-related knowledge, skills, experience, and education. The position is bonus-eligible, and there are medical, dental, and vision benefits available. There is a 401k saving plan and other benefits, such as Paid Time Off, Life Insurance, and an Employee Assistance Plan.
Inclusion & Belonging Statement
We are committed to creating an inclusive environment that embraces differences and fosters belonging for all.
Equal Opportunity Statement
We are an Affirmative Action/Equal Opportunity Employer and strongly committed to all policies which will afford equal opportunity employment to all qualified persons without regard to age, national origin, race, ethnicity, creed, gender, disability, veteran status, or any other characteristic protected by law.