Data Center Technician
GA, US
At Penguin Solutions (Nasdaq: PENG) – The AI Factory Platform Company – we’re building a team of innovators who thrive on collaboration, creativity, and the opportunity to help shape the future of AI. As part of the AI technology revolution, our teams design, build, deploy, and manage AI factories for enterprises, sovereign AI initiatives, and neocloud providers worldwide.
Headquartered in Silicon Valley, California, Penguin Solutions operates globally through a network of R&D, manufacturing, and sales locations. For nearly three decades, we have operated at the intersection of memory and AI/HPC infrastructure. That engineering expertise positions us to power the next generation of AI workloads, from training to inference and agentic AI at scale.
Penguin Solutions brings together differentiated infrastructure software, advanced memory, compute systems, end-to-end services, and industry-leading partner solutions in a full-stack AI factory platform designed to help customers deploy and scale AI workloads with speed and precision.
At Penguin Solutions, we value ideas over hierarchy and empower employees to take ownership, drive innovation, and grow through challenging work, continuous learning, and exposure to advanced AI tools and technologies. With flexibility where it matters and a strong focus on outcomes, Penguin Solutions is a place to do your best work, grow your career, and make a meaningful impact.
Job Overview
Penguin Solutions is looking for a Data Center Technician looking to apply their technical skills in a fast-paced and complex environment. A strong working knowledge of server hardware and data center infrastructure is essential to this role. This position will work to resolve and diagnose compute issues at scale, escalate issues, and work with remote engineering teams. Additionally, this role will support rack lifecycle processes with a focus on helping build out and support cloud scale compute and storage environments. Solid communication, adaptability, and flexibility are a requirement for this role.
Responsibilities
- Perform routine monitoring and basic maintenance on servers and network hardware, including component replacements (e.g., drives, cables, memory) following established procedures.
- Execute standard hardware diagnostics and replace failing parts in alignment with service level agreements.
- Collaborating with software and network engineering teams on overall high-performance computing cluster health.
- Works within the client ticketing system for all hardware related high performance computing cluster observations.
- Upgrading internal system components, including CPUs, memory, hard drives, and network cables.
- Providing technical support to staff and customers, as well as responding to the server and network hardware issues.
- Maintaining, creating, and updating documentation where needed; looking for gaps in procedures and contributing to run books and guides.
- Troubleshooting system issues through logs and tooling and diagnosing rack issues.
- Providing support with root cause analysis and continued improvement.
- Coordinating with logistics / inventory management teams for hardware removal of components from the high-performance computing cluster.
Qualifications
- 2-3 years of experience as a data center technician or similar role.
- CompTIA Server+, CompTIA Network+, or RHCSA recommended.
- Familiarity with ERAD and compliance standards preferred.
- Must be a U.S. Citizen.
- Technical Skills:
- Hardware knowledge: Basic understanding of server hardware, including components and troubleshooting.
- Linux/Unix: Familiarity in Linux/Unix operating systems, including basic command-line interfaces and scripting.
- Networking fundamentals: Understanding of networking concepts, including TCP/IP, DNS, and DHCP.
- Scripting languages: Familiarity with scripting languages such as Python, Bash, or Perl.
- Troubleshooting and Repair:
- Troubleshooting methodologies: Ability to apply structured troubleshooting methodologies to resolve issues.
- Repair techniques: Knowledge of repair techniques for server hardware, including component replacement and upgrade procedures.
- Root cause analysis: Ability to identify root causes of issues and implement corrective actions.
- Physical Requirements: Must be able to lift and move equipment weighing 50 pounds or more, as required by this role.
- Communication and Collaboration:
- Effective communication: Strong written and verbal communication skills, with ability to articulate technical information to non-technical stakeholders.
- Collaboration: Ability to work effectively with cross-functional teams, including engineering, operations, and management.
- Documentation: Strong documentation skills, with ability to create and maintain technical documents and knowledge bases.
Location
This is an onsite position located in Atlanta, GA.
Travel
As necessary.
Compensation & Benefits
The base pay range that the Company reasonably expects to pay for this position in Atlanta, GA is $76,000 - $83,000; the pay ultimately offered may vary based on business considerations, including job-related knowledge, skills, experience, and education. The position is bonus-eligible, and there are medical, dental, and vision benefits available. There is a 401k saving plan and other benefits, such as Paid Time Off, Life Insurance, and an Employee Assistance Plan.
Inclusion & Belonging Statement
We are committed to creating an inclusive environment that embraces differences and fosters belonging for all.
Equal Opportunity Statement
We are an Affirmative Action/Equal Opportunity Employer and strongly committed to all policies which will afford equal opportunity employment to all qualified persons without regard to age, national origin, race, ethnicity, creed, gender, disability, veteran status, or any other characteristic protected by law.