Voltage Park is building a machine learning cloud infrastructure business from the ground up. As part of this effort, we’re looking for a Data Center Technician to join our small team. You’ll be monitoring some of the world’s most advanced equipment and systems within our new HPC data center. As an early hire, you’ll also have significant influence in defining our company culture and ensuring our success.
This is a full-time role, on-site in Sterling, VA. We are not able to provide sponsorship for this position.
What You’ll Do
-
Help maintain hardware in the data center, including tasks such as configuration, maintenance, monitoring, and troubleshooting assistance
-
Help diagnose and resolve system/compute/network issues to restore functionality
-
Ensure new server, storage and network infrastructure is properly racked, labeled, cabled, and configured
-
Troubleshoot server hardware and software issues
-
Become a subject matter expert on all the physical technical aspects of the data center build, from power deployment to containment solution to cabling infrastructure
-
Proactively monitor onsite hardware to get ahead of any failures
-
Work closely with HW Support team to ensure data center infrastructure-related support tickets are resolved
-
Work with RMA team to ensure faulty parts are returned and replacements are ordered
-
Create installation standards and documentation for placement, labeling, and cabling to drive consistency and discoverability across all data centers
Requirements
-
5+ years of experience in data centers and technically proficient about critical infrastructure systems supporting data centers, such as power distribution, air flow management, environmental monitoring, capacity planning, DCIM software, structured cabling, and cable management
-
Strong Linux administration experience
-
Experience setting up networking appliances (Ethernet and InfiniBand)
-
Attention to detail: precision in debugging work (such as cabling, patching, or configuring equipment), documenting fixes and changes
-
Resourceful Problem-solver: doesn’t need to be explicitly told to solve an issue, can figure out solutions under constraints
-
Ability to work in an environment that operates 24/7 with an ability to participate in on-call rotation and provide after-hours support as needed
-
Must be able to prioritize tasks and react quickly to issues
-
Strong analytical and communication skills: collaborate closely with our network engineers, SREs, on-site remote hands, and operations team to resolve issues and ensure smooth deployments
-
Nice to have:
-
Experience in HPC environments
-
Experience working with Dell or Nvidia equipment
-
Voltage Park is an equal opportunity employer and makes employment decisions on the basis of merit. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic under federal, state, or local law. If you require an accommodation during the job application process, please notify your recruiter.
Compensation Range: $65K - $85K
Top Skills
What We Do
The market for cutting-edge ML compute is broken. Startups, researchers and even big AI labs are scrambling to buy or rent access to the latest chips for ML training. But demand far outstrips supply, and what’s available is only accessible to the well-resourced, placing an artificial damper on innovation.
To solve this challenge, we've launched Voltage Park, and we’re on a mission to make machine learning infrastructure accessible to all, from large enterprises and research universities, to seed-stage startups and nonprofits.
With around 24,000 NVIDIA H100 GPUs, the Voltage Park cloud is one of the most powerful collections of cutting-edge ML compute in the world. Our clusters consist of 80GB H100 SXM5 GPUs fully interconnected with 3.2T InfiniBand.
Why Work With Us
You’ll play a pivotal role as a member of the founding team that will change the face of machine learning infrastructure. As an early hire, you’ll have outsize influence in defining the company’s culture and ensuring mission success.
Voltage Park Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.