Platform & HPC Data Engineer

Posted 7 Days Ago
Herndon, VA
Senior level
Information Technology • Consulting
The Role
As a Platform & HPC Data Engineer, you will design and optimize data management systems for HPC environments, ensuring efficient storage, compliance, and integration with HPC clusters.
Summary Generated by Built In

RGi is searching for a talented Platform and HPC Data Engineer to join our team, where you will play a key role in the design, implementation, and optimization of data management solutions within high-performance computing (HPC) environments. We are looking for a candidate with substantial experience in diverse file systems, data labeling and tagging systems, as well as the configuration of various storage appliances.

In this position you will be responsible for ensuring that data workflows, storage configurations, and metadata management are not only efficient and scalable but also adhere to organizational and government security standards. As part of a dynamic, cross-disciplinary team, you will help address the technical demands of HPC platforms, effective data management, and large-scale computational workflows. Join us in advancing our innovative solutions and shaping the future of high-performance computing!


Clearance:

Active Top Secret clearance with willingness and ability to obtain an SCI and CI polygraph

US Citizenship required

As a Platform & HPC Data Engineer you will...

  • Design and implement data management systems and architectures for HPC platforms, focusing on optimizing data flow, storage, and access in large-scale computing environments.
  • Oversee the configuration, maintenance, and optimization of distributed file systems (e.g., Lustre, IBM Spectrum Scale, NFS, GPFS) and storage solutions used in HPC environments to ensure efficient performance, scalability, and reliability.
  • Implement and manage metadata-driven systems for data labeling/tagging. This includes the development of strategies for classifying, indexing, and organizing datasets to enhance data discoverability, access control, and auditing.
  • Configure and maintain various storage appliances (e.g., NetApp, Dell EMC, HPE) and integrated storage solutions. Ensure that storage devices are optimized for performance, capacity, and availability within the HPC ecosystem.
  • Integrate data storage and management systems with HPC clusters, ensuring seamless data flow between compute nodes and storage appliances. Optimize data pipelines to support high-throughput workloads and minimize bottlenecks in I/O performance.
  • Monitor and improve the performance of storage systems, focusing on I/O throughput, latency, and efficient resource allocation. Use performance metrics to guide optimizations across storage appliances and file systems.
  • Implement security best practices for data access, protection, and management, ensuring compliance with government regulations and internal data governance policies. Configure encryption, access control, and secure data sharing methods.
  • Develop and maintain automation scripts (e.g., using Python, Bash, or Perl) to streamline storage configurations, data labeling/tagging, and system monitoring tasks. Automate processes related to data integration and HPC platform management.
  • Work closely with data scientists, HPC administrators, software developers, and other technical staff to support ongoing projects. Provide expertise in troubleshooting data storage issues and ensuring optimal system performance.
  • Maintain thorough documentation for storage configurations, file system setups, data labeling/tagging procedures, and performance optimization strategies. Provide regular reports on system health, data management processes, and any improvements made.

Platform & HPC Data Engineer Qualifications:

  • Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field. A Master’s degree or higher is a plus.
  • 7+ years of experience in managing data infrastructure in HPC environments, with expertise in file systems, storage appliances, and data workflows.
  • Hands-on experience with distributed file systems, including Lustre, IBM Spectrum Scale (GPFS), NFS, and others commonly used in HPC settings.
  • Proven experience with storage appliance configuration (e.g., NetApp, Dell EMC, HPE, or similar systems), including performance tuning, capacity management, and reliability.
  • Strong experience in implementing data labeling/tagging systems, metadata management, and structuring large datasets for efficient access and compliance.
  • Knowledge of high-performance networking protocols (e.g., InfiniBand, RDMA) and their role in data transfer and storage optimization.
  • Familiarity with data access protocols like GridFTP, rsync, and NFS for large-scale data transfer.

Additional Skills We'd Like to See:

  • Experience with cloud storage integration or hybrid cloud environments, with knowledge of cloud-native storage solutions (e.g., AWS S3, Ceph, OpenShift).
  • Familiarity with high-performance computing (HPC) schedulers (e.g., SLURM, PBS, Torque) and their interaction with data storage systems.
  • Understanding of data protection mechanisms, including data replication, backup strategies, and disaster recovery in HPC environments.
  • Experience with containerization (Docker, Singularity) in an HPC context for data processing and application deployment.
  • Experience with machine learning or data science workflows in HPC environments.

Who we are:

Reinventing Geospatial, Inc. (RGi) is a fast-paced small business that has the environment and culture of a start-up, with the stability and benefits of a well-established firm. We solve complex problems within geospatial software development and national defense to make an Immediate Impact for our nation’s soldiers and analysts.


We pride ourselves on giving employees an exceptional life experience, where creativity thrives, and challenges are simply part of the fun. We provide truly excellent benefits, including:


·        100% paid employee healthcare & dental insurance

·        Paid parental leave

·        401k with matching

·        Escalating vacation time

·        Referral bonuses

·        Tuition reimbursement

·        Professional development training

·        Free beverages and snacks

·        Weekly catered lunches and breakfast on Fridays

 

Grow to be our next leader:

At RGi, fostering a strong and organic corporate culture is paramount and serves as a compass on the decisions we make and how we operate the company. We believe our culture of camaraderie, innovation, and collaboration reflects the caliber of our employees and their dedication to the mission of providing quality software to our customers. As such, we want our employees to feel empowered to seek growth and leadership opportunities within the company and position us to maintain our culture as we grow. RGi provides opportunities, resources, training, and mentorship to all our employees to let them take control of their careers and become a leader or a crucial member of our company. If this is what you are looking for in a company, then you are what we are looking for in an employee.


Reinventing Geospatial, Inc. is an Equal Opportunity Employer committed to hiring and retaining a diverse workforce. We are an Equal Opportunity Employer, making decisions without regard to race, color, religion, sex, national origin, age, veteran status, disability, or any other protected class. U.S. Citizenship is required for all positions.

Top Skills

Aws S3
Bash
Ceph
Dell Emc
Docker
Gpfs
Gridftp
Hpc
Hpe
Ibm Spectrum Scale
Infiniband
Lustre
Netapp
Nfs
Openshift
Perl
Python
Rdma
Rsync
Singularity
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Denver, , Colorado
49 Employees
On-site Workplace
Year Founded: 2009

What We Do

Reinventing Geospatial (RGi) is a leading geospatial expert working with Defense, Intelligence, and Federal clients to achieve mission success and solutions for varied mission-critical programs. Projects at RGi span a wide range of software and analytical methodologies and lie at the intersection of software development and geospatial intelligence. We work with soldiers and geospatial analysts to produce solutions that allow them to develop better situational understanding of complex operational pictures. We do everything from data collection to UI/UX development and advanced deep learning, using a broad toolset including Python, C#, and ArcGIS. Our projects include cutting-edge R&D efforts, as well as large mission programs incorporating data processing/optimization, data dissemination techniques, visualization, and collaboration across the commercial and government sectors.

At RGi, we also pride ourselves on our company culture. We ensure that our work environment is a welcoming and fun place for everyone. In addition to offering competitive benefits, including company-paid healthcare and company vacations, we also embody the following values:

BE UNIQUELY YOU: You are a name, not a number…quirks and all. We embrace diversity and individuality.

TRUST IS A TWO-WAY STREET: Our leadership proves they have our best interests in mind through their actions. They put people over profits. There is earned mutual trust.

JUMP THROUGH FLAMING HOOPS: We feel a part of the mission, we don’t just do what is expected. We go above and beyond to innovate and create solutions…making an immediate and important impact.

SPEAK UP: We embrace and maintain open communication across the company. We voice our concerns, questions, comments, suggestions, and praise.

PAY IT FORWARD: We give back to our community, not because we have to but because we genuinely care.

Similar Jobs

The Aerospace Corporation Logo The Aerospace Corporation

Space Tactics and RPO Engagement Modeling Analyst

Aerospace • Artificial Intelligence • Cloud • Machine Learning • Software • Cybersecurity • Defense
Hybrid
3 Locations
4600 Employees
95K-166K Annually

The Aerospace Corporation Logo The Aerospace Corporation

2025 Communications Architecture Analyst

Aerospace • Artificial Intelligence • Cloud • Machine Learning • Software • Cybersecurity • Defense
Hybrid
2 Locations
4600 Employees
95K-120K Annually

STR Logo STR

Senior Vulnerability Researcher

Machine Learning • Security • Software • Analytics • Defense
Easy Apply
Arlington, VA, USA
600 Employees

Capital One Logo Capital One

Principal Associate, Ontology and Data Modeling- Retail Bank

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
3 Locations
55000 Employees
116K-159K Annually

Similar Companies Hiring

InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees
Quantum Rise Thumbnail
Software • Professional Services • Natural Language Processing • Machine Learning • Consulting • Automation • Artificial Intelligence
Chicago, Illinois
17 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account