HPC System Administrator - Paris

Posted 11 Days Ago
Be an Early Applicant
Paris, Île-de-France
Hybrid
Mid level
Artificial Intelligence
The Role
As an HPC System Administrator, you will manage high-performance computing clusters, ensuring their operations across users and locations. Responsibilities include optimizing system performance, troubleshooting technical issues, maintaining security compliance, and documenting infrastructure procedures. Close collaboration with engineers and researchers is essential.
Summary Generated by Built In

About Mistral 

- At Mistral AI, we are a tight-knit, nimble team dedicated to bringing our cutting-edge AI technology to the world. Our mission is to make AI ubiquitous and open. 

- We are creative, low-ego, team-spirited, and have been passionate about AI for years.

- We hire people who thrive in competitive environments, because they find them more fun to work in.

- We hire passionate women and men from all over the world.

- Our teams are distributed between France, UK and USA.


Role Summary 

- We work at the cutting edge of science and technology, combining modern cloud environments with High-Performance Computing (HPC) standards. Our clusters use the latest available GPUs at large scales.

- As an HPC System Administrator, you will be responsible for managing our clusters, ensuring their smooth operations for all users and all sites.

- You will be the interface between our research and production users as well as various cloud providers to address node issues, capacity requests, image upgrades and tooling needs. 

- Location: Paris, France.


Key Responsibilities 

- Oversee the strategic design, system performance, resource allocation, configuration management and operational support for both our hardware and software systems.

- Solve and troubleshoot complex technical problems with a proactive approach to system optimization and issue resolution.

- Ensure HPC standard security practices and compliance.

- Maintain comprehensive documentation for infrastructure, configurations and procedures.

- Collaborate effectively across teams of engineers and researchers.


Qualifications & profile


We’re looking for a blend of experience with:

- High-performance networking

- GPUs in large scale distributed networks

- Large scale distributed storage file systems and providers

- Linux Kernel and OS

- Virtualization and container architecture in cloud environments (Docker, Kubernetes, OpenStack…)

- SLURM

- Software, hardware and network failure troubleshooting


Now, it would be ideal if you had experience with : 

- LLM training

- AI/ML frameworks

- GPU programming (CUDA)


We’re also looking for people who are:

- Passionate

- Self-directed 

- Low-ego 

- Team player


Benefits


- Daily lunch vouchers 

- Contribution to a Gympass subscription 

- Monthly contribution to a mobility pass 

- Full health insurance for you and your family 

- Generous parental leave policy 


Top Skills

Linux
The Company
HQ: Paris
92 Employees
On-site Workplace
Year Founded: 2023

What We Do

Fast, open-source and secure language models. Facilitated specialisation of models on business use-cases, leveraging private data and usage feedback.

Built from a world-class team in Europe, targeting global market. Join the team ! https://jobs.lever.co/mistral/

Similar Jobs

Hybrid
Paris, Île-de-France, FRA
1500 Employees

Qualtrics Logo Qualtrics

Senior XM Success Consultant

Artificial Intelligence • Information Technology • Natural Language Processing • Software • Business Intelligence • Generative AI
Paris, Île-de-France, FRA
5000 Employees

Akeneo Logo Akeneo

Security Engineer

eCommerce • Information Technology • Marketing Tech • Software
Hybrid
Paris, Île-de-France, FRA
430 Employees

Akeneo Logo Akeneo

Business Consultant

eCommerce • Information Technology • Marketing Tech • Software
Hybrid
Paris, Île-de-France, FRA
430 Employees

Similar Companies Hiring

Voltage Park Thumbnail
Software • Other • Machine Learning • Infrastructure as a Service (IaaS) • Hardware • Cloud • Artificial Intelligence
San Francisco, CA
51 Employees
Eastwall Thumbnail
Software • Information Technology • Consulting • Cloud • Big Data Analytics • Artificial Intelligence • App development
Denver, CO
20 Employees
Smartcat Thumbnail
Natural Language Processing • Machine Learning • Conversational AI • Artificial Intelligence
Boston, Massachusetts
242 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account