Cloud Platform Engineer

Sorry, this job was removed at 03:36 p.m. (CST) on Tuesday, Nov 26, 2024
Be an Early Applicant
Bengaluru, Bengaluru Urban, Karnataka
Internship
Artificial Intelligence • Hardware • Robotics • Software • Metaverse
The Role

NVIDIA is looking for outstanding software and systems engineers to help us develop and operate our enterprise GPU infrastructure management systems across Clouds. In this role, you will work closely with the broader NVIDIA team to operate, design and build infrastructure management systems, Kubernetes operators, and end-to-end HPC integration solutions that combine GPUs with the rest of the datacenter software management ecosystem. We are focused on supporting NVIDIA products across HPC, Cloud, and enterprise on both bare metal and virtualized platforms as the role of GPUs in all of these environments expands. Your contributions will span many aspects of GPU systems management, including Cloud provisioning, observability, operations and incident response. The systems you operate will support single-node developer systems through large clusters with thousands of nodes deployed on multiple Cloud providers.

To succeed, you must have a strong system and software development background, familiarity with modern distributed systems especially the Cloud-native ecosystem, and a proven work ethic. You will be expected to jump in quickly and provide valuable contributions from day one. This is a dynamic work environment with many exciting opportunities awaiting. NVIDIA GPUs are central to many hot enterprise, cloud, and datacenter trends. Come join us as we craft the future of accelerated computing and AI.

What you'll be doing:

  • Enable GPU provisioning and life-cycle with state-of-the-art Cloud-Native open-source ecosystem solutions, including Kubernetes, Docker, Prometheus, TerraForm and Crossplane.

  • Develop, maintain and/or operate robust, scalable Go programs in a Kubernetes environment.

  • Develop the next-generation multi-cloud infrastructure management systems to support GenAI.

  • Support internal and external users through bug fixes, documentation, and feature improvements.

  • Maintain high-quality products through robust test coverage and Day 2 capabilities..

What we need to see:

  • BS or higher in Computer Science or equivalent experience.

  • 5+ years of meaningful industry experience with a strong Kubernetes and DevOps background

  • Deep understanding and execution skills of all aspects of the software development lifecycle

  • Experience with OpenAPI and Kubernetes Custom Resource Definitions

  • Outstanding written and verbal interpersonal skills

  • Strong motivation and commitment to learn new skills

  • Ability to manage time in a fast, heavily multitasked environment

Ways to stand out from the crowd:

  • Open Source contributions to the Cloud-Native community.

  • Strong experience with GitHub/GitLab CI/CD pipelines and application configuration..

  • Strong knowledge of container technologies, orchestration frameworks and observability systems.

  • Exposure to GPU programming with CUDA a plus.

  • Experience with managing and operating HPC schedulers and/or working across multiple Cloud providers.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, we want to hear from you!

The Company
HQ: Santa Clara, CA
21,960 Employees
On-site Workplace
Year Founded: 1993

What We Do

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, NVIDIA is increasingly known as “the AI computing company.”

Similar Jobs

Atlassian Logo Atlassian

Senior Principal Engineer, Enterprise Cloud

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
Bengaluru, Karnataka, IND
11000 Employees

Guidewire Software Logo Guidewire Software

Senior Software Engineer - Product Development

Cloud • Information Technology • Insurance • Software • Analytics
Bangalore, Bengaluru, Karnataka, IND
3400 Employees

Atlassian Logo Atlassian

Senior Principal Engineer, Cloud Transition

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
Bengaluru, Karnataka, IND
11000 Employees

Atlassian Logo Atlassian

Senior Engineering Manager, Micros Foundations

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
Bengaluru, Karnataka, IND
11000 Employees

Similar Companies Hiring

TrainingPeaks (A Peaksware Company) Thumbnail
Software • Fitness
Louisville, CO
69 Employees
bet365 Thumbnail
Software • Gaming • eSports • Digital Media • Automation
Denver, Colorado
6100 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account