Engineering Manager, GPU Platform

Posted 11 Days Ago
Be an Early Applicant
San Francisco, CA
Expert/Leader
Artificial Intelligence • Machine Learning • Generative AI
The Role
Lead the GPU platform team, managing engineers to build and scale infrastructure for AI workloads, ensuring security and performance. Collaborate with various stakeholders and guide technical direction.
Summary Generated by Built In

About the Team

Our team runs the GPU fleet that serves the models backing ChatGPT and the API. We build automation to provision and manage one of the largest cutting edge GPU inference fleets in the world, exposing it as a singular platform for other OpenAI teams to seamlessly run production applied AI workloads. 

We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely. Safety is more important to us than unfettered growth.

About the Role

We are looking for an experienced engineering manager to help lead our GPU platform team. You’ll help build and scale one of the largest inference fleets in the world. You will also collaborate closely with product and infrastructure teams to help ship reliable products quickly. 

In this role, you will:

  • Manage and build a diverse team of high performing infrastructure engineers

  • Guide the roadmap for automation for a fleet that can grow an order of magnitude in size or more

  • Build a world-class, secure compute fleet that serves users at scale

  • Set technical direction on evolving our compute and abstractions to support a growing business

  • Collaborate closely with a broad set of stakeholders, including product engineering, inference, security, research and finance

  • Work with external partners to unlock bleeding edge compute and making it available as a turnkey resource for scheduling workloads

  • Coach and nurture engineers to accelerate their growth and learning

You might thrive in this role if you:

  • Have 10+ years of experience in infrastructure software engineering, including 5+ years of experience in engineering management

  • Have prior experience building out high performance computing infrastructure teams at scale

  • Have worked with provisioning bare metal server data centers that interconnect across a WAN

  • Have experience building hybrid-cloud platforms

  • Care deeply about diversity, equity, and inclusion, and have a track record of building inclusive teams

  • Own problems end-to-end, and are willing to pick up whatever knowledge you're missing to get the job done. You will be expected to be able to be hands-on to help the team debug issues or manage systems from time to time as needed.

  • Have the ability to move fast in an environment where things are sometimes loosely defined and may have competing priorities or deadlines

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. 

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status. 

OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement

For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Top Skills

Automation
Gpu
High Performance Computing
Hybrid-Cloud Platforms
Infrastructure Software
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
224 Employees
On-site Workplace
Year Founded: 2015

What We Do

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. AI is an extremely powerful tool that must be created with safety and human needs at its core. OpenAI is dedicated to putting that alignment of interests first — ahead of profit.

To achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. Our investment in diversity, equity, and inclusion is ongoing, executed through a wide range of initiatives, and championed and supported by leadership.

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Similar Jobs

Red 6 Logo Red 6

Senior Optical Software Engineer

Aerospace • Hardware • Software • Virtual Reality • Defense
Santa Monica, CA, USA
113 Employees
170K-220K Annually

Cloudflare Logo Cloudflare

Technical Account Manager, Developer Platform

Cloud • Information Technology • Security • Software • Cybersecurity
Hybrid
3 Locations
3900 Employees
149K-183K Annually

HRL Laboratories Logo HRL Laboratories

Senior Physical Security Applications Engineer

Computer Vision • Hardware • Machine Learning • Software • Semiconductor
Hybrid
Calabasas, CA, USA
1050 Employees
110K-137K Annually

Klaviyo Logo Klaviyo

Salesforce Experience Cloud Developer

Consumer Web • eCommerce • Marketing Tech • Retail • Software • Analytics • Generative AI
Hybrid
San Francisco, CA, USA
2000 Employees
100K-150K Annually

Similar Companies Hiring

HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account