Gemini Large-Scale Pre-Training TPM

Posted 16 Days Ago
Be an Early Applicant
Mountain View, CA
168K-266K
Senior level
Artificial Intelligence
The Role
The Technical Program Manager will oversee large-scale model pre-training projects, ensuring timely execution and effective resource management while collaborating with cross-functional teams.
Summary Generated by Built In

Snapshot

The Gemini Pre-Training team is developing the next generation of large models, pushing the boundaries of what's possible in AI. This team is responsible for the critical first step: pre-training these models at a massive scale. This requires incredibly precise planning and execution, as each training run is a significant investment. As a Technical Program Manager, you'll play a vital role in ensuring these complex projects run smoothly and efficiently, collaborating with researchers, engineers, and other teams across Google DeepMind.

About us

Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

The role

As a Technical Program Manager on the Gemini Pre-Training team, you will be a key contributor to the successful execution of large-scale model pre-training runs. You will work closely with research scientists, engineers, and other TPMs to plan, track, and manage the various aspects of these complex projects. You'll be responsible for ensuring that projects stay on schedule, resources are used effectively, and risks are proactively identified and mitigated. This role requires a strong technical understanding of machine learning, excellent organizational skills, and the ability to work effectively in a fast-paced, collaborative environment.

Key responsibilities

  • Program Planning & Execution: Contribute to the planning and execution of large-scale pre-training runs. This includes developing detailed project plans, supporting the creation of technical roadmaps in collaboration with engineering stakeholders, defining milestones, tracking progress, and managing dependencies.
  • Risk Management: Identify potential risks and issues that could impact project timelines or success. Develop and implement mitigation strategies.
  • Process Improvement: Identify opportunities to streamline the pre-training process and improve efficiency. Contribute to the development and implementation of repeatable processes and automated workflows.
  • Cross-Functional Collaboration: Work closely with research scientists, engineers, product managers, and infrastructure teams to ensure alignment on goals, priorities, and dependencies. Facilitate clear communication and collaboration across these teams.
  • Resource Coordination: Assist in managing and coordinating access to shared resources, such as compute infrastructure, data, and code repositories.
  • Status Reporting: Provide regular updates to stakeholders on project progress, highlighting key milestones, risks, and issues.
  • Documentation: Maintain clear and comprehensive documentation for processes, plans, and project status.
  • Program Closure: Contribute to the assessment of program viability and provide recommendations for closure when necessary, in the best interest of GDM.
  • Team Support: Proactively share knowledge, best practices, and guidance with other team members to enhance overall team performance.

About you

In order to set you up for success as a Technical Program Manager at Google DeepMind, we look for the following skills and experience:

  • Bachelor's degree in Computer Science, Engineering, or a related technical field.
  • 5+ years of experience as a Technical Program Manager, Project Manager, or in a similar role involving the coordination and management of technical projects.
  • Demonstrated experience managing complex projects with multiple stakeholders and dependencies.
  • Strong understanding of project management methodologies and best practices.
  • Solid technical understanding of machine learning concepts, with a familiarity with large models being a significant plus.
  • Excellent communication, interpersonal, and presentation skills. Able to communicate technical information effectively to both technical and non-technical audiences, clearly articulating risks and opportunities to influence decision-making.
  • Strong organizational skills and attention to detail.
  • Ability to thrive in a fast-paced, dynamic, and collaborative environment.
  • Proven ability to proactively identify and solve problems.


In addition, the following would be an advantage:

  • Experience working with large models and pre-training techniques.
  • Experience with managing large-scale compute resources.
  • Experience with computing platforms (e.g., Google Cloud Platform).
  • Familiarity with automation tools and scripting languages (e.g., Python).


The US base salary range for this full-time position is between $168,000 - $266,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

Application deadline: 12pm GMT Friday 23rd May 2025

Note: In the event your application is successful and an offer of employment is made to you, any offer of employment will be conditional on the results of a background check, performed by a third party acting on our behalf. For more information on how we handle your data, please see our Applicant and Candidate Privacy Policy.


At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunities regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

 

Top Skills

Google Cloud Platform
Python
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
1,218 Employees
On-site Workplace
Year Founded: 2010

What We Do

We’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

Our long term aim is to solve intelligence, developing more general and capable problem-solving systems, known as artificial general intelligence (AGI).

Guided by safety and ethics, this invention could help society find answers to some of the world’s most pressing and fundamental scientific challenges.

We have a track record of breakthroughs in fundamental AI research, published in journals like Nature, Science, and more.Our programs have learned to diagnose eye diseases as effectively as the world’s top doctors, to save 30% of the energy used to keep data centres cool, and to predict the complex 3D shapes of proteins - which could one day transform how drugs are invented.

Similar Jobs

Capital One Logo Capital One

Senior Manager, Technical Program Manager

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
5 Locations
55000 Employees
176K-240K Annually

Snap Inc. Logo Snap Inc.

Program Manager, IT

Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Hybrid
3 Locations
5000 Employees
76K-133K Annually

ServiceNow Logo ServiceNow

Senior Project Specialist, Global Workforce Innovation & Culture

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Hybrid
Santa Clara, CA, USA
26000 Employees
103K-169K Annually

PwC Logo PwC

Managed Services - Enterprise Testing - Director

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Remote
Hybrid
60 Locations
370000 Employees
143K-317K Annually

Similar Companies Hiring

Stepful Thumbnail
Software • Healthtech • Edtech • Artificial Intelligence
New York, New York
60 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account