ML Performance Engineer

Posted 16 Days Ago
Mountain View, CA
Entry level
Artificial Intelligence • Hardware • Software
The Role
The ML Performance Engineer will build performance models and tools for ML model scheduling, create production-grade libraries for efficient distributed training, and collaborate with teams to enhance ML solutions from architecture to implementation.
Summary Generated by Built In

MatX's mission is to make the world’s best AI models run as efficiently as allowed by physics, bringing the world years ahead in AI quality and availability. We are developing vertically integrated full-stack solutions from silicon to systems including hardware and software to train and run the largest ML workloads for AGI. We are looking for people who are excited about systems-focused ML research.

Responsibilities include:

  • Build performance models and tooling to validate and guide scheduling decisions for current and future ML models.
  • Write production-grade libraries for efficient distributed training and serving.
  • Collaborate with architects, hardware, and software teams to drive solutions from model to metal.

Requirements:

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field
  • Strong programming skills in Python
  • Expertise in ML frameworks such as JAX, PyTorch, or Tensorflow.
  • Deep knowledge of the Transformer architecture.
  • Experience with distributed computing, high performance networking, or large-scale ML systems,

Preferred Skills: 
Any of the following:

  • Hands on experience with flash attention, quantization, pruning, or other systems performance optimizations.
  • Fluency in parallelism strategies that balance computation, communication, and memory to improve throughput and latency of large models.
  • Experience with performance analysis tools and profilers for large scale systems.
  • Solid understanding of computer architecture and low-level optimization techniques
  • A track record of impactful ML Systems Research, through publication and/or industrial practice.
  • Experiences in pre-silicon exploration, post-silicon bringup, and fleet debug.

Compensation: The US base salary for this full-time position is $120,000 - $400,000 + equity + benefits

As part of our dedication to the diversity of our team and our focus on creating an inviting and inclusive work experience, MatX is committed to a policy of Equal Employment Opportunity and will not discriminate against an applicant or employee on the basis of race, color, religion, creed, national origin or ancestry, sex, gender, gender identity, gender expression, sexual orientation, age, physical or mental disability, medical condition, marital/domestic partner status, military and veteran status, genetic information or any other legally recognized protected basis under federal, state or local laws, regulations or ordinances.

All candidates must be authorized to work in the United States and work from our offices in Mountain View Tuesdays-Thursdays.

This position requires access to information that is subject to U.S. export controls. This offer of employment is contingent upon the applicants capacity to perform job functions in compliance with U.S. export control laws without obtaining a license from U.S. export control authorities.

Top Skills

Python
The Company
HQ: Mountain View, CA
19 Employees
On-site Workplace

What We Do

MatX designs hardware tailored for the world’s best AI models: We dedicate every transistor to maximizing performance for large models. For these models, we deliver 10× more computing power, enabling AI labs to make models an order of magnitude smarter and more useful. Our hardware would make it possible to train GPT-4 and run ChatGPT, but on the budget of a small startup.

A world with more widely available intelligence is a happier and more prosperous world—picture people of all socioeconomic levels having access to an AI staff of specialist MDs, tutors, coaches, advisors, and assistants.

Similar Jobs

Zoox Logo Zoox

Senior/Staff Software Engineer, ML Performance Optimization

Artificial Intelligence • Machine Learning • Robotics • Software • Transportation • Design • Manufacturing
Foster City, CA, USA
2500 Employees
234K-342K Annually
San Francisco, CA, USA
154 Employees

Doximity Logo Doximity

Senior Software Engineer, Data

Healthtech • Information Technology • Mobile • Productivity • Software • Analytics • Telehealth
Easy Apply
Remote
2 Locations
700 Employees

Crexi Logo Crexi

Staff Software Engineer, Back End

Real Estate • Sales • Software
Easy Apply
Playa Vista, CA, USA
350 Employees

Similar Companies Hiring

Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees
HERE Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account