ML Runtime Optimization Engineer

Posted 3 Days Ago
Mountain View, CA
Mid level
Automotive
The Role
The role involves optimizing ML models for deployment in embedded environments, enhancing compute efficiency and latency while collaborating with ML engineers on model solutions.
Summary Generated by Built In
About Applied Intuition

Applied Intuition is a vehicle software supplier that accelerates the adoption of safe and intelligent machines worldwide. Founded in 2017, Applied Intuition delivers the AI-powered ADAS/AD toolchain, Vehicle OS, and autonomy stack to help customers shorten time to market, build high-quality systems, and create next-generation consumer experiences. 18 of the top 20 global automakers trust Applied Intuition’s solutions to drive the production of modern vehicles. Applied Intuition serves the automotive, trucking, construction, mining, agriculture, and defense industries and is headquartered in Mountain View, CA, with offices in San Diego, CA, Ft. Walton Beach, FL, Ann Arbor and Detroit, MI, Washington, D.C., Stuttgart, Munich, Stockholm, Seoul, and Tokyo. Learn more at appliedintuition.com.

We are an in-office company, and our expectation is that employees primarily work from their Applied Intuition office 5 days a week. However, we also recognize the importance of flexibility and trust our employees to manage their schedules responsibly. This may include occasional remote work, starting the day with morning meetings from home before heading to the office, or leaving earlier when needed to accommodate family commitments. (Note: this is not applicable for EpiSci roles).

 

About the role

We are looking for a software engineer with deep experience in optimizing ML models and deploying them on production-grade embedded runtime environments. You’ll work across the entire ML framework stack (e.g. PyTorch, JAX, ONNX, TensorRT, CUDA, XLA, Triton).

At Applied Intuition, you will:

  • Drive ML performance optimization on multiple technologies for on-road and off-road ADAS / AD stacks targeting deployment on a variety of embedded compute platforms 
  • Develop compute usage strategies to optimize efficiency and latency of model inference for compute boards selected by our customers
  • Work on model pruning and quantization, and support deployment on memory constrained platforms
  • Collaborate closely with ML engineers and software developers on technical efforts to find and optimize efficient model architecture solutions
  • Set up methodologies to profile the model performance on target embedded compute platforms and identify performance bottlenecks as part of stack integration 

We're looking for someone who has:

  • Bachelors in Electrical Engineering or Computer Science, OR B.Sc. in Computer Science, Mathematics, Physics or a related field
  • 3+ years of experience with ML accelerators, GPU, CPU, SoC architecture and micro-architecture
  • Strong software development skills with the focus on embedded programming
  • Experience profiling and optimizing model performance on embedded compute platforms
  • Experience in working with deep learning frameworks (e.g., PyTorch, JAX, ONNX, etc.)

Nice to have:

  • M.Sc or PhD in a ML related area
  • Built an ML optimization framework from scratch before
  • Deployed ML solutions to embedded chips for real time robotics applications

Compensation at Applied Intuition for eligible roles includes base salary, equity, and benefits. Base salary is a single component of the total compensation package, which may also include equity in the form of options and/or restricted stock units, comprehensive health, dental, vision, life and disability insurance coverage, 401k retirement benefits with employer match, learning and wellness stipends, and paid time off. Note that benefits are subject to change and may vary based on jurisdiction of employment.

Applied Intuition pay ranges reflect the minimum and maximum intended target base salary for new hire salaries for the position. The actual base salary offered to a successful candidate will additionally be influenced by a variety of factors including experience, credentials & certifications, educational attainment, skill level requirements, interview performance, and the level and scope of the position.

Please reference the job posting’s subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the location listed is: $159,053 - $199,295 USD annually. 

Don’t meet every single requirement? If you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyway. You may be just the right candidate for this or other roles.

Applied Intuition is an equal opportunity employer and federal contractor or subcontractor. Consequently, the parties agree that, as applicable, they will abide by the requirements of 41 CFR 60-1.4(a), 41 CFR 60-300.5(a) and 41 CFR 60-741.5(a) and that these laws are incorporated herein by reference. These regulations prohibit discrimination against qualified individuals based on their status as protected veterans or individuals with disabilities, and prohibit discrimination against all individuals based on their race, color, religion, sex, sexual orientation, gender identity or national origin. These regulations require that covered prime contractors and subcontractors take affirmative action to employ and advance in employment individuals without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status or disability. The parties also agree that, as applicable, they will abide by the requirements of Executive Order 13496 (29 CFR Part 471, Appendix A to Subpart A), relating to the notice of employee rights under federal labor laws.

Top Skills

Cuda
Jax
Onnx
PyTorch
Tensorrt
Triton
Xla
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Sunnyvale, CA
472 Employees
On-site Workplace
Year Founded: 2017

What We Do

As the foremost enabler of autonomous vehicle development, Applied Intuition equips engineering and product teams with software that makes it faster, safer, and easier to bring autonomy to market.
Applied’s suite of products, focused on simulation and analytics, delivers sophisticated infrastructure built for scale. Companies of all sizes use Applied to comprehensively test and rapidly accelerate their autonomous vehicle development.
Headquartered in Silicon Valley with offices in Detroit, Tokyo, and Munich, Applied is composed of software and automotive experts from the top companies in the world (such as Google, Amazon, Apple, Waymo, Tesla, Delphi, GM, and Bosch).

Similar Jobs

Mountain View, CA, USA
2359 Employees
170K-216K Annually
Mountain View, CA, USA
472 Employees
Mountain View, CA, USA
2359 Employees
204K-259K Annually
Mountain View, CA, USA
2359 Employees
238K-302K Annually

Similar Companies Hiring

Cox Enterprises Thumbnail
Software • Other • Information Technology • Greentech • Cybersecurity • Cloud • Automotive
Atlanta, GA
50000 Employees
UL Solutions Thumbnail
Software • Renewable Energy • Professional Services • Energy • Consulting • Chemical • Automotive
Chicago, IL
15000 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account