LLM Algorithmic Optimization Engineer - Intern

Posted 15 Days Ago
Be an Early Applicant
San Jose, CA
Internship
Automotive
The Role
As an LLM Algorithmic Optimization Engineer Intern, you will research and apply technologies to optimize Large Language Models (LLMs) for efficient inference and deployment across diverse hardware. The role involves collaboration with teams to integrate optimized models into automotive applications and covers the entire pipeline from research to deployment on GPUs and other systems.
Summary Generated by Built In

JOB DESCRIPTION

About NIO

NIO is a pioneer and a leading company in the premium smart electric vehicle market. Founded in November 2014, NIO’s mission is to shape a joyful lifestyle. NIO aims to build a community starting with smart electric vehicles to share joy and grow together with users.

NIO designs, develops, jointly manufactures and sells premium smart electric vehicles, driving innovations in next-generation technologies in autonomous driving, digital technologies, electric powertrains and batteries. NIO differentiates itself through its continuous technological breakthroughs and innovations, such as its industry-leading battery swapping technologies, Battery as a Service, or BaaS, as well as its proprietary autonomous driving technologies and Autonomous Driving as a Service, or ADaaS.

NIO’s product portfolio consists of the ES8, a six-seater smart electric flagship SUV, the ES7 (or the EL7), a mid-large five-seater smart electric SUV, the ES6, a five-seater all-round smart electric SUV, the EC7, a five-seater smart electric flagship coupe SUV, the EC6, a five-seater smart electric coupe SUV, the ET7, a smart electric flagship sedan, and the ET5, a mid-size smart electric sedan.

About NIO

NIO is a pioneer and a leading company in the premium smart electric vehicle market. Founded in November 2014, NIO’s mission is to shape a joyful lifestyle. NIO aims to build a community starting with smart electric vehicles to share joy and grow together with users.

NIO designs, develops, jointly manufactures and sells premium smart electric vehicles, driving innovations in next-generation technologies in autonomous driving, digital technologies, electric powertrains and batteries. NIO differentiates itself through its continuous technological breakthroughs and innovations, such as its industry-leading battery swapping technologies, Battery as a Service, or BaaS, as well as its proprietary autonomous driving technologies and Autonomous Driving as a Service, or ADaaS.

NIO’s product portfolio consists of the ES8, a six-seater smart electric flagship SUV, the ES7 (or the EL7), a mid-large five-seater smart electric SUV, the ES6, a five-seater all-round smart electric SUV, the EC7, a five-seater smart electric flagship coupe SUV, the EC6, a five-seater smart electric coupe SUV, the ET7, a smart electric flagship sedan, and the ET5, a mid-size smart electric sedan.

Roles and Responsibilities:

  • Conduct research and apply cutting-edge technologies to optimize Large Language Models (LLMs) and multimodal models, exploration and implementation of the core algorithmic optimization on heterogeneous architectures, for highly efficient LLM inference as well as deployment across distributed and heterogeneous hardware environments.
  • Focus on model optimization from a systems perspective, ensuring efficient deployment in the vehicle’s digital cockpit and advanced driving (AD) domain.
  • Collaborate with cross-functional teams to ensure the integration of optimized models into real-world automotive applications.
  • Contribute to the entire pipeline from research, development, and testing, through to deployment on hardware, including GPUs and other distributed systems.

Qualifications:

  • Currently pursuing or completed a PhD or Master’s degree in Computer Science, Computer Engineering, Applied Mathematics, Communications, Electronics, or a related field with relevant research projects and publications.
  • Strong understanding of GPU/NPU architecture and optimization techniques to identify and address bottlenecks.
  • Proficient in LLM and VLM architectures and algorithms, familiar with transformer based NLP / Audio / CV algorithms and technologies.
  • Proficiency in Python and experience with AI-related training and inference tools such as PyTorch.
  • Proficiency in C/C++ programming, familiar with at least one commonly used LLM inference engines.
  • Hands-on experience with model-serving frameworks such as Open Neural Network Exchange (ONNX).
  • Familiarity with debugging code in distributed computing environments.Experience in LLM inference optimization on resource constrained edge devices is a plus.

Preferred Qualification:

  • Ph.D. in computer science, artificial intelligence, or related fields; or Masters degree + 3 years of relevant industry experience

  • Experience in inference optimization techniques of deep learning models or libraries on hardware architectures;

  • Familiar with microkernel architecture, Linux kernel, hypervisor, middleware, and application framework

  • Those who have good publication records and have published high impact, innovative papers are preferred

Compensation:

The US base salary range for this full-time position is $38.00 - $46.00.

  • Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.

  • Please note that the compensation details listed in US role postings reflect the base salary only. It does not include discretionary bonus, equity, or benefits.

Top Skills

C/C++
Python
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Jose, CA
2,732 Employees
On-site Workplace
Year Founded: 2014

What We Do

NIO’s mission is to shape a joyful lifestyle by offering premium smart electric vehicles and providing the best user experience. NIO was founded in November 2014 as a global electric vehicle company. The company has over 7,000 employees working across world-class research and development, design and manufacturing centers in Shanghai, Beijing, San Jose, Munich, London and six other locations. In 2015, NIO was the title sponsor for the Drivers’ Championship winning team during the inaugural ABB FIA Formula E season. In 2016, NIO unveiled one of the fastest electric cars in the world, the EP9. The EP9 set the lap record for an electric vehicle at the Nürburgring Nordschleife and three other world-renowned tracks. In 2017, NIO unveiled its vision car EVE and announced that the NIO EP9 set a new world speed record for an autonomous vehicle at the Circuit of the Americas. NIO officially began deliveries of the ES8, the high-performance electric flagship SUV, on June 28, 2018. NIO was listed on the New York Stock Exchange on September 12, 2018. NIO officially launched the high-performance long-range electric SUV, NIO ES6, at NIO Day on December 15, 2018. On May 28, 2019, the first production model ES6 rolled off the line at the JAC NIO Advanced Manufacturing Center. NIO officially began deliveries of the ES6 on June 18, 2019. NIO officially launched the EC6, a 5-seater smart premium electric coupe SUV, in December 2019 and began deliveries of the EC6 on September 2020. On January 9, 2021, NIO ET7, the smart electric flagship sedan and NIO’s first autonomous driving model, was officially launched.

Similar Jobs

San Jose, CA, USA
2732 Employees

ZS Logo ZS

Applied AI Scientist

Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
Hybrid
10 Locations
13000 Employees
150K-186K Annually

Notion Logo Notion

Engineering Manager, Core Product

Artificial Intelligence • Productivity • Software
Hybrid
San Francisco, CA, USA
800 Employees

Notion Logo Notion

Software Engineer, iOS

Artificial Intelligence • Productivity • Software
Hybrid
San Francisco, CA, USA
800 Employees

Similar Companies Hiring

Cox Enterprises Thumbnail
Software • Other • Information Technology • Greentech • Cybersecurity • Cloud • Automotive
Atlanta, GA
50000 Employees
UL Solutions Thumbnail
Software • Renewable Energy • Professional Services • Energy • Consulting • Chemical • Automotive
Chicago, IL
15000 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account