Researcher, Machine Learning

Posted 7 Hours Ago
Be an Early Applicant
Mountain View, CA
151K-208K Annually
Mid level
Software
The Role
The role involves developing fast LLM inferencing techniques and optimizing AI technology stacks. Key responsibilities include decoding algorithms for LLMs, optimizing GPU/TPU utilization, analyzing inference bottlenecks, and collaborating on multidisciplinary projects. Researchers are expected to publish findings and contribute to open-source projects while staying up-to-date with industry trends.
Summary Generated by Built In

Lab Summary:

Samsung Research America is looking for outstanding researchers to join our Emerging Technologies (ET) Group.  The ET Group is uniquely positioned to be at the heart of Samsung Research America’s innovation engine aimed towards advancing Samsung’s product offerings across smartphones, wearables, TVs, XR devices, and identifying the next big growth drivers for Samsung across a range of emerging technologies.

Position Summary: 

We are looking for highly skilled and motivated researchers/engineers who can contribute to the development of fast LLM inferencing techniques, AI technology stack optimization, neuro-symbolic AI models, temporal knowledge graphs, and multimodal reasoning.

Position Responsibilities

  • Speculative decoding algorithms tailored for LLMs
  • Optimization of GPU/TPU utilization to minimize latency in token generation pipelines
  • Analyzing bottlenecks in model inference (e.g., memory bandwidth, compute constraints) and propose solutions to improve benchmark model performance pre- and post-optimization
  • Lead integration of multiple projects at the intersection of Cognitive Modeling, Machine Learning, Knowledge Representation and Reasoning  
  • Collaborate with a multidisciplinary team of researchers across different teams
  • Stay ahead of industry trends in LLM acceleration (e.g., dynamic batching, quantization, kernel fusion)
  • Publish findings and contribute to open-source projects
  • Generate creative solutions (patents), publish research results in top conferences (papers)

 Required Skills:

  • PhD in C.S., EE or related fields or equivalent combination of education, training and experience
  • 2+ years of work experience after PhD.
  • Expertise in knowledge graph and retrieval augmented generation (RAG) models
  • Experience in Contextual intelligence
  • Experience in large language model (LLM), including Transformer model architecture, attention mechanisms, decoder only LLMs, and autoregressive model optimization
  • Proficiency in PyTorch, CUDA, and distributed training/inference framework (e.g., DeepSpeed, vLLM) 
  • Hands-on experience profiling and optimizing LLMs on GPUs/TPU
  • A strong publication record in top-tier AI and NLP conferences is a plus

Special Attributes:

  • Ability to debug complex, latency-critical systems
  • Strong analytical and problem-solving skills, with a keen attention to detail and a passion for pushing the boundaries of AI capabilities
  • Excellent written and verbal communication skills, with the ability to present complex concepts and research findings in a clear and concise manner
  • Demonstrated ability to work independently as well as collaboratively in a fast-paced research and development environment

Our total rewards programs are designed to motivate and engage exceptional talent. The base pay range for roles at this level is listed below, but may be higher or lower in other states due to geographic differentials in the labor market. Within the base pay range, individual rates depend on a number of factors—including the role’s function and location as well as the individual’s knowledge, skills, experience, education and training. This is part of our comprehensive compensation package with annual bonus eligibility and generous benefits to help you live life well.

Base Pay Range

$151,200$207,750 USD

Additional Information

Be careful not to disclose information related to the trade secrets of your previous or current employer(s)

Essential Job Functions

This position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, and frequently operate standard office equipment, such as telephones and computers.

Samsung Research America is committed to complying with all Federal, State and local laws related to the employment of qualified individuals with disabilities. If you are an individual with a disability and would like to request a reasonable accommodation as part of the employment selection process, please contact the recruiter or email [email protected].

Affirmative Action / Equal Opportunity

Samsung Research America is an Affirmative Action and Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, disability, or status as a protected veteran.

For more information regarding protection from discrimination under Federal law for applicants and employees, please refer to the links below.

Know Your Rights  |  Pay Transparency

Top Skills

Python
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Mountain View, CA
455 Employees
On-site Workplace
Year Founded: 1988

What We Do

Founded in October 1988, Samsung Research America (SRA) builds upon Samsung’s 40-year history in the Bay Area. Headquartered in Silicon Valley, with offices across the United States and Canada, SRA is engaged in researching emerging technology to create new businesses and developing core technology to enhance the competitiveness of Samsung products. We also play a key role in providing the infrastructure to support Samsung’s open innovation and university collaboration activities.

Similar Jobs

ServiceNow Logo ServiceNow

Staff Machine Learning Engineer

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Hybrid
Santa Clara, CA, USA
26000 Employees
166K-290K Annually

Curai Health Logo Curai Health

Machine Learning Researcher

Artificial Intelligence • Healthtech
Remote
8 Locations
76 Employees

MatX Logo MatX

Machine Learning Researcher - New College Grad 2025

Artificial Intelligence • Hardware • Software
Mountain View, CA, USA
19 Employees

Together AI Logo Together AI

AI Researcher, Core ML

Artificial Intelligence • Information Technology
San Francisco, CA, USA
84 Employees

Similar Companies Hiring

HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
52 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account