Research Engineer (Pre-training & Post-training)

Posted 2 Days Ago
Be an Early Applicant
San Francisco, CA
Mid level
Artificial Intelligence • Software
The Role
The Research Engineer will manage the lifecycle of AI models, focusing on large-scale data pipelines, model optimization, and collaboration across multimodal systems.
Summary Generated by Built In

Job title: Research Engineer (Pre-training & Post-training) / Member of Technical Staff

Who We Are
WaveForms AI is an Audio Large Language Models (LLMs) company building the future of audio intelligence through advanced research and products. Our models will transform human-AI interactions making them more natural, engaging and immersive.

Role overview: The Research Engineer – Pre-training & Post-training role integrates responsibilities across all phases of the AI model lifecycle, including pre-training, post-training, and data preparation. This position involves building and optimizing large-scale data pipelines, handling multimodal datasets (audio and text), conducting pre-training with a focus on compute efficiency and scalability, and refining models with cutting-edge techniques like supervised fine-tuning (SFT), reinforcement learning from human feedback (RLHF) and generative modeling. The ideal candidate will leverage advanced methods, including GANs and diffusion models, to push the boundaries of multimodal AI systems focused on audio and text.

Key Responsibilities

  • Lead the pre-training and fine-tuning of large-scale language models (LLMs), maximizing compute efficiency and scaling infrastructure.

  • Optimize model performance using advanced techniques, including RLHF, reward modeling (RM), instruction-tuning, distillation, GANs, and diffusion models.

  • Develop robust evaluation pipelines to monitor, refine, and improve model performance throughout training phases.

  • Build and optimize scalable, distributed data pipelines to support multimodal (audio + text) AI training.

  • Handle and process massive datasets (PiB scale) for pre-training and post-training, ensuring efficient preparation, annotation, and data flow.

  • Collaborate with research and engineering teams to ensure seamless integration of data preparation and training workflows for multimodal systems.

Required Skills & Qualifications

  • Proven experience in training large language models (LLMs), including pre-training, fine-tuning, and post-training optimization.

  • Strong background in distributed systems, compute efficiency, and scaling model training infrastructure.

  • Expertise in designing and managing large-scale, distributed data pipelines for multimodal datasets, particularly audio + text.

  • Proficiency in advanced techniques such as RLHF, instruction-tuning, reward modeling, distillation, GANs, and diffusion models.

  • Proficiency in Python, PyTorch, and distributed frameworks (e.g., Fully Sharded Data Parallel)

  • Familiarity with cloud platforms like AWS, GCP, or Azure for managing distributed environments.

  • Knowledge of multimodal AI systems combining audio and text for training and evaluation.

Minimum Experience

  • 4-5 years of relevant professional experience is required

Top Skills

AWS
Azure
GCP
Python
PyTorch
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
9 Employees
On-site Workplace
Year Founded: 2024

What We Do

WaveForms AI is an Audio LLM research and product company aiming to solve the Speech Turing Test and create Emotional General Intelligence. Learn more at waveforms.ai/about.

Similar Jobs

Finch Logo Finch

Implementation Engineer

Fintech • HR Tech • Software
Hybrid
2 Locations
68 Employees

Atlassian Logo Atlassian

Senior Data Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees
136K-218K Annually

Voltage Park Logo Voltage Park

Software Engineer, Kubernetes & Virtualization Platform

Artificial Intelligence • Cloud • Hardware • Machine Learning • Other • Software • Infrastructure as a Service (IaaS)
Remote
2 Locations
51 Employees
120K-180K Annually

Veeva Logo Veeva

Senior DevOps Engineer

Big Data • Cloud • Healthtech • Software • Big Data Analytics
Remote
San Luis Obispo, CA, USA
6000 Employees
125K-275K Annually

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account