Senior Research Engineer (Data)

Posted 16 Days Ago
Palo Alto, CA
Mid level
Information Technology
The Role
The Senior Research Engineer (Data) will lead data acquisition and management systems essential for AI research, develop pipelines for dataset preparation, collaborate with teams to enhance dataset quality, and drive improvements in data quality for video generation models.
Summary Generated by Built In

We are seeking a Senior Software Engineer to spearhead our data acquisition and management systems, critical to our advanced AI research. In this role, you will architect and maintain efficient pipelines for sourcing, processing, and organizing the extensive datasets that fuel our generative AI models. Your expertise will have a direct and transformative impact on the quality and capabilities of our technology.

Responsibilities

  • Partner with research teams to understand and address model performance gaps by identifying and leveraging novel data sources.
  • Develop and implement robust data pipelines for acquisition, deduplication, filtering, and pre-training dataset preparation.
  • Collaborate with annotation operations teams to design innovative data filtering strategies and enhance dataset quality.
  • Apply and integrate advanced methodologies such as self-supervised active learning to scale data systems.
  • Lead research projects to improve data quality and drive advancements in video generation models.

Qualifications

  • Education: Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field.
  • Experience: 3+ years of experience in managing and curating large-scale datasets, particularly in fields like computer vision, NLP, robotics, or self-driving technologies.

Key Skills:

  • Strong proficiency in Python and familiarity with deep learning frameworks such as PyTorch.
  • Experience with large-scale data processing tools, such as SQL or Spark.
  • Hands-on expertise in designing and working with distributed systems.
  • Proven ability to thrive in a fast-paced, research-focused environment and deliver end-to-end project solutions.

Note: This position is not intended for recent graduates.

Compensation

The salary range for this role in California is $175,000–$250,000 per year. Actual base pay may vary based on factors such as job-related expertise, skills, experience, and candidate location. Additionally, we provide competitive equity packages through stock options and a comprehensive benefits plan.

Top Skills

Python
The Company
29 Employees
Remote Workplace
Year Founded: 2023

What We Do

An idea-to-video platform that brings your creativity to motion

Similar Jobs

Atlassian Logo Atlassian

Data Engineer Intern, 2025 Summer U.S.

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees

The Walt Disney Company Logo The Walt Disney Company

Lead Data Engineer

AdTech • Digital Media • News + Entertainment
Hybrid
Santa Monica, CA, USA
200000 Employees
149K-210K Annually

NVIDIA Logo NVIDIA

Senior Research Engineer, ML Data Pipelines

Artificial Intelligence • Hardware • Robotics • Software • Metaverse
Santa Clara, CA, USA
21960 Employees

Kumo Logo Kumo

Data Engineer

Artificial Intelligence • Cloud • Machine Learning • Software • Database
Hybrid
Mountain View, CA, USA
38 Employees

Similar Companies Hiring

Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
GB
357 Employees
Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account