Founding Machine Learning Engineer - NomadicML

Posted 16 Days Ago
Be an Early Applicant
San Francisco, CA
Expert/Leader
Angel or VC Firm
The Role
As a Founding Machine Learning Engineer, you'll develop and refine RAG workflows for video-understanding models, using statistical analysis, while collaborating on front-end tools.
Summary Generated by Built In

About Us:

Mustafa and Varun met at Harvard, where they both did research in the intersection of computation and evaluations. Between them, they have authored multiple published papers in the machine learning domain and hold numerous patents and awards. Drawing on their experiences as tech leads at Snowflake and Lyft, they founded NomadicML to solve a critical industry challenge: elevate critical operations of video-ingesting enterprises with domain-specific semantic reasoning.

At NomadicML, we leverage advanced techniques, such as retrieval-augmented generation, adaptive fine-tuning, and compute-accelerated inference, to significantly improve machine learning models in the domain of real-time video understanding. Backed by leading investors and enterprises (such as Pear VC, BAG VC, Confluent and Cognition AI), we’re committed to building cutting-edge infrastructure that helps teams realize the full potential of their video insights.

About the Role:

As a Founding Machine Learning Engineer, you will shape the next generation of semantic video reasoning AI agents, blending cutting-edge research with practical implementation. You’ll design, implement, and refine Retrieval-Augmented Generation (RAG) pipelines, enabling our models to adapt in real-time to changing data and user needs. This will involve working with text, video, and other high-dimensional inputs, as well as exploring advanced embeddings, vector databases, and GPU-accelerated infrastructures. You’ll apply statistical rigor—using significance testing, distributional checks, and other quantitative methods—to determine precisely when and how to retune models, ensuring that updates are timely yet never arbitrary.

Beyond the core ML tasks, you’ll also be a key contributor to our research initiatives. You’ll evaluate and experiment with new model architectures, foundational models, and emerging techniques in large-scale machine learning and optimization. As part of the full-stack experience, you’ll work closely with the other team members to build intuitive front-end interfaces, dashboards, and APIs. These tools will enable rapid iteration, real-time monitoring, and easy configuration of models and pipelines, making it possible for both technical and non-technical stakeholders to guide model evolution effectively.

Key Responsibilities:

  • Research, prototype, and integrate new model architectures and foundational models into our pipeline.

  • Develop and maintain real-time RAG workflows, ensuring efficient adaptation to new text, video, and streaming data sources.

  • Implement statistical methods to determine when models need retuning, leveraging metrics, significance tests, and distributional analyses.

  • Collaborate with Software Engineers to build front-end interfaces and dashboards for monitoring performance and triggering model updates.

  • Continuously refine embeddings, vector databases, and model architectures to drive improved accuracy, latency, and stability.

Must Haves:

  • Strong Proficiency in Python 

  • Deep understanding of ML model development (e.g., LLMs, embedding techniques)

  • Experience with Retrieval-Augmented Generation (RAG) pipelines, fine tuning APIs, and similar ML workflows.

  • Strong statistical background for evaluating model performance 

Nice to Haves:

  • Proficiency in frameworks like PyTorch or TensorFlow

  • Knowledge of vector databases, embedding stores, and scalable ML serving platforms

  • Experience with CI/CD tools and ML workflow management (MLflow, Kubeflow)

  • Prior research background (publications, patents) in ML, especially in foundational models or large-scale adaptation techniques

What We Offer:

  • Competitive compensation and equity

  • Apple Equipment

  • Health, dental, and vision insurance.

  • Opportunity to build foundational machine learning infrastructure from scratch and influence the product’s technical trajectory.

  • Primarily in-person at our San Francisco office with hybrid flexibility.

Top Skills

Kubeflow
Mlflow
Python
PyTorch
Retrieval-Augmented Generation
TensorFlow
Vector Databases
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Menlo Park, CA
154 Employees
On-site Workplace
Year Founded: 2013

What We Do

We’re specialists in pre-seed and seed. The startups we back go far. Best-in-class founders do not come around every day. When they do, we jump at the opportunity to work together. Our approach is to work with just a small number of best-in-class founders so we can dig in and go deep.

Similar Jobs

San Francisco, CA, USA
154 Employees

MongoDB Logo MongoDB

Forward Deployed Engineer, AI

Big Data • Cloud • Software • Database
San Francisco, CA, USA
2382 Employees
118K-231K Annually

Anduril Logo Anduril

Sr. Supplier Quality Engineer - PCB/PCBA

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
Costa Mesa, CA, USA
4500 Employees
142K-213K Annually

Relativity Space Logo Relativity Space

Structures Manufacturing Engineer I, Mechanical Assembly

3D Printing • Aerospace • Hardware • Robotics • Software • Manufacturing
Easy Apply
Hybrid
Long Beach, CA, USA
1300 Employees
97K-124K Annually

Similar Companies Hiring

Cie Thumbnail
Software • Enterprise Web • Digital Media • Consulting • Co-Working Space or Incubator • Angel or VC Firm • Agency
Irvine, CA
65 Employees
M13 Thumbnail
Angel or VC Firm
New York, NY
40 Employees
The HEICO Companies, LLC Thumbnail
Manufacturing • Industrial • Angel or VC Firm
Warrenville, IL
9000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account