Pear VC

Founding Machine Learning Engineer - NomadicML

Posted 16 Days Ago

Be an Early Applicant

San Francisco, CA

Expert/Leader

Angel or VC Firm

The Role

As a Founding Machine Learning Engineer, you'll develop and refine RAG workflows for video-understanding models, using statistical analysis, while collaborating on front-end tools.

Summary Generated by Built In

About Us:

Mustafa and Varun met at Harvard, where they both did research in the intersection of computation and evaluations. Between them, they have authored multiple published papers in the machine learning domain and hold numerous patents and awards. Drawing on their experiences as tech leads at Snowflake and Lyft, they founded NomadicML to solve a critical industry challenge: elevate critical operations of video-ingesting enterprises with domain-specific semantic reasoning.

At NomadicML, we leverage advanced techniques, such as retrieval-augmented generation, adaptive fine-tuning, and compute-accelerated inference, to significantly improve machine learning models in the domain of real-time video understanding. Backed by leading investors and enterprises (such as Pear VC, BAG VC, Confluent and Cognition AI), we’re committed to building cutting-edge infrastructure that helps teams realize the full potential of their video insights.

About the Role:

As a Founding Machine Learning Engineer, you will shape the next generation of semantic video reasoning AI agents, blending cutting-edge research with practical implementation. You’ll design, implement, and refine Retrieval-Augmented Generation (RAG) pipelines, enabling our models to adapt in real-time to changing data and user needs. This will involve working with text, video, and other high-dimensional inputs, as well as exploring advanced embeddings, vector databases, and GPU-accelerated infrastructures. You’ll apply statistical rigor—using significance testing, distributional checks, and other quantitative methods—to determine precisely when and how to retune models, ensuring that updates are timely yet never arbitrary.

Beyond the core ML tasks, you’ll also be a key contributor to our research initiatives. You’ll evaluate and experiment with new model architectures, foundational models, and emerging techniques in large-scale machine learning and optimization. As part of the full-stack experience, you’ll work closely with the other team members to build intuitive front-end interfaces, dashboards, and APIs. These tools will enable rapid iteration, real-time monitoring, and easy configuration of models and pipelines, making it possible for both technical and non-technical stakeholders to guide model evolution effectively.

Key Responsibilities:

Research, prototype, and integrate new model architectures and foundational models into our pipeline.
Develop and maintain real-time RAG workflows, ensuring efficient adaptation to new text, video, and streaming data sources.
Implement statistical methods to determine when models need retuning, leveraging metrics, significance tests, and distributional analyses.
Collaborate with Software Engineers to build front-end interfaces and dashboards for monitoring performance and triggering model updates.
Continuously refine embeddings, vector databases, and model architectures to drive improved accuracy, latency, and stability.

Must Haves:

Strong Proficiency in Python
Deep understanding of ML model development (e.g., LLMs, embedding techniques)
Experience with Retrieval-Augmented Generation (RAG) pipelines, fine tuning APIs, and similar ML workflows.
Strong statistical background for evaluating model performance

Nice to Haves:

Proficiency in frameworks like PyTorch or TensorFlow
Knowledge of vector databases, embedding stores, and scalable ML serving platforms
Experience with CI/CD tools and ML workflow management (MLflow, Kubeflow)
Prior research background (publications, patents) in ML, especially in foundational models or large-scale adaptation techniques

What We Offer:

Competitive compensation and equity
Apple Equipment
Health, dental, and vision insurance.
Opportunity to build foundational machine learning infrastructure from scratch and influence the product’s technical trajectory.
Primarily in-person at our San Francisco office with hybrid flexibility.

Top Skills

Kubeflow

Mlflow

Python

PyTorch

Retrieval-Augmented Generation

TensorFlow

Vector Databases

View all jobs at Pear VC

View Pear VC Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

HQ: Menlo Park, CA

154 Employees

On-site Workplace

Year Founded: 2013

What We Do

We’re specialists in pre-seed and seed. The startups we back go far. Best-in-class founders do not come around every day. When they do, we jump at the opportunity to work together. Our approach is to work with just a small number of best-in-class founders so we can dig in and go deep.