Sourcegraph

ML Engineer [IC3]

Posted 10 Days Ago

San Francisco, CA

Senior level

Information Technology • Software

A code search + AI platform to help you understand, fix, and automate across all your code.

The Role

As a full stack ML Engineer, you will develop and deploy large scale ML models, manage data pipelines, and push AI boundaries while collaborating closely with product teams.

Summary Generated by Built In

Who we are

Our mission at Sourcegraph is to make it so that everyone can code, not just ~0.1% of the population.

We are transforming how the world’s most important companies build software by industrializing development with AI. Today, most professional developers spend a disproportionate amount of time understanding code and performing repetitive, low-level tasks—leaving less time for innovation and meaningful impact.

We’re changing that. Sourcegraph brings AI-powered search and agents to the enterprise, helping teams automate the mundane and amplify what developers do best— solving hard problems and creating great products.

Here’s how we’re making a difference:

Accelerating developers with AI agents that deliver insights and precision—enabling 5x faster test creation, 30% increase in merge requests, and saving 20 minutes per developer daily.
Automating repetitive tasks, from remediating vulnerabilities (saving teams 1,000+ hours annually) to speeding up migrations that would take years to months.
Enabling innovation by addressing complex problems like automated bug triage, vulnerability detection, and AI-driven code reviews seamlessly integrated into workflows.

Trusted by 7/10 top software companies by market cap, 4/6 top US banks and many of the companies leading global innovation, like Stripe, Indeed, Tesla, and 1Password, and with $225M in funding from investors like a16z, Sequoia, and Redpoint, we are building the tools that will define the next era of enterprise software development.

If you’re passionate about solving the hardest problems in software and shaping the future of technology, join us. Let’s build something extraordinary together.

Hours & location

🌎 While we hire almost anywhere in the world, we have office hubs in select locations and expect this person to reside in the San Francisco Bay Area to be able to work from our office for several days per week.

Required locations:

San Francisco, CA

We do not subscribe to “I do my best work when I work 40 hours a week.” People we hire at Sourcegraph believe that building outstanding things means working very hard — smarter and more hours than the competition.

Why this job is exciting

Many companies are trying, but Sourcegraph is uniquely differentiated by our rich code intelligence data and powerful code search platform. In the world of prompting LLMs, context is everything, and Sourcegraph’s context is simply the best you can get: IDE-quality, global-scale, and served lightning fast. Our code intelligence, married with modern AI, is already providing a remarkable alpha experience, and you can help us unlock its full potential.

We are looking for an experienced full stack ML engineer with demonstrated industry experience in productionizing large scale ML models in industrial settings. And if you happen to have an entrepreneurial streak, you’re in luck: We have an enterprise distribution pipeline, so whatever you build can be deployed straight to enterprise customers with some of the largest code bases in the world, without all the go-to-market hassle you’d encounter in a startup.

You will be tightly integrated with our product teams and pushing the boundaries of what AI can do. You will have the full power of Sourcegraph’s Code Intelligence Platform at your disposal, and you’ll be working on coding agents to multiply dev productivity to unprecedented levels.

📅 Within one month, you will…

Start building a trusting relationship with your peers, and learning the company structure.
Be set up to do local development, and be actively prototyping.
Dive deep into how AI and ML is already used at Sourcegraph and identify ways to improve moving forward.
Develop simulated datasets using Gym style frameworks across a number of Cody use cases.
Experiment with changes to Cody prompts, context sources and evaluate the changes with offline experimentation datasets.
Ship a substantial new feature to end users.

📅 Within three months, you will…

Building out feature computation, storage, monitoring, analysis and serving systems for features required across our Cody LLM stack
Be contributing actively to the world’s best coding agents.
Developing distributed training & experiment infrastructure over Code AI datasets, and scaling distributed backend services to reliably support high-QPS low latency use cases.
Be following all the relevant research, and conducting research of your own.

📅 Within six months, you will…

Be fully ramped up and owning key pieces of the agents.
Be ramped up on other relevant parts of the Sourcegraph product.
Be helping design and build what might become the biggest dev accelerator in 20 years.
Owning a number of ML systems, and building core data and model metadata systems powering the end-to-end ML lifecycle.
Be developing a highly scalable, high-QPS inference service providing low latency performance using a mix of CPU and GPU hardware to most efficiently utilize resources.
Be driving the technical vision and owning a couple of major ML components, including their modeling and ML infra roadmap.

About you

You are an experienced full-stack ML engineer with demonstrated industry experience in formulating ML solutions, developing end-to-end data orchestration pipelines, deploying large-scale ML models, and experimenting offline and online to drive business impact for Sourcegraph users. You want to be part of a world-class team to push the boundaries of AI, with a particular focus on leveraging Sourcegraph’s code intelligence to leapfrog competitors.

You have 5-8 years of industry experience
You are a backend focused ML engineer who has worked on the entire ML lifecycle
You have deployed ML models to production to users and have developed feature pipelines
You understand the nuances of ML for users to move metrics forward

Level

📊 This job is an IC3. You can read more about our job leveling philosophy in our Handbook.

Compensation

💸 We pay you an above-average salary because we want to hire the best people who are fully focused on helping Sourcegraph succeed, not worried about paying bills. As an open and transparent company that values competitive compensation, our compensation ranges are visible to every single Sourcegraph teammate.

Your salary is determined by your pay band for the IC3 job level. For determining pay bands, we use a number of market and data-driven salary sources, along with your location zone, and target the high-end of the range to ensure we’re always paying above market regardless of where you live in the world. Both U.S. and international locations are divided into one of four zones, determined by the cost of labor index for each area. The salary for a successful candidate will be based on level, job-related skills, experience, qualifications, and location zone. Please note that the salaries below may be adjusted in the future.

💰 The target compensation for this role is based on the IC3 pay band for your zone. The start of the IC3 pay band for each zone is listed below:

Zone 1: $195,000 USD

Please speak with a recruiter for additional information regarding zone locations.

📈 In addition to our cash compensation, we offer equity (because when we succeed as a company, we want you to succeed, too) and generous perks & benefits.

Interview process

Below is the interview process you can expect for this role (you can read more about the types of interviews in our Handbook). It may look like a lot of steps, but rest assured that we move quickly and the steps are designed to help you get the information needed to determine if we’re the right fit for you… Interviewing is a two-way street, after all!

We expect the interview process to take 5.5 hours in total.

👋 Introduction Stage - we have initial conversations to get to know you better…

[30m] Recruiter Screen
[45m] Technical Deep Dive

🧑‍💻 Team Interview Stage - we then delve into your experience in more depth and introduce you to members of the team, including cross-functional partners…

[60m] Resume Deep Dive
[60m] Architecture Interview
[15m + async] Pairing Exercise

🎉 Final Interview Stage - we move you to our final round, where you gain a better understanding of our business and values holistically…

[30m] Values
[30m] Leadership with co-founder
We check references and conduct your background check

Please note - you are welcome to request additional conversations with anyone you would like to meet, but didn’t get to meet during the interview process.

Learn more about us

You can learn more about what it is like to work at Sourcegraph by reading our handbook.

We are an ambitious team who are collectively working hard to build the most influential company in the world. You can read more about our culture, competitive compensation and benefits here.

Sourcegraph is an equal opportunity workplace; we welcome people from all backgrounds.

Sourcegraph participates in E-Verify for U.S. Employees.

Top Skills

Coding Agents

Data Orchestration Pipelines

Gpu Hardware

Gym Style Frameworks

Ml Models

View all jobs at Sourcegraph

View Sourcegraph Profile

Report Job

Am I A Good Fit?

beta

Get Personalized Job Insights.

Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company

San Francisco, California

180 Employees

On-site Workplace

Year Founded: 2013

What We Do

Sourcegraph is a code AI platform that makes it easy to read, write, and fix code–even in big, complex code bases. Meet our 2 products:

Cody: Write, fix, and maintain code with the most powerful & accurate AI coding assistant, Cody. Cody uses the code graph to understand your entire codebase and help developers write and ship code with autocomplete and commands.

Code Search: Search your entire codebase—every code host and repository, at any scale—in a single place. Code Search makes it easy for developers to onboard to new codebases, understand code faster, and find & fix security risks.

Why Work With Us

We're developing the world's most advanced code AI platform with a team of brilliant across the globe (100% remote). Our company values are the beliefs + principles that help us achieve our goals and build an inclusive team. We provide total rewards that are highly competitive and allow you to thrive both personally and professionally.