Staff Applied Research Scientist

Posted Yesterday
2 Locations
Remote
Senior level
Software
The Role
As a Staff Applied Research Scientist, you will oversee the research process, collaborate with your team, contribute to the design of research roadmaps, and train other members while solving complex problems in media creation using deep learning techniques.
Summary Generated by Built In

 

Our vision is to build the next-generation platform for fast and easy creation of audio and video content. In November 2022, we took a huge leap forward by launching Storyboard, the video editing foundation that we’re excited to build atop AI capabilities like Overdub, Studio Sound, and other generative AI capabilities in the near future. We're used by some of the world's top podcasters and influencers as well as businesses such as BBC, ESPN, Hubspot, Shopify and Washington Post for communicating via video. We've raised $100M from some of the world's best investors like Andreessen Horowitz, OpenAI Startup fund, Redpoint Ventures and Spark Capital.

 

We need great people to help us build these cutting-edge technologies and guide its development. In particular, we’re always looking to hire smart applied research scientists. You will join a team of around a dozen researchers specialized in generative models and deep learning.

 

Some of our research publications:

  • SampleRNN: An Unconditional End-to-End Neural Audio Generation Model.
  • Char2Wav: End-to-end Speech Synthesis.
  • ObamaNet: Photo-realistic lip-sync from text.
  • MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis.
  • Chunked Autoregressive GAN for Conditional Waveform Synthesis
  • Wav2CLIP: Learning Robust Audio Representations From CLIP

This role can be remote in the US or based in the SF Bay Area.

Responsibilities

  • Oversee research process implementation, from problem definition all the way to running and analyzing concrete research experiments.
  • Collaborate and communicate clearly and efficiently with the rest of the team about the status, results, and challenges of your current tasks.
  • Contribute to designing the research roadmap of the company
  • Train and mentor other members of the team.
  • Own the research function of specific product features.

Challenges

As a core member of our research team, you'll play an integral role in challenges such as:

  • Using deep learning (including but not limited to NLP, speech processing, computer vision, etc.) to solve problems for media creation and editing.
  • Creating realistic voice doubles using only a few minutes of audio.
  • Creating tools to synthesize photo-realistic videos that match our Overdub (personalized speech synthesis) feature.
  • Designing and developing new algorithms for media synthesis, anomaly detection, speech recognition, speech enhancement, filler word detection, audio and video tagging etc.
  • Coming up with new research directions to improve our product

Requirements

  • Proven experience in designing and implementing deep learning algorithms.
  • PhD or Master’s degree specialized in Deep Learning or equivalent experience.
  • Track record of developing new ideas in machine learning, as demonstrated by one or more first author publications or projects.
  • Good programming skills and experience with deep learning frameworks.
  • Ability to generate more ideas than you can implement.
  • Implementing a given idea is easy and efficient. Once the experiment set up is established, you’re able to implement many ideas per day, evaluate them, and organize your time to be productive and efficient.
  • You wish you had more GPUs to run all the experiments that you wanted!
  • You know Pytorch/Tensorflow inside-out
  • We do not require domain-specific knowledge in computer vision or speech-processing

At least one of the following must be true for the applicant to be considered:

  • Lead author of an accepted publication in one of the top conferences: ICLR, ICML, NeurIPS, ICASSP, ICCV, CVPR, InterSpeech, etc.
  • Played a key role in shipping a feature in production which uses deep learning as a key component.

 

The base salary range for this role is $190,000- $300,000/year. Final offer amounts will carefully consider multiple factors, including prior experience, expertise, location, and may vary from the amount above.

About Descript

Descript is building a simple, intuitive, fully-powered editing tool for video and audio — an editing tool built for the age of AI. We are a team of 150 — with a proven CEO and the backing of some of the world's greatest investors (OpenAI, Andreessen Horowitz, Redpoint Ventures, Spark Capital). 

Descript is the special company that's in possession of both product market fit and the raw materials (passionate user community, great product, large market) for growth, but is still early enough that each new employee has a measurable influence on the direction of the company.

Benefits include a generous healthcare package, catered lunches, and flexible vacation time. Our headquarters are located in the Mission District of San Francisco, CA. We're looking to hire people who are local and able to join us at the office when needed. We're flexible, and you're an adult—we don't expect or mandate that you're in the office every day. But we do believe there are valuable and serendipitous moments of discovery and collaboration that come from working together in person. 

Descript is an equal opportunity workplace—we are dedicated to equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. We believe in actively building a team rich in diverse backgrounds, experiences, and opinions to better allow our employees, products, and community to thrive. 

Top Skills

PyTorch
TensorFlow
The Company
HQ: San Francisco, CA
90 Employees
On-site Workplace
Year Founded: 2017

What We Do

Descript builds simple and powerful collaborative tools for new media creators. We strive to eliminate the tedious work that often stands between an idea and its expression, so that creators can focus on developing their craft instead of their usage of tools.

Descript is a next generation digital media platform that offers a new "engine"​ that lets you edit audio by editing text (instead of waveforms).

That means it does transcription (both automated and human-powered, whichever you prefer), but more interestingly, it lets you move the audio around by simply editing the transcript.

If you're new to editing audio, you'll find it far easier - it's basically like using a word processor. If you're experienced, Descript is faster than waveform editing - and more fun.

If you work with voice audio (as opposed to music), Descript is what the future looks like...join us and change the world.

Similar Jobs

Upstart Logo Upstart

Research Scientist, Personal Loans

Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
Easy Apply
Remote
2 Locations
1500 Employees
123K-171K Annually

Anthropic Logo Anthropic

Research Scientist, Interpretability

Artificial Intelligence • Natural Language Processing • Generative AI
Remote
3 Locations
57 Employees
280K-625K Annually
Remote
2 Locations
76 Employees
94K-141K Annually
Remote
San Francisco, CA, USA
47 Employees

Similar Companies Hiring

Jobba Trade Technologies, Inc. Thumbnail
Software • Professional Services • Productivity • Information Technology • Cloud
Chicago, IL
45 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account