AI Scientist - Palo Alto

Reposted 8 Days Ago
Palo Alto, CA
Hybrid
Senior level
Artificial Intelligence
The Role
As an AI Scientist, you'll modify and enhance large language models for better human interaction, deploy external tools, and lead pre-training efforts. You should have a strong grasp of generative AI, MLOps architecture, and software design, contributing to a collaborative team environment.
Summary Generated by Built In

About Mistral


At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.


We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.


We are a dynamic, collaborative team passionate about AI and its potential to transform society.

Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.


Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.


Mistral AI are hiring experts in the role of pre-training and fine-tuning large language models.


What you will do


- Modifying pre-trained large language models to make them able to interact with humans

- Equipping large language models with the ability of calling external tools

- Aligning large language models based on feedback obtained during their deployment, or going through an ad-hoc annotation process

- Designing ad-hoc annotation processes themselves

- Participating to the pre-training effort.


About you


- High scientific understanding of the field of generative AI. This means a broad knowledge of the field of AI, and specific knowledge or interest in fine-tuning and using language models for applications

- High technical engineering competence. This means being able to design complex software and make them usable in production

- Be able to navigate the full MLOps technical stack, with a focus on architecture development and model evaluation and usage

- Occasionally be able to do front-end development, and have to use complex HPC infrastructure with full autonomy

- Hands-on experience with AI frameworks and tools (e.g., TensorFlow, PyTorch, Jax)

- Have experience working with large distributed systems

- Self-starter and autonomous 

- Low-ego 

- Collaborative and have a real team player mindset


Now, it would be ideal if you have;

- Audio/Speech experience - audio input/out, NLP etc


What We Offer


💰 Competitive salary and bonus structure 

🚀 Generous Equity

🧑‍⚕️ Health : Competitive Healthcare program (Medical Provider: Blueshield of California 100% coverage for employee, 75% for dependents)

👴🏻 Pension : 401K (6% matching)

🏝️ PTO : 18 days 

🚗 Transportation: Reimburse office parking charges, or $120/month for public transport

🤝 Coaching: we offer Betterup coaching on a voluntary basis

🏀 Sport: $120/month reimbursement for gym membership

🥕 Meal stipend: $400 monthly allowance for meals (solution might evolve as we grow bigger)

🪪 Visa sponsorship

Top Skills

Hpc Infrastructure
Jax
PyTorch
TensorFlow
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Paris
92 Employees
On-site Workplace
Year Founded: 2023

What We Do

Fast, open-source and secure language models. Facilitated specialisation of models on business use-cases, leveraging private data and usage feedback.

Built from a world-class team in Europe, targeting global market. Join the team ! https://jobs.lever.co/mistral/

Similar Jobs

Palo Alto, CA, USA
92 Employees

Anduril Logo Anduril

Flight Test Instrumentation Engineer, Group 5

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
Costa Mesa, CA, USA
4500 Employees
142K-200K Annually

Atlassian Logo Atlassian

Machine Learning Engineer

Cloud • Information Technology • Productivity • Security • Software • App development • Automation
Remote
San Francisco, CA, USA
11000 Employees
160K-268K Annually

The Aerospace Corporation Logo The Aerospace Corporation

Space Domain Awareness Research Scientist

Aerospace • Artificial Intelligence • Cloud • Machine Learning • Software • Cybersecurity • Defense
Hybrid
El Segundo, CA, USA
4600 Employees
110K-194K Annually

Similar Companies Hiring

Stepful Thumbnail
Software • Healthtech • Edtech • Artificial Intelligence
New York, New York
60 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account