Multimodal VLM/LLM Researcher

Posted 16 Hours Ago
Be an Early Applicant
3 Locations
Remote
Mid level
Artificial Intelligence • Software
The Role
The Multimodal VLM/LLM Researcher will develop and train vision-language and large language models, implement fine-tuning strategies, and innovate by integrating these models into media generation. The role involves conducting research, maintaining knowledge of AI developments, collaborating on model implementation, and sharing findings with teams.
Summary Generated by Built In

Join Black Forest Labs, the pioneering team behind Stable Diffusion, Stable Video Diffusion, and FLUX.1, as we push the boundaries of generative AI. We're seeking an exceptional researcher to run cutting-edge projects in multimodal vision-language and large language models.

Key Responsibilities

Model Development & Training

  • Run the development and training of state-of-the-art multimodal vision-language models within the FLUX technology stack
  • Design and implement specialized fine-tuning strategies for VLMs to address specific use cases and performance requirements
  • Develop and optimize LLM implementations for prompt enhancement, content moderation, and novel applications

Research & Innovation

  • Drive innovation by integrating VLM/LLM capabilities into our media generation pipeline
  • Conduct research to creatively combine vision and language models for enhanced generative capabilities
  • Maintain cutting-edge knowledge of the latest developments in multimodal AI and LLM research
  • Evaluate emerging models and architectures for potential integration into our technology stack

Technical Leadership

  • Collaborate with cross-functional teams to implement and deploy models at scale
  • Contribute to architectural decisions and technical roadmap planning
  • Document and share research findings with the broader team

Required Qualifications

  • Demonstrated expertise in training and fine-tuning large-scale vision-language models
  • Strong publication record or practical experience with relevant projects in multimodal AI research
  • Proficiency in PyTorch or similar deep learning frameworks
  • Experience with distributed training systems and large-scale model optimization
  • Track record of implementing and scaling AI models in production environments

Nice to have

  • Experience with diffusion models and generative AI architectures alongside autoregressive modelling 
  • Background in computer vision 
  • Contributions to open-source AI projects
  • Experience working in fast-paced startup environments
  • Strong software engineering practices and system design skills
  • Experience in open-source VLM inference frameworks such as vLLM

What We Offer

  • Opportunity to work with the strong technical team at Black Forest Labs
  • Access to state-of-the-art computing resources
  • Collaborative, research-focused environment
  • Competitive compensation package
  • Flexible work arrangements
  • Chance to shape the future of generative AI

We're looking for self-motivated individuals who are passionate about advancing the field of AI and can thrive in a fast-paced, research-driven environment. If you're excited about pushing the boundaries of what's possible in generative AI, we want to hear from you.

Top Skills

PyTorch
The Company
27 Employees
Remote Workplace

What We Do

A new era of creation

Similar Jobs

GitLab Logo GitLab

Engineering Manager, GitLab Delivery - Release

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
28 Locations
2350 Employees

GitLab Logo GitLab

Intermediate Site Reliability Engineer, FinOps

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
29 Locations
2350 Employees
98K-210K Annually

GitLab Logo GitLab

Senior Engineering Manager, Software Supply Chain Security

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
32 Locations
2350 Employees
158K-338K Annually

GitLab Logo GitLab

IT Business Systems Analyst

Cloud • Security • Software • Cybersecurity • Automation
Easy Apply
Remote
28 Locations
2350 Employees

Similar Companies Hiring

Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees
HERE Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account