UA - SSr. LLM Engineer - 0026

Posted 7 Days Ago
Be an Early Applicant
Brazil
Mid level
Software
The Role
The LLM Engineer will design, develop, and optimize Retrieval Augmented Generation solutions, support ML teams, and document AI use cases, ensuring effective collaboration across teams.
Summary Generated by Built In

Description

We are looking for a talented and innovative Large Language Model (LLM) Engineer with specialized experience in Retrieval Augmented Generation (RAG) to join our dynamic team. The ideal candidate will be instrumental in the development, optimization, and deployment of RAGbased LLM solutions that meet various business needs. This role requires a deep understanding of Data Science, Machine Learning (ML), Natural Language Processing (NLP), and the fundamentals of LLMs. The successful candidate will have hands-on experience with agent based prompting techniques such as LangChain, APIdriven development architectures for AI products, usage of vector databases, and information retrieval pipelines to extract actionable insights. Additionally, strong listening skills and an ability to understand business requirements are essential for designing effective solutions tailored to stakeholder needs.

Job Responsibilities

  • Designing RAG Solutions: Develop robust RAGbased LLM solutions that enhance data retrieval processes while ensuring high quality output aligned with business objectives.
  • Prompt Engineering: Craft effective prompts that efficiently guide LLMs towards generating outputs that fulfill specific business requirements; iterate on prompt designs based on performance metrics.
  • AI Product Development: Design, develop, and refine AI products based on user feedback and stakeholder requirements; ensure alignment with program priorities through agile methodologies.
  • Support ML Engineering Teams: Collaborate closely with ML engineering teams during the deployment phase of APIs in production environments; assist in troubleshooting issues related to model performance or integration challenges.
  • Documentation Creation: Document Generative AI use cases comprehensively; create manuals detailing workflows, methodologies employed in solution design, and best practices adopted throughout the process.
  • Retrieval Augmented Generation Maintenance: Develop and maintain RAG concepts following established data science principles; ensure adherence to industry standards while fostering innovation.
  • Pipeline Optimization: Ensure efficient operation of data pipelines and processes according to company security policies; promote Responsible AI practices within all project phases.
  • Collaboration & Communication: Work closely with cross functional teams including product managers, data scientists, software engineers, and stakeholders to align goals effectively; communicate technical concepts clearly across diverse audiences.
Requirements
  • A minimum of 3 years' experience working with Python for machine learning applications along with expertise in AWS Cloud Computing services.
  • Familiarity with AWS Bedrock's suite of LLM models along with associated prompting strategies tailored for optimal performance.
  • At least 1 year of direct experience working specifically within NLP frameworks/LLMs focusing on Retrieval Augmented Generation models.
  • Proficiency in Python programming language utilized extensively for data science projects involving AI development.
  • Familiarity with agent based frameworks like LangChain which support advanced interactions between users/models.
  • Experience leveraging AI training frameworks such as TensorFlow or PyTorch alongside Hugging Face Transformers library for model building/training tasks.
  • Demonstrated expertise in prompt engineering techniques aimed at finetuning large language models tailored toward specific applications or industries.
  • Knowledgeable about vector databases including practical experience implementing embedding techniques necessary for enhanced information retrieval capabilities.

Preferred Skills

  • Experience utilizing orchestration tools designed specifically for AI product development (e.g., ArgoCD).
  • Ability to create detailed diagrams illustrating data flows/solutions which can effectively communicate complex ideas/processes among team members/internal stakeholders alike.
  • Prior experience deploying UI/frontend components related directly back into existing large language model systems would be advantageous.

Top Skills

AWS
Aws Bedrock
Hugging Face Transformers
Langchain
Python
PyTorch
TensorFlow
Vector Databases
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Francisco, CA
39 Employees
On-site Workplace
Year Founded: 2020

What We Do

Experts in crafting digital products ⚡️

At Thaloz, the mission is to support at every stage of the digital product journey. With a team of over 100 experts and a global presence in 30 countries, we leverage top-tier Latin American talent to deliver exceptional software development solutions that drive success.

Our Services:
→ Product Lab: Comprehensive product development services to build and scale software solutions. From strategy and design to development, testing, and launch, every aspect is handled with expertise.
→ Talent Hub: Accelerate the team-building process by 50% with carefully vetted LATAM talent. Select the team members, and they will be seamlessly integrated into projects under the client's leadership.
→ Enterprise Pod: Optimize operations with streamlined complex integrations and flawless implementations of digital products for B2B companies, ensuring rapid and smooth deployments.

Ready to assist in turning ideas into reality, get in touch through www.thaloz.com/contact-us

Join our community! 👨‍💻
Instagram: @thalozteam
YouTube: @thalozteam
Clutch: @thaloz

Similar Jobs

Dynatrace Logo Dynatrace

Direct Lead Solutions Engineer (Hybrid, Sao Paulo)

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Hybrid
São Paulo, BRA
4700 Employees

Superhuman Logo Superhuman

Senior Site Reliability Engineer

Consumer Web • Enterprise Web • Mobile • Productivity • Software
Easy Apply
Remote
13 Locations
116 Employees

Cargill Logo Cargill

Sr. Application Developer - ERP

Food • Greentech • Logistics • Sharing Economy • Transportation • Agriculture • Industrial
São Paulo, BRA
155000 Employees

Adyen Logo Adyen

Java Engineer (Affirmative Action for Women)

Fintech • Payments • Financial Services
Easy Apply
Hybrid
São José dos Campos, São Paulo, BRA
4196 Employees

Similar Companies Hiring

True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees
Caliola Engineering Thumbnail
Software • Machine Learning • Hardware • Defense • Data Privacy • App development • Aerospace
Colorado Springs, CO
53 Employees
Red 6 Thumbnail
Virtual Reality • Software • Hardware • Defense • Aerospace
Orlando, Florida
113 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account