LLM Fine Tuning

Uploaded by

The document discusses parameter efficient fine-tuning (PEFT) methods to reduce the number of trainable parameters during fine-tuning large language models. It describes LoRA, a PEFT method that introduces rank decomposition matrices to fine-tune specific attention layers while keeping most weights frozen. It also covers soft prompt tuning which adds trainable embedding vectors to optimize task performance while limiting updated parameters.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

LLM Fine Tuning

Uploaded by

Vishnuvardhan

0% found this document useful (0 votes)

51 views1 page

Original Description:

LLM cheat Sheets

Original Title

LLM fine tuning

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

51 views1 page

LLM Fine Tuning

Uploaded by

Vishnuvardhan

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 1

Search inside document

LoRA SOFT PROMPTS

Parameter Efficient Fine-Tuning

Method to reduce the number of trainable parameters during fine-tuning Unlike prompt engineering, whose limits are:
(PEFT) Methods by freezing all original model parameters and injecting a pair of rank • The manual effort requirements
decomposition matrices alongside the original weights
• The length of the context window

PEFT Prompt tuning: Add trainable tensors to the model input embeddings,
Full fine-tuning of LLMs is challenging: commonly known as “soft prompts,” optimized directly through
1 - Keep the majority of the original
gradient descent.
h = W0.x + AB.x LLM weights frozen.

Outputs h 2 - Introduce a pair of rank

Gradients decomposition matrices.
+ 3 - Train the new matrices A and B.
Optimizer states Activations
B
Pre-trained Pre-trained LLM
rank r Model weights update:
weights W0
Trainable weights Temporary variables A
1 - Matrix multiplication:
Requires a lot
of memory B * A = BxA Tunable soft prompt Input text
Inputs x (Typically, 20-100 tokens)
2 - Add to original weights :
PEFT methods only update a small number of model parameters.
LoRA
Examples of PEFT techniques: + BxA
• Freeze most model weights, and fine tune only specific layer parameters.
Soft prompt vectors:
• Keep existing parameters untouched; add only a few new ones or layers
for fine-tuning. • Equal in length to the embedding vectors of the input language tokens
The trained parameters can account for only 15%-20% of the Additional notes: • Can be seen as virtual tokens which can take any value within the
original LLM weights. multidimensional embedding space
• No impact on inference latency.
Main benefits: In prompt tuning, LLM weights are frozen:
• Fine-tuning specifically on the self-attention layers using LoRA is often
enough to enhance performance for a given task. • Over time, the embedding vector of the soft prompt is adjusted to optimize
• Decrease memory usage, often requiring just 1 GPU. model’s completion of the prompt
• Weights can be switched out as needed, allowing for training on many
• Mitigate risk of catastrophic forgetting. different tasks. • Only few parameters are updated
• Limit storage to only the new PEFT weights. • A different set of soft prompts can be trained for each task and easily swapped
out during inference (occupying very little space on disk).
Multiple methods exist with trade-offs on parameters or memory efficiency, Rank Choice for LoRA Matrices:
training speed, model quality, and inference costs. From literature, it is shown that at 10B parameters, prompt tuning is as efficient
Three PEFT methods classes from literature: Trade-Off: A smaller rank reduces parameters and accelerates training as full fine-tuning.
but risks lower adaptation quality due to reduced task-specific
Selective Reparameterization Additive information capture. ! Interpreting virtual tokens can pose challenges
Augment the pre-trained In literature, it appears that a rank between 4-32 is a good trade-off. (nearest neighbor tokens to the soft prompt location can be used).
Fine-tune only Use low-rank representations
specific parts of to reduce the number of model with new parameters
the original LLM. trainable parameters. or layers, training only
the additions. LoRA can be combined with quantization (=QLoRA).
E.g., LoRA
Adapter
Soft prompts

3 Coding Attention Mechanisms - Build A Large Language Model (From Scratch)
Document31 pages
3 Coding Attention Mechanisms - Build A Large Language Model (From Scratch)
gptplus1999
No ratings yet
Coursera - Introduction To Ai Notes
Document4 pages
Coursera - Introduction To Ai Notes
geralineealeia
100% (1)
Generative AI For Everyone - Coursera
Document5 pages
Generative AI For Everyone - Coursera
andrew tan kian lam
No ratings yet
Introduction To Artificial Learning Lecture One
Document16 pages
Introduction To Artificial Learning Lecture One
Ghaffaru Mudashiru
No ratings yet
LLM Evaluation
Document1 page
LLM Evaluation
Vishnuvardhan
No ratings yet
Generative AI
Document2 pages
Generative AI
P P
No ratings yet
Exploit Generative AI
Document34 pages
Exploit Generative AI
afinetti3
No ratings yet
Fine Tuning LLM For Enterprise: Practical Guidelines and Recommendations
Document17 pages
Fine Tuning LLM For Enterprise: Practical Guidelines and Recommendations
weihaopan1
No ratings yet
Building GenAI Products and Business Outline Web
Document8 pages
Building GenAI Products and Business Outline Web
minh đức
No ratings yet
Large Language Model (LLM) 1
Document17 pages
Large Language Model (LLM) 1
Dary
100% (1)
GenerativeAIBootcamp Presentation
Document50 pages
GenerativeAIBootcamp Presentation
Noyula
No ratings yet
LLM Mesh: A Practical Guide To Using Generative AI in The Enterprise
Document27 pages
LLM Mesh: A Practical Guide To Using Generative AI in The Enterprise
jansle.sp3331
No ratings yet
Generative Ai in The Pharmaceutical Industry Moving From Hype To Reality
Document25 pages
Generative Ai in The Pharmaceutical Industry Moving From Hype To Reality
Luciana Brito
No ratings yet
Generative AI 101 - Intro
Document9 pages
Generative AI 101 - Intro
razorfires4
No ratings yet
Implementing GenAI Use Cases and Challenges
Document42 pages
Implementing GenAI Use Cases and Challenges
ebinezer.jhonson
100% (1)
Exploring The Security Risks of Using Large Language Models
Document15 pages
Exploring The Security Risks of Using Large Language Models
patrickbackup0nn3
100% (1)
Session 11-12 - Text Analytics
Document38 pages
Session 11-12 - Text Analytics
Shishir Gupta
No ratings yet
Deep Learning Algorithms
Document412 pages
Deep Learning Algorithms
aitor.worker
No ratings yet
Generative AI Presentation For Students
Document7 pages
Generative AI Presentation For Students
bnbtechnologiez
100% (1)
Artificial Intelligence AI
Document21 pages
Artificial Intelligence AI
Dheeraj Ssv Bunty
No ratings yet
Aisha A Custom AI Library Chatbot Using The ChatGPT API
Document23 pages
Aisha A Custom AI Library Chatbot Using The ChatGPT API
raflihw Caksono
No ratings yet
Tensor Flow
Document9 pages
Tensor Flow
Ankit Shrivastava
100% (1)
ai agent threats
Document35 pages
ai agent threats
robson.mamedde
No ratings yet
AI and ML For Business Antim Prahar WITH ANSWERS
Document26 pages
AI and ML For Business Antim Prahar WITH ANSWERS
Tinku The Blogger
No ratings yet
Langchain PDF Reader
Document15 pages
Langchain PDF Reader
Emmanuel Kutani
No ratings yet
Large Language Models From Scratch
Document29 pages
Large Language Models From Scratch
alexandre albalustro
No ratings yet
Playbook Executive+Briefing Machine Learning
Document38 pages
Playbook Executive+Briefing Machine Learning
sindhuja464
No ratings yet
Large Language Model
Document38 pages
Large Language Model
21020641
0% (1)
Blockchain Unit 1
Document13 pages
Blockchain Unit 1
sharmanikki8381
No ratings yet
CS 8520: Artificial Intelligence: Knowledge Representation
Document30 pages
CS 8520: Artificial Intelligence: Knowledge Representation
P.R.BABU
No ratings yet
Tackling Healthcares Biggest Burdens With Generative Ai
Document7 pages
Tackling Healthcares Biggest Burdens With Generative Ai
Luciana Brito
No ratings yet
Transforming Financial Services With Generative AI
Document21 pages
Transforming Financial Services With Generative AI
jerriebillss
No ratings yet
01 GenAI ExploreWorkshopGuide
Document55 pages
01 GenAI ExploreWorkshopGuide
Grinberg Zeev
No ratings yet
Student Chatbot System: Advance Computer Programming
Document24 pages
Student Chatbot System: Advance Computer Programming
Mr Nero
No ratings yet
2023-24 AhaGuru Brochure
Document8 pages
2023-24 AhaGuru Brochure
Keshav Agarwal
No ratings yet
Mathematics of Generative AI
Document22 pages
Mathematics of Generative AI
ParichayBhattacharjee
No ratings yet
LLM Assignment 1
Document3 pages
LLM Assignment 1
drishti23117
No ratings yet
Shreyash's Resume
Document1 page
Shreyash's Resume
haih15121
No ratings yet
Robotics and AI in Healthcare
Document2 pages
Robotics and AI in Healthcare
Rajesh Ithape
100% (1)
The New Stack and Ops For AI - LLMOps
Document12 pages
The New Stack and Ops For AI - LLMOps
JOSE ALBERTO ARANGO SÁNCHEZ
No ratings yet
Embeddings
Document13 pages
Embeddings
bigdata.vamsi
No ratings yet
LLM Training Update
Document31 pages
LLM Training Update
Chao Lv
No ratings yet
LLM Challenges
Document1 page
LLM Challenges
Vishnuvardhan
No ratings yet
JC Bose
Document28 pages
JC Bose
Shaji A
No ratings yet
Machine Learning
Document20 pages
Machine Learning
vigneshsrinivasankkl
No ratings yet
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
Document12 pages
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
Kanishk Khatter
No ratings yet
LLM Economics Hansa Cequity 2023 Low - Shared by WorldLine Technology
Document40 pages
LLM Economics Hansa Cequity 2023 Low - Shared by WorldLine Technology
Bún Đậu
100% (1)
Short Report On Expert Systems
Document12 pages
Short Report On Expert Systems
Sadaf Fayyaz
100% (1)
OceanofPDF - Com Enterprise Generative AI Well Architected - Suvoraj Biswas
Document109 pages
OceanofPDF - Com Enterprise Generative AI Well Architected - Suvoraj Biswas
sdummy56789
No ratings yet
OceanofPDF - Com Learn Generative AI With PyTorch - Mark Liu
Document434 pages
OceanofPDF - Com Learn Generative AI With PyTorch - Mark Liu
karinamodu
No ratings yet
Sheffield R. Generative AI Development With Langchain. The Ultimate Guide 2023
Document134 pages
Sheffield R. Generative AI Development With Langchain. The Ultimate Guide 2023
vn.varnavskiy
100% (1)
Lecture6 Tfidf
Document45 pages
Lecture6 Tfidf
Hamza
No ratings yet
Module-5:: Network Analysis
Document22 pages
Module-5:: Network Analysis
Kv Bdr
No ratings yet
GANppt
Document34 pages
GANppt
Sreejith PB
No ratings yet
Gradient Descent Algorithms and Variations - PyImageSearch
Document21 pages
Gradient Descent Algorithms and Variations - PyImageSearch
ROHIT ARORA
No ratings yet
Introduction To LLMS: Transformers Types of Llms Configuration Settings
Document7 pages
Introduction To LLMS: Transformers Types of Llms Configuration Settings
ashish tewari
100% (1)
Llama3, LangGraph and Elasticsearch - Build A Local Agent For Vector Search - Search Labs
Document48 pages
Llama3, LangGraph and Elasticsearch - Build A Local Agent For Vector Search - Search Labs
indoprof
No ratings yet
ML Unit-5
Document83 pages
ML Unit-5
6682 Bhanu
No ratings yet
X 86 Asm
Document18 pages
X 86 Asm
Abdelouahid Alouane
No ratings yet
High Efficiency Amplifiers For EDGE Applications Based On Enhancement-Mode Junction PHEMT
Document19 pages
High Efficiency Amplifiers For EDGE Applications Based On Enhancement-Mode Junction PHEMT
Chelva Selva
No ratings yet
Tensor Programs V
Document48 pages
Tensor Programs V
alekthiery
No ratings yet
ChatGPT and Language Translation A Small Case Study Evaluating English MandarinTranslation
Document11 pages
ChatGPT and Language Translation A Small Case Study Evaluating English MandarinTranslation
n588dz762r
No ratings yet
What Is A Potential Use of Anyword For Writing Cre
Document49 pages
What Is A Potential Use of Anyword For Writing Cre
cherry blossoms
No ratings yet
01rag For LLM A Survey
Document21 pages
01rag For LLM A Survey
Praveen Kumar
No ratings yet
FL97 - ML Scientist
Document2 pages
FL97 - ML Scientist
Kobe Bryant
No ratings yet
Pengembangan Chatbot Ai Large Language Model (LLM) Untuk Layanan Informasi Dan Konsultasi RS - Brawijaya
Document11 pages
Pengembangan Chatbot Ai Large Language Model (LLM) Untuk Layanan Informasi Dan Konsultasi RS - Brawijaya
Ari fadillah Ahmad
No ratings yet
Autonomous LLM-driven Research From Data To Human-Verifiable Research Papers
Document38 pages
Autonomous LLM-driven Research From Data To Human-Verifiable Research Papers
Spencer Xu
No ratings yet
The Emergence of Large Language Models LLM As A To
Document58 pages
The Emergence of Large Language Models LLM As A To
T.v. Vignesh
No ratings yet
Ten Simple Rulesfor Crafting Effective Promptsfor Large Language Models
Document12 pages
Ten Simple Rulesfor Crafting Effective Promptsfor Large Language Models
mohdharislcp
No ratings yet
Amurd: Annotated Arabic-English Receipt Dataset For Key Information Extraction and Classification
Document11 pages
Amurd: Annotated Arabic-English Receipt Dataset For Key Information Extraction and Classification
Wawan Widiantara
No ratings yet
Dornburg&Davin 2024
Document25 pages
Dornburg&Davin 2024
fondationibdaa
No ratings yet
Scaler Curriculum
Document16 pages
Scaler Curriculum
chibbmayura25
No ratings yet
Transformer-Lite - High-Efficiency LLMs On Mobile
Document21 pages
Transformer-Lite - High-Efficiency LLMs On Mobile
abhinavgcpandey30
No ratings yet
YC Companies Batch W24 - Sheet1
Document66 pages
YC Companies Batch W24 - Sheet1
nvietanh926
No ratings yet
EDGE-LLM: Enabling Efficient Large Language Model Adaptation On Edge Devices Via Layerwise Unified Compression and Adaptive Layer Tuning & Voting
Document6 pages
EDGE-LLM: Enabling Efficient Large Language Model Adaptation On Edge Devices Via Layerwise Unified Compression and Adaptive Layer Tuning & Voting
diegobonesso
No ratings yet
CVPR2024优质论文汇总（B站future整理）
Document70 pages
CVPR2024优质论文汇总（B站future整理）
yytyy475
No ratings yet
Research Proposal LLMs and Knowledge Graphs
Document4 pages
Research Proposal LLMs and Knowledge Graphs
kammy19941
No ratings yet
Quantifying and Analyzing Entity-Level Memorization in Large Language Models
Document9 pages
Quantifying and Analyzing Entity-Level Memorization in Large Language Models
卫天然
No ratings yet
Generative AI Exists Because of The Transformer
Document52 pages
Generative AI Exists Because of The Transformer
timsmith1081574
No ratings yet
UGRD AI6100 AI Prompt Engineering - Midterms - Final Quizzes by Kuya Jarmz Ver 2.O
Document53 pages
UGRD AI6100 AI Prompt Engineering - Midterms - Final Quizzes by Kuya Jarmz Ver 2.O
Donuts
No ratings yet
A Survey On Vision-Language-Action Models For Embodied AI
Document32 pages
A Survey On Vision-Language-Action Models For Embodied AI
Santosh Chaganti
No ratings yet
Basics of Prompt Engineering
Document16 pages
Basics of Prompt Engineering
Antony Alex
No ratings yet
Flexgen: High-Throughput Generative Inference of Large Language Models With A Single Gpu
Document23 pages
Flexgen: High-Throughput Generative Inference of Large Language Models With A Single Gpu
Ym M
No ratings yet
Bridge AI A Beginners Guide To Adopting AI For Business Transformation
Document8 pages
Bridge AI A Beginners Guide To Adopting AI For Business Transformation
nunko2ooh
No ratings yet
Amazons GPT55X: A Comprehensive Overview and Analysis
Document6 pages
Amazons GPT55X: A Comprehensive Overview and Analysis
Sunai Chapan
No ratings yet
GenAI AUP Training
Document29 pages
GenAI AUP Training
bassemmohamed287
No ratings yet
LLMOps Truera Intel LLM Ops Explained2
Document14 pages
LLMOps Truera Intel LLM Ops Explained2
Chaaranpall Lambba
No ratings yet
WEF Jobs of Tomorrow Large Language Models and Jobs 2023
Document17 pages
WEF Jobs of Tomorrow Large Language Models and Jobs 2023
tgascoyne
No ratings yet
W Enth50
Document9 pages
W Enth50
Ahnic Lee
No ratings yet
A RAG Chatbot For Precision Medicine of Multiple Myeloma
Document13 pages
A RAG Chatbot For Precision Medicine of Multiple Myeloma
azlankhawas
No ratings yet