Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation)

ttkciar@alien.top · 2 years ago

Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation)

Relevant_Outcome_726@alien.top · 2 years ago

From my experience, Here are some other things related to Lora:
+ FSDP doesn’t work for Lora because FSDP requires all parameters to be trainable or frozen.

+ For Qlora, currently we can only use deepspeed zero2 (deepspeed zero 3 is not supported)