Rule of thumb for peft/lora

trollbrot@alien.top · 2 years ago

Rule of thumb for peft/lora

Exotic-Estimate8355@alien.top · 2 years ago

Unless it’s some really heavy stuff like learning a new language, you should be fine with LoRA

trollbrot@alien.top · 2 years ago

Ok, interesting. One obvious use-case I could see is, that we want to train it on internal documents, to interact with the documents in a more dynamic way. That should be easier than learning a new language.

sshh12@alien.top · 2 years ago

My rule of thumb has been to LoRA (r between 4 and 16) until unsatisfied with results. It of course depends on data/task but imo most cases don’t require full fine-tune and perf/compute ROI is low.