sshh12@alien.topBtoLocalLLaMA•Which model is best for binary text classification?English
1·
1 year ago+1, when in doubt, LLM it out.
You could also ask for explanations so when it gets it wrong, you can work on modifying your prompts/examples to get better performance.
Potentially you wouldn’t want to do this if:
- Your classification problem is very unusual/cannot be explained by a prompt
- You want to be able to run this extremely fast or on a ton of data
- You want to learn non-LLM deep learning/NLP (in which case I would’ve suggested basically some form of finetuning BERT)
My rule of thumb has been to LoRA (r between 4 and 16) until unsatisfied with results. It of course depends on data/task but imo most cases don’t require full fine-tune and perf/compute ROI is low.