Tool to quickly iterate when fine-tuning open-source LLMs

torque-mcclyde@alien.top · 2 years ago

Tool to quickly iterate when fine-tuning open-source LLMs

kivathewolf@alien.top · 2 years ago

This is really cool! Good choice on starting with the chat model and not the base model. They are much more friendly to alignment with a small dataset. In your post you mention you do QLorA in few mins. I am assuming that’s for a small dataset like <1000 samples? What’s your backend running on? I would love to learn how you are deploying and scaling this for multiple customers. Best of luck!

torque-mcclyde@alien.top · 2 years ago

Yes, our datasets usually have a few hundred examples. We do support arbitrarily large datasets though, the fine-tuning just takes a little longer.

For deploying and scaling we’re using Modal, it’s a “serverless” GPU provider that we found to be very user-friendly.