Hey everyone,
I came across a post recently where someone found it hard to find simple scripts to fine-tune LLMs with their data. So I put together a repo where you can just type out your requirements in a config.yaml file and the training happens flawlessly based on that.
Here’s the repo - LLM-Trainer
It is still a wip so lemme know if guys want some other features added to this.
TIA.
I assume auth_token is for storing the merged model in HF? Seems worth noting/clarifying.
I’ll get back with more feedback when I get to test it.+)