Hey everyone,

I came across a post recently where someone found it hard to find simple scripts to fine-tune LLMs with their data. So I put together a repo where you can just type out your requirements in a config.yaml file and the training happens flawlessly based on that.

Here’s the repo - LLM-Trainer

It is still a wip so lemme know if guys want some other features added to this.

TIA.

  • uhuge@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    I assume auth_token is for storing the merged model in HF? Seems worth noting/clarifying.

    I’ll get back with more feedback when I get to test it.+)