Looking to move on in the next step of my LLM learning journey and:
a) generate a q&A dataset, say with GPT-4
b) use the dataset to instruction fine tune a 7B variant of mistral and evaluate
The Q might be to give me a sumamrised history of a company, with the dataset answer generated by GPT-4, to fine-tune the instruction fine tuned mistral 7B model.
If you know of any good guides for this, I’d highly appreciate, thank-you
EDIT: Reposted to fix title, god damn iPad auto complete!
You must log in or register to comment.