Looking to move on in the next step of my LLM learning journey and:

a) generate a q&A dataset, say with GPT-4

b) use the dataset to instruction fine tune a 7B variant of mistral and evaluate

The Q might be to give me a sumamrised history of a company, with the dataset answer generated by GPT-4, to fine-tune the instruction fine tuned mistral 7B model.

If you know of any good guides for this, I’d highly appreciate, thank-you

EDIT: Reposted to fix title, god damn iPad auto complete!