Point me towards some basic dataset preparation tips for LLM's?

ArtifartX · 1 year ago

Point me towards some basic dataset preparation tips for LLM's?

FPham · 1 year ago

Trained and finetuned - 2 things.

The trained on wikipedia - yes, they feed the wikipedia articles to it - hook and sinker. No Q/A. But that doesn’t mean it will be able to give you answer, unless you fine tune it with Q/A “I want you to behave like this” template - but the kick is - what we all are using to our huge advantage - it can be fine-tuned on a totally different Q/A, it will still be able to answer from wikipedia. It’s a hat trick.

psdwizzard · 1 year ago

I am new to LLMs (I normally train Image Models) so if this is a stupid question let me know.

I have been converting the shadowrun lore wiki into Q and A so i can use that model for a sillytavern character as a contact in my current tabletop game. Do I really need to convert it all to Q and A? If I get a better “Contact” I dont mind.

ArtifartX · 1 year ago

Thanks for the information and explanation