With a 24GB video card, single card system, what is the best LLM that utilizes Exllama2? (for RPG/Chat)

cleverestx · 1 year ago

With a 24GB video card, single card system, what is the best LLM that utilizes Exllama2? (for RPG/Chat)

Gyramuur · 1 year ago

I’m a huge fan of MLewd-ReMM-L2-Chat-20B at the moment. I use the 6-bit quant and have found it at times to be similar in quality to the roleplays I used to have with ChatGPT 3.5 before "Open"AI nerfed it into oblivion. Hardly ever have to reroll.

cleverestx · 1 year ago

So far with the local models, I’ve just done like storybook format, RPGing, without a game system, dice, rolls, etc, which I used to do with chat GPT…

Do you have a prompt template that works well for you that you would be willing to share that gamifies it?

Gyramuur · 1 year ago

Not entirely, I mainly just use it as an open ended story. However, if I’m doing a second person text adventure, sometimes I will place information pertaining to the “game” and put it in memory (or system prompt, if in LM Studio), like this:

Scenario: Second person survival text adventure game set in the African savannah, hundreds of millions of years ago.

Inventory: Stone knife, dried meat, berries (4), half-filled water skin.

Current quest: Go hunting

In my experience so far, it seems to do a really good job of remembering exactly whatever’s in my inventory, and if my character picks something up in game, I add it to the memory.

If you wanted more game-like systems like stats and dice rolls, I suppose you could keep track of those externally, and your rolls could be done through a site like random.org and just tell the LLM your result.