Local LLMs Unable to Sort Lists

External-Salary-4095 · 1 year ago

Local LLMs Unable to Sort Lists

FPham · 1 year ago

At every step LLM was giving you BS. It tells you that it understands every step yet the result is wrong.

The reason is simple: we need more parameters. We are topping at 70b. That’s fine for text, not good enough for non-text.

Goliath is still 70b - merging two 70b models doesn’t make it 140b base. It won’t suddenly have 2 x pre-training.

Unlike words that can be split into one or two tokens, every digit is in llama tokenizer split into a single token. So you need more parameters to find a pattern in numbers when the task is textual - for LLM a longer number is as complicated as entire sentence. It’s a miracle it can add two numbers.