GPT-4's 128K context window tested

Ok_Relationship_9879 · 1 year ago

GPT-4's 128K context window tested

Tiny_Arugula_5648 · 1 year ago

Their needle in a haystack test isn’t very compelling. Sure no test is flawless but a random out of context fact placed at different points in the context window there is a lot of reasons why the model would fail to retrieve that.

Distinct-Target7503 · 1 year ago

Someone compared that with Claude 2 100K?

Also, gpt4 32K have same 100% accuracy in all its context? Is that 64 on 180 “absolute” or relative?

ArtifartX · 1 year ago

If the fact was at the beginning of the document, it was recalled regardless of context length

Lol at OpenAI adding a cheap trick like this, since they know the first thing people will test at high context lengths is recall from the beginning.