Training on the rephrased test set is all you need: 13B models can reach GPT-4 performance in benchmarks with no contamination detectable by traditional methods

lmsys.org

Training on the rephrased test set is all you need: 13B models can reach GPT-4 performance in benchmarks with no contamination detectable by traditional methods

lmsys.org

Covid-Plannedemic_B to

LocalLLaMA@poweruser.forumEnglish · 3 years ago

Catch me if you can! How to beat GPT-4 with a 13B model | LMSYS Org

lmsys.org

Announcing Llama-rephraser: 13B models reaching GPT-4 performance in major benchmarks (MMLU/GSK-8K/HumanEval)! To ensure result validity, we followed Open...

Chat