Clearing up confusion: GPT 3.5-Turbo may not be 20b after all

SomeOddCodeGuy · 1 year ago

Clearing up confusion: GPT 3.5-Turbo may not be 20b after all

ttkciar · 1 year ago

Perhaps someone heard “10x reduction in footprint” and didn’t realize that meant a reduction in bytes, not a reduction in parameters, and concluded it had a tenth as many parameters?

ambient_temp_xeno · 1 year ago

So they, as big-shot microsoft scientists, just decided that was good enough to stick it in a table in their paper?