LocalLLaMA@poweruser.forumEnglish · 1 year ago

Relationship of RAM to context size?

1

Relationship of RAM to context size?

LocalLLaMA@poweruser.forumEnglish · 1 year ago

I understand that a bigger memory means you can run a model with more parameters or less compression, but how does context size factor in? I believe it’s possible to increase the context size, and that this will increase the initial processing before the model starts outputting tokens, but does someone have numbers?

Is memory for context independent on the model size, or does a bigger model mean that each bit of extra context ‘costs’ more memory?

I’m considering an M2 ultra for the large memory and low energy/token, although the speed is behind RTX cards. Is this the best option for tasks like writing novels, where quality and comprehension of lots of text beats speed?

Chat

a_beautiful_rhindB
link
fedilink
English
arrow-up
1·
1 year ago
How much multicard? You can get away with single CPU EPYC boards for 4 and below. For more you need those supermicro 4028/4029 big guns. The older one is still $1000 and under.