Serious inquiry: I've been tinkering a lot with finetuning and was wondering if it would be worth to buy a V100 of my own

holistic-engine · 1 year ago

Fun_Tangerine_1086 · 1 year ago

You want VRAM, like lots of folks have mentioned; there’s some non-obvious things here - you can make smaller VRAM work w/ reduced batch size or non-AdamW optimizers, but you trade off both speed and quality to do so.
You can split training across multiple GPUs; I use 2x 3060 12gb, though a real 24gb card would be better.
I don’t recommend a V100 - you’d miss out on the bfloat16 datatype.