I’m extremely confused about system requirements. Some people are worried about ram and others about vram. I have 64gb of ram and 12gb vram. What size of model can I run?

A0sanitycomp · 1 year ago

I’m extremely confused about system requirements. Some people are worried about ram and others about vram. I have 64gb of ram and 12gb vram. What size of model can I run?

fallingdowndizzyvr · 1 year ago

Yes. MLC Chat runs great with no fuss. The same as running it on nvidia or AMD. Then things get more fussy. There’s ooba, fastchat and of course Intel’s own BigDL. The Arcs actually run on llama.cpp too, OpenCL and Vulkan, but it’s dog slow. Like half the speed of the CPU. Considering it happens in both OpenCL and Vulkan, there’s something about llama.cpp that isn’t friendly to the Arc architecture. Vulkan under MLC is fast.