alchemist1e9B to LocalLLaMA@poweruser.forumEnglish · 1 year agoExLlamaV2: The Fastest Library to Run LLMstowardsdatascience.comexternal-linkmessage-square22fedilinkarrow-up11arrow-down10file-text
arrow-up11arrow-down1external-linkExLlamaV2: The Fastest Library to Run LLMstowardsdatascience.comalchemist1e9B to LocalLLaMA@poweruser.forumEnglish · 1 year agomessage-square22fedilinkfile-text
minus-squareCardAnarchistBlinkfedilinkEnglisharrow-up1·1 year agoCan you offload layers with this like GGUF? I don’t have much VRAM / RAM so even when running a 7B I have to partially offload layers.
Can you offload layers with this like GGUF?
I don’t have much VRAM / RAM so even when running a 7B I have to partially offload layers.