ninjasaid13B to LocalLLaMA@poweruser.forumEnglish · 1 year agoLQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuningarxiv.orgexternal-linkmessage-square2fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkLQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuningarxiv.orgninjasaid13B to LocalLLaMA@poweruser.forumEnglish · 1 year agomessage-square2fedilink
minus-squaremcmoose1900BlinkfedilinkEnglisharrow-up1·1 year agoAmazing! And they published the code. Also, the omniquant paper they linked is amazing! They hooked some super quantization into MLC, apparently?
Amazing! And they published the code.
Also, the omniquant paper they linked is amazing! They hooked some super quantization into MLC, apparently?