ninjasaid13B to LocalLLaMA@poweruser.forumEnglish · 2 years agoLQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuningarxiv.orgexternal-linkmessage-square2linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkLQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuningarxiv.orgninjasaid13B to LocalLLaMA@poweruser.forumEnglish · 2 years agomessage-square2linkfedilink
minus-squaremcmoose1900BlinkfedilinkEnglisharrow-up1·2 years agoAmazing! And they published the code. Also, the omniquant paper they linked is amazing! They hooked some super quantization into MLC, apparently?
Amazing! And they published the code.
Also, the omniquant paper they linked is amazing! They hooked some super quantization into MLC, apparently?