ninjasaid13B to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning

2

1

LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning

ninjasaid13B to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

2

Chat

mcmoose1900B
link
fedilink
English
arrow-up
1·
1 year ago
Amazing! And they published the code.

Also, the omniquant paper they linked is amazing! They hooked some super quantization into MLC, apparently?