davidmezzettiB to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

RAG in a couple lines of code with txtai-wikipedia embeddings database + Mistral

1

RAG in a couple lines of code with txtai-wikipedia embeddings database + Mistral

davidmezzettiB to

LocalLLaMA@poweruser.forumEnglish · 1 year ago

Chat

davidmezzettiOPB
link
fedilink
English
arrow-up
1·
1 year ago
It works with GPTQ models as well, just need to install AutoGPTQ.

You would need to replace the LLM pipeline with llama.cpp for it to work with GGUF models.

See this page for more: https://huggingface.co/docs/transformers/main_classes/quantization