Local Rag/embedding clarifications

Appropriate-Tax-9585B to

LocalLLaMA@poweruser.forumEnglish · 2 years ago

Hi all, I posted originally to langchain sub but didn’t get any response yet, could anyone give some pointers, thanks.

Basic workflow for questioning data locally?

Hi all,

I’m using lang chain js, and most examples I find are using openAI but I’m using llama. I managed to get a simple text file embedded and can ask basic questions, but most of the time the model just spits out the prompt.

I’m using just cpu at the moment so it’s very slow but that’s ok. I’m experimenting with loading txt files, csv files etc but clearly it’s not going well, I can ask some very simple question but most of the time it fails.

My understanding is;

Load model
Load data and chunk (csv file for example. I chunk usually with something like 200 and by separators /n
Load embedding (I’m supposed to load llama gguf model right? The same one as in step 1? As a parameter in llamaCppEmbeddings)
Vector store in memory
Create chain and ask question
Console log answer

Is this concept correct and do you have any tips to help me get better results.

Thank you

You must log in or register to comment.

Chat