Can't handle efficiently RAG with large PDF

Temporary-Size7310 · 1 year ago

Temporary-Size7310 · 1 year ago

Some updates:

I changed to Jina-small-en-v2, base model crash due to lack of RAM under WSL2
Make a parents retriever (chunk 2000) and input it child retriever (chunk 400), 0 overlap (will share the method)
Still use sciphy model but this time using the right template (From the indication from The bloke) by adding a template prompt rather than Alpaca prompt and it resolves the problem of hallucination
Put text oobabooga on instruct by default, loader exllamav2hf

I got a strong 90% of success with the PDF, will send the code when this will be cleaned and optimized, thank you all for the help 😊