So I’m looking for a service I can run that can search the internal contents of multiple PDFs (multiple 1000+ page reference manuals) for a a phrase/word, similar to Adobe acrobats advance search function.
Bonus points of I can control the scope of which documents it searches through through some sort of interface.
You must log in or register to comment.
https://github.com/phiresky/ripgrep-all
Made by the same phiresky that’s been contributing incredible improvements to lemmy
For Linux command line, there is pdfgrep. It can be found e.g. in the official Debian repository.
Look at the subreddit sidebar and find this: awesome-selfhosted, category document management, and more.