Hi folks, I have edited the llama.cpp server frontend and made it look nicer. Also added a few functions. Something I have been missing there for a long time: Templates for Prompt Formats.
Here to the github link: ++camalL
Otherwise here is a small summary:
- UI with CSS to make it look nicer and cleaner overall.
- CSS outsourced as a separate file
- Added a dropdown menu with prompt style templates
- Added a dropdown menu with system prompts
- Prompt Styles and System Prompts are separate files, so editing is very easy.
- Created a script that uses “dialog” to compose the command for the server.
- Script offers the possibility to save and load configs
In planning or already started:
- WIP Multilingual: You will be able to select the language from a dropdown menu. So far language files only for English and German. (concerns UI elements and system prompts).
- Dark Mode
- Templates for the values of the UI options (samplers etc.), e.g. deterministic template, creative template, balanced template etc…
- Zenity start script (like dialog, but gui)
-–
As for the prompt format templates, I just picked a few by feel. The most important are the four to which almost all others can be traced back: Alpaca, ChatML, Llama2, Vicuna.
But if you want more templates for a specific model, feel free to let me know here or on github.
As you can see on the third picture, it should now be easier for beginners to use the llama.cpp server, since a tui dialog will assist them.
Hope you like my work. Feel free to give feedback
ps: I’ve made a pull request, but for now I publish it on my own forked repo.
Does it have min-p sampling?
u/ambient_temp_xeno ah I have now seen that min-p has been implemented in the server anyway, so I have now added it too.
Ah one sidenote: selecting a model via dialog is absolutely not intuitive. If you want to navigate into a folder, you have to press space two times. Do not press enter until you decide to choose a specific folder. It doesnt matter that much if you are in parent folders, since the script will search recursively - but of course if you have many files it could take a long time.
Do you happen to have on hand the docs for the api?