Use case is that I want to create a service based on Mistral 7b that will server an internal office of 8-10 users.
I’ve been looking at modal.com, and runpod. Are there any other recommendations?
You must log in or register to comment.
I can recommend vLLM. Also offers OpenAI compatible API service, if you want that.