struggling to include text prompts along with image-data (multimodal) for inferencing

LyPreto · 1 year ago

struggling to include text prompts along with image-data (multimodal) for inferencing

LyPreto · 1 year ago

I ended up just scrutinizing the server code to understand it better and found that the prompt needs to follow a very specific format or else it won’t work well:

prompt: \A chat between a curious human and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the human’s questions.\nUSER:[img-12]${message}\nASSISTANT:``