CradawxB to LocalLLaMA@poweruser.forumEnglish · 1 year agoShareGPT4V - New multi-modal model, improves on LLaVAsharegpt4v.github.ioexternal-linkmessage-square17fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkShareGPT4V - New multi-modal model, improves on LLaVAsharegpt4v.github.ioCradawxB to LocalLLaMA@poweruser.forumEnglish · 1 year agomessage-square17fedilink
minus-squaremetalman123BlinkfedilinkEnglisharrow-up1·1 year agoThis style of captioning could be amazing for text to image datasets and i wouldn’t be surprised to see them take a jump in quality as well.
This style of captioning could be amazing for text to image datasets and i wouldn’t be surprised to see them take a jump in quality as well.