What with the ongoing turmoil at OAI, has anyone found an alternative for their vision
endpoint that offers comparable functionality? I am aware of LLaVa which seems early in its maturity, but are there any commercial offerings?
What with the ongoing turmoil at OAI, has anyone found an alternative for their vision
endpoint that offers comparable functionality? I am aware of LLaVa which seems early in its maturity, but are there any commercial offerings?
There’s fuyu-8b, but no commercial license.
It can really cover the “GPT-4 reads websites” and stuff like that, helpful with complex charts too. Other than that LLava is your best hope.