What with the ongoing turmoil at OAI, has anyone found an alternative for their vision endpoint that offers comparable functionality? I am aware of LLaVa which seems early in its maturity, but are there any commercial offerings?

  • vatsadevB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    There’s fuyu-8b, but no commercial license.

    It can really cover the “GPT-4 reads websites” and stuff like that, helpful with complex charts too. Other than that LLava is your best hope.