I have an idea to collect all open-source LLM models to launch one full-fledged service for working with models, that is, we will use one mini LLM model to recognize a task, write code, translate text, solve problems, etc., and then transfer the task to a model designed for this, We will use Petals to run models, we can make your own cryptocurrency and pay people who allocate their servers to run Petals to support models, what do you think of this idea?

  • LocoModB
    link
    fedilink
    English
    arrow-up
    1
    ·
    10 months ago

    This is what my hobby project essentially does. I’m running a single chat from 3 different servers in my network all serving different LLMs that are given a role in the chat pipeline. I can send the same prompt to multiple models so they can work on it concurrently, or have them handoff each other’s responses to continue elaborating, validating, or whatever that LLMs job is. Since each server is serving an API and websocket route, all I need to do is put it behind a proxy and port forward them to the public internet. Anyone here could visit the public URL and run inference workflows in my homelab(theoretically speaking). They could also spin up an instance on their side and we can have our servers talk to each other.

    Of course that’s highly insecure and just bait for bad actors. So I will scale it using overlay network that requires a key exchange and runs over VPN.

    Any startup thinking they are going to profit from this idea will only burn investor money and waste their own time. This will all be free and it’s only a matter of time before the open source community cuts into their hopes and dreams.