In my opinion open-source projects should focus an a very narrow thing, instead of focusing on being a "GPT", that focuses on being able to do everything.

GodEmperor23 · 1 year ago

In my opinion open-source projects should focus an a very narrow thing, instead of focusing on being a "GPT", that focuses on being able to do everything.

a_beautiful_rhind · 1 year ago

The big company gives you a base model and then it’s up to you to do that.

I’ve seen some agent and medical tunes. Smaller models for image and vision or tts, etc. Anyone doing it for a specific business case is probably not posting the model or advertising it.

Are you asking for people to make specialized 1.3b models from scratch? Because I think even that takes a long time on a “few” A100s.

FPham · 1 year ago

The big company give you 13b base to play with and you can fine tune it to fit your specifications.

I agree that people should not focus on OpenAI GPT killer, but mostly because it is a losing proposition, so they are basically wasting time after certain point. You can finetune 13b until it is blue, it will still not be OpenAi GPT.

But then, back to the top - YOU can finetune it to whatever you want. It just happened the other people want to make it a general GPT - toddler. I don’t. And I finetune it in whatever I want.

Still, 13b is playing with a toy car, 33b is playing with a toy truck. From 70b it starts to be more interesting (a toy airplane?) but it also require a bit different setup to play with it. So out of my toy box.

To think we are scratching at the feet of company that got 10B from Microsoft and can hire the brightest minds is unrealistic. We are not even playing in the same field. We are in a sandbox, somewhere near our mama’s home, they are in a big arena sponsored by big money. With a marching band and cheerleaders and beer and everything.