How to train/finetune with long examples on dataset?

tgredditfc · 1 year ago

Maybe. I have not done it yet so I don’t know. You can google around.

tgredditfc · 1 year ago

You can use oobabooga API to do that. I haven’t done it myself, can’t say much about it.

tgredditfc · 1 year ago

You can start with reading Oobabooga’s wiki, I think it’s one of most beginner friendly tools. https://github.com/oobabooga/text-generation-webui/wiki/05-‐-Training-Tab

tgredditfc · 1 year ago

How to train/finetune with long examples on dataset?

tgredditfc · 1 year ago

If I can run them all I will just pick the biggest one.

tgredditfc · 1 year ago

“Write the snake game using pygame”

tgredditfc · 1 year ago

Thanks for sharing! I have been struggling with llama.cpp loader and GGUF (using oobabooga and the same LLM model), no matter how I set the parameters and how many offloaded layers to GPUs, llama.cpp is way slower to ExLlama (v1&2), not just a bit slower but 1 digit slower. I really don’t know why.

tgredditfc · 1 year ago

In my experience it’s the fastest and llama.cpp is the slowest.

tgredditfc · 1 year ago

Thank you! It looks very deep to me, I will look into it.

tgredditfc · 1 year ago

Thanks! I have some problems to load GPTQ models with transformer loader.

tgredditfc · 1 year ago

Thanks for sharing!

tgredditfc · 1 year ago

Is it possible to fine tune a 33B model with 48GB vRAM?

tgredditfc · 1 year ago

I have 2 gpus and AWQ never works for me on Oobabooga, no matter how I split the vRAM, oom in most of the cases.

tgredditfc · 1 year ago

How does Apple’s new M3 128GB ram MacBook Pro compare with Nvidia A100?