We are Higgsfield AI. We have a large GPU cluster and want to finetune your dataset.

RiskApprehensive9770 · 2 years ago

herozorro · 2 years ago

please do something like this, or provide detailed example, on how an open source framework api can be added to a coder LLM.

how do we prepare the data with code sample, docs, so the coder LLM learns it can can do code completions and answer documentation?

RiskApprehensive9770 · 2 years ago

You can train on any dataset as long as it follows our format.

Soon we’ll publish a video tutorial.

herozorro · 2 years ago

but what would be the proper formatting example for code? just paste in a bunch of files from a repo? or should be more a cheatsheet format?