Base models are not trained for conversations, so you cannot use it as a chat. It’s like GPT-4 and ChatGPT. GPT-4 is the base model, then it’s fine-tuned to be conversational, which is what you see in ChatGPT. Same as Llama vs Chat Llama.
Base models are not trained for conversations, so you cannot use it as a chat. It’s like GPT-4 and ChatGPT. GPT-4 is the base model, then it’s fine-tuned to be conversational, which is what you see in ChatGPT. Same as Llama vs Chat Llama.
The chat model came out today
Hey there! I think this is doing offloading?
If so, it’s not a new thing. Check out https://huggingface.co/docs/accelerate/usage_guides/big_modeling for a guide with code and videos about it