I recently started using the base model of LLaMA-2-70B for creative writing and surprisingly found most of my prompts from ChatGPT actually works for the “base model” too, suggesting it might also be fine tuned a bit on ChatGPT-like instructions.
Curious anyone tried both llama 1 & 2 base model and can share their experiences on creativity ? My hunch is llama 1 might be slightly better at it, assuming it hasn’t go through as much alignment.
That’s an interesting idea … in my experience anything <1 works, >1.2 goes wild and for things we expect to be a bit more deterministic, setting it to 0 is preferred.
What’s your best setup and temperature for creative writing ?