• ThistleknotB
    link
    fedilink
    English
    arrow-up
    1
    ·
    10 months ago

    rm is the reward model… not the same as the lm model. I tried the lm, wasn’t impressed. Gpt-3.5 did better for summarizing quotes. It was good, but I honestly think open hermes and or synthia 1.3b do better