Alien Top
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
LegcorB to LocalLLaMA@poweruser.forumEnglish · 2 years ago

Starling-RM-7B-alpha: New RLAIF Finetuned 7b Model beats Openchat 3.5 and comes close to GPT-4

message-square
message-square
49
link
fedilink
1
message-square

Starling-RM-7B-alpha: New RLAIF Finetuned 7b Model beats Openchat 3.5 and comes close to GPT-4

LegcorB to LocalLLaMA@poweruser.forumEnglish · 2 years ago
message-square
49
link
fedilink

​

​

https://preview.redd.it/3krgd1sg2z2c1.png?width=800&format=png&auto=webp&s=b76c5fb9fa22938c74ec3095f63adaec8ff2219d

​

I came across this new finetuned model based on Openchat 3.5 which is apparently trained used Reinforcement Learning from AI Feedback (RLAIF).

https://huggingface.co/berkeley-nest/Starling-LM-7B-alpha

Check out this tweet: https://twitter.com/bindureddy/status/1729253715549602071

  • sahil1572B
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 years ago

    Every other model nowadays claims to be GPT-4, and they turn out to be < GPT-3. I don’t know what kind of test they use to score .

    • sahil1572B
      link
      fedilink
      arrow-up
      1
      ·
      2 years ago

      LOL GPT4

      https://preview.redd.it/fy2rvgg8v13c1.png?width=1754&amp;format=png&amp;auto=webp&amp;s=8df41b305a0d01be335f406a204b1061ca24b658

LocalLLaMA@poweruser.forum

localllama@poweruser.forum

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@poweruser.forum

Community to discuss about Llama, the family of large language models created by Meta AI.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 4 users / day
  • 4 users / week
  • 4 users / month
  • 4 users / 6 months
  • 3 local subscribers
  • 4 subscribers
  • 1.03K Posts
  • 5.96K Comments
  • Modlog
  • mods:
  • communick@poweruser.forum
  • BE: 0.19.11
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org