https://huggingface.co/deepnight-research

I’m not affiliated with this group at all, I was just randomly looking for any new big merges and found these.

100B model: https://huggingface.co/deepnight-research/saily_100B

220B model: https://huggingface.co/deepnight-research/Saily_220B

600B model: https://huggingface.co/deepnight-research/ai1

They have some big claims about the capabilities of their models, but the two best ones are unavailable to download. Maybe we can help convince them to release them publicly?

  • SomeOddCodeGuyB
    link
    fedilink
    English
    arrow-up
    1
    ·
    10 months ago

    Right. This part right here is very suspicious to me, and I’m taking their claims with a grain of salt.

    No! The model is not going to be available publically. APOLOGIES. The model like this can be misused very easily. The model is only going to be provided to already selected organisations.

    • bot-333B
      link
      fedilink
      English
      arrow-up
      1
      ·
      10 months ago

      I think they changed it to it’s still an experiment and they are finishing evaluations to better understand the model.

        • bot-333B
          link
          fedilink
          English
          arrow-up
          1
          ·
          10 months ago

          I guess they might open source the 600B one? They have different names, so maybe different training approaches.