New Model: Starling-LM-11B-alpha-v1

perlthoughts · 2 years ago

New Model: Starling-LM-11B-alpha-v1

perlthoughts · 2 years ago

I noticed a lot of responses about the mergekit configuration i used to copy layers of 7b model of mistral to 11b. Here is my config.yml for mergekit (link in post description):

slices:
  - sources:
    - model: maywell/Synatra-7B-v0.3-RP
      layer_range: [0, 24]
  - sources:
    - model: maywell/Synatra-7B-v0.3-RP
      layer_range: [8, 32]
merge_method: passthrough
dtype: float16