kryptkpr
B

1 Post
1 Comment

Joined 2 years ago

Cake day: October 30th, 2023

You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.

OverviewCommentsPosts

kryptkprBtoLocalLLaMA@poweruser.forum•SQLCoder-34b beats GPT-4 at Text-to-SQL
link
fedilink
English
arrow-up
1·
2 years ago
DeepSeek is not based on any llama training, it’s a 2T token pretrain of their own. 16k context. All this info is at the top of their model card.

link
fedilink

LocalLLaMA@poweruser.forumEnglish · 2 years ago

GoLLIE: Guideline-following Large Language Model for Information Extraction

hitz-zentroa.github.io

1

1

GoLLIE: Guideline-following Large Language Model for Information Extraction

hitz-zentroa.github.io

LocalLLaMA@poweruser.forumEnglish · 2 years ago

1