hi folks,
simple question really - what model (finetuned or otherwise) have you found that can extract data from a bunch of text.
I’m happy to finetune, so if there are any successes there, would really appreciate some pointers in the right direction.
Really looking for a starting point here. I’m aware of the DETR class of models and how Microsoft trained table-transformers on DETR. Wondering if that can be done on llama2,etc models ?
P.S. cannot use GPT because of sensitive PII data.
What is the information extraction evals ? Do u have a link ?