The Picard Maneuver@lemmy.world to People Twitter@sh.itjust.works · 2 months agoIt's best not to dwell on itlemmy.worldimagemessage-square151fedilinkarrow-up11.89Karrow-down16
arrow-up11.88Karrow-down1imageIt's best not to dwell on itlemmy.worldThe Picard Maneuver@lemmy.world to People Twitter@sh.itjust.works · 2 months agomessage-square151fedilink
minus-squaremelpomenesclevage@lemmy.dbzer0.comlinkfedilinkEnglisharrow-up15·2 months agothat depends on what topic you know and how well you know it.
minus-squaretaladar@sh.itjust.workslinkfedilinkarrow-up11·edit-22 months agoLLMs are actually pretty good for looking up words by their definition. But that is just about the only topic I can think of where they are correct even close to 80% of the time.
minus-squaremelpomenesclevage@lemmy.dbzer0.comlinkfedilinkEnglisharrow-up1·2 months agoyeah. some things I’d be shocked if they were correct 1% of the time. some things, like that, I might expect them to be correct about 80%, yeah.
40% seems low
that depends on what topic you know and how well you know it.
LLMs are actually pretty good for looking up words by their definition. But that is just about the only topic I can think of where they are correct even close to 80% of the time.
yeah. some things I’d be shocked if they were correct 1% of the time. some things, like that, I might expect them to be correct about 80%, yeah.