ByteOnBikes@slrpnk.net to Microblog Memes@lemmy.worldEnglish · 4 天前Critical thinkingslrpnk.netimagemessage-square216linkfedilinkarrow-up11.57Karrow-down130
arrow-up11.54Karrow-down1imageCritical thinkingslrpnk.netByteOnBikes@slrpnk.net to Microblog Memes@lemmy.worldEnglish · 4 天前message-square216linkfedilink
minus-squareTheTechnician27@lemmy.worldlinkfedilinkEnglisharrow-up19arrow-down3·4 天前 It’s a two-pass solution, but it makes it a lot more reliable. So your technique to “make it a lot more reliable” is to ask an LLM a question, then run the LLM’s answer through an equally unreliable LLM to “verify” the answer? We’re so doomed.
minus-squareApepollo11@lemmy.worldlinkfedilinkEnglisharrow-up3arrow-down8·edit-24 天前Give it a try. The key is in the different prompts. I don’t think I should really have to explain this, but different prompts produce different results. Ask it to create something, it creates something. Ask it to check something, it checks something. Is it flawless? No. But it’s pretty reliable. It’s literally free to try it now, using ChatGPT.
minus-squareTheTechnician27@lemmy.worldlinkfedilinkEnglisharrow-up10arrow-down1·4 天前 I don’t think I should really have to explain this, but different prompts produce different results.
minus-squareApepollo11@lemmy.worldlinkfedilinkEnglisharrow-up2·4 天前Hey, maybe you do. But I’m not arguing anything contentious here. Everything I’ve said is easily testable and verifiable.
So your technique to “make it a lot more reliable” is to ask an LLM a question, then run the LLM’s answer through an equally unreliable LLM to “verify” the answer?
We’re so doomed.
Give it a try.
The key is in the different prompts. I don’t think I should really have to explain this, but different prompts produce different results.
Ask it to create something, it creates something.
Ask it to check something, it checks something.
Is it flawless? No. But it’s pretty reliable.
It’s literally free to try it now, using ChatGPT.
Hey, maybe you do.
But I’m not arguing anything contentious here. Everything I’ve said is easily testable and verifiable.