ChatGPT provides false information about people, and OpenAI can’t correct it

Alb_x_008@lemm.ee · 1 year ago

ChatGPT provides false information about people, and OpenAI can’t correct it

VeganCheesecake@lemmy.blahaj.zone · 1 year ago

Uh, I understand the sentiment, but the model doesn’t know anything. And it’s legit really hard to differentiate between factual things and random bullshit it made up.

👍Maximum Derek👍@discuss.tchncs.de · 1 year ago

Yeah, no one can make it say “I don’t know” because it is not really AI. Business bros decided to call it that and everyone smiled and nodded. LLMs are 1 small component (maybe) of AI. Maybe 1/80th of a true AI or AGI.

Honestly the most impressive part of LLMs is the tokenizer that breaks down the request, not the predictive text button masher that comes up with the response.

Kichae@lemmy.ca · 1 year ago

Honestly the most impressive part of LLMs is the tokenizer that breaks down the request, not the predictive text button masher that comes up with the response.

Yes, exactly! It’s ability to parse the input is incredible. It’s the thing that has that “wow” factor, and it feels downright magical.

Unfortunately, that also makes people intuitively trust its output.

DudeDudenson@lemmings.world · 1 year ago

Was gonna say, the AI doesn’t make up or admit bullshit, its just a very advanced a prediction algorithm. It responds with what the combination of words that is most likely the expected answer.

Wether that is accurate or not is part of training it but you’ll never get 100% accuracy to any query

maynarkh@feddit.nl · 1 year ago

If it can name what the most likely combination is, couldn’t it also know how likely that combination of words is?

wahming@monyet.cc · 1 year ago

No, because that requires it to understand the words. It doesn’t.

kent_eh@lemmy.ca · 1 year ago

If it has been trained using questionable sources, or if it’s training data includes sarcastic responses (without understanding that context), it isn’t hard to imagine how confidently wrong some of the responses could be.

DudeDudenson@lemmings.world · edit-2 1 year ago

It’s not actually deciding anything, the AI thinking is marketing fluff really. But yes that’s called confidence rating and it does. But at the scale of something like chatgpt that uses a snapshot of the entire internet and is non mutable there’s no way to train it for every possible question. If you ask about a topic 99% of the internet gets wrong it’ll respond the wrong thing with 99% confidence

givesomefucks@lemmy.world · 1 year ago

It “knows” as in it has access to the information and the ability to provide the right info for the right context.

Any part of that process the AI can just “bullshit” and fills in the gaps with random stuff.

Which is what you want when it’s “learning”. You want it to try so it’s attempt can be rated, and the relevant info added to its “knowledge”.

But when consumers are using it, you want it to say “I can’t answer that”. But consumers are usually stupid and will buy/use the one that says “I can’t answer that” the least.

And it’s legit really hard to differentiate between factual things and random bullshit it made up.

Which is why AI should tell end users “I don’t know” more often.

NounsAndWords@lemmy.world · 1 year ago

Which is why AI should tell end users “I don’t know” more often.

If you feel this is a simple solution, I strongly suggest you write up exactly how you do this and make yourself a billion dollars.

Kichae@lemmy.ca · 1 year ago

It “knows” as in it has access to the information and the ability to provide the right info for the right context.

It doesn’t, though, any more than you have access to the information in a pile of 10 million shredded documents.

givesomefucks@lemmy.world · 1 year ago

Right, in this case that we’re talking about…

Do you not understand how “answer unavailable” is a better answer than taking a small percent of strips of paper at random and filling in the rest with words that sound relevant?

It’s like a mad libs

wahming@monyet.cc · 1 year ago

That is what LLMs do in EVERY conversation. Most of the time you don’t notice it, because it fits your expectations.

then_three_more@lemmy.world · 1 year ago

You know that answer unavailable is better because you have real intelligence, an LLM is just some mathematical functions so it can’t do that. If it could it would be getting much closer to actually being AI.

Ech@lemm.ee · 1 year ago

taking a small percent of strips of paper at random and filling in the rest with words that sound relevant?

It’s like a mad libs

Right. They’re text generators. That’s the technology. It can’t do what you’re demanding because that’s not how it works. LLMs aren’t magic answer machines. They don’t know when to say “answer not available”. They don’t know what they’re being asked. They don’t know anything.