OpenAI says its latest models outperform doctors in medical benchmark

Pro@programming.dev · edit-2 3 months ago

OpenAI says its latest models outperform doctors in medical benchmark

banghida@lemm.ee · 3 months ago

Sure thing

etchinghillside@reddthat.com · 3 months ago

US Healthcare will now be affordable!

orclev@lemmy.world · 3 months ago

Wake me up when someone besides OpenAI says they’re the best at something. When a company releases a benchmark they designed that their own tool that’s generally regarded as not very good is suddenly the best at, that’s not news, at best that’s PR, at worst propaganda. This reeks of “we investigated ourselves and found we did nothing wrong”.

Buffalox@lemmy.world · 3 months ago

I almost feel sad for IBM, this was supposed to be their thing.

etchinghillside@reddthat.com · 3 months ago

Had forgotten we’ve been promised this before.

Kalvin@lemmy.world · edit-2 1 month ago

Removed by mod

TrendigOsthyvel@lemmy.world · 3 months ago

Much wow.

Kalvin@lemmy.world · edit-2 1 month ago

Removed by mod

taladar@sh.itjust.works · 3 months ago

So they created a test so broken and warped that no actual professional can understand it but their AI performs well on it?

Bezier@suppo.fi · 3 months ago

Tl;dr: OpenAI says OpenAI product performs well in a benchmark created by OpenAI.

Opinionhaver@feddit.uk · 3 months ago

The bar exam isn’t created by OpenAI, yet the outdated GPT-4 model still ranked in the 90th percentile on it.