Wikimedia Foundation's plans to introduce AI-generated article summaries to Wikipedia

ɯᴉuoʇuɐ@lemmy.dbzer0.com · edit-2 3 months ago

Wikimedia Foundation's plans to introduce AI-generated article summaries to Wikipedia

cotlovan@lemm.ee · 3 months ago

Who exactly asked for this? Wikipedia isn’t publicly traded, they aren’t a for profit company, why are they trying to shove Ai into people’s faces?

For those few who wanted it, there are dozens of bots who can summarize the (already kinda small) Wikipedia articles

coolmojo@lemmy.world · 3 months ago

Is this the same WiliMedia Foundation who was complaining about AI scrapers in April?

AbouBenAdhem@lemmy.world · 3 months ago

IIRC, they weren’t trying to stop them—they were trying to get the scrapers to pull the content in a more efficient format that would reduce the overhead on their web servers.

Lv_InSaNe_vL@lemmy.world · 3 months ago

You can literally just download all of Wikipedia in one go from one URL. They would rather people just do that instead of crawling their entire website because that puts a huge load on their servers.

palordrolap@fedia.io · 3 months ago

Ah, but the clueless code monkeys, script kiddies and C-levels who are responsible for writing the AI companies’ processing code only know how to scrape from someone else’s website. They can’t even ask their (respective) company’s AI for help because it hasn’t been trained yet. (Not that Wikipedia’s content will necessarily help).

They’re not even capable of taking the ZIP file and hosting the contents on localhost to allow the scraper code they got working to operate on something it understands.

So hammer Wikipedia they must, because it’s the limit of their competence.

JackbyDev@programming.dev · edit-2 3 months ago

What’s funny is crawling the site would actually be more difficult and take longer than downloading and reading the archive.

Context for others, Wikipedia is only ~24 GB (compressed and without media or history). https://en.wikipedia.org/wiki/Wikipedia:Size_of_Wikipedia

As of 16 October 2024, the size of the current version including all articles compressed is about 24.05 GB without media.

tfm@europe.pub · 3 months ago

Thanks, I hate it.

qevlarr@lemmy.world · 3 months ago

🪦🪦🪦🪦

RIP Wikipedia, we will miss you

Deflated0ne@lemmy.world · 3 months ago

miguel@fedia.io · 3 months ago

Well, this inspired me to swing my monthly wikipedia donation over to a world book sub instead. It’s bad enough that wikipedia was a very dubious source of info, but now this is just too much.

doctortofu@reddthat.com · 3 months ago

Et tu, Wikipedia?

My god, why does every fucking piece of text suddenly needs to be summarized by AI? It’s completely insane to me. I want to read articles, not their summaries in 3 bullet points. I want to read books, not cliff notes, I want to read what people write to me in their emails instead od AI slop. Not everything needs to be a fucking summary!

It seriously feels like the whole damn world is going crazy, which means it’s probably me… :(

Maeve@kbin.earth · 3 months ago

It’s not you.

“It is no measure of health to be well-adjusted to a profoundly sick society.” Krishnamurti

FourWaveforms@lemm.ee · 3 months ago

Then skip the AI summary.

liv@lemmy.nz · 3 months ago

For those of us who do skip the AI summaries it’s the equivalent of adding an extra click to everything.

I would support optional AI, but having to physically scroll past random LLM nonsense all the time feels like the internet is being infested by something equally annoying/useless as ads, and we don’t even have a blocker for it.

FourWaveforms@lemm.ee · 3 months ago

I think it would be best if that’s a user setting, like dark mode. It would obviously be a popular setting to adjust. If they don’t do that, there will doubtless be grease monkey and other scripts to hide it.

GraniteM@lemmy.world · 3 months ago

Dr. Moose@lemmy.world · 3 months ago

This ignorance is my biggest pet peeve today. Wikipedia is not targeting you with this but expanding accessibility to people who don’t have the means to digest a complex subject on their lunch break.

TL;DR: check your privilege

JandroDelSol@lemmy.world · 3 months ago

Giving people incorrect information is not an accessibility feature

Dr. Moose@lemmy.world · 3 months ago

RAG on 2 pages of text does not hallucinate anything though. I literally use it every day.

Redex@lemmy.world · edit-2 3 months ago

Honestly, I think it’s a good idea. As long as it’s clearly highlighted that “this is an AI generated summary”, it could be very useful. I feel like a lot of people here have never tried to e.g. read a maths article without having a PHD in mathematics. I would often find myself trying to remember what a term means or how it works in practice, only to be met by a giant article going into extreme technical detail that I for the life of me cannot understand, but if I were to ask ChatGPT to explain it I would immediately get it.

JandroDelSol@lemmy.world · 3 months ago

People will believe the AI summary without reading the article, and AI hallucinates constantly. Never trust an output from a LLM

KnitWit@lemmy.world · 3 months ago

Never thought I’d cancel my recurring donation for them, but just sent the email. I hope they change their mind on this, but as I told them, I will not support this.

asudox@lemmy.asudox.dev · 3 months ago

Time to switch to something else? Nutomic developed Ibis wiki for example: https://ibis.wiki/

FaceDeer@fedia.io · 3 months ago

You realize this is just a proposal at this stage? Their proposed next step is an experiment:

If we introduce a pre-generated summary feature as an opt-in feature on a the mobile site of a production wiki, we will be able to measure a clickthrough rate greater than 4%, ensure no negative effects to session length, pageviews, or internal referrals, and use this data to decide how and if we will further scale the summary feature.

Note, an opt-in clickthrough that they intend to monitor for further information on how to implement features like this and whether they should monitor them at all. As befits Wikipedia, they’re planning to base these decisions on evidence.

If “they’re gathering evidence and making proposals” is the threshold for you to jump ship to some other encyclopedia, I guess you do you. It’s not going to be much of an exodus though since nobody who actually uses Wikipedia has seen anything change.

asudox@lemmy.asudox.dev · 3 months ago

Mb. I still don’t see anything good coming out of implementing anything to do with AI though.

Erik L. Midtsveen 🏴🌈@lemmy.wtf · 3 months ago

sandflavoured@lemm.ee · 3 months ago

My immediate thought is that the purpose of an encyclopaedia is to have a more-or-less comprehensive overview of some topic of interest. The reader should be able to look through the page index to find the section they care about and read that section.

Its purpose is not to rapidly teach anyone anything in full.

It seems like a poor fit as an application for LLMs

katy ✨@lemmy.blahaj.zone · 3 months ago

fucking disgusting. no place should have ai but especially not an encyclopedia.

ace_garp@lemmy.world · 3 months ago

These LLM-page-summaries need to be contained and linked, completely separately, in something like llm.wikipedia.org or ai.wikipedia.org.

In a possible future case, that a few LLM hallucinations have been uncovered in these summaries, it would cast doubts about the accuracy of all page content in the project.

Keep the generated-summaries visibly distinct from user created content.

bookmeat@lemmynsfw.com · 3 months ago

If it runs without human supervision, it’ll be a gong show.

ɯᴉuoʇuɐ@lemmy.dbzer0.com · 3 months ago

There should be some degree of supervision, users will at a minimum be able to rate the summaries as helpful or unhelpful, and I guess those rated as unhelpful will be removed.