- cross-posted to:
- news@lemmy.world
- cross-posted to:
- news@lemmy.world
Reddit says Microsoft’s Bing, Anthropic, and Perplexity have scraped its data without permission. “It has been a real pain in the ass to block these companies.”
lol. That’s not how any of this works.
If you have content freely and publicly accessible, it will be read freely by humans and bots.
I’ve said once and I’ll say it again. Either the information on your site is free to all or to none. You can’t have some people/entities pay and some not!
You can. We didn’t need to like it but they can. Besides, isn’t that how many magazines work? Pay for articles and such
Not really, the people who write the articles are actually employed by those magazine companies, and everyone who wants to get one, needs to pay for one.
and spez will pay you for creating content.
/s
U/spez the former moderator of r/jailbait? Who might have connections to Ghilisaine Maxwell? Him?
Oh and PS - posting OC porn on Reddit is a very weird process and not transparent
is there even anything of value on reddit?
A lot of older posts are still relevant to specific hobbies. I will look up information on paper, some guitar information, but most posts from the last two years are not worth looking at.
There is also so much regurgitated LLM shit.
Completely. Lemmy is far too small to have the value Reddit does.
I left Reddit due to their API bullshit, but I so miss all of the hobby communities I was part of, that has like-minded members, and a plethora of resources. It’s not easy to impossible to start communities such as reeftanks, homesteading, literature, bookcirclejerk, etc. on a platform as small as Lemmy. And beyond starting one, the quality and quantity will never match Reddit’s because Lemmy just doesn’t have the same reach.
Lemmy is great if you like Linux, like Star Trek, or are trans, but other than that, it’s missing so, so many demographics that make a wholistic platform.
trans
I feel this so hard, the sheer number of openly LGBTQ+ people here really skews the demographics of the site. I’m not saying it’s a problem, just saying that LGBTQ+ people are dramatically over-represented here. It’s an interesting contributor to lemmy culture, and I wonder how much that impacts homogeneity here (e.g. upvotes and downvotes for certain types of content).
But yeah, it’s missing a lot of demographics.
That said, I’m really into Linux (been using for >15 years), so that’s cool I guess.
As a cis straight man I’m taking this as a learning opportunity until the demographics level out. An inherently inclusive bias will be more helpful early on than more niche communities anyways.
Sure. Again, I’m not saying it’s bad, just that the bias seems to exist.
There are certainly worse biases that exist, such as very little representation from people on the right side of the spectrum, so hate against half the population seems to get a pass and downvotes silence constructive comments/posts just due to political bias. That’s incredibly frustrating, and I think the high focus on supporting LGBTQ+ people goes along with that (i.e. the message that conservatives “hate” LGBTQ+ people, which is only true for the more extreme end of conservatism).
That said, I do like the support LGBTQ+ people get, I just wish the demographics were a bit more diverse without sacrificing the culture. I live and work in a conservative area, but my company has built a pretty inclusive culture (at least for the area), so I think it’s totally possible.
Oh man I don’t miss that at all. Moderating out a pervasive delusion isn’t bias, any more than we’re biased in favor of a round Earth. On Reddit there were constant “enlightened centrists” who kept making appeals to moderation.
There’s nothing of value to be gained from conservatives. The “good” ones who don’t say the homophobia out loud are still voting for politicians who do. If it was just the extreme end, then Trump wouldn’t be their nominee. Hate is their normal now.
“If there’s a Nazi at the table and 10 other people sitting there talking to him, then you got a table with 11 Nazis.”
This is exactly what I’m talking about: casually dismissing half of the population based on little more than association. That drives division and pushes people into echo chambers.
Understood. I am disagreeing with you. If that wasn’t obvious, then I fear you may have missed my point.
Half of America supporting fascism is reason to create somewhere - anywhere - where that shit is shut down. You’re free to go associate with freeze peach Nazis on X, Facebook, Nostr, wherever. I don’t want any part of that and prefer a server that moderates them out. Paradox of tolerance and all that.
If you all believed the Earth was flat, then I would prefer the “echo chamber” of people saying “no, we checked, it’s round”. There simply being a lot of believers doesn’t imply an idea has merit, and we don’t have infinity time for BS.
Reddit remains as valuable as ever. It’s amusing that you think it imploded a year ago just because a small number of users migrated here
It sort of did, thousands of useful comments were turned to gibberish, the mobile web site turned to shot, and the mobile app stopped properly working for communitys with specific content warnings.
Yes, absolutely. Any time I need to buy a product I don’t know much about, I look for an enthusiast community with a FAQ. Most of the active, high-quality communities are on Reddit.
I would like decentralized services to replace that, but that’s a slow process, if it happens at all.
An absolutely prodigious back catalog of high quality images, interviews, and explainers. A treasure trove of historical content that’s been heavily indexed and participant-weighted for relevancy. And the bulk of it predates the infestation of AI, so its valuable just as sampling data of original human content for further iterative development of ChatGPT and other LLMs.
We shouldn’t accept this behavior or other companies will follow!
Reddit CEO can shove reddit up his ass sideways. The whole thing.
He can put his dick in /dontputyourdickinthat
Aside: I give Lemmy serious props for not reproducing some of these communities btw.
This must be how they plan to raise revenue, since people don’t buy enough gold and badges to support the site
That isn’t how it works for publicly traded companies. There is no such thing as enough only more
Plus they don’t even need to be profitable. They just have to convince some rich losers that they might be profitable at some undetermined time in the future.
A lot of times it’s more like the old Taxi medallions. You paid 400k for the medallion, operated it for a few years, and then sold it to the next guy for the going rate which is likely around 400k.
Well, they did. I guess the definition of “support” has changed. Probably to include more yachts.
Ok, now I’m miffed that Google caved to Reddit’s demands and paid up.
Because this set a dangerous precedent.
Earlier, Google got a lot of demands from various publications to pay up for indexing the publicly available news sites. And they always responded with “Ok, guess you leave us no other choice than just exclude you from indexing altogether.” Let the site simmer for a while until they went “oh shit, not being indexed by major search engines sucks. we didn’t really mean it please come back”
It’s especially jarring because Reddit doesn’t even produce their own news content anyway. That search engine money isn’t going to the content creators. News sites at least could say they need to pay for their content to be written by their employees.
Other search engines should continue indexing Reddit and take them to court if they issue a cease and desist imo.
I am guessing Google paid for access to their internal archives on posts and comments. Will give them a unique dataset for all the stuff that was deleted during the many exodus runs over the years.
I don’t see why you should be miffed at all, Google can bully publications and unindex them and it will work. Reddit according to this: https://www.semrush.com/blog/most-visited-websites/ is the third most visited website after google and youtube, so they have a bit more power, lots of people google with “site:reddit.com” because it still has some useful content like that and I am going go out on a limb and say that US visitors are the most important for selling ads for Google.
Microsoft will have to make it’s own value calculation whether it’s worth it and they will likely payup, although more and more of reddit is just bots posting stupid shit.
At this point I think Google needs Reddit more than Reddit needs Google. Google search kind of sucks these days. How often do you add site:reddit.com to the end of the query to get any sort of useful result for a specific question? For me it’s pretty often. If Reddit cuts off Google, that goes away and Google search suffers significantly. And that might mean the one thing Google cannot abide- a situation where people in large numbers start actively seeking out other search engines.
Don’t get me wrong, they’re both being super shitty.
Google needs to quit obsessing over AI and a million different cloud products and fix the one product that people actually care about. Reddit needs to stop acting like they own everybody.Steve “Spaz” Huffman has been trying to milk money out of the site that Alexis Ohanian, Aaron Swartz, and pigboy Steven Spaz kinda created collaborating with each other. Aaron was shoved out first by The Spaz, though one could claim rightfully so in that case since Aaron was basically done with the site, and had moved on to his next project, essentially leaving Alexis and Spaz in the lurch as neither of them understood the code that Aaron had written to make the site functional.
In many ways, the users made this possible. Most of us aren’t users in this case. The users that make up the vast majority of the population don’t give one thought to their own personal privacy, after all they have “nothing to hide,” not knowing that they really need to hide almost all of their data.
If the users were to be educated about how much money the various companies like Reddit, Facebook, Microsoft, Apple, and almost every single other “disruptive tech company,” has stolen from them, the socialist revolution would have started in the 1980s
That’s kind of how search engines work, buddy.
“People who fecklessly farm other people’s data upset at other companies are farming their data.”
Does LLMmy have a robots.txt against scrapers?
I don’t think the content on Reddit is their to sell…unless resistors are getting a cut. That site is a dumpster and needs to die already.
On one hand, that’s a shitty, greedy move from Reddit. All of the data, and value is provided by the users.
On the other hand, Microsoft does want to feed it into OpenAI and present it on a Bing search engine result page rather than sending the visitor to Reddit where they might join a community (or view an ad or ten).
The article says OpenAI made a deal with Reddit, so blocking Microsoft isn’t going to keep Reddit’s data from getting fed to OpenAI
“our” data
Yeah this.
Channeling Joe Pesci from “Casino”: “You only have that farking data because WE made that possible!”
“GET THIS THROUGH YOUR HEAD!”
Reddit says “blablabla” . Reddit is just trying to stop its communistic website from losing money.
Reddit says “we will give you access” once you pay us and feather our pockets.
Fuck spez, what a cunt. Delete all your comments etc, and let the AI rot in retarded posts.
I dont think you know what the word communistic means.
Communistic = Things I don’t like.
For example, Flat tires = Communist
I woke up one day and my leg started being communist. I haven’t been able to get rid of it since.
Don’t you have a communist repair kit in your car?
Do you mean a capitalist repair kit? Sometimes you just need to plug the communist tire with a bit of capitalism.
I has a communist once. Not fun.