Jokes on you for crawling mostly synthetic text?
Only 100,000 times? Shit, do I need to be worried about getting sued too?
All porn subreddits are exempted
So Reddit serves free data but Anthropic took too much?
They give out free license for their data, but require following their terms of service.
Obviously Reddit isn’t averse to bots scraping the site for data, just ones that aren’t paying them. I’m regretting not going through and systematically deleting all my posts and comments before deleting my account, but I thought that happened automatically.
I don’t regret not deleting all my comments. For me, It’s a mishmash of helpful/comedic/observational comments that I don’t care that they have sold off for use as training data.
But, I just got shadowbanned, because of my VPN or something, so they aren’t getting any more!
it wouldnt have mattered anyway if you left during the time most people did. Reddit rolled back mass deleted data and manually deleted accounts during that duration so that comments remained without usernames.
The fuckers!
Do you really believe they don’t have backups? Especially since it seems selling content for AI training was their plan for quite a while?
Or that they didn’t make full backups a couple years ago before the protest, anticipating a lot of users would try to delete their comments?
I think the only way to truly delete anything from reddit would be living in EU and enforcing a GDPR request, but even in that case, I believe it would be very difficult to check they actually comply.
I think the only way to truly delete anything from reddit would be living in EU and enforcing a GDPR request, but even in that case, I believe it would be very difficult to check they actually comply.
Wouldn’t work. GDPR is not copyright. Deleting the username is enough, unless you have doxed yourself in some post.
Rather, it can be argued that GDPR requires restoring comments at least in some situations. Comments may be necessary context to understand replies or even other posts.
I deleted all my posts and my account and they restored them all and then permanently banned me. They can recover anything they want to.
Basically nowhere on the internet does delete mean delete. Nearly everywhere it means archive or hide.
Long story short: They are not combatting bots on their platform. They sold training data to google and these guys aren’t paying, that’s why they’re suing.
So if reddit wins, that means the content is theirs. So if the content is theirs, they are liable for any content that is illegal. Is that true?
yes to both regardless of this lawsuit
The wiggle room for large businesses is that they remove content that violates local laws when notified of it
No. I am not aware of any law that makes you liable by holding or claiming the copyright to some content. EG you may have to pay damages for libel, but not because you have copyright to the libelous statement.
Doesn’t quite make sense.
You’re telling me that someone can get popped for mistakenly visiting the dark side of the internet and having whatever-the-fuck horrible shit put on their machine, but owning the content and hosting it on your servers results in nothing?
Not quite.
Generally, sites aren’t liable for user generated content as long as they follow some rules. They need to take down illegal content and provide some way of reporting such content. In the US, that’s the whole DMCA takedown thing. The whole content ID thing, that YouTube does, might not be strictly necessary, but it was rolled out in response to a high-stakes lawsuit. The EU is, as always, more strict in these matters.
People are not punished for things beyond their control (but mind that a fine is not the same as damages). If you are sent illegal content, that you have not requested, you shouldn’t expect formal punishment, though the investigation may be punishing in itself. If you simply don’t know how caching works, you’re probably in trouble.
But this was about copyright. I don’t think you get punished anywhere for holding some copyright. Say some Japanese Manga artist travels to some European state where some of their works are illegal. They’re not going to get arrested for that. Anyone who brings such illegal works into the country will not be so lucky, regardless of copyright.
The content’s theirs whether they win or not, isn’t it? It’s in the EULA when you sign up.
non-exclusive
That means we can license all our content to another company, and Reddit would be forced to allow them to fetch it, as we still own it, right?
Non-exclusive just means you’re free to give a copy of your content to whoever you want. It doesn’t mean Reddit is obligated to distribute it for you.
No. Just because you own a copyright, doesn’t mean that you are entitled to free network services. If you owned the copyright to a movie, would you expect free tickets for any cinema showing it?
It certainly reads that way. Gonna start a Reddit User Collective? Licence it to Anthropic at a discount to undercut Reddit? That could be pretty funny.
That would be legally possible, though, obviously, you would have to pay for your own servers.
In practice, it wouldn’t be worth anyone’s time.
I don’t see why. Users own the content wherever it’s located. Reddit, of course, would be free to remove that content, but that would be cutting off their own nose to spite their face and is also acceptable.
You don’t see why you would need your own servers? Do you see why unauthorized access to a computer system might be illegal?
“The pot sues the kettle, alleging that it is black”
“Reddit’s humanity is uniquely valuable in a world flattened by AI,“ Lee said. ”Now more than ever, people are seeking authentic human-to-human conversation. Reddit hosts nearly 20 years of rich, human discussion on virtually every topic imaginable. These conversations don’t happen anywhere else—and they’re central to training language models like Claude.”
LMAO, reddit’s days of genuine conversations between humans is long gone.
Bots? On Reddit!?
It’s more likely than you think!
Aren’t the copyrights still belonging to the original authors? What is Reddit suing for? The header and the footer? 🤔
This is like one of those cases where I’m kind of hoping they both lose somehow. Neither party are right in this case, Reddit is trying to claim copyright over content they have no rights to, and anthropic shouldn’t be violating copyright without a licence.
But apparently you are actually allowed to violate copyright without a licence if you’re an AI company because apparently llms are the future? So I guess Reddit are going to lose, which will be funny.
Is it violating copyright to browse the web?
“Violating copyright without a licence” is a lovely turn of phrase. You must be the valedictorian of the Lemmy School of Copyright.
Judge finds that anthropic has to pay restitution to the reddit users. Affirms that posts belong to users.
Well, I can dream.
I am squarely on the reddit should lose this side.
Anthropic may be breaking copyright, but not Reddit’s copyright. Sure maybe Anthropic should be sued, but not by Reddit.
Actually this case could be a good thing. The whole question of who owns user generated content needs hashing out, because no one seems to actually know.
Obviously the logical answer would be that the people who created it own the content, but that’s never been officially decided.
The whole question of who owns user generated content needs hashing out, because no one seems to actually know.
It’s billionaires. They know. They just sometimes squable over it like two year olds. But they know. They pay lawyers to make it clear in thousand page terms of service documents.
Because that’s the only common sense conclusion to make, but that doesnt make rich fucks more money
You mean Reddit, the company that would be very happy if Anthropic did the exact same thing, but paid Reddit first?
hope reddit loses
Spez can forever get fucked
while half of reddit is infested with propaganda bots from russia.
Not just Russia.
Israel, US, China, North Korea, India and other countries… Nuclear Lobby, Fossil Fuel Lobby and countless other industry lobbyists… Private companies advertising their products…
But have you seen Rampart?
First of all, I have not seen it, and second off, I don’t want to see it. Lets focus on the Reddit lawsuit.
Fuck Reddit