That’s going to be a lot more work since comments and posts are decentralized here. You can probably easily get some of it but it will be hard to get all of it.
It’s actually even easier than that. Instead of setting up an tool to make up requests for the API, you can just set up a bridge that will dump everything right into your database. The wonders of federation.
Long Live Lemmy
Well, they can (and will) still scrape us if they want. Just nobody’s making a buck off of it.
yet
All better than that piggyboy getting free money
That’s going to be a lot more work since comments and posts are decentralized here. You can probably easily get some of it but it will be hard to get all of it.
It’s actually even easier than that. Instead of setting up an tool to make up requests for the API, you can just set up a bridge that will dump everything right into your database. The wonders of federation.
The reality though is I can train LLMs off Lemmy data all I want and I don’t have to pay ANYONE a dime…