• LalSalaamComrade@lemmy.ml
    link
    fedilink
    arrow-up
    22
    arrow-down
    1
    ·
    edit-2
    11 months ago

    So, correct me if I’m wrong, but Threads, as a federated web app, can get posts from other federated web app instances, and can store them indefinitely, right? Is this going to be a data-collection nightmare?

    • 0xtero@kbin.social
      link
      fedilink
      arrow-up
      47
      ·
      11 months ago

      Not more than it is now. Everything is already public so if they need it, they’ve already been collecting it. This doesn’t really change anything.

      • h3ndrik@feddit.de
        link
        fedilink
        arrow-up
        15
        ·
        edit-2
        11 months ago

        And it’s how federation is supposed to work. Either you want to send your content to other instances or you don’t. But federation is the wrong tool if you want to stay alone. You can defederate and block them if you don’t like their terms.

        • DarkThoughts@kbin.social
          link
          fedilink
          arrow-up
          11
          ·
          11 months ago

          Even without federation / specific protocols. You can just take about any sort of content on the internet pretty easily if you wanted to. Search engine crawlers do something similar, otherwise search results would just not work at all.

        • PropaGandalf@lemmy.world
          link
          fedilink
          arrow-up
          4
          arrow-down
          1
          ·
          11 months ago

          Couldn’t instances or accounts just license their content? Like would it be legally binding if I write in my profile that all the content I wrote here is licensed under a specific CC license?

          • 0x1C3B00DA@kbin.social
            link
            fedilink
            arrow-up
            9
            ·
            11 months ago

            Genius (the lyrics company) tried to license the content on their website and a judge said that can’t be legally binding because there’s no guarantee the scraper read it. It seems like the same would apply here.

            • h3ndrik@feddit.de
              link
              fedilink
              arrow-up
              6
              arrow-down
              1
              ·
              edit-2
              11 months ago

              Seriously doubt that. If I pirate a book, game or TV series and don’t read the copyright, it’s still illegal. Same should apply to other written text like on a website.

              • Zaktor@sopuli.xyz
                link
                fedilink
                English
                arrow-up
                6
                ·
                11 months ago

                Copyright is a law. Everything is copyrighted, with or without the little ©. Licensing is a peer-to-peer contract. Unless you can prove the other side is aware of and agreed to a contract, it doesn’t bind them.

                Notably, licensing often is needed because general copyright exists. The license grants them the right to copy your full text or whatever, and if they didn’t agree to it, then they had no right to copy it. There are exceptions for excerpts and search indexing and the like, but they can’t (legally) just take all your posts because you put them online.

                That all said, big companies have already been doing mass copyright violations for AI, so copyright or licenses don’t necessarily mean anything unless you can force them to comply. There are lawsuits on AI scraping now. Because the end result is either making up some reason that copyright doesn’t ban copying if you do enough of it or making LLMs effectively illegal and putting some massive corporations on the hook for mass violations against basically everyone online, I wouldn’t personally bet the courts ruling against the corporations.

              • 0x1C3B00DA@kbin.social
                link
                fedilink
                arrow-up
                3
                ·
                11 months ago

                It looks like I was mixing up some facts. The Genius case was denied because genius doesn’t own the copyright to the lyrics they were publishing. I can’t find the case now, but there was a case where a judge said scraping was allowed because it wasn’t a given that the scraper had read a ToS.

          • h3ndrik@feddit.de
            link
            fedilink
            arrow-up
            7
            ·
            11 months ago

            We should open a feature request. An additional license selection field upon posting on Lemmy, or a default setting to license every post and comment from a user account would be awesome. And free/libre culture fits well within this ecosystem.

    • Helmic [he/him]@hexbear.net
      link
      fedilink
      English
      arrow-up
      9
      ·
      11 months ago

      It aleady has been. Anything, anything publicly posted and available will be harvested by, at a minimum, Google spiders. The only privacy benefits of any site can only be promises that they won’t share the information that you don’t publicly list.

      And even then,.all it takes is the feds raiding a dipshit Kolektiva admin to get that information illegally anyways.

      Do not organize serious actions over social media. At most, hold sensitive discussions over private, E2E encrypted chats like Matrix.

    • spudwart@spudwart.com
      link
      fedilink
      English
      arrow-up
      7
      ·
      11 months ago

      Honestly. This is a social network platform. Assume all you post and share is being collected via web crawlers and data brokers pulling from the api.