• Zetta@mander.xyz
    link
    fedilink
    arrow-up
    0
    ·
    edit-2
    12 days ago

    But it’s not the same, you don’t understand how LLM training works. The original piece of work is not retained at all, the training data is used to tune pre existing numbers, those numbers change slightly as training goes on.

    At no point in time is anything resembling the training data ever present in the 1’s and 0’s of the model.

    You are wrong, bring on the downvotes uninformed haters.

    FYI I also agree sampling music should be fine for artists

    • fluxion@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      12 days ago

      Yes, weights for individual words/phrases/token which, given a particular prompt/keyword, which might reproduce the original training data almost in it’s entirety given similar set of prompt or set of keywords. Hence why it is so obvious when these models have been trained on copyrighted material.

      Similarly, I don’t digitally store music in my head verbatim, I store some fuzzy version that i can still reproduce fairly closely when prompted, and still get sued if I’m charging money for performing or recording it, because the “weightings” in my neurons are just an implementation detail of how my brain works and not some active/purposeful attempt to transform the music in any appreciable way.