• ultranaut@lemmy.world
    link
    fedilink
    English
    arrow-up
    12
    ·
    3 hours ago

    Where does the training data come from seems like the main issue, rather than the training itself. Copying has to take place somewhere for that data to exist. I’m no fan of the current IP regime but it seems like an obvious problem if you get caught making money with terabytes of content you don’t have a license for.

    • FaceDeer@fedia.io
      link
      fedilink
      arrow-up
      1
      ·
      2 hours ago

      A lot of the griping about AI training involves data that’s been freely published. Stable Diffusion, for example, trained on public images available on the internet for anyone to view, but led to all manner of ill-informed public outrage. LLMs train on public forums and news sites. But people have this notion that copyright gives them some kind of absolute control over the stuff they “own” and they suddenly see a way to demand a pound of flesh for what they previously posted in public. It’s just not so.

      I have the right to analyze what I see. I strongly oppose any move to restrict that right.