• 0 Posts
  • 7 Comments
Joined 1 year ago
cake
Cake day: July 1st, 2023

help-circle


  • And you’re absolutely right about that. That’s not the same thing as LLMs being incapable of constituting anything written in a novel way, but that they will readily with very little prodding regurgitate complete works verbatim is definitely a problem. That’s not a remix. That’s publishing the same track and slapping your name on it. Doing it two bars at a time doesn’t make it better.

    It’s so easy to get ChatGPT, for example, to regurgitate its training data that you could do it by accident (at least until someone published it last year). But, the critics cry, you’re using ChatGPT in an unintended way. And indeed, exploiting ChatGPT to reveal its training data is a lot like lobotomizing a patient or torture victim to get them to reveal where they learned something, but that really betrays that these models don’t actually think at all. They don’t actually contribute anything of their own; they simply have such a large volume of data to reorganize that it’s (by design) impossible to divine which source is being plagiarised at any given token.

    Add to that the fact that every regulatory body confronted with the question of LLM creativity has so far decided that humans, and only humans, are capable of creativity, at least so far as our ordered societies will recognize. By legal definition, ChatGPT cannot transform (term of art) a work. Only a human can do that.

    It doesn’t really matter how an LLM does what it does. You don’t need to open the black box to know that it’s a plagiarism machine, because plagiarism doesn’t depend on methods (or sophisticated mental gymnastics); it depends on content. It doesn’t matter whether you intended the work to be transformative: if you repeated the work verbatim, you plagiarized it. It’s already been demonstrated that an LLM, by definition, will repeat its training data a non-zero portion of the time. In small chunks that’s indistinguishable, arguably, from the way a real mind might handle language, but in large chunks it’s always plagiarism, because an LLM does not think and cannot “remix”. A DJ can make a mashup; an AI, at least as of today, cannot. The question isn’t whether the LLM spits out training data; the question is the extent to which we’re willing to accept some amount of plagiarism in exchange for the utility of the tool.


  • Your quote never appears anywhere in any of those citations.

    That the Bible–a collection of religious texts, many of them advancing directly or indirectly the ethnic and national interests of their authors’ people groups–would have stuff in it about killing people for lots of reasons is no surprise.

    But your purported origin of the common proverb in the Bible is a fabrication. It’s not in there, anywhere.

    Also Christianity doesn’t advocate killing non-believers as a matter of doctrine. Plenty of Christians have done that historically, but it’s not a teaching of the religion, and it’s never advocated in the Bible, anywhere.

    There’s plenty to criticize Christians for. I don’t understand why you felt you needed to make something up.




  • Hear, hear. This isn’t a case of Mercedes selling an upgrade. It’s more akin to selling the car pre-booted and then demanding a monthly payment to remove it under threat of returning to re-apply it if a payment is missed. It’s absolutely a protection racket. Sure would be a shame if something happened to those fancy features we installed.

    The good news is that the companies who will float this first are the ones most likely to do business with politicians, and unfortunately I’m cynical enough to believe that the best way to get regulation in place is to personally inconvenience the decision-makers. I hope that results in action.

    If it doesn’t, well, the next step is self-help. If we’re changing the definition of private property, it’s only so long before people begin questioning whether there’s any point in having private property at all.