Malus, which is a piece of “satire” but also fully functional, performs a “clean room” clone of open source software, meaning users could then sell, redistribute, etc. the software without crediting the original developers. But I have a hard time with the “clean room” argument since the LLM doing the behind-the-scenes work has already ingested the entire corpus of open source software – and somehow the output of the LLMs isn’t considered a derivative work.

    • Naich@piefed.world
      link
      fedilink
      English
      arrow-up
      6
      ·
      4 hours ago

      It’s satire which brings attention to the problem. Read the reviews on the site.

  • solrize@lemmy.ml
    link
    fedilink
    arrow-up
    26
    ·
    7 hours ago

    It’s obviously not clean room since the original code was in the training data.

      • slacktoid@lemmy.ml
        link
        fedilink
        English
        arrow-up
        6
        ·
        6 hours ago

        Omg Imagine if half of the case is just an ML course to teach the jury what training data is.

        • searabbit@piefed.social
          link
          fedilink
          English
          arrow-up
          2
          ·
          5 hours ago

          That sounds painful. Absolutely no shade to the lovely older person I overheard today say: “so how do you get the little folks on the screen to know what buttons I pressed?”

          But I’m guessing it could be similar to music copywrite law where jurors don’t have to understand audio engineering to know two samples sound the same.

          • slacktoid@lemmy.ml
            link
            fedilink
            English
            arrow-up
            2
            ·
            4 hours ago

            Nope the class will be dropped down to teach all the basics and then up to vector databases and word embeddings to understand the case.

          • slacktoid@lemmy.ml
            link
            fedilink
            English
            arrow-up
            2
            ·
            4 hours ago

            And you teach all of them the amount of data facebook takes from them. ALL of it.

  • Alex@lemmy.ml
    link
    fedilink
    arrow-up
    4
    ·
    6 hours ago

    It will be fun watching those users who first make the jump to the new project.