Shit in -> shit out 📤

  • kromem@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 year ago

    Well, the ideal would probably be to train a discriminator based on human ratings of generated outputs.

    Take generation 0 (G0), produce output which is accepted or rejected based on humans, train a discriminator to predict those ratings off output, and then use the combined accepted outputs from humans and trained discriminator to train G1.

    Repeat again for G1, G2, G3, etc.

    My guess would be that the end result would continue to get better and better rather than worse.

    The problem is if the diffusion model can’t properly reject weird hands or pupils, those magnify in subsequent rounds.

    But there’s likely adaptive and maladaptive tendencies in the diffusion model, and adding a halfway decent filter between human selection and synthetic selection of outputs separate from the diffusion model itself would effectively curb the magnification here.

    • Ryantific_theory@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 year ago

      It seems like a simple enough fix, though also setting a weird precedent. Instead of directly fixing things, just keep adding layers of machine learning to produce improved outputs.

      The future of AI isn’t spaghetti code, but spaghetti AI chains lol. Probably why people much smarter than me are the ones working on machine learning.