I wasn’t proposing only using curated AI-generated content. If the problem is the loss of “rare data” from the edges, then adding some AI-generated data to a data set that still includes that rare data shouldn’t be a problem.
The article doesn’t say that AI-generated data is somehow “infectious”, just that the data set becomes more and more limited with each cycle since rare information gets lost each time.
We’d need to test and see if AI-generated content that is curated by human quality assurance still causes MADness.
My suspicion is that would only slow down the degradation of the outputs, rather than stop it completely.
I wasn’t proposing only using curated AI-generated content. If the problem is the loss of “rare data” from the edges, then adding some AI-generated data to a data set that still includes that rare data shouldn’t be a problem.
The article doesn’t say that AI-generated data is somehow “infectious”, just that the data set becomes more and more limited with each cycle since rare information gets lost each time.