arXiv:2606.05168v1 Announce Type: cross Abstract: Training on synthetic data causes model collapse, but existing analyses treat this as single-chain degradation. In reality, the AI ecosystem involves cross-contamination: models ingest synthetic data from other models, produce new synthetic text, and contaminate shared corpora. We propose a...
Read the full article at the source.
Comments (0)
No comments yet. Be the first to comment!