• Drun@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    edit-2
    1 day ago

    It would be true if dataset’s composing mechanism was nothing but a blind internet scraping.

    But it’s not so simple. There’s many hundreds of quality and trust filters that were created solely to prevent such things. As I said, it’s a solvable issue, even if most folks here really believe that the smartest AI was buried somewhere in 2021