• mabeledo@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    12 hours ago

    I see that you don’t know what an algorithm is. Because every search engine before Google used algorithms, just not Pagerank.

    Tell me what you are seeing. This page you linked shows all top models for AI research at above 80% and the top ones at 90%.

    On a single benchmark, GPQA, that was first released in 2022 and is now widely considered to be on its way out. This is extremely common in LLM benchmarking: every couple years, old methodologies are weeded out because most recent LLMs score 90% or higher. This could be for many reasons, but more than likely LLMs are trained on the specific corpus these benchmarks test against, which is akin to taking an exam knowing the answers beforehand.

    The aggregates tell a very different story.

    On the NYT article, you missed where it says that on the latest Gemini version, more than half of the summaries are ungrounded, meaning that users couldn’t possibly verify the accuracy of the response given the linked site. They also often provide additional information that’s not true.

    I’m not even going to bother with the “I’m fine paying with my data” bit, because think that’s just a morally broken take.

    • Sabrinamycarpet@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      1
      ·
      8 hours ago

      I guess you just like being super pedantic because apparently you do know the difference between directory based searches and what came after it. Feel free to argue how yahoo and altavista was fine and the world really didn’t need Google search though.

      The aggregates tell a very different story.

      Tell me what aggregates you are seeing. I have no idea what you are looking at on this page that is suppose to prove your point on accuracy.

      Because right now you brought up a benchmark site and then just saying how all benchmarks are skewed. That’s fine and all but can you tell me what you are actually looking at so we can be on the same page??

      And again, at 75% or 70%, or whatever you imagine current accuracy is, what happened to all the memes about Gemini telling you to add bleach to pancakes or something? Low hanging karma posts just literally disappeared. Why do you think that is? A google search for “funny gpt answers” brings up results from 2024. Let that sink in.

      I’m not even going to bother with the “I’m fine paying with my data” bit, because think that’s just a morally broken take.

      I imagine it’s quite the burden being so morally superior to the vast majority of humanity.

      • mabeledo@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        7 hours ago

        JFC this is so tiring. I’m trying to explain the issues from a technical point of view and you dismiss all I’m saying like I’m crazy or I don’t know anything about the topic.

        I get that you like these summaries because you believe that they make your life easier, but the least you could do is educate yourself, read some books, anything.

        • Sabrinamycarpet@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          1
          ·
          6 hours ago

          You’re trying to explain by ignoring my question regarding your source not once but twice.

          And also conveniently ignoring the actual point being made and instead focusing on whether or not its 75% or 85% or what is an algorithm

          And also dismissing arguments because its morally beneath you.