• NotMyOldRedditName@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    1 day ago

    In theory costs could come down with each new hardware generation if the we dont keep pushing models the to max extent of what the hardware can do while pushing size.

    E.g Claude Opus today, only trained in a similar size and manner as today, will be cheaper to run on whatever the next GPU that comes out with higher speeds and processing capabilities, unless of course NVidia raises the cost substantially. Given the current situation I think nvidia might do that which would hamper this lowering of costs, but it should possible, if not slower.

    E.g 10 years from now it will be cheaper to run a opus similar model. But 10 years from now everyone will want the mythos of today, then. That wont be cheaper.

    • MintyAnt@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      7 hours ago

      This has been stated since ChatGPT was released and has not happened. The video cards released specifically for LLM usage do not benchmark particularly better than the previous generation. And it’s still unbelievably expensive to run these cards and maintain the facility and, again, you only get like 3 or 5 years out of them! That’s a crazy investment lol