• msage@programming.dev
    link
    fedilink
    arrow-up
    2
    arrow-down
    1
    ·
    12 hours ago

    From what I know, local LLMs take minutes to process a single prompt, not seconds, but I guess that depends on the use case.

    But also games, dunno about maxing GPU in most games. I maxed mine for crypto mining, and that was power hungry. So I would put LLMs closer to crypto than games.

    Not to mention games will entertain you way more for the same time.

    • jsomae@lemmy.ml
      link
      fedilink
      arrow-up
      1
      ·
      edit-2
      1 hour ago

      Obviously it depends on your GPU. A crypto mine, you’ll leave it running 24/7. On a recent macbook, an LLM will run at several tokens per second, so yeah for long responses it could take more than a minute. But most people aren’t going to be running such an LLM for hours on end. Even if they do – big deal, it’s a single GPU, that’s negligible compared to running your dishwasher, using your oven, or heating your house.