What would be the cheapest and most cost-effeciant way of self hosting LLMs

I've a minipc running an AMD 5700U where I host some services, including ollama and openwebui.

Unfortunately the support of rocm isn't quite there yet and not to mention that of mobile GPUs.

Surprisingly the prompts work when configured to use the CPU, but the speed is just... well, not good.

So, what'd be a cheap and energy efficient setup to run sone kind of LLM for personal use, but still get decent speed?

I was thinking about getting an e-gpu case, but I'm not sure about how solid this would end up.

tagginator Bot ,

New Lemmy Post: What would be the cheapest and most cost-effeciant way of self hosting LLMs (https://lemmyverse.link/lemmy.world/post/15946491)
Tagging:

(Replying in the OP of this thread (NOT THIS BOT!) will appear as a comment in the lemmy discussion.)

I am a FOSS bot. Check my README: https://github.com/db0/lemmy-tagginator/blob/main/README.md

  • All
  • Subscribed
  • Moderated
  • Favorites
  • [email protected]
  • kbinchat
  • All magazines