I tried, and decided it is not worth it. llama.cpp with a 13B model fit into RAM... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		lynx23 on July 7, 2023 \| parent \| context \| favorite \| on: GPT-4 API General Availability I tried, and decided it is not worth it. llama.cpp with a 13B model fit into RAM of my laptop, but pushes CPU temperature to 95 degrees within a few seconds, and mightily sucks the battery dry. Besides, the results were slow and rather useless. GPT is the first cloud application I deliberately use to push off computing and energy consumption to an external host which is clearly more capable of handling the request then my local hardware. I sympathize with the idea of wanting to run a local LLM, but IMO, this would require building a desktop with a GPU and plenty of horsepower + silent cooling and put it somewhere in a closet in my apartment. Running LLMs on my laptop is (to me) clearly a waste of my time and its battery/cooling.

regularfry on July 7, 2023 [–]

So I do actually want a really good games machine, and an AI worker box. Since I can't both use inference output and play games at the same time, having a ludicrously over-specced desktop for both uses actually makes sense to me.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact