More

empiko · 2026-01-07T06:39:58 1767767998

The sota chatbots are getting more and more functionality that is not just LLM inference. They can search the web, process files, integrate with other apps. I think that's why most people will consider local LLMs to be insufficient very soon.

empiko · 2026-01-06T18:08:57 1767722937

This is a natural response to software enshittification. You can hardly find an iOS app that is not plagued by ads, subscriptions, or hostile data collection. Now you can have your own small utilities that can work for you. This sort of personal software might be very valuable in the world where you are expected to pay 5$ to click any button.

nikisil80 · 2026-01-06T18:39:44 1767724784

Yeah sure but have you considered that the actual cost of running these models is actually much greater than whatever cost you might be shelling out for the ad-free apps? You're talking to someone who hates the slopification and enshittification of everything, so you don't need to convince me about that. However, everything I've seen described in the replies to my initial comment - while cute, and potentially helpful on a case-by-case basis, does NOT warrant the amount of resources we are pouring into AI right now. Not even fucking close. It'll all come crashing down, taxpayers the world over will be caught with the bag in their hands, and for what? So that we can all have a less robust version of an app that already exists but that has the colours we want and the button where we want it?

If AI cost nothing and wasn't absolutely decimating our economy, I'd find what you've shared cute. However, we are putting literally all of our eggs, and the next generation's eggs, and the one after that, AND the one after that, into this one thing, which, I'm sorry, is so far away from everything that keeps on being promised to us that I can't help but feel extremely depressed.

falloutx · 2026-01-07T02:20:22 1767752422

At this point it doesn't matter that much whether we use AI or not, the apps are not selling and they are being produced at an alarming rate.

The projects being submitted to product hunt is 4x the year before.

The market is shrinking rapidly because now more people make their own apps.

Even making a typo and landing on a website, there is good chance its selling more ai snake oil, yet none of these apps are feature complete and easily beaten by apps made by guys in 2010s. (tldr & sketchbook for the drawing space).

Only way to excite the investors is to fake the ARR by giving free trials and sell before the recurring event occurs.

minimaxir · 2026-01-06T18:45:25 1767725125

You are attempting to move the goalposts. There are two different points in this debate:

1) Modern LLMs are an inflection point for coding.

2) The current LLM ecosystem is unsustainable.

This submission discussion is only about #1, which #2 does not invalidate. Even if the ecosystem crashes, then open-source LLMs that leverage the same tricks Opus 4.5 does will just be used instead.

strange_quark · 2026-01-06T18:56:22 1767725782

But it's only an inflection point if it's sustainable. When this comes crashing down, how many people are going to be buying $70k GPUs to run an open source model?

minimaxir · 2026-01-06T18:59:10 1767725950

I said open-source models, not locally-hosted models. Essentially, more power to inference-only providers such as Groq and Together AI which host the large-scale OSS LLMs who will be less affected by a crash as long as the demand for coding agents is there.

simonw · 2026-01-06T22:09:45 1767737385

> When this comes crashing down, how many people are going to be buying $70k GPUs to run an open source model?

If the AI thing does indeed come crashing down I expect there will be a whole lot of second-hand GPUs going for pennies on the dollar.

strange_quark · 2026-01-06T23:32:35 1767742355

Ok, and then? Taking a one time discount on a rapidly depreciating asset doesn’t magically make this whole industry profitable, and it’s not like you’re going to start running a GB200 in your basement.

simonw · 2026-01-06T23:34:08 1767742448

Then I'll wait for a bunch of companies to spring up running those cheap GPUs in their data centers and selling me access to GLM-4.7 and friends.

Or I'll start one myself, if the market fails to provide!

nikisil80 · 2026-01-06T20:30:53 1767731453

Checked your history. From a fellow skeptic, I know how hard it is to reason with people around here. You and I need to learn to let it go. In the end, the people at the top have set this up so that either way, they win. And we're down here telling the people at our level to stop feeding the monster, but told to fuck off anyways.

So cool bro, you managed to ship a useless (except for your specific use-case) app to your iphone in an hour :O

What I think this is doing is it's pitting people against the fact that most jobs in the modern economy (mine included btw) are devoid of purpose. This is something that, as a person on the far left, I've understood for a long time. However, a lot (and I mean a loooooot) of people have never even considered this. So when they find that an AI agent is able to do THEIR job for them in a fraction of the time, they MUST understand it as the AI being some finality to human ingenuity and progress given the self-importance they've attributed to themselves and their occupation - all this instead of realizing that, you know, all of our jobs are useless, we all do the exact same useless shit which is extremely easy to replicate quickly (except for a select few occupations) and that's it.

I'm sorry to tell anyone who's reading this with a differing opinion, but if AI agents have proven revolutionary to your job, you produced nothing of actual value for the world before their advent, and still don't. I say this, again, as someone who beyond their PhD thesis (and even then) does not produce anything of value to the world, while being paid handsomely for it.

christophilus · 2026-01-07T00:45:53 1767746753

> if AI agents have proven revolutionary to your job, you produced nothing of actual value for the world before their advent, and still don't.

This doesn’t logically follow. AI agents produce loads of value. Cotton picking was and still is useful. The cotton gin didn’t replace useless work. It replaced useful work. Same with agents.

strange_quark · 2026-01-06T20:47:54 1767732474

> You and I need to learn to let it go.

Definitely, it’s an unhealthy fixation.

> I'm sorry to tell anyone who's reading this with a differing opinion, but if AI agents have proven revolutionary to your job, you produced nothing of actual value for the world before their advent, and still don't.

I agree with this, but I think my take on it is a lot less nihilistic than yours. I think people vastly undersell how much effort they put into doing something, even if that something is vibecoding a slop app that probably exists. But if people are literally prompting claude with a few sentences and getting revolutionary results, then yes, their job was meaningless and they should find something to do that they’re better at.

But what frustrates me the most about this whole hype wave isn’t just that the powers that be have bet the entire economy on a fake technology, it’s that it’s sucking all of of the air out of the room. I think most people’s jobs can actually provide value and there’s so much work to be done to make _real_ progress. But instead of actually improving the world, all the time, money, and energy is being thrown into such a wasteful technology that is actively making the world a worse place. I’m sure it’s always been like this and I was just to naive too see it, but I much preferred it when at least the tech companies pretended they cared about the impact their products had on society rather than simply trying to extract the most value out of the same 5 ideas.

nikisil80 · 2026-01-06T21:09:21 1767733761

Yeah, I do tend to have a rather nihilistic view on things, so apologies.

I really think we're just cooked at this point. The amount of people (some great friends whom I respect) that have told me in casual conversation that if their LLM were taken from them tomorrow, they wouldn't know how to do their work (or some flavour of that statement) has made me realize how deep the problem is.

We could go on and on about this, but let's both agree to try and look inward more and attempt to keep our own things in order, while most other people get hooked on the absolute slop machine that is AI. Eventually, the LLM providers will need to start ramping up the costs of their subscriptions and maybe then will people start clicking that the shitty code that was generated for their pointless/useless app is not worth the actual cost of inference (which some conservative estimates put out to thousands of dollars per month on a subscription basis). For now, people are just putting their heads in the sand and assuming that physicists will somehow find a way to use quantum computers to speed up inference by a factor of 10^20 in the next years, while simultaneously slashing its costs (lol).

But hey, Opus 4.5 can cook up a functional app that goes into your emails and retrieves all outstanding orders - revolutionary. Definitely worth the many kWh and thousands of liters of water required, eh?

Cheers.

keeda · 2026-01-06T22:29:11 1767738551

A couple of important points you should consider:

1. The AI water issue is fake: https://andymasley.substack.com/p/the-ai-water-issue-is-fake (This one goes into OCD-levels of detail with receipts to debunk that entire issue in all aspects.)

2. LLMs are far, far more efficient than humans in terms of resource consumption for a given task: https://www.nature.com/articles/s41598-024-76682-6 and https://cacm.acm.org/blogcacm/the-energy-footprint-of-humans...

The studies focus on a single representative task, but in a thread about coding entire apps in hours as opposed to weeks, you can imagine the multiples involved in terms of resource conservation.

The upshot is, generating and deploying a working app that automates a bespoke, boring email workflow will be way, way, wayyyyy more efficient than the human manually doing that workflow everytime.

Hope this makes you feel better!

D-Machine · 2026-01-07T04:48:19 1767761299

> 2. LLMs are far, far more efficient than humans in terms of resource consumption for a given task: https://www.nature.com/articles/s41598-024-76682-6 and https://cacm.acm.org/blogcacm/the-energy-footprint-of-humans...

I want to push back on this argument, as it seems suspect given that none of these tools are creating profit, and so require funds / resources that are essentially coming from the combined efforts of much of the economy. I.e. the energy externalities here are monstrous and never factored into these things, even though these models could never have gotten off the ground if not for the massive energy expenditures that were (and continue to be) needed to sustain the funding for these things.

To simplify, LLMs haven't clearly created the value they have promised, but have eaten up massive amounts of capital / value produced by everyone else. But producing that capital had energy costs too. Whether or not all this AI stuff ends up being more energy efficient than people needs to be measured on whether AI actually delivers on its promises and recoups the investments.

EDIT: I.e. it is wildly unclear at this point that if we all pivot to AI that, economy-wide, we will produce value at a lower energy cost, and, even if we grant that this will eventually happen, it is not clear how long that will take. And sure, humans have these costs too, but humans have a sort of guaranteed potential future value, whereas the value of AI is speculative. So comparing energy costs of the two at this frozen moment in time just doesn't quite feel right to me.

simonw · 2026-01-06T22:14:02 1767737642

> For now, people are just putting their heads in the sand and assuming that physicists will somehow find a way to use quantum computers to speed up inference by a factor of 10^20 in the next years, while simultaneously slashing its costs (lol).

GPT-3 Da Vinci cost $20/million tokens for both input and output.

GPT-5.2 is $1.75/million for input and $14/million for output

I'd call that pretty strong evidence that they've been able to dramatically increase quality while slashing costs, over just the past ~4 years.

tuesdaynight · 2026-01-07T00:17:55 1767745075

Isn't that kind of related with the amount of money thrown at the field? If the economy gets worse for any reason, do you think that we can still expect these level of cutting costs in the future?

strange_quark · 2026-01-06T21:46:35 1767735995

> But hey, Opus 4.5 can cook up a functional app that goes into your emails and retrieves all outstanding orders - revolutionary. Definitely worth the many kWh and thousands of liters of water required, eh?

The thing is in a vacuum this stuff is actually kinda cool. But hundreds of billions in debt-financed capex that will never seen a return, and this is the best we’ve got? Absolutely cooked indeed.

empiko · 2026-01-02T05:24:55 1767331495

It's not really a mystery why it happens. LLM APIs are non-deterministic from user's point of view because your request is going to get batched with other users' requests. The batch behavior is deterministic, but your batch is going to be different each time you send your request.

The size of the batch influences the order of atomic float operations. And because float operations are not associative, the results might be different.

empiko · 2025-12-31T10:08:09 1767175689

Not convinced. There is an obvious value in having more food or more products for almost anybody on Earth. I am not sure this is the case for software. Most people's needs are completely fulfilled with the amount and quality of software they already have.

tossandthrow · 2025-12-31T10:49:54 1767178194

> There is an obvious value in having more food or more products for almost anybody on Earth

Quite the opposite is true. For a large proportion of people, they would increase both the amount of years they live and quality of life by eating less.

I think the days where more product is always better lapse to an end - we just need to figure out how the economy should work.

npodbielski · 2025-12-31T10:16:19 1767176179

But how about some silly software for just a giggle. Like 'write website that plays fart sound when you push button'? That can be a thing for the kids at school.

empiko · 2025-12-31T08:42:51 1767170571

ChatGPT - 5.8b visits - -5.2% MoM

Gemini - 1.4b visits - +14.4% MoM

Yeah, ChatGPT is still more popular, but this does not show Gemini struggling exactly.

empiko · 2025-12-29T07:48:46 1766994526

Do we have a better estimate? I don't think it's particularly difficult to get information from the occupied territories, the people there seem to freely use Internet.

It's my understanding that this war is really not particularly bloody for civilians as it is moving so slow that Russians are taking month to conquer pretty small towns and cities and the civilians can usually evacuate or hide. The bombing campaign has some civilian casualties, but I mostly see headlines mentioning <5 dead overall per occasional huge wave of drones and missiles.

lawn · 2025-12-29T08:31:50 1766997110

Yes we have better estimates. In Mariupul for example estimates are above 20k civilians dead and murdered.

UN cannot personally verify any of this though so it counts them as zero. It should be at least the double of their estimate.

> It's my understanding that this war is really not particularly bloody for civilians as it is moving so slow that Russians are taking month to conquer pretty small towns and cities and the civilians can usually evacuate or hide.

Russia's advance has slowed to a crawl yes but the amount of people murdered in the places where Russia does take control are still very high (see Mariupul as an example). Especially in the early days of war they took a lot of land.

> The bombing campaign has some civilian casualties, but I mostly see headlines mentioning <5 dead overall per occasional huge wave of drones and missiles.

5 per day is too low as that would only add up to around 5.5k civilians and per UN's own calculations that's too low.

They've been targeting civilians, including schools and hospitals, daily since the war started.

spwa4 · 2025-12-29T09:30:00 1767000600

> UN cannot personally verify any of this though so it counts them as zero. It should be at least the double of their estimate.

UN has not verified any of the 70k death toll in Gaza either. Those numbers come from hamas. Why is it a problem in Ukraine?

lawn · 2025-12-29T10:06:39 1767002799

You'd have to ask the UN. Fact is they aren't counting deaths in Russian occupied land, take that as you will.

mcphage · 2025-12-29T15:07:01 1767020821

> Those numbers come from hamas. Why is it a problem in Ukraine?

Because in the Russian occupied territories, Russia has no interest in reporting the number of civilian deaths.

Yizahi · 2025-12-29T11:20:19 1767007219

It's almost like both numbers are heavily biased in the UN. Almost. Surely such bias and possible corruption couldn't happen in the esteemed institution, known for its impartial and objective rulemaking. Right?

spwa4 · 2025-12-29T14:50:02 1767019802

> It's almost like both numbers are heavily biased in the UN.

Yes, but in the opposite direction. It would be baffling if the UN's claim of 16,000 Ukrainian victims wasn't at least 100,000 in reality.

And, let's be honest, in Gaza, it does not seem realistic that there are even 50,000 victims of the 70,000 civilian victims claimed total. Don't get me wrong, significant amount of victims, but much less than reported. And on top of that it doesn't seem realistic that none of those are militants. I'd guess, say, at least half of those are militants, not civilians.

And on top of that, UN has no problem to state that of those 16,000, about 70 Ukrainian dead are not victims of Russia but of Ukrainian frienly fire. Again, of the 70,000 claimed dead in Gaza ... let's assume at the very least 300 are victims of hamas friendly fire (probably more, since hamas is no stranger to boobytrapping civilian buildings), rather than enemy action.

If you count the way the UN counts in Gaza in Ukraine, Russia has killed some 400,000 people minimum. Maybe half a million, and of course climbing fast. No distinction between civilian and military, no distinction between accidents vs friendly fire ...

And I guess in the Gaza case I sort of understand. But why downplay Ukrainian victims? Why by a factor of 2, not counting military deaths, which would make it a factor 5 lower than real, minimum? I guess if you discounted everything the same way in Gaza the numbers would also drop by a factor of 5 there, but still.

Yizahi · 2025-12-29T15:15:43 1767021343

That commend above was sarcasm, I completely agree with you.

empiko · 2025-12-28T21:38:28 1766957908

My current job cold contacted me via LinkedIn. I use LI minimally, basically only to establish connections with my network, and it already gave me huge value back.

duxup · 2025-12-28T23:39:19 1766965159

Are your skills particularly unique / unusual resume?

I could imagine that working.

For most people though I expect they're just one of many and odds of spam / scam contacts greatly outweigh legit communication.

empiko · 2025-12-28T23:49:54 1766965794

My skills are alright, nothing too crazy. I've had messages that were more spammy/scammy, but the volume was not crazy high, they were usually fairly obvious, and I resolved by simply ignoring them. I would say that the random chance of getting an interesting job offer, however small, is probably worth it for most professionals.

However, one thing I haven't mentioned is that I am based in Europe. My small sample size of people reaching out to me is showing that US contacts are usually less serious (e.g. ghosting). Maybe the US experience is so much worse overall due to this?

duxup · 2025-12-29T00:59:34 1766969974

I suspect you may be onto something. I'm a run of the mill dev. At least in the past when I was last looking for a job and more active on linkedin, the volume of contacts did not represent legitimate interest, not at all.

empiko · 2025-12-28T10:30:40 1766917840

Working as ML engineer/researcher:

- LLMs are absolutely abysmal at PyTorch. They can basic MLP workflows, but that's it more or less. 0% efficiency gained.

- LLMs are great at short autocompletes, especially when the code is predictable. The typing itself is very efficient. Using vim-like shortcuts is now the slower way to write code.

- LLMs are great at writing snippets for tech I am not using that often. Formatting dates, authorizing GDrive, writing advanced regex, etc. I could do it manually, but I would have to check docs, now I can have it done in seconds.

- LLMs are great at writing boilerplate code, e.g. setting up argparse, printing the results in tables, etc. I think I am saving hours per month on these.

- Nowadays I often let LLMs build custom HTML visualization/annotation tools. This is something I would never do before due to time constraints, and the utility is crazy good. It allows my team to better understand the data we are working with.

empiko · 2025-12-28T08:00:37 1766908837

The interesting question is how much more software we actually need. Will software be done one day, all built up, similar to railway networks? With LLMs, software engineering might get cheaper, but it can also lead to increased demand. Resource getting cheaper actually very often leads to demand skyrocketing, as it becomes accessible to new markets.

freddref · 2025-12-28T09:47:37 1766915257

Definitely feels like a good amount of dev work is writing the same things over and over, in a different language, codebase or context. And it seems like llms are particularly good at translating, specializing and contextualizing across existing knowledge.

akmarinov · 2025-12-28T22:42:45 1766961765

Well we have 5000 front end frameworks, with more every day. I imagine once LLMs are in charge - they won’t need that many.

empiko · 2025-12-27T06:51:37 1766818297

AI alignment is not a solved problem by any means. As long as LLMs hallucinate, they cannot be considered aligned. You can only be aligned if you have a zero probability of generating hallucinations. The two problems, alignment and hallucinations, can be considered equivalent.

ben_w · 2025-12-27T12:45:32 1766839532

A human who hates maths is different from one who adds up wrong because they think the first digit counts units, second digit how many tens, third digit how many twenties (as one of my uni lecturers recounted of her own childhood).

Alignment is, approximately, "are we even training this AI on the correct utility function?" followed up by the second question "even if we specified the correct utility function, did the AI learn a representation of that function or some weird approximation of that function with edge cases we've not figured out how to spot?"

With, e.g. RLHF, the first is "is optimising for thumbs-up/thumbs-down the right objective at all?", the second is "did it learn the preference, or just how to game the reward?"