More

jaccola · 2026-02-11T14:59:07 1770821947

I have no idea how an LLM company can make any argument that their use of content to train the models is allowed that doesn't equally apply to the distillers using an LLM output.

"The distilled LLM isn't stealing the content from the 'parent' LLM, it is learning from the content just as a human would, surely that can't be illegal!"...

mikehearn · 2026-02-11T15:07:22 1770822442

The argument is that converting static text into an LLM is sufficiently transformative to qualify for fair use, while distilling one LLM's output to create another LLM is not. Whether you buy that or not is up to you, but I think that's the fundamental difference.

zozbot234 · 2026-02-11T15:19:07 1770823147

The whole notion of 'distillation' at a distance is extremely iffy anyway. You're just training on LLM chat logs, but that's nowhere near enough to even loosely copy or replicate the actual model. You need the weights for that.

budududuroiu · 2026-02-11T15:24:54 1770823494

> The U.S. Court of Appeals for the D.C. Circuit has affirmed a district court ruling that human authorship is a bedrock requirement to register a copyright, and that an artificial intelligence system cannot be deemed the author of a work for copyright purposes

> The court’s decision in Thaler v. Perlmutter,1 on March 18, 2025, supports the position adopted by the United States Copyright Office and is the latest chapter in the long-running saga of an attempt by a computer scientist to challenge that fundamental principle.

I, like many others, believe the only way AI won't immediately get enshittified is by fighting tooth and nail for LLM output to never be copyrightable

https://www.skadden.com/insights/publications/2025/03/appell...

roywiggins · 2026-02-11T15:51:06 1770825066

Thaler v. Perlmutter is an a weird case because Thaler explicitly disclaimed human authorship and tried to register a machine as the author.

Whereas someone trying to copyright LLM output would likely insist that there is human authorship is via the choice of prompts and careful selection of the best LLM output. I am not sure if claims like that have been tested.

wongarsu · 2026-02-12T09:30:26 1770888626

The US copyright office has published a statement that they see AI output analogous to a human contracting the work out to a machine. The machine would hold the copyright, but can't, consequently there is none. Which is imho slightly surprising since your argument about choice of prompt and output seems analogous to the argument that lead to photographs being subject to copyright despite being made by a machine.

On the other hand in a way the opinion of the US copyright office doesn't matter, what matters is what the courts decide

mikehearn · 2026-02-11T16:09:01 1770826141

It's a fine line that's been drawn, but this ruling says that AI can't own a copyright itself, not that AI output is inherently ineligible for copyright protection or automatically public domain. A human can still own the output from an LLM.

budududuroiu · 2026-02-12T00:07:00 1770854820

> A human can still own the output from an LLM.

It specifically highlights human authorship, not ownership

Aerroon · 2026-02-12T09:32:09 1770888729

>I, like many others, believe the only way AI won't immediately get enshittified is by fighting tooth and nail for LLM output to never be copyrightable

If the person who prompted the AI tool to generate something isn't considered the author (and therefore doesn't deserve copyright), then does that mean they aren't liable for the output of the AI either?

Ie if the AI does something illegal, does the prompter get off scot-free?

amenhotep · 2026-02-11T15:52:34 1770825154

When you buy, or pirate, a book, you didn't enter into a business relationship with the author specifically forbidding you from using the text to train models. When you get tokens from one of these providers, you sort of did.

I think it's a pretty weak distinction and by separating the concerns, having a company that collects a corpus and then "illegally" sells it for training, you can pretty much exactly reproduce the acquire-books-and-train-on-them scenario, but in the simplest case, the EULA does actually make it slightly different.

Like, if a publisher pays an author to write a book, with the contract specifically saying they're not allowed to train on that text, and then they train on it anyway, that's clearly worse than someone just buying a book and training on it, right?

BeetleB · 2026-02-11T16:23:22 1770827002

> When you buy, or pirate, a book, you didn't enter into a business relationship with the author specifically forbidding you from using the text to train models.

Nice phrasing, using "pirate".

Violating the TOS of an LLM is the equivalent of pirating a book.

creamyhorror · 2026-02-12T04:57:13 1770872233

Contracts can't exclude things that weren't invented when the contracts were written.

Ultimately it's up to legislation to formalize rules, ideally based on principles of fairness. Is it fair in non-legalistic sense for all old books to be trainable-on, but not LLM outputs?

TZubiri · 2026-02-12T00:49:04 1770857344

Because the terms by each provider are different

American Model trains on public data without a "do not use this without permission" clause.

Chinese models train on models that have a "you will not reverse engineer" clause.

WSSP · 2026-02-12T00:59:39 1770857979

> American Model trains on public data without a "do not use this without permission" clause.

this is going through various courts right now, but likely not

jaccola · 2026-02-07T20:28:29 1770496109

Not really your point but I think the skills to create these things are much slower to train than producing chips and data centres.

So they couldn't really build any of these projects weekly since the cost of construction materials / design engineers / construction workers would inflate rapidly.

Worth keeping in mind when people say "we could have built 52 hospitals instead!" or similar. Yes, but not really... since the other constraints would quickly reveal themselves

jaccola · 2026-02-06T12:01:40 1770379300

I think this is cool!

But by some definition my "Ctrl", "C", and "V" keys can build a C compiler...

Obviously being facetious but my point being: I find it impossible to judge how impressed I should be by these model achievements since they don't show how they perform on a range of out-of-distribution tasks.

jaccola · 2026-02-06T11:52:44 1770378764

Even that is underselling it; jobs are a necessary evil that should be minimised. If we can have more stuff with fewer people needing to spend their lives providing it, why would we NOT want that?

direwolf20 · 2026-02-06T11:57:26 1770379046

Because we've built a system where if you don't have a job, you die.

jaccola · 2026-02-06T12:11:46 1770379906

This is already hyperbolic; in most countries where software engineers or similar knowledge workers are widely employed there are welfare programmes.

To add to that, if there is such mass unemployment in this scenario it will be because fewer people are needed to produce and therefore everything will become cheaper... This is the best kind of unemployment.

So at best: none of us have to work again and will get everything we need for free. At worst, certain professions will need a career switch which I appreciate is not ideal for those people but is a significantly weaker argument for why we should hold back new technology.

direwolf20 · 2026-02-08T00:39:35 1770511175

Most of those welfare programs aren't very good, and most of that is on purpose, to make people get jobs at whatever cost.

jelder · 2026-02-06T13:09:16 1770383356

If you were to rank all of the C compilers in the world and then rank all of the welfare systems in the world, this vibe-coded mess would be at approximately the same rank as the American welfare system. Especially if you extrapolate this narcissistic, hateful kleptocracy out a few more years.

aurareturn · 2026-02-06T12:12:15 1770379935

Did we build it or did nature?

direwolf20 · 2026-02-06T13:31:25 1770384685

We did.

jaccola · 2026-02-02T22:13:47 1770070427

Yeah but who can be hurt by this, these are both private companies? So whose interest is his "conflicting" with? I'm sure the shareholders will raise it with him and/or bring a lawsuit if they aren't happy (they probably are happy).

jaccola · 2026-02-02T00:39:16 1769992756

What $$$?! The top tier of apple one is £36.95pm. If I spend 15 mins of time every month extra self hosting then it’s immediately not worth it. (Not to mention self hosting won’t be free).

Also, for that price I get: 2TB cloud storage,Apple TV,Apple Music,news,workouts,arcade most of which cannot be self hosted.

Economies of scale are real, it’s possible Apple makes a ton of money and the user is getting a good deal!

jaccola · 2026-01-30T12:28:08 1769776088

The initial tweet was primarily a lie though

> The rendering engine is from-scratch in Rust with HTML parsing, CSS cascade, layout, text shaping, paint, and a custom JS VM.

If I cloned Pixar’s rendering library and called that then added to my CV ‘built a renderer from scratch’ this would be entirely dishonest…

I use LLMs often and don’t hate Cursor or think they’re a bad company. But it’s obvious they are being squeezed and have little USP (even less so than other AI players). They are frankly extremely pressured to make up lies.

I don’t think I’d resist the pressure either, so not on a high horse here, but it doesn’t make it any less dishonest.

jaccola · 2026-01-29T01:05:33 1769648733

Interestingly, the UK PM (and allies) just blocked a would-be political rival Andy Burnham standing as an MP.

One of the given reasons is because Burnham is currently mayor of Greater Manchester, and running a new election there would cost approx £4m(!!) which is a huge waste of taxpayer money.

I was surprised that they even gave this as a faux reason since it seems like the sort of money they would spend on replenishing the water coolers, or buying bic pens, or... building a static website!

mellosouls · 2026-01-29T01:58:58 1769651938

Tangentially, Burnham has a long history with these sorts of public-sector private vampires, having been up to his neck in PFI (of "£200 to change a lightbulb" fame) in his stint leading the NHS.

eg.

https://www.theguardian.com/uk/2012/jun/28/labour-debt-peter...

https://doctorsforthenhs.org.uk/the-truth-about-the-lies-tha...

etc

lwhi · 2026-01-29T08:38:52 1769675932

And that's just it. Vampiric.

The fact that a huge amount of money is extracted from the UK government for no (or very little value) is a crying shame.

I know multiple people who work as consultants (hired via private agencies, paid for by Government) who have literally done nothing for six months plus.

They have no incentive to whistleblow, the agency employing them has no incentive to get rid of them as they take a cut, and then government department hiring them is non-the-wiser because they have no technical knowledge or understanding of what's being carried out.

It should be the scandal of the decade.

FridayoLeary · 2026-01-29T01:15:52 1769649352

Being cynical i would say it's because Burnham could potentially challenge Starmer. Less cynically Labour has a big enough majority they can afford to lose this by election. The headache of replacing the mayor of Manchester is not worth it.

Why can't he just do both jobs? Boris did it iirc.

hkt · 2026-01-29T04:46:45 1769662005

If memory serves, Dan Jarvis also did it, being both MP and mayor of the South Yorkshire city region or whatever it was called at the time.

It is fairly innately political. No Prime Minister has ever polled as low as Starmer and come back from it, or so is being said in the press. Burnham might be a smart electoral move, but he's not a plaything of the Labour right, so they kept him out.

owisd · 2026-01-29T06:26:42 1769668002

The rules are inconsistent. You can be Mayor of Sheffield and an MP at the same time but you can’t be Mayor of Greater Manchester and an MP.

petesergeant · 2026-01-29T06:47:40 1769669260

That's not inconsistency in the rules, that's inconsistency in what being the mayor means. In Sheffield it means you show up wearing funny clothes every so often, in Greater Manchester it means you have a full-time job, a large budget, and actual responsibilities.

For our American brethren, it's like the difference between being the Mayor of NYC vs the Macy’s Thanksgiving Day Parade King.

roryirvine · 2026-01-29T09:58:35 1769680715

It's actually the role of Police and Crime Commissioner that prevents them from being an MP simultaneously. In Greater Manchester (and London) the PCC role is combined with that of Mayor, but it isn't in most other city regions.

There's not much actual difference in the mayoral aspect of the roles - Jarvis was the Mayor of the South Yorkshire Combined Authority, not simply the mayor of Sheffield City Council.

jaccola · 2026-01-25T20:00:20 1769371220

Funny, I’m the same. I also like taking walks to think but I’ve found that I must have my head pointing almost directly down (I.e. looking at my feet). It’s also how I stand thinking in the shower, with the warm water hitting my angled neck. Maybe something beneficial about that position of the neck, or maybe just habit!

I will also have conversations in my head during my walk, I’ve done this my whole life and I’m not sure to this day whether my lips move during these or not. In any case, I must get some funny looks with head bolted to the ground mumbling to myself…

Fnoord · 2026-01-25T23:43:20 1769384600

Sing it!

As for the software. I would not want a camera on 24/7 (on any device, a compromise being my doorbell, which isn't cloud connected). It'd defeat the small LED which informs you it is on (since it is always-on), and if the machine is compromised this is a method to receive personal data.

Actually, I'd prefer a hardware killswitch on things like camera and microphone.

butvacuum · 2026-01-26T03:43:01 1769398981

Post-It makes an excelent kill switch for the camera. not effective for audio though

average_r_user · 2026-01-26T09:25:48 1769419548

Alas, I'm not alone in meditating and thinking while taking a shower. It's one of the moments of my day when I recollect what happened, what I need to do, and what not to do.

The problem is that I can get quite lost during this phase, and hot water isn't cheap, so my SO is always threatening to put a big timer in the bathroom.

strogonoff · 2026-01-26T12:37:52 1769431072

My pet hypothesis about why shower is often praised to be such a mindful place is that it has not so much to do with water and more to do with the fact that for many people life alternates between 1) constant social interaction and interruptions from other people and 2) bathroom time.

How many people these days have a dedicated home office, off limits to anyone else? How many partners sleep in different rooms?

Sure, perhaps the sensory experience plays some role, but if your bathroom is reliably the most interruption-free place for you, naturally you’d form a habit of catching up on all the “slow thinking”, most negatively impacted by interruptions, during shower.

I’ve seen people with interruption-free solo hobbies (be that hiking in the woods, motorcycling, rock climbing, etc.) describing similarly mindful experiences, but unlike those shower is the lowest common denominator and perhaps one that happens most routinely.

average_r_user · 2026-01-27T16:27:26 1769531246

True, I hadn't framed it that way either, but it makes sense. Sometimes just stepping away from the usual rhythm creates its own kind of reset

neal_jones · 2026-01-26T12:47:04 1769431624

I’ve gone home from work before to take a shower. At least one time I took multiple showers in a work day to think.

I now live somewhere that hot water is expensive and I didn’t realize how good things were before.

wowzaa · 2026-01-25T21:49:38 1769377778

In my case, though walks help declutter my mind somewhat, for deeper thoughts, I have to write it down sitting or laying in the bed in the worst of positions. Thinking too deeply while walking only leaves me anxious in the end as I tend to get sidetracked a lot in conversation and always have to restart the conversation over and over again.

visarga · 2026-01-26T05:56:06 1769406966

I used paper a lot to jot my ideas and all sorts of diagrams but lately I just pull Claude and chat it out, it works like a thinking environment.

wowzaa · 2026-01-27T13:24:20 1769520260

I tried doing the same. Sometimes it made my understanding of things much clearer. However most fimes, I found it worked best when I had a clear idea on paper, either to validate the idea or when I needed to an opinion. Otherwise, ChatGPT in my case, built upon my idea that I hadn't thought through well and confuse the shit out of me.

drittich · 2026-01-26T20:26:18 1769459178

Yes, shower thinking with warm water on my neck is absolute peak. In those conditions I'm unafraid of tackling the most challenging of thinking.

j45 · 2026-01-25T20:15:02 1769372102

Wear earbuds like you’re on call or recording something

soulofmischief · 2026-01-25T20:29:39 1769372979

I've fully embraced looking insane in public. Try it some time; you won't go back.

j45 · 2026-01-25T22:30:57 1769380257

haha, sounds good.

lgeorget · 2026-01-26T16:03:21 1769443401

I have my best ideas and illuminations for the day when I brush my teeth in the morning. Somehow, that's when I can think best.

parentheses · 2026-01-25T23:33:27 1769384007

I suppose in that position your head has lower elevation, allowing for better circulation.

pc86 · 2026-01-26T15:32:10 1769441530

Talking to myself is the only way to crystallize certain thoughts.

whompyjaw · 2026-01-26T01:41:36 1769391696

Uhhh… are you me? No other comment has hit more home. Nice. Mayne there’s something about these physical practices helping mental abilities.

jaccola · 2026-01-19T10:27:31 1768818451

But that’s not the Turing Test. The human who can be fooled in the Turing test was explicitly called the “interrogator”.

To pass the Turing test the AI would have to be indistinguishable from a human to the person interrogating it in a back and forth conversation. Simply being fooled by some generated content does not count (if it did, this was passed decades ago).

No LLM/AI system today can pass the Turing test.

zahlman · 2026-01-19T11:33:55 1768822435

I've encountered people who seem to understand properly how the test works, and still think that current LLM passes it easily.

Most of them come across to me like they would think ELIZA passes it, if they weren't told up front that they were testing ELIZA.

caspar · 2026-01-20T01:16:23 1768871783

I think state of the art LLMs would pass the Turing test for 95% people if those people could (text) chat to them in a time before LLM chatbots became widespread.

That is, the main thing that makes it possible to tell LLM bots apart from humans is that lots of us have over the past 3 years become highly attuned to specific foibles and text patterns which signal LLM generated text - much like how I can tell my close friends' writing apart by their use of vocabulary, punctuation, typical conversation topics, and evidence (or lack) of knowledge in certain domains.