GPT-5 knowledge cutoff: Sep 30, 2024 (10 months before release). Compare that to...

asboans · 2025-08-08T06:31:12 1754634672

It would be fun to train an LLM with a knowledge cutoff of 1900 or something

twh270 · 2025-08-08T15:01:59 1754665319

Someone tried this, I saw it one of the Reddit AI subs. They were training a local model on whatever they could find that was written before $cutoffDate.

Found the GitHub: https://github.com/haykgrigo3/TimeCapsuleLLM

ph4evers · 2025-08-08T06:45:49 1754635549

That’s been done to see if it could extrapolate and predict the future. Can’t find the link right now to the paper.

creativeSlumber · 2025-08-08T06:57:09 1754636229

This one? "Mind the Gap: Assessing Temporal Generalization in Neural Language Models" https://arxiv.org/abs/2102.01951

MadameMinty · 2025-08-08T08:40:37 1754642437

The idea matches, but 2019 is a far cry from, say, 1930.

fmbb · 2025-08-08T08:52:17 1754643137

In 1930 there was not enough information in the world for consciousness to develop.

amelius · 2025-08-08T09:08:44 1754644124

You mean information in digestible form.

ClarityJones · 2025-08-08T13:06:30 1754658390

I think this is a meta-allusion to the theory that human consciousness developed recently, i.e. that people who lived before [written] language did not have language because they actually did not think. It's a potentially useful thought experiment, because we've all grown up not only knowing highly performant languages, but also knowing how to read / write.

However, primitive languages were... primitive. Where they primitive because people didn't know / understand the nuances their languages lacked? Or, were those things that simply didn't get communicated (effectively)?

Of course, spoken language predates writings which is part of the point. We know an individual can have a "conscious" conception of an idea if they communicate it, but that consciousness was limited to the individual. Once we have written language, we can perceive a level of communal consciousness of certain ideas. You could say that the community itself had a level of shared-consciousness.

With GPTs regurgitating digestible writings, we've come full circle in terms of proving consciousness, and some are wondering... "Gee, this communicated the idea expertly, with nuance and clarity.... but is the machine actually conscious? Does it think undependably of the world, or is it merely a kaledascopic reflection of its inputs? Is consciousness real, or an illusion of complexity?"

Den_VR · 2025-08-08T13:59:31 1754661571

I’m not sure why it’s so mind-boggling that people in the year 1225 (Thomas Aquinas) or 1756 (Mozart) were just as creative and intelligent as they themselves are, as modern people. They simply had different opportunities then comparable to now. And what some of them did with those opportunities are beyond anything a “modern” person can imagine doing in those same circumstances. _A lot_ of free time over winter in the 1200s for certain people. Not nearly as many distractions either.

balder1991 · 2025-08-08T15:53:58 1754668438

Saying early humans weren’t conscious because they lacked complex language is like saying they couldn’t see blue because they didn’t have a word for it.

unkulunkulu · 2025-08-08T17:23:56 1754673836

Well, Oscar Wilde argues in “The Decay of Lying” that there were no stars before an artist could describe them and draw people’s attention to the night sky.

The basic assumption he attacks is that “there is a world we discover” vs “there is a world we create”.

It is hard paradigm shift, but there is certainly reality in “shared picture of the world” and convincing people of a new point of view has real implications in how the world appears in our minds for us and what we consider “reality”

Byamarro · 2025-08-08T14:26:18 1754663178

It should be almost obligatory to always state which definition of consciousness one is talking about whenever they talk about consiousness, because I for example don't see what language has to do with our ability to experience qualia for example.

Is it self awarness? There are animals that can recognize themselves in mirror, I don't think all of them have a form of proto-language.

7bit · 2025-08-08T09:27:56 1754645276

Llama are not conscious

yanis_t · 2025-08-08T11:00:07 1754650807

Not sure we have enough data for any pre-internet date.

artursapek · 2025-08-08T11:07:29 1754651249

That would be hysterical

levocardia · 2025-08-07T18:18:37 1754590717

with web search, is knowledge cutoff really relevant anymore? Or is this more of a comment on how long it took them to do post-training?

mastercheif · 2025-08-07T18:27:11 1754591231

In my experience, web search often tanks the quality of the output.

I don't know if it's because of context clogging or that the model can't tell what's a high quality source from garbage.

I've defaulted to web search off and turn it on via the tools menu as needed.

gorkish · 2025-08-07T19:53:11 1754596391

Web search often tanks the quality of MY output these days too. Context clogging seems a reasonable description of what I experience when I try to use the normal web.

clbrmbr · 2025-08-08T10:47:48 1754650068

THIS. I do my best work after a long vigorous walk and contemplation, while listening to Bach sipping espresso. (Not exaggerating much.) If I go on HN or slack or ClickUp or work email, context is slammed and I cannot do /clear so fast. Even looking up something quick on the web or an LLM causes a dirtying.

bangaladore · 2025-08-07T18:33:50 1754591630

I feel the same. LLMs using web search ironically seem to have less thoughtful output. Part of the reason for using LLMs is to explore somewhat novel ideas. I think with web search it aligns too strongly to the results rather than the overall request making it a slow search-engine.

troyvit · 2025-08-07T23:17:55 1754608675

That makes sense. They're doing their interpretation on the fly for one thing. For another just because they now have data that is 10 months more recent than their cutoff they don't have any of the intervening information. That's gotta make it tough.

manmal · 2025-08-07T20:13:21 1754597601

Web search is super important for frameworks that are not (sufficiently?) in the training data. o3 often pulls info from Swift forums to find and fix obscure Swift concurrency issues for me.

fmos · 2025-08-07T21:49:18 1754603358

In my experience none of the frontier models I tried (o3, Opus 4, Gemini 2.5 Pro) was able to solve Swift concurrency issues, with or without web search. At least not sufficiently for Swift 6 language mode. They don’t seem to have a mental model of the whole concept and how things (actors, isolation, Tasks) need to play together.

elpakal · 2025-08-08T02:17:34 1754619454

> They don’t seem to have a mental model of the whole concept and how things (actors, isolation, Tasks) need to play together.

to be fair, does anyone ¯\_(ツ)_/¯

manmal · 2025-08-08T19:22:26 1754680946

This. It’s a bunch of rules you need to juggle in your head.

jjice · 2025-08-07T20:05:28 1754597128

I haven't tried ChatGPT web search, but my experience with Claude web search is very good. It's actually what sold me and made me start using LLMs as part of my day to day. The citations they leave (I assume ChatGPT does the same) are killer for making sure I'm not being BSd on certain points.

nicce · 2025-08-08T08:27:30 1754641650

How often you actually check the citations? They seems to confidentally cite things but then they also say different things what source has.

jjice · 2025-08-10T13:16:46 1754831806

It depends on the question. I was having a casual chat with my dad and we wondered how Apple's revenue was split amongst products, and it was just to chat about so I didn't check.

On the other hand, I got an overview of Postgres RLS and I checked the majority of those citations since those answers were going to be critical.

illiac786 · 2025-08-08T04:14:48 1754626488

That’s interesting. I use the API and there are zero citations with Claude, charGPT and Gemini. Only Kagi assistant gives me some, which is why I prefer it when researching facts.

What software to you use? The native Claude app? What subscription do you have?

jjice · 2025-08-10T13:15:13 1754831713

Claude directly (web and mobile) with the Pro ($20) subscriptions.

I found it very similar to Kagi Assistant (which I also use).

pbronez · 2025-08-07T23:57:20 1754611040

Kagi really helps with this. They built a good search engine first, then wired it up to AI stuff.

ActionHank · 2025-08-07T19:21:04 1754594464

I also find that it gets way more snarky. The internet brings that bad taint.

throw310822 · 2025-08-07T21:46:42 1754603202

Completely opposite experience here (with Claude). Most of my googling is now done through Claude- it can find and digest a d compile information much quicker and better than I'd do myself. Without web search you're basically asking an LLM to pull facts out of its ass- good luck with trusting the results.

MisterSandman · 2025-08-07T18:59:36 1754593176

It still is, not all queries trigger web search, and it takes more tokens and time to do research. ChatGPT will confidently give me outdated information, and unless I know it’s wrong and ask it to research, it wouldn’t know it is wrong. Having a more recent knowledge base can be very useful (for example, knowing who the president is without looking it up, making references to newer node versions instead of old ones)

WorldPeas · 2025-08-07T21:10:22 1754601022

The problem, perhaps illusory that it's easy to fix, is that the model will choose solutions that are a year old, e.g. thinking database/logger versions from December '24 are new and usable in a greenfield project despite newer quarterly LTS releases superseding them. I try to avoid humanizing these models, but could it be that in training/posttraining one could make it so the timestamp is fed in via the system prompt and actually respected? I've begged models to choose "new" dependencies after $DATE but they all still snap back to 2024

clickety_clack · 2025-08-07T19:19:28 1754594368

The biggest issue I can think of is code recommendations with out of date versions of packages. Maybe the quality of code has deteriorated in the past year and scraping github is not as useful to them anymore?

seanw265 · 2025-08-07T20:29:27 1754598567

Knowledge cutoff isn’t a big deal for current events. Anything truly recent will have to be fed into the context anyway.

Where it does matter is for code generation. It’s error-prone and inefficient to try teaching a model how to use a new framework version via context alone, especially if the model was trained on an older API surface.

diegocg · 2025-08-07T18:28:05 1754591285

I wonder if it would even be helpful because they avoid the increasing AI content

rapind · 2025-08-07T23:08:17 1754608097

This is what I was thinking. Eventually most new material could be AI produced (including a lot of slop).

joshuacc · 2025-08-07T18:30:32 1754591432

Still relevant, as it means that a coding agent is more likely to get things right without searching. That saves time, money, and improves accuracy of results.

alfalfasprout · 2025-08-08T00:27:20 1754612840

It absolutely is, for example, even in coding where new design patterns or language features aren't easy to leverage.

Web search enables targeted info to be "updated" at query time. But it doesn't get used for every query and you're practically limited in how much you can query.

richardw · 2025-08-08T06:36:53 1754635013

Isn’t this an issue with eg Cloudflare removing a portion of the web? I’m all for it from the perspective of people not having their content repackaged by an LLM, but it means that web search can’t check all sources.

m3kw9 · 2025-08-08T04:02:33 1754625753

Web pages become prompt, so you still need the model to analyze

stevage · 2025-08-07T23:04:42 1754607882

I've been having a lot of issues with chatgpt's knowledge of DuckDb being out of date. It doesn't think DuckDb enforces foreign keys, for instance.

roflyear · 2025-08-07T23:33:05 1754609585

Yes, totally. The model will not know about new versions of libraries, features recently deprecated, etc..

havefunbesafe · 2025-08-07T19:17:54 1754594274

Question: do web search results that GPT kick back get "read" and backpropagated into the model?

CamperBob2 · 2025-08-07T22:08:59 1754604539

Right now nothing affects the underlying model weights. They are computed once during pretraining at enormous expense, adjusted incrementally during training, and then left untouched until the next frontier model is built.

Being able to adjust the weights will be the next big leap IMO, maybe the last one. It won't happen in real time but periodically, during intervals which I imagine we'll refer to as "sleep." At that point the model will do everything we do, at least potentially.

bearjaws · 2025-08-07T20:46:45 1754599605

Falling back to web search is a crutch, its slower and often bloats context resulting in worse output.

CharlieDigital · 2025-08-07T20:49:06 1754599746

Yes, because it may not know that it needs to do a web search for the most relevant information.

LeoPanthera · 2025-08-07T18:26:11 1754591171

Gemini does cursory web searches for almost every query, presumably to fill in the gap between the knowledge cutoff and now.

verytrivial · 2025-08-08T09:26:48 1754645208

I had 2.5 Flash refuse to summarise a URL that had today's date encoded in it because "That web page is from the future so may not exist yet or may be missing" or something like that. Amusing.

2.5 Pro went ahead and summarized it (but completely ignored a # reference so summarised the wrong section of a multi-topic page, but that's a different problem.)

mock-possum · 2025-08-08T06:02:26 1754632946

I always pick Gemini if I want more current subjects / info

adhoc_slime · 2025-08-08T13:35:27 1754660127

funny result of this is that GPT5 doesn't understand the modern meaning of Vibe Coding (maximising llm code generation), it thinks it "a state where coding feels effortless, playful, and visually satisfying" and offers more content around adjusting IDE settings, and templating.

archon810 · 2025-08-07T18:44:25 1754592265

And GPT-5 nano and mini cutoff is even earlier - May 30 2024.

nialv7 · 2025-08-08T00:38:12 1754613492

maybe OpenAI have a terribly inefficient data ingestion pipeline? (wild guess) basically taking in new data is tedious so they do that infrequently and keep using old data for training.

xnx · 2025-08-08T03:58:13 1754625493

Does this indicate that OpenAI had a very long pretraining process for GPT5?

m3kw9 · 2025-08-08T04:01:39 1754625699

Maybe they have a long data cleanup process

m101 · 2025-08-08T07:16:34 1754637394

Perhaps they want to extract the logic/reason behind language over remembering facts which can be retrieved with a search.

wayeq · 2025-08-08T18:33:15 1754677995

Does the knowledge cut off date still matter all that much since all these models can do real time searches and RAG?

lurking_swe · 2025-08-07T18:26:52 1754591212

the model can do web search so this is mostly irrelevant i think.

breadwinner · 2025-08-07T18:20:27 1754590827

That could means OpenAI does not take any shortcuts when it comes to safety.

dotancohen · 2025-08-08T06:23:51 1754634231

  > GPT-5 knowledge cutoff: Sep 30, 2024
  > Gemini 2.5 Pro knowledge cutoff: Jan 2025
  > Claude Opus 4.1: knowledge cutoff: Mar 2025

A significant portion of the search results available after those dates is AI generated anyway, so what good would training on them do?

sumedh · 2025-08-08T12:36:12 1754656572

Latest tech docs about a library which you want to use in your code.

dotancohen · 2025-08-09T00:39:33 1754699973

So, JavaScript vibe coding. Got it.

Honestly, maintaining software for which the AI knowledge cutoff matters sounds tedious.