More

bubble12345 · 2025-04-12T17:42:49 1744479769

> - No screenshot tool that allows you to type text (in addition to circling and highlighting and arrows)

Snipping tool works for all of this

breadwinner · 2025-04-12T20:39:32 1744490372

It doesn't let you type text on the image. That's so important! The main thing I want to do when I take a screenshot is to circle something, or draw an arrow, then type some text about the item I am pointing to.

bubble12345 · 2025-02-02T16:19:15 1738513155

The linked article is about hoaxes which are specific to Wikipedia. Not about falsehoods in general.

bubble12345 · on Dec 24, 2024

AI will not do math for us, but maybe eventually it will lead to another mainstream tool for mathematicians. Along with R, Matlab, Sage, GAP, Magma, ...

It would be interesting if in the future mathematicians are just as fluent in some (possibly AI-powered) proof verifying tool, as they are with LaTeX today.

bufferoverflow · on Dec 24, 2024

AI can already do a bunch of math. So "AI will not do math for us" is just factually wrong.

vlovich123 · on Dec 24, 2024

Can AI solve “toy” math problems that computers have not been able to do? Yes. Can AI produce novel math research? No, it hasn’t yet. So “AI will not do math for us” is only factually wrong if you take the weaker definition of “doing math for us”. The stronger definition is not factually wrong yet.

More problematic with that statement is that a timeline isn’t specified. 1 year? Probably not. 10 years? Probably. 20 years? Very likely. 100 years? None of us here will be alive to be proven wrong but I’ll venture that that’s a certainty.

khafra · on Dec 24, 2024

This is a pretty strong position to take in the comments of a post where a mathematician declared the 5 problems he'd seen to be PhD level, and speculated that the real difficulty with switching from numerical answers to proofs will be finding humans qualified to judge the AI's answers.

I will agree that it's likely none of us here will be alive to be proven wrong, but that's in the 1 to 10 year range.

vlovich123 · on Dec 28, 2024

solving PhD level problems != producing new lines of research. PhDs are typically given problems their advisors know are solvable but difficult and might contribute in some way to a larger scope of research the PhD doesn’t yet understand or hasn’t earned the “right” to explore on their own. And phds do frequently do their own research exploration, but that doesn’t involve solving these kinds of “PhD-level” problems which just means having the knowledge about the techniques available and creativity in applying to solving them (as evidenced by the poster noting how they could solve some of these on their own fairly quickly).

I don’t see how my position is so exceptionally strong. I’m saying indeed there’s a 55-70% probability that this happens in the 1-10 year time frame. At 1-20 it goes up to 70-90%. It’s still important to leave room for doubt that we might miss something or be unable to build something for a long time. Trying to state otherwise seems like an even stronger position to take to me.

khafra · on Dec 30, 2024

Yeah, I made the right reply, but to the wrong person. bubble12345 was confidently wrong, and bufferoverflow got downvoted for correcting him, but your caveats to his answer were fine; and that PDF is well within the sane range, even if mine is substantially tighter.

fooker · on Dec 24, 2024

Your idea of ‘do math’ is a bit different from this context.

Here it means do math research or better, find new math.

bubble12345 · on Oct 15, 2024

I mean so far LLMs can't even do addition and multiplication of integers accurately. So we can't really expect too much in terms of logical reasoning.

boroboro4 · on Oct 15, 2024

Can you multiply 1682671 and 168363 without pen and paper? I can’t. LLMs can if you force them do it step by step, but can’t in one shot.

janalsncm · on Oct 15, 2024

For logical reasoning tasks you should use pen and paper if necessary, not just say the first thing that comes to mind.

Comparing one-shot LLM responses with what a human can do in their head doesn’t make much sense. If you ask a person, they would try to work out the answer using a logical process but fail due to a shortage of working memory.

An LLM will fail at the task because it is trying to generate a response token by token, which doesn’t make any sense. The next digit in the number can only be determined by following a sequence of logical steps, not by sampling from a probability distribution of next tokens. If the model was really reasoning the probability for each incorrect digit would be zero.

boroboro4 · on Oct 15, 2024

And that's why OpenAI o1 will use chain of thoughts for this particular question rather than hallucinate approximate answer. And it does work just like before by generating token by token.

janalsncm · on Oct 15, 2024

Here are some actual performance metrics:

https://x.com/yuntiandeng/status/1836114401213989366

If chain of thought really worked we should see no difference between 1 digit and 20 digit multiplication.

Tainnor · on Oct 15, 2024

No, but you can say "I don't know", "I can't do this in my head", "Why is this important?", "Let me get my calculator" or any other thing that is categorically more useful than just making up a result.

solveit · on Oct 15, 2024

It's relatively trivial to get an LLM that does that and every big lab has one, even if they're not selling them.

ChatGPT 4o as of right now just runs python code, which I guess is "Let me get my calculator", see https://chatgpt.com/share/670df313-9f88-8004-a137-22c302f8bf...).

Claude 3.5 just... does the multiplication correctly by independently deciding to go step-by-step (don't see a convenient way to share conversations, but the prompt was just "What is 1682671* 168363?").

serf · on Oct 15, 2024

it's a weird differentiation , part of how they do that is by reading back what they said - someone trained in doing so could essentially abuse this characteristic themselves to do the math in a simplified step by step way if they had perfect recall of what they said or wrote..

in other words, for the LLMs that do that kind of thing well, like gpt-o1, don't they essentially also use 'a pen and paper'?

boroboro4 · on Oct 15, 2024

And this is very good comparison, because o1 indeed does multiply these numbers correctly...

Ask LLMs without chain of thought built-in is the same as to ask people to multiply these numbers without pen and paper. And LLMs with chain of thought actually are capable of doing this math.

akomtu · on Oct 15, 2024

LLMs have pen and paper: it's their output buffer, capped to a few KBs, which is far longer than necessary to multiply the two numbers.

If you tell an LLM to explain how to multiply two numbers it will give a flawless textbook answer. However when you ask it to actually multiply the numbers it will fail. LLMs have all the knowledge in the world in their memory, but they can't connect that knowledge into a coherent picture.

namaria · on Oct 15, 2024

They have codified human knowledge in human language, represented by arrays of numbers. They can't access that knowledge in any meaningful way, they can just shuffle numbers to give the illusion of cogency.

auggierose · on Oct 15, 2024

Does that make an LLM the perfect academic?

_1tem · on Oct 15, 2024

Pen and paper? LLMs are literally a computer program that cannot compute.

moi2388 · on Oct 15, 2024

But it can call into systems that can do compute.

Do you think your inner monologue is any different? Because it sure as hell isn’t the same system as the one doing math, or recognising faces, or storing or retrieving memories, to name a few

carlmr · on Oct 15, 2024

The comparison makes sense though. We're trying to build an simulated brain. We want to create a brain that can think about math.

And chain of thought is kind of like giving that brain some scratch space to figure out the problem.

This simulated brain can't access multiplication instructions on the CPU directly. It has to do the computation via it's simulated neurons interacting.

This is why it's not so surprising that this is an issue.

namaria · on Oct 15, 2024

LLMs are not simulating brains in any capacity. The words 'neural network' shouldn't be taken at face value. A single human neuron can take quite a few 'neurons' and layers to simulate as a 'neural network'.

carlmr · on Oct 15, 2024

Sure, but the basic idea of firing neurons is there, and the connection of these "neurons" to a neural network like an LLM does not allow the network to perform computations directly.

The level of detail of the simulation has little bearing on this. And in fact whether you call it a simulation or something else doesn't matter either. Understanding that the LLM does not compute by using the CPU or GPU directly is what's necessary to understand why computation is hard for LLMs.

ulbu · on Oct 15, 2024

Does it have an understanding of the strict rules that govern the problem and that it needs to produce a result that is in total accordance to them? (In accordance which is not 100%, but boolean) i.e., can it apply a function over a sentence?

I don’t know, that’s why I ask.

ThunderSizzle · on Oct 15, 2024

The answer is sometimes. Typically it'll forget rules you've given it by the time it might be useful because of the memory limit of LLMs. Either way, you basically need to know it's hallucinating to you so you can keep applying more rules.

blitzar · on Oct 15, 2024

282399355737 - My answer is not wrong, I was hallucinating.

tanduv · on Oct 15, 2024

yea, but I'm able to count the number of r's in 'strawberry' without second guessing myself

mewpmewp2 · on Oct 15, 2024

Except o1 can do that and previously gpt could also do it if you asked it to count character by character while keeping count.

bubble12345 · on Oct 12, 2024

Can LLMs even do addition, with say 20+ digit numbers? Multiplication?

bubble12345 · on Sept 19, 2024

Are there replies? I was expecting comment threads like on HN/Reddit, expect with voice messages. Now it's just individual messages on channels?

bubble12345 · on Sept 19, 2024

"almost no publications or citations to show"

Not accurate, he published relatively few papers (less than 20), but several in top journals like Journal of the AMS. His papers also have been cited plenty

bubble12345 · on Aug 15, 2024

You are right in the sense that solvability by radicals has no practical importance, especially when it comes to calculations.

It is just a very classical pure math question, dating back hundreds of years ago. Its solution led to the development of group theory and Galois theory.

Group theory and Galois theory then are foundational in all kinds of areas.

Anyway, so why care about solvability by radicals? To me the only real reason is that it's an interesting and a natural question in mathematics. Is there a general formula to solve polynomials, like the quadratic formula? The answer is no - why? When can we solve a polynomial in radicals and how?

And so on. If you like pure math, you might find solvability by radicals interesting. It's also a good starting point and motivation for learning Galois theory.

bubble12345 · on Aug 15, 2024

That's a common myth. See this paper referenced in the wikipedia article:

Rothman, Tony (1982). "Genius and Biographers: The Fictionalization of Evariste Galois". The American Mathematical Monthly. 89 (2): 84–106. doi:10.2307/2320923. JSTOR 2320923

ForOldHack · on Aug 15, 2024

Quite the clickbait. You can only access it from the pay site, or unless you can get a school library to access it, which I will do. Only the first page is available free.

bubble12345 · on Aug 16, 2024

Or you can paste the following JSTOR link into sci-hub.

https://www.jstor.org/stable/2320923

bubble12345 · on May 9, 2024

Reminds me of Solaris, the same thing holds there (Lem's "rational" book vs. Tarkovsky's "impressionistic" movie)