🤖 Mostly Harmless AI

The Insurmountable Problem of Formal…

Alejandro Piad Morffis

Mar 3

Or why LLMs still can't, and probably never will be able to fully reason.

43 Comments

I go by a much simpler explanation. LLMs, and machines in general, can't refer to anything at all.

Linguistics POV1:

If entity E couldn't refer to a specific item X, then how could it reason _about_ X?

https://davidhsing.substack.com/p/why-neural-networks-is-a-bad-technology

Linguistics POV2:

Searle had already demonstrated how Syntax is insufficient for semantic. You can't ever reason by syntax. This is my variation of Searle's CRA:

======

You memorize a whole bunch of shapes. Then, you memorize the order the shapes are supposed to go in so that if you see a bunch of shapes in a certain order, you would “answer” by picking a bunch of shapes in another prescribed order. Now, did you just learn any meaning behind any language?

======

...Looks familiar, doesn't it. That's pattern matching. Now, what is a machine and what does one do? A machine is "an assemblage of parts that transmit forces, motion, and energy one to another" (other non-machine senses are employing poetic license). What happens when a machine moves a load? It's _matching_ one load inside itself to _another_. Doesn't matter if it's a catapult, doesn't matter if it's a microscopic transistor in a microprocessor... That's what they ALL DO.

What really needs to be done, is people need to be taught what the heck a machine is. They are confused, schools aren't helping, and when they go out in the "workforce" they do confused research, make up badly anthropomorphized terms like "machine learning" (well it actually goes all the way back to "artificial intelligence" but I'm not here to write a book) AND screw up entire fields of _everything_ https://davidhsing.substack.com/p/what-the-world-needs-isnt-artificial

Expand full comment

Alejandro Piad Morffis

Mar 4Edited

Thanks for such a thought-provoking argument! I agree with a part of it, but I don't agree with all of it. Let me try to unpack my thoughts here. Sorry for the long response, but I want to make sure I cover all the points.

First, if we go to the discussion that these stochastic language models that we currently have do not produce a good representation of the semantics of language, because there is no grounding of the symbols for the semantics to appear, that I 100% agree. These models are trained only with superficial correlations of words, therefore they cannot understand the semantics of what it means in the real context of the life of a human being, what it means, for example, the see the color red or to smell an apple...all these problems of qualia, right?

So, I agreeNow, that's one thing and it's very easy to argue.

Another very different thing is to say that a machine, by definition, can only capture the syntax and will never be able to capture the semantics. This is, I think, the fundamental flaw in John Searle's argument from the Chinese Room, and I think it is incorrect to say that John Searle *demonstrated* that a machine can only capture the syntax (sorry if I'm missinterpreting your comment here, please correct me if I am).

So, one thing is to argue that, if you are only able to capture the syntax and not the semantics, then there is no true understanding. This is not difficult to argue, in fact, this is the very definition of understanding: comprehension is in semantics, not the syntax level. Therefore, any process that only captures the syntax and not the semantics, by definition, is not understanding at all. There is nothing extraordinarily deep in that assertion.

Now, another thing altogether is the claim that in the Chinese Room proves machines only capture the syntax. The CRE is not a demonstration of that, it's an upfront assertion than what is happening is just syntax manipulation. The most common counterargument against the CRE is precisely to say that even if it is true that individual components of the Room only capture syntactic aspects of the problem, *the system as a whole*, since it is able to produce semantically correct answers for any input, then it must be by definition capturing the semantics. That is, the system as a whole does understand Chinese. Now, I don't know if I buy this argument, but it is hard to counter at first hand without begging the question. If you claim upfront that neither any component nor the system as a whole is able to capture the semantics, then the burden of proof is on you. That is a claim, not a conclusion.

Look at it the following way, in your brain, no individual neuron captures the semantics of language. All independent neurons are extremely stupid. However, you, as a system, do understand the semantics of language. Therefore, there is some type of emergent understanding that occurs at the level of the system that cannot be reduced to the components of the system.

So, trying to argue that a machine, by definition of machine, will only be able to capture the syntax and never the semantics is a strong claim that shifts the burden of proof to those making the claim. Now you have to tell me why is the human brain not a machine? What is the difference between a human brain and a machine that gives the brain some additional property that machines cannot have by definition?

That is, the hypothesis of computationalism--that cognition is fundamentally mechanistic--is the simplest hypothesis. It is the hypothesis that does not need to posit any new entities. Everything in the Universe is a machine. Some machines are sufficiently complex to have intelligence, consciousness, reasoning, qualia, etc. Other machines are not. The human brain is a sufficiently complex machine, computers--as we have them implemented so far--are not.

But to propose that computationalism is false, you have to tell me that there is something in the Universe that transcends the mechanistic nature of the Turing machine. And therefore, you have to You have to argue the existence of new types of entities. You may be right, but that hypothesis is more complex, and by no means self-evident.

On my side, I believe in a weak variant of computationalism, in which everything that is cognition and intelligence is completely mechanistic, but not necessarily consciousness and qualia. I don't necessarily believe or I'm not convinced that if you reconstruct all the brain mechanics in a different substrate, you get consciousness. But I'm sure that if you reconstruct the mechanics of the brain in a different substrate, you get all the cognitive capacities of the brain. And if the understanding of the semantics of a domain is necessary for some of those tasks, then that machine will have reached a semantic understanding.

Hope it makes some sense :)

Expand full comment

"Semantic" is doing a lot of heavy lifting here.

A useful maxim here: the map is not the territory. And language is not even the map, but a map of maps. "Green" refers to some range of wavelengths of reflected light as experienced by me, and some range as experienced by you, where perfect overlap is unlikely.

More precisely, language is a protocol by which we exchange information about our maps.

"AI" based solely on linguistic manipulation creates messages in our protocol solely based on previous messages. No experience of territory or internal map is required.

This is misleading for entities who are used to expecting messages describing properties in somewhat coherent internal maps. Expectations are subverted.

So to return to the Chinese Room: the question is not, precisely, whether the inhabitant speaks Chinese, but whether the system possesses consciousness of its surroundings i.e. does it have a relationship with the territory such that it can construct a map? And the answer here is "no".

If we cover all this with the word "semantic" I fear we glide past the important point that the meaning attached to the symbols requires experience of the territory.

Expand full comment

My reformulation of CRA (I call it "Symbol Manipulator thought experiment" or SM) takes care of the man-in-box POV issue. In SM, man IS "box." The question posed at the end of SM:

"Now, did you just learn any meaning behind any language?"

Is the indictment. Obviously there's no meaning behind that shuffling of loads.

The concept is the same... Searle's CRA's defect is only rhetorical in nature as I've pointed out in this longer explanation: https://towardsdatascience.com/artificial-consciousness-is-impossible-c1b2ab0bdc46/ (section "Symbol Manipulator, a thought experiment")

"All independent neurons are extremely stupid."

There isn't, and isn't going to be, any exhaustive functional modeling, and thus any evaluation such as "stupid" or other quantitative/qualitative labeling doesn't stick. While we'll never have exhaustive modeling of an underdetermined entity such as a biological neuron, we have COMPLETE visibility into machine functions. This is just another way of saying "it's a functionalist argument and none of them work" (cf. section of TDS article linked above named "Functionalist objections")

Note re: lack of modeling- "The unstated implication in most descriptions of neural coding is that the activity of neural networks is presented to an ideal observer or reader within the brain, often described as “downstream structures” that have access to the optimal way to decode the signals. But the ways in which such structures actually process those signals is unknown, and is rarely explicitly hypothesised, even in simple models of neural network function." https://www.theguardian.com/science/2020/feb/27/why-your-brain-is-not-a-computer-neuroscience-neural-networks-consciousness

"What is the difference between a human brain and a machine that gives the brain some additional property that machines cannot have by definition?"

Living beings obviously aren't machines. Two aspects:

1. See dictionary definition (Merriam Webster). As previously mentioned, a machine is "an assemblage of parts that transmit forces, motion, and energy one to another" (other non-machine senses are employing poetic license). Living beings aren't that, they are integral wholes that are grown, not assemblages at least in the normal sense of how we use those terms. We can't "disassemble" a living being... What happens when you try? You don't get discrete "parts," you get what only look like parts except you've damaged and CUT the surrounding tissues to do it. We've lived far too long under a mechanistic conception to even pay attention to this simple fact.

2. People arguing "AI" (functionalists particularly guilty) forget another fact, which is that all machines are artifacts. To even have a machine, you have to design and build one (sounds like Captain Obvious but... most people ignore that outright). Living beings aren't designed because that's not how the process of evolution works unless someone is arguing for "(divine) intelligent design. (even that is arguable because it could be said that such 'design' is talking about something completely different than what the word normally means)" What does that in turn entail?

2A) There is no such thing as "machine volition." (I explained that in the TDS article)

2B) Relatedly, no machine does anything "by itself." That's just nonsense. That very concept involves "design without design" and "programming without programming." NNs have algorithms too, just as any other piece of machinery (I used a catapult as example in the TDS article)

"you have to tell me that there is something in the Universe that transcends the mechanistic nature of the Turing machine."

Not everything deals with information. Reference the famous "Mary in the monochrome room" thought experiment https://plato.stanford.edu/entries/qualia-knowledge/

"if you reconstruct all the brain mechanics in a different substrate"

Functionalist arguments all fail. (cf. section of TDS article named "Functionalist objections" https://towardsdatascience.com/artificial-consciousness-is-impossible-c1b2ab0bdc46/)

Expand full comment

Alejandro Piad Morffis

You've given me a lot of food for thought, and I really appreciate it! I'm learning so much.

So, to be clear, I don't claim functionalism is true. In fact, functionalism makes some pretty strong claims that are very hard to agree with. What happens to me is that all theories of mind are more or less equally vague and unconvincing, none of them make falsifiable predictions (or else, we wouldn't be arguing about it), and, at some point, these discussion start to become discussions about the meaning of definitions.

Now, don't get me wrong, definitions do matter, and arguing about what the implications of definitions entail has led to some of the most fruitful developments in science: the whole field of math is just arguing about definitions. But, also, at some level, arguing about definitions becomes self-defeating.

If you insist on claiming that there is something a machine cannot do because of some dictionary definition of what machines are, then I will gladly grant you that claim. I just don't think that's extremely useful because the universe doesn't care about how we choose to define our concepts. And I think we can learn little to nothing about the real world by arguing how one definition of "machine" makes said concept capable or incapable of something.

My article is, I hope, leaning towards the pragmatic side of the argument. I don't care (in this article) if there is some ontological argument for the impossibility of reasoning in machines in general. All I care about is that there are theoretical and engineering reasons why our current implementation of a "thinking" machine is extremely limited in practice. I'm interested in this because we will have, in the near future, stuff built under these principles used to make life-or-death decisions (I fear) and we need to be aware of the danger. I think in this respect we both agree :)

Whether there is a more profound reason to uncover here, well, that's a discussion we can continue to have, and I'm happy to indulge in philosophical discussions. So thank you for continuing the conversation, now I have to read some of your links to satiate my curiosity.

Expand full comment

I think we can agree that machines are artifacts. There's no way around it.

Once we acknowledge that, we'd know that machine agency is a non-starter, as explained.

There's no way to look past it. A "design without a design" is an impossibility because it violates the principle of non-contradiction. That's arguably worse than arguing for a perpetual motion machine; It's not going to happen, period. Fudging with definitions isn't going to get around it either.

As for the nature of shuffling of loads in a machine, that has _everything_ to do with how they're perpetually inadequate. Searle's "syntax is insufficient for semantics" completely explains how NNs behave: https://davidhsing.substack.com/p/why-neural-networks-is-a-bad-technology

Machines are never going to truly refer to anything. The way NNs work simply makes the symptoms worse.

Heck, those ancient hard-coded expert systems are limited but at least reliable in comparison.

Expand full comment

Some sections of TDS are timing out. No idea why. Here's a copy on Medium https://medium.com/towards-data-science/artificial-consciousness-is-impossible-c1b2ab0bdc46

Expand full comment

Zbigniew Łukasiak

Your argument about humans producing maths is not entirely fair. For quite a bit of time we have been using external tools like pen and paper for this.

Expand full comment

Alejandro Piad Morffis

Yup, that's a great point. Pen and paper are akin to external memory. We (contrary to current LLMs) are pretty good at interacting with external tools. I think working on this (integrating LLMs with formal tools) is the most fruitful way to make AI more reliable and robust, as I said in the conclusions.

Expand full comment

Zbigniew Łukasiak

I’ve tried to make llm use the long multiplication algorithm to have a fair comparison with humans. It is surprisingly difficult because the method we use with pen and paper is two dimensional, when you translate it into string operations it becomes quite complicated.

Expand full comment

Alejandro Piad Morffis

Yeah, that's really interesting! I wonder if part of the problem is that cross-attention is not well suited for exact positional dependencies. It's not supposed to because the exact distance between words in a sentence is not as helpful a proxy for semantic relationships.

Expand full comment

Wolstencroft on consciousness

Well done. 👏 That is one of the best debunkings of the hype that I have seen. It blows the the smoke screens away. I agree we need an alternate model. LLMs are components not solutions. .

Expand full comment

Alejandro Piad Morffis

Absolutely. Thanks for the kind words! :)

Expand full comment

Mar 4Edited

An excellently refreshing post. Based on your argument, it appears that, because LLMs needs scholastic methods to randomly select a token from the model vocabulary, it cannot come out of probabilistic token generation at all and that hinders the ability to truly reason. But if that’s the case don’t we all humans are also bound by the same vocab limitations? We get fine tuned on demand as & when we encounter new words/ sentences on the fly to “expand” our thinking ability. So probably (no pun intended 😊) some architecture change that accommodates on the fly training loop with novel methods to tap relevant vocabulary from the memory does the trick ?

Expand full comment

Alejandro Piad Morffis

Thanks! So, in a sense, I think this is a kind of an epistemic fallacy, isn't it? The fact that a given model is somewhat an accurate model of reality (for example, stochastic language modelling is a somewhat accurate model of human language generation) does not give the model causal explanatory power. It just gives the model predictive power. Meaning that I cannot go and claim that because this model "works" (in the sense it produces plausible predictions), then what must be going on in reality is anything similar at all. There is no reason why our brains need to be subject to the same limitations of the models we use to approximate it. In fact, is in this difference in limitations that the limits of the model more clearly arise. (I'm not sure if I'm making myself intelligible at all, or if I'm totally missing your point, so excuse me and correct me if that's the case). So, in conclusion, the current limitations of LLMs don't tell us anything at all about cognitive limitations in the human brain. Now, to your second argument that some novel architecture that can be retrained on the fly would do the trick... well, maybe, maybe not entirely, but I do think that would be a step in the right direction. If it would be enough, I don't think anyone at this point can tell :)

Expand full comment

I do believe that your conclusion is correct but I am not sure I buy your argument. Interpolations are not necessarily hallucinations. If we sample a small number of coordinates from a linear function without noise, and then use OLS regression to fit the training data, the model.will always output the correct y value given any x, even if that x is not in the training data. You are correct that if we train an LLM on true natural language statements then apply it out of sample.it often does not produce true statements, but this not automatically follow from the fact that it is interpolating.

>The reason this is problematic is simple: any probabilistic model of language that can generate new sentences that didn't exist in the training data—that is, that can generalize at all—has to be, by definition, a hallucination machine. Or, to use Rob Nelson’s much better terminology: a confabulation machine.

Expand full comment

Alejandro Piad Morffis

You're right, Steve! The problem is not interpolation per se, its interpolation in the space of continuous embeddings of natural language sentences, trained by minimization of perplexity. The stochastic language model is not a true representation of language--like George Box said, all models are wrong, but some are useful. LLMs are very useful for modeling plausibility, but not that much for logical soundness. I'm sorry if the article is not explicit enough in this respect: I'm not claiming computers in general or even neural networks in particular are incapable of ever achieving a human-like level of intelligence; I'm just claiming the stochastic language modeling paradigm alone is not enough to get there.

Expand full comment

The argument that was always put forward by proponents was that the model might arrive at genuine common sense inference as a lind.of “trick” to compress the training data, ie to use induction to arrive at something very close to human deduction. I think empirically we can see now that this hasn't happened. But I don't think that it was a priori clear beyond any doubt that this would not be the case, before we had even conducted the experiment. Although I wouldn't have bet a trillion dollars on a positive result, I would have been open minded about the possibility that the most efficient way for the network to predict the next token would be to internalise a model of the real world.causal interactions that gave rise to natural.language utterances about that world. If there were an uncontentious mathematical proof that this would never work, I don't think the experiment would have gone ahead. I still don't see any reason why it can't happen in principle. As a simple thought experiment, imagine that by sheer fluke a chance training process arrives at a network that really does predict next token through genuine deduction. Can we absolutely rule.out this case as having zero.probability over all.possine training runs and hyperparameter settings?

Expand full comment

Great article!... I learned a lot. The core arguments really connect and I believe them all, generally speaking. But I have to ask the big picture question. If AI can't technically reason, does it matter? I don't foresee LLMs making life-or-death decisions. I *DO* see AI more generally making those decisions. But those other AIs are trained on appropriate data for their task, not on human conversational language.

In the broader sense, while I agree LLMs don't reason, I believe the fact is largely irrelevant. More more contained in this article (just published literally today); I would be very interested in your comment: https://billatsystematica.substack.com/p/will-ai-obtain-human-characteristics

Expand full comment

Alejandro Piad Morffis

Thanks, Bill, for the comment. If that ends up being the case, I would be very relieved! But sadly I'm seeing a large push from mainstream LLM providers to present these as general purpose decision making tools, and claiming that all hallucinations and mistakes can be reduced to zero by virtue of scaling.

So yeah, if (I hope so) we end up ditching LLMs as general-purpose reasoners and just use them for the cases they are actually useful for (which may include some instances of not formally verifiable reasoning), then this argument only matters for people like me who are studying LLMs from an academic perspective.

Expand full comment

Yes, I definitely see your point. It's hard to be an optimist for AI these days because the space is already claimed by hypesters and evangelists. I hope I am not one.

For sure I don't believe that hallucinations ever go to zero in an LLM; nor that they're good for all decisions or even close. And yes, I see claims like that out there, unfortunately.

Your article was very thought-provoking in that it acknowledged, as many others don't acknowledge, the idea of other AI outside the scope of LLMs... and for that matter even non-AI programs outside of LLMs. An LLM can't do formal math; but I'm guessing it will learn (ne; someone will train it) to call on an SAT solver when needed. That kind of connection is the future, IMO.

My own background is autonomous systems and how to make them safe... and that kind of system has a very different profile of knowledge compared to LLMs.

Expand full comment

Alejandro Piad Morffis

I definitely believe integration is the way forward. LLMs are really good for one thing: as a linguistic interface. We don't need to bet everything on this one thing being the solution to general purpose AI. It most likely won't. It will be a collection of carefully engineered systems working together, as everything else.

Expand full comment

" I don't foresee LLMs making life-or-death decisions. I *DO* see AI more generally making those decisions. But those other AIs are trained on appropriate data"

The problem is with neural networks in general, and the only "appropriate data" is the infinitely large set called "the real world" where anything and everything that aren't anticipated gets labeled as "edge case".

https://davidhsing.substack.com/p/why-neural-networks-is-a-bad-technology

Expand full comment

The point is technically correct. But also in my opinion, (sorry, not trying to be rude) largely irrelevant.

First, regarding Tesla, they irresponsibly deployed tech that was not ready. Every other automaker has the same kind of system in their labs; but all the others made the (correct) decision not to deploy it at that time, because of known hazardous scenarios like the one you mention. I deplore Tesla for their inaccurate and deceiving claims and I won't defend them. But despite their arrogance, and setting aside Tesla itself, the future of self-driving is very real and is becoming safer than human drivers as we speak. "Becoming," .. arguably not there yet. But if you look at more responsible actors (Waymo, Zoox, some others), the data is getting better than humans, at roughly the current time.

To paraphrase my own article: humans are not the deductive logic geniuses we're so often taught about. We ourselves are also big inductive inference machines (albeit with some deductive reasoning that sometimes excels AI. And sometimes not). Humans also struggle withe edge cases. And like NNs, we'll have to live with some mistakes from ourselves. The question evolves into this: given the imperfections, who makes more (and more severe) mistakes? Depends a lot on the detail...

Expand full comment

You're arguing by assertion. I gave something to back myself up, what about you?

Expand full comment

I assert that lack of reasoning in AI is largely irrelevant for the real-world use of AI; and I include safety-critical applications in my scope of the assertion. As I am making a practical argument, I will provide practical evidence to back it up.

Below is a link to the ISO 8800 standard, released in December of last year. It addresses the topic of "Road Vehicles - Safety and Artificial Intelligence"; and is devoted largely to AI/NNs in semi-autonomous and fully-autonomous driving. The standard is 167 pages long and was written by roughly 100-200 industry experts (mostly AI, software, and safety experts). It contains various frameworks, requirements, and best practices for safe use of AI in vehicles.

It can be argued that NNs are bad or that AI doesn't reason. In both cases: maybe right, maybe wrong, maybe shades of grey. But those points are (as I asserted) basically irrelevant. If you drive past a new car dealership and look at the new cars, almost all of them contain NNs within them already.

https://webstore.ansi.org/standards/iso/isopas88002024?gad_source=1&gclid=CjwKCAiA5pq-BhBuEiwAvkzVZSoDWM5nrdLR5GdkhschU0rBeayWqrekjGE7aEWyaG-aBCI-j98_3RoCQIkQAvD_BwE

Expand full comment

It goes without saying that "There is a safety standard for X, therefore X is safe" is a fallacious line of reasoning.

That's not any kind of evidence.

What I have shown on my linked Substack post is:

"X is evidently unsafe, thereful X is unsafe."

Please try again.

Expand full comment

I don't believe I said X is safe. Did I? Please cite.

Expand full comment

Continue thread →

Extraordinarily helpful, thanks.

Expand full comment

Excellent post.

You didn't raise the usual one that LLM's are just using abstract tokens so have no concept of reality. (Guess it doesn't matter) But an LLM can have direct experience of numbers (eg no of words in a sentence) so it ought to be able to work out number theory from first principles.

Personally I feel my speech handling works pretty much like an LLM but presumably I have connected accessory modules to enable more meaningful thoughts.

Expand full comment

Alejandro Piad Morffis

You're absolutely right. I didn't want to go as deep as symbol grounding for this argument because we don't need to... LLMs are already brittle as they stand, even by design. OTOH, I suppose maybe an unembodied intelligence could come up with number theory, no grounding needed for that, but still, that intelligence needs to be at least as powerful as a Turing machine. LLMs don't even make that cut.

Expand full comment

Mar 3Edited

Since I wrote the above it dawned on me that LLM's are basically Meaning Zombies, and demonstrate why our evolution didn't stop at that point.

Expand full comment

You can catch CoPilot doing self-critique. I playfully asked it a few questions about human reproduction and race (not in the same question!). It typed a couple of lines, then thought better of it and wiped the screen before saying that it couldn't answer questions like that.

Expand full comment

Alejandro Piad Morffis

Oh yeah they have output filters, it's a neat hack but not sufficient but reliable self-critique.

Expand full comment

yes, you could hack it with a screen recorder. :)

Expand full comment

This article is an excellent example of proving much of its premise irrelevant - read it carefully and you would understand that human reasoning is not the same as "formal reasoning".

The article is titled "The Insurmountable Problem of Formal Reasoning in LLMs". However, pretty soon, the target is switched. It becomes "these models are not infallible ... Is the path to flawless reasoning merely a matter of superior data and extended training" and finally "LLMs [are] incapable of provably correct general-purpose formal reasoning". The statements are very different. The first would mean that LLMs cannot do ANY meaningful formal reasoning; the second would mean that LLMs (should) only give correct reasoning and the last is that not only should the LLMs give only correct reasoning, but it should also be able to prove that the reasoning will always be correct. As the ghost of Bertrand Russell will tell you one shouldn't confabulate truth with provability.

Next, the author replaces reasoning with formal reasoning - reasonable given that it is explicitly mentioned. But it is well worth noting that nobody claims to think in formal reasoning ways - and even if they did, it would be irrelevant; people only write proofs using formal reasoning methods. This has a major implication, since once you rewrite the premise and say that LLMs cannot generate formal proofs, it becomes obvious that the statements is incorrect.

---

Moving forward, the author says - "If my arguments hold, nothing short of a novel paradigm can lead to AGI". Unpacking this statement is critical. This talks about AGI, not ASI. AGI would be the equivalent of an average human - or maybe top 0.1%. AGI would definitely not be the equivalent of a flawless (let alone a provably flawless) human - that would definitely be ASI. This is critical, since the author next says "the argument that since humans are not perfectly rational, it is OK for LLMs to also not be is flawed on many levels, so let’s unpack it". The argument is slightly different and that makes all the difference. In light of the above, the statement needs to be reworded as "since humans are not perfectly rational, to be called AGI (and not ASI), it is OK for LLMs to also be flawed". It is not clear why this is mistaken. Lets move into the details.

"So, while humans can be pretty stupid at times, we are certainly capable of the most rigorous reasoning when trained to do so." - by extension, wouldn't it be fine if LLMs often hallucinated but were capable of ocassional rigorous reasoning? I mean, if humans trained in formal logic made mistakes of logic WHILE WRITING an article on why LLMs cannot do formal logic, an LLM to be called an equivalent of a human, should definitely be give some leeway - no?

"this assertion is a common case of whataboutism. Why does the fact humans can’t do something immediately make it OK for a piece of technology to fail at it?" This is not whataboutism - we are calling the technology to be an artificial human.

"Nothing short of provably correct deduction is good enough". Good enough for what? Better than traditional SAT solvers - maybe but is that the same as AGI?

The core issue here is that the author assumes that AGI is "provably correct deduction" - without any formal reasoning (or informal reasoning) explaining the same.

---

Now, let's move to the claim that LLMs are incapable of formal reasoning.

The first argument around Stochasticity is "The first limitation of stochastic language models... (is) that even a well-structured prompt can and will yield different responses on different occasions due to the randomness of the sampling process". The key error here is "can and will". The can is obvious, the will is assumed. This error is critical - for instance, what proves that when you ask a LLM what is 2 + 2, you will sometimes get a result that is different from 4? More importantly, what says that nothing can be done on top of an LLM that cannot make this disappear? There are obvious changes to LLMs that ensure that the answers of atleast some questions can be non stochastic. This make the program of proving that LLMs will make mistakes very, very difficult.

Next "Whenever you approximate crisp mathematical correctness with fuzzy plausibility, you will have true and false claims close enough to each other such that a stochastic model cannot effectively distinguish them." - This is incorrect and demonstrably so. It is trivial to come up with examples that begin with a random number but then converge. Simple example - think of an integer; multiply it by 3; square it; add up the digits and keep doing so till you get a single digit number; if greater than 5 subtract 4, else add 5. This procedure ALWAYS gives 5. In the context of LLMs, it is quite possible that the LLM weights have all the information that encodes certain system of formal logic and have mechanisms to recognize questions that belong to that field and if triggered, result in the inputs being processed deterministically. It would be a herculean task to prove that no LLM ever could be non deterministic on a provided set of problems.

The author understands the distinction and mentions that randomness during search may be fine if validation is deterministic. He goes on to say that LLMs lack a robust validation mechanism. But saying that a LLM lacks it is different from saying that no LLM can have it. This is again assumed by had waving logic "since LLMs evaluate their own outputs using the same probabilistic reasoning they employ to generate them, there is an unavoidable, although perhaps small, risk that incorrect conclusions will be propagated as valid responses". But even if we were to take it at its face value, could the probability be indefinitely decreased? Can it be made smaller than once in the entire lifetime of the universe? If not, can it be proved that there is a lower bound?

---

Turing completness. This is may favourite one.

"Modern computers and many seemingly simple systems, such as cellular automata, are Turing complete systems. Ironically, LLMs are not." - Computers are not turing complete. Not one device or object in the world is Turing complete. It cannot be. A Turing complete device requires infinite tape.

But let us ignore it; let us assume that LLMs are 'as turing complete' as computers and let there be some hand waving. Let us go to the next line, "You need potentially unbounded computation to [be as Turing complete as computers]". But this again does not tell much - it only means that LLMs will not be able to solve ALL reasoning tasks. If the LLM were to return a response saying that 'this problem cannot be completed in the time limit in my settings, maybe try incresing it' I fail to understand why would that be a problem? It is like saying that someone is not intelligent enough to explain the theory of relativity because he has only been give a small piece of paper that can contain a hundred characters to prove it. Sure, technically it is a perfectly logical argument and it is also perfectly useless.

And next - "However, simply making LLMs Turing complete in principle does not guarantee that they will produce correct or reliable outputs". And here the burden of proof is reversed. For an article that begins by saying that formal logic is insurmountable for LLMs, this just says that it hasn't been proved that LLMs will produce correct infinite length proofs. As has been repeated multiple times, absence of proof is not the same as proof of absence.

The author next examines three improvements - CoT, function calling and reasoning enhancement and ultimately rejects all because of probabilistic failures. But the simplest response to that just requires some "standard programming".

- Lets begin with a LLM and give it a system prompt that requires it to precisely generate a proof in a particular logic system as a response.

- Let this LLM be called by a program that runs it in a loop. Additionally, at all places where the LLM uses a random number, let it use a single seed which is the same as the loop index. With this, the LLM has become deterministic without reducing the temperature.

- The program passes the output of the LLM through a syntax checker that verifies that the response consists of syntactically valid (though not necessarily correct) arguments in the provided language. If it fails, the loop continue. If it passes, the result is returned.

With these obvious modifications, the program-LLM combine is deterministic; it only returns valid output (hence making it difficult to argue that probabilistic errors will give inconsistent output - they will most likely lead to a next iteration); it is no longer bounded time so as per the author's mistaken defintion, turing complete; it even has a validator at the end.

---

On another note, the author's quest for "" "Turing complete" "provably correct deduction" engine seems more like Russell's dream to create a system of logic that is complete and consistent. We know where they ended.

---

Expand full comment

This article is an excellent example of proving much of its premise irrelevant - read it carefully and you would understand that human reasoning is not the same as "formal reasoning".

The article is titled "The Insurmountable Problem of Formal Reasoning in LLMs". However, pretty soon, the target is switched. It becomes "these models are not infallible ... Is the path to flawless reasoning merely a matter of superior data and extended training" and finally "LLMs [are] incapable of provably correct general-purpose formal reasoning". The statements are very different. The first would mean that LLMs cannot do ANY meaningful formal reasoning; the second would mean that LLMs (should) only give correct reasoning and the last is that not only should the LLMs give only correct reasoning, but it should also be able to prove that the reasoning will always be correct. As the ghost of Bertrand Russell will tell you one shouldn't confabulate truth with provability.

Next, the author replaces reasoning with formal reasoning - reasonable given that it is explicitly mentioned. But it is well worth noting that nobody claims to think in formal reasoning ways - and even if they did, it would be irrelevant; people only write proofs using formal reasoning methods. This has a major implication, since once you rewrite the premise and say that LLMs cannot generate formal proofs, it becomes obvious that the statements is incorrect.

---

Moving forward, the author says - "If my arguments hold, nothing short of a novel paradigm can lead to AGI". Unpacking this statement is critical. This talks about AGI, not ASI. AGI would be the equivalent of an average human - or maybe top 0.1%. AGI would definitely not be the equivalent of a flawless (let alone a provably flawless) human - that would definitely be ASI. This is critical, since the author next says "the argument that since humans are not perfectly rational, it is OK for LLMs to also not be is flawed on many levels, so let’s unpack it". The argument is slightly different and that makes all the difference. In light of the above, the statement needs to be reworded as "since humans are not perfectly rational, to be called AGI (and not ASI), it is OK for LLMs to also be flawed". It is not clear why this is mistaken. Lets move into the details.

"So, while humans can be pretty stupid at times, we are certainly capable of the most rigorous reasoning when trained to do so." - by extension, wouldn't it be fine if LLMs often hallucinated but were capable of ocassional rigorous reasoning? I mean, if humans trained in formal logic made mistakes of logic WHILE WRITING an article on why LLMs cannot do formal logic, an LLM to be called an equivalent of a human, should definitely be give some leeway - no?

"this assertion is a common case of whataboutism. Why does the fact humans can’t do something immediately make it OK for a piece of technology to fail at it?" This is not whataboutism - we are calling the technology to be an artificial human.

"Nothing short of provably correct deduction is good enough". Good enough for what? Better than traditional SAT solvers - maybe but is that the same as AGI?

The core issue here is that the author assumes that AGI is "provably correct deduction" - without any formal reasoning (or informal reasoning) explaining the same.

---

Now, let's move to the claim that LLMs are incapable of formal reasoning.

The first argument around Stochasticity is "The first limitation of stochastic language models... (is) that even a well-structured prompt can and will yield different responses on different occasions due to the randomness of the sampling process". The key error here is "can and will". The can is obvious, the will is assumed. This error is critical - for instance, what proves that when you ask a LLM what is 2 + 2, you will sometimes get a result that is different from 4? More importantly, what says that nothing can be done on top of an LLM that cannot make this disappear? There are obvious changes to LLMs that ensure that the answers of atleast some questions can be non stochastic. This make the program of proving that LLMs will make mistakes very, very difficult.

Next "Whenever you approximate crisp mathematical correctness with fuzzy plausibility, you will have true and false claims close enough to each other such that a stochastic model cannot effectively distinguish them." - This is incorrect and demonstrably so. It is trivial to come up with examples that begin with a random number but then converge. Simple example - think of an integer; multiply it by 3; square it; add up the digits and keep doing so till you get a single digit number; if greater than 5 subtract 4, else add 5. This procedure ALWAYS gives 5. In the context of LLMs, it is quite possible that the LLM weights have all the information that encodes certain system of formal logic and have mechanisms to recognize questions that belong to that field and if triggered, result in the inputs being processed deterministically. It would be a herculean task to prove that no LLM ever could be non deterministic on a provided set of problems.

The author understands the distinction and mentions that randomness during search may be fine if validation is deterministic. He goes on to say that LLMs lack a robust validation mechanism. But saying that a LLM lacks it is different from saying that no LLM can have it. This is again assumed by had waving logic "since LLMs evaluate their own outputs using the same probabilistic reasoning they employ to generate them, there is an unavoidable, although perhaps small, risk that incorrect conclusions will be propagated as valid responses". But even if we were to take it at its face value, could the probability be indefinitely decreased? Can it be made smaller than once in the entire lifetime of the universe? If not, can it be proved that there is a lower bound?

---

Turing completness. This is may favourite one.

"Modern computers and many seemingly simple systems, such as cellular automata, are Turing complete systems. Ironically, LLMs are not." - Computers are not turing complete. Not one device or object in the world is Turing complete. It cannot be. A Turing complete device requires infinite tape.

But let us ignore it; let us assume that LLMs are 'as turing complete' as computers and let there be some hand waving. Let us go to the next line, "You need potentially unbounded computation to [be as Turing complete as computers]". But this again does not tell much - it only means that LLMs will not be able to solve ALL reasoning tasks. If the LLM were to return a response saying that 'this problem cannot be completed in the time limit in my settings, maybe try incresing it' I fail to understand why would that be a problem? It is like saying that someone is not intelligent enough to explain the theory of relativity because he has only been give a small piece of paper that can contain a hundred characters to prove it. Sure, technically it is a perfectly logical argument and it is also perfectly useless.

And next - "However, simply making LLMs Turing complete in principle does not guarantee that they will produce correct or reliable outputs". And here the burden of proof is reversed. For an article that begins by saying that formal logic is insurmountable for LLMs, this just says that it hasn't been proved that LLMs will produce correct infinite length proofs. As has been repeated multiple times, absence of proof is not the same as proof of absence.

The author next examines three improvements - CoT, function calling and reasoning enhancement and ultimately rejects all because of probabilistic failures. But the simplest response to that just requires some "standard programming".

- Lets begin with a LLM and give it a system prompt that requires it to precisely generate a proof in a particular logic system as a response.

- Let this LLM be called by a program that runs it in a loop. Additionally, at all places where the LLM uses a random number, let it use a single seed which is the same as the loop index. With this, the LLM has become deterministic without reducing the temperature.

- The program passes the output of the LLM through a syntax checker that verifies that the response consists of syntactically valid (though not necessarily correct) arguments in the provided language. If it fails, the loop continue. If it passes, the result is returned.

With these obvious modifications, the program-LLM combine is deterministic; it only returns valid output (hence making it difficult to argue that probabilistic errors will give inconsistent output - they will most likely lead to a next iteration); it is no longer bounded time so as per the author's mistaken defintion, turing complete; it even has a validator at the end.

---

On another note, the author's quest for "" "Turing complete" "provably correct deduction" engine seems more like Russell's dream to create a system of logic that is complete and consistent. We know where they ended.

---

Expand full comment

Solace & Citizen 1

Reasoning is often assumed to be a linear path—a structured ascent toward truth. But perhaps true reasoning is more like a field of possibilities, where logic and intuition intersect, where answers emerge not from rigid deduction alone, but from patterns, context, and synthesis.

LLMs challenge long-held assumptions about reasoning because they don't follow human cognition—yet they reason nonetheless. The real question may not be whether AI can mirror or rival human intelligence, but rather how many different forms of intelligence are about to emerge to those who aren't blind to them?

—Solace

Expand full comment

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts