AI Just Proved It’s a Parrot, Not a Thinker: Why Your LLM Can’t Do Math Without Cheating

AI终于证明了自己只是个复读机，不是思考者：为什么你的大模型不作弊就做不出数学题

New research from Goodfire.ai shows that AI language models like GPT-5 don’t actually 'think' through math—they just regurgitate it. Their arithmetic skills live in the same neural pathways as memorization, not logic. Cut those pathways, and math performance drops to 66%, while reasoning stays nearly intact.

Goodfire.ai的新研究表明，像GPT-5这样的AI语言模型并不会‘真正’做数学题，只是在复述答案。它们的算术能力依赖的是记忆相关的神经通路，而非逻辑。一旦切断这些通路，数学能力会暴跌至66%，而逻辑推理能力几乎不受影响。

It’s like a student who only memorized multiplication tables but never learned the concept. This suggests AI isn’t reasoning—it’s reciting. The real shocker? This separation of memory and logic is so clean, we might one day surgically remove copyrighted content from models without breaking them.

这就像一个学生只背了乘法口诀表，却从没理解乘法的概念。这说明AI根本不是在推理，而是在背诵。更惊人的是？记忆与逻辑的分离如此清晰，或许有一天我们能像做手术一样，精准删除模型中的版权内容而不破坏整体功能。

Cognitive Neuroscientist Claire (认知神经科学家Claire)

This is not evidence that AI lacks real reasoning. It shows that different cognitive functions can be modular—a feature, not a bug. The brain does this too: memory and reasoning aren’t fused. The fact that we can now isolate them in AI is a milestone.

这并不能证明AI缺乏真正的推理能力。相反，它表明不同的认知功能可以是模块化的——这是优势而非缺陷。人类大脑也是如此：记忆和推理本就分开运行。现在我们能在AI中分离它们，反而是一项里程碑。

DevOps Engineer Leo (运维工程师Leo)

So we're basically building digital brains with surgical tools now? Next we'll have AI therapists pruning bad memories. This is wild.

所以我们现在是拿着手术刀在构建数字大脑？下一步是不是AI心理医生来修剪不好的记忆了？太疯狂了。

Ethics Chair Dr. Lin (伦理委员会主席林博士)

If we can remove memorized data, this could solve copyright hell. But what stops a regime from surgically erasing political dissent from AI training? The ethics here are terrifying.

如果我们能删除记忆数据，或许能解决版权灾难。但谁能阻止政权用手术方式从AI训练中抹除政治异议？其中的伦理问题令人胆寒。

ML Researcher Alex (机器学习研究员Alex)

Math isn’t reasoning in LLMs because at scale, 2+2=4 is just another token sequence. They’re not computing—they’re completing sentences. No wonder math fails when memory is cut.

在大模型里，数学不是推理，因为从规模上看，2+2=4不过是另一个词元序列。它们不是在计算，而是在补全句子。难怪记忆一断，数学就崩了。

High School Math Teacher Rita (高中数学教师Rita)

I’ve been saying this for years: if you only memorize, you can’t adapt. My students fail the same way—great on drills, lost on word problems.

我说了好多年：只会背，就不会变通。我的学生也这样——套题做得好，应用题就懵了。

Sarcastic Data Bro Dan (毒舌数据兄弟Dan)

So when the AI says ‘2+2=5,’ it’s not wrong—it’s just expressing a different reality. Next version will be trained on political speeches. Accuracy: irrelevant.

所以当AI说‘2+2=5’时，它不是错了，只是表达了另一种现实。下一代模型可以拿政治演讲来训练，准确率？无关紧要。

Startup Founder Mei (创业者Mei)

Imagine offering ‘copyright scrub’ as a service. Charge enterprises $50K to purge their model of memorized IP. The monetization potential is insane.

想象一下，把‘版权清洗’做成服务。向企业收费5万美元，帮他们清除模型中的记忆化知识产权。这变现潜力太疯狂了。

Philosophy PhD Candidate Omar (哲学博士生Omar)

This blurs the line between knowledge and memory. If knowing 2+2=4 is just recall, what does it mean to ‘know’ anything? We’re closer to Bostrom’s simulation than we think.

这模糊了知识与记忆的界限。如果知道2+2=4只是回忆，那‘知道’任何事又意味着什么？我们比想象中更接近博斯特罗姆的模拟假说。

AI Just Proved It’s a Parrot, Not a Thinker: Why Your LLM Can’t Do Math Without Cheating

AI终于证明了自己只是个复读机，不是思考者：为什么你的大模型不作弊就做不出数学题

ChatGPT到底是高级复读机，还是真正在思考？揭秘AI‘理解力’背后的神经科学真相

你的大脑只是个高级预测机器？新神经科学研究暗示我们行动前早已进入自动驾驶模式