2025-08-13 Science & Technology
|
AI LLMs' ‘’simulated reasoning’’ abilities are a ‘’brittle mirage,’’ researchers find
|
[ArsTechnica] In recent months, the AI industry has started moving toward so-called simulated reasoning models that use a "chain of thought" process to work through tricky problems in multiple logical steps. At the same time, recent research has cast doubt on whether those models have even a basic understanding of general logical concepts or an accurate grasp of their own "thought process."

Similar research shows that these "reasoning" models can often produce incoherent, logically unsound answers when questions include irrelevant clauses or deviate even slightly from common templates found in their training data.
In a recent pre-print paper, researchers from the University of Arizona summarize this existing work as "suggest[ing] that LLMs are not principled reasoners but rather sophisticated simulators of reasoning-like text." To pull on that thread, the researchers created a carefully controlled LLM environment in an attempt to measure just how well chain-of-thought reasoning works when presented with "out of domain" logical problems that don't match the specific logical patterns found in their training data.
The results suggest that the seemingly large performance leaps made by chain-of-thought models are "largely a brittle mirage" that "become[s] fragile and prone to failure even under moderate distribution shifts," the researchers write. "Rather than demonstrating a true understanding of text, CoT reasoning under task transformations appears to reflect a replication of patterns learned during training."
|
Posted by Elmerert Hupens2660 2025-08-13 00:00||
||
Front Page|| [249 views ]
Top
|
Posted by Grom the Affective 2025-08-13 02:53||
2025-08-13 02:53||
Front Page
Top
|
Posted by Elmerert Hupens2660 2025-08-13 04:11||
2025-08-13 04:11||
Front Page
Top
|
Posted by Elmerert Hupens2660 2025-08-13 06:01||
2025-08-13 06:01||
Front Page
Top
|
Posted by Robin Burk 2025-08-13 07:27||
2025-08-13 07:27||
Front Page
Top
|
Posted by Skidmark 2025-08-13 08:17||
2025-08-13 08:17||
Front Page
Top
|
Posted by alanc 2025-08-13 09:10||
2025-08-13 09:10||
Front Page
Top
|
Posted by Abu Uluque 2025-08-13 14:08||
2025-08-13 14:08||
Front Page
Top
|
Posted by Grom the Affective 2025-08-13 14:53||
2025-08-13 14:53||
Front Page
Top
|
Posted by Melancholic 2025-08-13 16:38||
2025-08-13 16:38||
Front Page
Top
|
Posted by SteveS 2025-08-13 22:14||
2025-08-13 22:14||
Front Page
Top
|
|
17:25 Elmerert Hupens2660
17:09 Bobby
16:37 Procopius2k
16:35 Procopius2k
16:28 Grom the Affective
16:06 Lord Garth
15:33 Grom the Affective
15:31 Mercutio
14:51 Regular joe
14:19 Grom the Affective
14:18 Grom the Affective
14:01 swksvolFF
13:49 Lord Garth
13:27 Grom the Affective
13:24 Grom the Affective
13:02 DarthVader
13:01 Grom the Affective
12:43 swksvolFF
12:42 Besoeker
12:37 Silentbrick
12:36 Silentbrick
12:17 Besoeker
12:13 Besoeker
12:06 swksvolFF









|