Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increase

sanitation@lemmy.today · 21 小时前

Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increase

Communist@lemmy.frozeninferno.xyz · 5 小时前

No.

https://www.nature.com/articles/d41586-025-02343-x

It’s lying

zbyte64@awful.systems · 2 小时前

You know the “DeepMind and OpenAi models” is the hint that the LLM model is not the one doing the math. The LLM provides a hypothesis and the DeepMind model provides grounding or feedback on whether the hypothesis even makes sense or works.

Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increase

Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increase

Just a moment...