sanitation@lemmy.today to Technology@lemmy.worldEnglish · 21 小时前Advanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increasewww.psypost.orgexternal-linkmessage-square60fedilinkarrow-up1191arrow-down16
arrow-up1185arrow-down1external-linkAdvanced AI models suffer a near-total collapse on classic psychology test as cognitive demands increasewww.psypost.orgsanitation@lemmy.today to Technology@lemmy.worldEnglish · 21 小时前message-square60fedilink
minus-squareCommunist@lemmy.frozeninferno.xyzlinkfedilinkEnglisharrow-up1arrow-down2·5 小时前No. https://www.nature.com/articles/d41586-025-02343-x It’s lying
minus-squarezbyte64@awful.systemslinkfedilinkEnglisharrow-up1·2 小时前You know the “DeepMind and OpenAi models” is the hint that the LLM model is not the one doing the math. The LLM provides a hypothesis and the DeepMind model provides grounding or feedback on whether the hypothesis even makes sense or works.
No.
https://www.nature.com/articles/d41586-025-02343-x
It’s lying
You know the “DeepMind and OpenAi models” is the hint that the LLM model is not the one doing the math. The LLM provides a hypothesis and the DeepMind model provides grounding or feedback on whether the hypothesis even makes sense or works.