AIs like ChatGPT fall apart on the classic ‘Stroop’ psychological test, and that could hinder the achievement of artificial general intelligence



  • New study tasked AIs with tackling the ‘Stroop’ test
  • GPT and Claude performed very poorly compared to humans
  • There are nuances here, but generally speaking, researchers maintain that improving this side of AI is crucial to achieving artificial general intelligence.

A recently published study has pointed out a limitation of big-name AI models like ChatGPT, although it has caused some controversy as the main research uses now outdated versions of those models, but there are nuances to it, and this does not make the findings irrelevant.

I’ll get to that later, but first, let’s look at the study itself, which was highlighted on Reddit (“New Study Reveals Top AI Models Completely Fail Classic ‘Stroop’ Psychological Attention Test”) and published via Oxford University Press in the journal PNAS Nexus.

Leave a Comment

Your email address will not be published. Required fields are marked *