nature.com
Testing theory of mind in large language models and humans - Nature
Across the battery of theory of mind tests, we found that GPT-4 models performed at, or even sometimes above, human ... widely used to test theory of mind in humans 21-24. However, the mixed