ASTOUND Testing Theory of Mind in GPT Models and Humans at DuckDuckGo

nature.com
Only include results for this site Hide site from these results
Share feedback about this site
Nature
https://www.nature.com › articles › s41562-024-01882-z
Testing theory of mind in large language models and humans
May 20, 2024Testing two families of large language models (LLMs) (GPT and LLaMA2) on a battery of measurements spanning different theory of mind abilities, Strachan et al. find that the performance of LLMs ...
- Testing theory of mind in large language models and humans - Nature
  Across the battery of theory of mind tests, we found that GPT-4 models performed at, or even sometimes above, human ... widely used to test theory of mind in humans 21-24. However, the mixed
nature.com
Only include results for this site Hide site from these results
Share feedback about this site
Nature
https://www.nature.com › articles › s41562-024-01882-z.pdf
PDF Testing theory of mind in large language models and humans - Nature
Across the battery of theory of mind tests, we found that GPT-4 models performed at, or even sometimes above, human ... widely used to test theory of mind in humans 21-24. However, the mixed
researchsquare.com
Only include results for this site Hide site from these results
Share feedback about this site
Research Square
https://www.researchsquare.com › article › rs-3262385 › v1
Testing Theory of Mind in GPT Models and Humans
May 20, 2024Performance across Theory of Mind tests. Both GPT models performed well across most tests (see Fig. 1 A; 1 B), and showed impressive abilities to reason about social intentions, beliefs, and non-literal utterances. For each test, we conducted a series of two-way Bonferroni-corrected Wilcoxon tests comparing each LLM against human scores.
researchgate.net
Only include results for this site Hide site from these results
Share feedback about this site
ResearchGate
https://www.researchgate.net › publication › 373345349_Testing_Theory_of_Mind_in_GPT_Models_and_Humans
Testing Theory of Mind in GPT Models and Humans - ResearchGate
Aug 14, 2023Specifically, across a battery of Theory of Mind tests, we found that GPT models performed at human levels when recognising indirect requests, false beliefs, and higher-order mental states like ...
pubmed.ncbi.nlm.nih.gov
Only include results for this site Hide site from these results
Share feedback about this site
PubMed
https://pubmed.ncbi.nlm.nih.gov › 38769463
Testing theory of mind in large language models and humans
Across the battery of theory of mind tests, we found that GPT-4 models performed at, or even sometimes above, human levels at identifying indirect requests, false beliefs and misdirection, but struggled with detecting faux pas. Faux pas, however, was the only test where LLaMA2 outperformed humans.
rivista.ai
Only include results for this site Hide site from these results
Share feedback about this site
rivista.ai
https://www.rivista.ai › wp-content › uploads › 2024 › 05 › s41562-024-01882-z-1.pdf
PDF Testing theory of mind in large language models and humans
Testing theory of mind in large language models and humans ... Human GPT-4 GPT-3.5 LLaMA2-70B False belief Irony 0.029 1 0.008 9.18 × 10−4 0.955 0.086 0.955 1 1 0.123 0.462 0.002 Faux pas ... Testing theory of mind in large language models and humans James W. A. Strachan ...
academia.edu
Only include results for this site Hide site from these results
Share feedback about this site
Academia.edu
https://www.academia.edu › 123317079 › Testing_theory_of_mind_in_large_language_models_and_humans
Testing theory of mind in large language models and humans - Academia.edu
This approach enabled us to reveal the existence of specific deviations from human-like behaviour that would have remained hidden using a single theory of mind test, or a single run of each test. Both GPT models exhibited impressive performance in tasks involving beliefs, intentions and non-literal utterances, with GPT-4 exceeding human levels ...
blogs.upm.es
Only include results for this site Hide site from these results
Share feedback about this site
UPM [Blogs]
https://blogs.upm.es › astound › 2024 › 05 › 29 › testing-theory-of-mind-in-large-language-models-and-humans
Testing theory of mind in Large Language models and humans - UPM
May 29, 2024In a recent groundbreaking study published in the renowned journal Nature, a team of researchers from the ASTOUND project consortium, explored the theory of mind capabilities in humans and large language models (LLMs) such as GPT-4 and LLaMA2.This study, central to the ASTOUND project (GA 101071191) dives into how well these AI models can track and interpret human mental states, an ability ...
coms.events
Only include results for this site Hide site from these results
Share feedback about this site
coms.events
https://coms.events › teap-2024-regensburg › data › abstracts › en › abstract_0159.html
Testing Theory of Mind in GPT Models and Humans
Mar 18, 2024Specifically, across a battery of Theory of Mind tests, we found that GPT models performed at human levels when recognising indirect requests, false beliefs, and higher-order mental states like misdirection, but were specifically impaired at recognising faux pas. Follow-up studies revealed that this was due to GPT's conservatism in drawing ...
semanticscholar.org
Only include results for this site Hide site from these results
Share feedback about this site
Semantic Scholar
https://www.semanticscholar.org › paper › Testing-theory-of-mind-in-large-language-models-and-Strachan-Albergo › 7d16a2de08fc5053e1026ffaa0bbd279302abca2
Testing theory of mind in large language models and humans
May 20, 2024It is demonstrated that large language models exhibit behaviour that is consistent with the outputs of mentalistic inference in humans but also highlights the importance of systematic testing to ensure a non-superficial comparison between human and artificial intelligences. At the core of what defines us as humans is the concept of theory of mind: the ability to track other people's mental ...
Searches related to ASTOUND Testing Theory of Mind in GPT Models and Humans
Related Searches
1. theory of the mind
2. artificial theory of mind
Can’t find what you’re looking for?
Help us improve DuckDuckGo searches with your feedback

Testing theory of mind in large language models and humans - Nature

Can’t find what you’re looking for?

See What’s DuckDuckNew