Evaluating the Language Abilities of Large Language Models vs. Humans: Three Caveats

Leivada, E; Dentella, V; Guenther, F

doi:10.5964/bioling.14391

Datos identificativos

Identificador: imarina:9369657

Handle: https://hdl.handle.net/20.500.11797/imarina9369657

Autores: Leivada, E; Dentella, V; Guenther, F

Resumen:
We identify and analyze three caveats that may arise when analyzing the linguistic abilities of Large Language Models. The problem of unlicensed generalizations refers to the danger of interpreting performance in one task as predictive of the models' overall capabilities, based on the assumption that because a specific task performance is indicative of certain underlying capabilities in humans, the same association holds for models. The human-like paradox refers to the problem of lacking human comparisons, while at the same time attributing human-like abilities to the models. Last, the problem of double standards refers to the use of tasks and methodologies that either cannot be applied to humans or they are evaluated differently in models vs. humans. While we recognize the impressive linguistic abilities of LLMs, we conclude that specific claims about the
Otros:

Enlace a la fuente original: https://bioling.psychopen.eu/index.php/bioling/article/view/14391
Referencia de l'ítem segons les normes APA: Leivada, E; Dentella, V; Guenther, F (2024). Evaluating the Language Abilities of Large Language Models vs. Humans: Three Caveats. Biolinguistics, 18(), e14391-. DOI: 10.5964/bioling.14391
Referencia al articulo segun fuente origial: Biolinguistics. 18 e14391-
DOI del artículo: 10.5964/bioling.14391
Año de publicación de la revista: 2024-01-01
Entidad: Universitat Rovira i Virgili
Versión del articulo depositado: info:eu-repo/semantics/publishedVersion
Fecha de alta del registro: 2026-05-09
Autor/es de la URV: Dentella, Vittoria
Departamento: Estudis Anglesos i Alemanys
URL Documento de licencia: https://repositori.urv.cat/ca/proteccio-de-dades/
Tipo de publicación: Journal Publications
Autor según el artículo: Leivada, E; Dentella, V; Guenther, F
Acceso a la licencia de uso: https://creativecommons.org/licenses/by/3.0/es/
Áreas temáticas: Linguistics and language, Linguistics, Letras / linguística, Language and linguistics, Language & linguistics, Interdisciplinary research in the social sciences, Interdisciplinary research in the humanities, Filosofía, Experimental and cognitive psychology, Ciencias sociales, Ciencias humanas
Direcció de correo del autor: vittoria.dentella@estudiants.urv.cat

Palabras clave:

Probabilities
Probabilitie
Large language models
Grammaticality
Artificial intelligence
Experimental and Cognitive Psychology
Language & Linguistics
Linguistics and Language
Linguistics
Letras / linguística
Language and linguistics
Interdisciplinary research in the social sciences
Interdisciplinary research in the humanities
Filosofía
Ciencias sociales
Ciencias humanas
Documentos:

DocumentPrincipal
Cerca a google

Evaluating the Language Abilities of Large Language Models vs. Humans: Three Caveats

Datos identificativos

Otros:

Palabras clave:

Documentos:

Cerca a google