\'Evaluation des capacit\'es de r\'eponse de larges mod\`eles de langage (LLM) pour des questions d'historiens

Chartier, Mathieu; Dakkoune, Nabil; Bourgeois, Guillaume; Jean, Stéphane

Computer Science > Information Retrieval

arXiv:2406.15173 (cs)

[Submitted on 21 Jun 2024]

Title:Évaluation des capacités de réponse de larges modèles de langage (LLM) pour des questions d'historiens

Authors:Mathieu Chartier, Nabil Dakkoune, Guillaume Bourgeois, Stéphane Jean

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) like ChatGPT or Bard have revolutionized information retrieval and captivated the audience with their ability to generate custom responses in record time, regardless of the topic. In this article, we assess the capabilities of various LLMs in producing reliable, comprehensive, and sufficiently relevant responses about historical facts in French. To achieve this, we constructed a testbed comprising numerous history-related questions of varying types, themes, and levels of difficulty. Our evaluation of responses from ten selected LLMs reveals numerous shortcomings in both substance and form. Beyond an overall insufficient accuracy rate, we highlight uneven treatment of the French language, as well as issues related to verbosity and inconsistency in the responses provided by LLMs.

Comments:	in French language
Subjects:	Information Retrieval (cs.IR); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2406.15173 [cs.IR]
	(or arXiv:2406.15173v1 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2406.15173

Submission history

From: Mathieu Chartier [view email]
[v1] Fri, 21 Jun 2024 14:19:57 UTC (83 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.IR

< prev | next >

new | recent | 2024-06

Change to browse by:

cs
cs.AI

References & Citations

export BibTeX citation

Computer Science > Information Retrieval

Title:Évaluation des capacités de réponse de larges modèles de langage (LLM) pour des questions d'historiens

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Évaluation des capacités de réponse de larges modèles de langage (LLM) pour des questions d'historiens

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators