ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events

Islakoglu, Duygu Sezen; Kalo, Jan-Christoph

Computer Science > Machine Learning

arXiv:2501.03040 (cs)

[Submitted on 6 Jan 2025]

Title:ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events

Authors:Duygu Sezen Islakoglu, Jan-Christoph Kalo

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have achieved remarkable success in various NLP tasks, yet they still face significant challenges in reasoning and arithmetic. Temporal reasoning, a critical component of natural language understanding, has raised increasing research attention. However, comprehensive testing of Allen's interval relations (e.g., before, after, during) -- a fundamental framework for temporal relationships -- remains underexplored. To fill this gap, we present ChronoSense, a new benchmark for evaluating LLMs' temporal understanding. It includes 16 tasks, focusing on identifying the Allen relation between two temporal events and temporal arithmetic, using both abstract events and real-world data from Wikidata. We assess the performance of seven recent LLMs using this benchmark and the results indicate that models handle Allen relations, even symmetrical ones, quite differently. Moreover, the findings suggest that the models may rely on memorization to answer time-related questions. Overall, the models' low performance highlights the need for improved temporal understanding in LLMs and ChronoSense offers a robust framework for future research in this area. Our dataset and the source code are available at this https URL.

Comments:	14 pages, 2 figures
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2501.03040 [cs.LG]
	(or arXiv:2501.03040v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2501.03040

Submission history

From: Duygu Sezen Islakoglu [view email]
[v1] Mon, 6 Jan 2025 14:27:41 UTC (269 KB)

Computer Science > Machine Learning

Title:ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators