UserSumBench: A Benchmark Framework for Evaluating User Summarization Approaches

Wang, Chao; Wu, Neo; Ning, Lin; Wu, Jiaxing; Liu, Luyang; Xie, Jun; O'Banion, Shawn; Green, Bradley

Computer Science > Machine Learning

arXiv:2408.16966 (cs)

[Submitted on 30 Aug 2024 (v1), last revised 5 Sep 2024 (this version, v2)]

Title:UserSumBench: A Benchmark Framework for Evaluating User Summarization Approaches

Authors:Chao Wang, Neo Wu, Lin Ning, Jiaxing Wu, Luyang Liu, Jun Xie, Shawn O'Banion, Bradley Green

View PDF HTML (experimental)

Abstract:Large language models (LLMs) have shown remarkable capabilities in generating user summaries from a long list of raw user activity data. These summaries capture essential user information such as preferences and interests, and therefore are invaluable for LLM-based personalization applications, such as explainable recommender systems. However, the development of new summarization techniques is hindered by the lack of ground-truth labels, the inherent subjectivity of user summaries, and human evaluation which is often costly and time-consuming. To address these challenges, we introduce \UserSumBench, a benchmark framework designed to facilitate iterative development of LLM-based summarization approaches. This framework offers two key components: (1) A reference-free summary quality metric. We show that this metric is effective and aligned with human preferences across three diverse datasets (MovieLens, Yelp and Amazon Review). (2) A novel robust summarization method that leverages time-hierarchical summarizer and self-critique verifier to produce high-quality summaries while eliminating hallucination. This method serves as a strong baseline for further innovation in summarization techniques.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2408.16966 [cs.LG]
	(or arXiv:2408.16966v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2408.16966

Submission history

From: Chao Wang [view email]
[v1] Fri, 30 Aug 2024 01:56:57 UTC (683 KB)
[v2] Thu, 5 Sep 2024 23:18:00 UTC (683 KB)

Computer Science > Machine Learning

Title:UserSumBench: A Benchmark Framework for Evaluating User Summarization Approaches

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:UserSumBench: A Benchmark Framework for Evaluating User Summarization Approaches

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators