HistLLM: A Unified Framework for LLM-Based Multimodal Recommendation with User History Encoding and Compression

Zhang, Chen; Hu, Bo; Chen, Weidong; Mao, Zhendong

Computer Science > Information Retrieval

arXiv:2504.10150 (cs)

[Submitted on 14 Apr 2025]

Title:HistLLM: A Unified Framework for LLM-Based Multimodal Recommendation with User History Encoding and Compression

Authors:Chen Zhang, Bo Hu, Weidong Chen, Zhendong Mao

View PDF HTML (experimental)

Abstract:While large language models (LLMs) have proven effective in leveraging textual data for recommendations, their application to multimodal recommendation tasks remains relatively underexplored. Although LLMs can process multimodal information through projection functions that map visual features into their semantic space, recommendation tasks often require representing users' history interactions through lengthy prompts combining text and visual elements, which not only hampers training and inference efficiency but also makes it difficult for the model to accurately capture user preferences from complex and extended prompts, leading to reduced recommendation performance. To address this challenge, we introduce HistLLM, an innovative multimodal recommendation framework that integrates textual and visual features through a User History Encoding Module (UHEM), compressing multimodal user history interactions into a single token representation, effectively facilitating LLMs in processing user preferences. Extensive experiments demonstrate the effectiveness and efficiency of our proposed mechanism.

Subjects:	Information Retrieval (cs.IR); Multimedia (cs.MM)
Cite as:	arXiv:2504.10150 [cs.IR]
	(or arXiv:2504.10150v1 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2504.10150

Submission history

From: Chen Zhang [view email]
[v1] Mon, 14 Apr 2025 12:01:11 UTC (2,010 KB)

Computer Science > Information Retrieval

Title:HistLLM: A Unified Framework for LLM-Based Multimodal Recommendation with User History Encoding and Compression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:HistLLM: A Unified Framework for LLM-Based Multimodal Recommendation with User History Encoding and Compression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators