Context Retrieval via Normalized Contextual Latent Interaction for Conversational Agent

Liu, Junfeng; Mei, Zhuocheng; Peng, Kewen; Vatsavai, Ranga Raju

Computer Science > Computation and Language

arXiv:2312.00774 (cs)

[Submitted on 1 Dec 2023]

Title:Context Retrieval via Normalized Contextual Latent Interaction for Conversational Agent

Authors:Junfeng Liu, Zhuocheng Mei, Kewen Peng, Ranga Raju Vatsavai

View PDF HTML (experimental)

Abstract:Conversational agents leveraging AI, particularly deep learning, are emerging in both academic research and real-world applications. However, these applications still face challenges, including disrespecting knowledge and facts, not personalizing to user preferences, and enormous demand for computational resources during training and inference. Recent research efforts have been focused on addressing these challenges from various aspects, including supplementing various types of auxiliary information to the conversational agents. However, existing methods are still not able to effectively and efficiently exploit relevant information from these auxiliary supplements to further unleash the power of the conversational agents and the language models they use. In this paper, we present a novel method, PK-NCLI, that is able to accurately and efficiently identify relevant auxiliary information to improve the quality of conversational responses by learning the relevance among persona, chat history, and knowledge background through low-level normalized contextual latent interaction. Our experimental results indicate that PK-NCLI outperforms the state-of-the-art method, PK-FoCus, by 47.80%/30.61%/24.14% in terms of perplexity, knowledge grounding, and training efficiency, respectively, and maintained the same level of persona grounding performance. We also provide a detailed analysis of how different factors, including language model choices and trade-offs on training weights, would affect the performance of PK-NCLI.

Comments:	2023 IEEE International Conference on Data Mining Workshops (ICDMW)
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:2312.00774 [cs.CL]
	(or arXiv:2312.00774v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2312.00774

Submission history

From: Junfeng Liu [view email]
[v1] Fri, 1 Dec 2023 18:53:51 UTC (694 KB)

Computer Science > Computation and Language

Title:Context Retrieval via Normalized Contextual Latent Interaction for Conversational Agent

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Context Retrieval via Normalized Contextual Latent Interaction for Conversational Agent

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators