LCTG Bench: LLM Controlled Text Generation Benchmark

Kurihara, Kentaro; Mita, Masato; Zhang, Peinan; Sasaki, Shota; Ishigami, Ryosuke; Okazaki, Naoaki

Computer Science > Computation and Language

arXiv:2501.15875 (cs)

[Submitted on 27 Jan 2025]

Title:LCTG Bench: LLM Controlled Text Generation Benchmark

Authors:Kentaro Kurihara, Masato Mita, Peinan Zhang, Shota Sasaki, Ryosuke Ishigami, Naoaki Okazaki

View PDF

Abstract:The rise of large language models (LLMs) has led to more diverse and higher-quality machine-generated text. However, their high expressive power makes it difficult to control outputs based on specific business instructions. In response, benchmarks focusing on the controllability of LLMs have been developed, but several issues remain: (1) They primarily cover major languages like English and Chinese, neglecting low-resource languages like Japanese; (2) Current benchmarks employ task-specific evaluation metrics, lacking a unified framework for selecting models based on controllability across different use cases. To address these challenges, this research introduces LCTG Bench, the first Japanese benchmark for evaluating the controllability of LLMs. LCTG Bench provides a unified framework for assessing control performance, enabling users to select the most suitable model for their use cases based on controllability. By evaluating nine diverse Japanese-specific and multilingual LLMs like GPT-4, we highlight the current state and challenges of controllability in Japanese LLMs and reveal the significant gap between multilingual models and Japanese-specific models.

Comments:	15 pages, 11 figures. Project page: this [URL](this https URL)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2501.15875 [cs.CL]
	(or arXiv:2501.15875v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2501.15875

Submission history

From: Kentaro Kurihara [view email]
[v1] Mon, 27 Jan 2025 08:59:10 UTC (3,035 KB)

Computer Science > Computation and Language

Title:LCTG Bench: LLM Controlled Text Generation Benchmark

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:LCTG Bench: LLM Controlled Text Generation Benchmark

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators