Counterfactual Samples Constructing and Training for Commonsense Statements Estimation

Liu, Chong; Feng, Zaiwen; Liu, Lin; Deng, Zhenyun; Li, Jiuyong; Zhai, Ruifang; Cheng, Debo; Qin, Li

Computer Science > Computation and Language

arXiv:2412.20563 (cs)

[Submitted on 29 Dec 2024]

Title:Counterfactual Samples Constructing and Training for Commonsense Statements Estimation

Authors:Chong Liu, Zaiwen Feng, Lin Liu, Zhenyun Deng, Jiuyong Li, Ruifang Zhai, Debo Cheng, Li Qin

View PDF HTML (experimental)

Abstract:Plausibility Estimation (PE) plays a crucial role for enabling language models to objectively comprehend the real world. While large language models (LLMs) demonstrate remarkable capabilities in PE tasks but sometimes produce trivial commonsense errors due to the complexity of commonsense knowledge. They lack two key traits of an ideal PE model: a) Language-explainable: relying on critical word segments for decisions, and b) Commonsense-sensitive: detecting subtle linguistic variations in commonsense. To address these issues, we propose a novel model-agnostic method, referred to as Commonsense Counterfactual Samples Generating (CCSG). By training PE models with CCSG, we encourage them to focus on critical words, thereby enhancing both their language-explainable and commonsense-sensitive capabilities. Specifically, CCSG generates counterfactual samples by strategically replacing key words and introducing low-level dropout within sentences. These counterfactual samples are then incorporated into a sentence-level contrastive training framework to further enhance the model's learning process. Experimental results across nine diverse datasets demonstrate the effectiveness of CCSG in addressing commonsense reasoning challenges, with our CCSG method showing 3.07% improvement against the SOTA methods.

Comments:	14 pages, 4 figures
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2412.20563 [cs.CL]
	(or arXiv:2412.20563v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.20563

Submission history

From: Liu Chong [view email]
[v1] Sun, 29 Dec 2024 20:18:52 UTC (251 KB)

Computer Science > Computation and Language

Title:Counterfactual Samples Constructing and Training for Commonsense Statements Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Counterfactual Samples Constructing and Training for Commonsense Statements Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators