DISCOS: Bridging the Gap between Discourse Knowledge and Commonsense Knowledge

Fang, Tianqing; Zhang, Hongming; Wang, Weiqi; Song, Yangqiu; He, Bin

doi:10.1145/3442381.3450117

Computer Science > Computation and Language

arXiv:2101.00154 (cs)

[Submitted on 1 Jan 2021 (v1), last revised 18 Feb 2021 (this version, v2)]

Title:DISCOS: Bridging the Gap between Discourse Knowledge and Commonsense Knowledge

Authors:Tianqing Fang, Hongming Zhang, Weiqi Wang, Yangqiu Song, Bin He

View PDF

Abstract:Commonsense knowledge is crucial for artificial intelligence systems to understand natural language. Previous commonsense knowledge acquisition approaches typically rely on human annotations (for example, ATOMIC) or text generation models (for example, COMET.) Human annotation could provide high-quality commonsense knowledge, yet its high cost often results in relatively small scale and low coverage. On the other hand, generation models have the potential to automatically generate more knowledge. Nonetheless, machine learning models often fit the training data well and thus struggle to generate high-quality novel knowledge. To address the limitations of previous approaches, in this paper, we propose an alternative commonsense knowledge acquisition framework DISCOS (from DIScourse to COmmonSense), which automatically populates expensive complex commonsense knowledge to more affordable linguistic knowledge resources. Experiments demonstrate that we can successfully convert discourse knowledge about eventualities from ASER, a large-scale discourse knowledge graph, into if-then commonsense knowledge defined in ATOMIC without any additional annotation effort. Further study suggests that DISCOS significantly outperforms previous supervised approaches in terms of novelty and diversity with comparable quality. In total, we can acquire 3.4M ATOMIC-like inferential commonsense knowledge by populating ATOMIC on the core part of ASER. Codes and data are available at this https URL.

Comments:	WWW 2021 paper. 12 pages and 6 Figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2101.00154 [cs.CL]
	(or arXiv:2101.00154v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2101.00154
Related DOI:	https://doi.org/10.1145/3442381.3450117

Submission history

From: Tianqing Fang [view email]
[v1] Fri, 1 Jan 2021 03:30:38 UTC (1,212 KB)
[v2] Thu, 18 Feb 2021 12:43:37 UTC (2,156 KB)

Computer Science > Computation and Language

Title:DISCOS: Bridging the Gap between Discourse Knowledge and Commonsense Knowledge

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:DISCOS: Bridging the Gap between Discourse Knowledge and Commonsense Knowledge

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators