Hierarchical Graph Topic Modeling with Topic Tree-based Transformer

Zhang, Delvin Ce; Yang, Menglin; Wu, Xiaobao; Zhang, Jiasheng; Lauw, Hady W.

Abstract:Textual documents are commonly connected in a hierarchical graph structure where a central document links to others with an exponentially growing connectivity. Though Hyperbolic Graph Neural Networks (HGNNs) excel at capturing such graph hierarchy, they cannot model the rich textual semantics within documents. Moreover, text contents in documents usually discuss topics of different specificity. Hierarchical Topic Models (HTMs) discover such latent topic hierarchy within text corpora. However, most of them focus on the textual content within documents, and ignore the graph adjacency across interlinked documents. We thus propose a Hierarchical Graph Topic Modeling Transformer to integrate both topic hierarchy within documents and graph hierarchy across documents into a unified Transformer. Specifically, to incorporate topic hierarchy within documents, we design a topic tree and infer a hierarchical tree embedding for hierarchical topic modeling. To preserve both topic and graph hierarchies, we design our model in hyperbolic space and propose Hyperbolic Doubly Recurrent Neural Network, which models ancestral and fraternal tree structure. Both hierarchies are inserted into each Transformer layer to learn unified representations. Both supervised and unsupervised experiments verify the effectiveness of our model.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2502.11345 [cs.CL]
	(or arXiv:2502.11345v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.11345

Computer Science > Computation and Language

Title:Hierarchical Graph Topic Modeling with Topic Tree-based Transformer

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators