Computer Science > Databases
[Submitted on 25 Dec 2017 (v1), last revised 23 May 2018 (this version, v2)]
Title:Recurrent Meta-Structure for Robust Similarity Measure in Heterogeneous Information Networks
View PDFAbstract:Similarity measure as a fundamental task in heterogeneous information network analysis has been applied to many areas, e.g., product recommendation, clustering and Web search. Most of the existing metrics depend on the meta-path or meta-structure specified by users in advance. These metrics are thus sensitive to the pre-specified meta-path or meta-structure. In this paper, a novel similarity measure in heterogeneous information networks, called Recurrent Meta-Structure-based Similarity (RMSS), is proposed. The recurrent meta-structure as a schematic structure in heterogeneous information networks provides a unified framework to integrate all of the meta-paths and meta-structures. Therefore, RMSS is robust to the meta-paths and meta-structures. We devise an approach to automatically constructing the recurrent meta-structure. In order to formalize the semantics, the recurrent meta-structure is decomposed into several recurrent meta-paths and recurrent meta-trees, and we then define the commuting matrices of the recurrent meta-paths and meta-trees. All of the commuting matrices of the recurrent meta-paths and meta-trees are combined according to different weights. Note that the weights can be determined by two kinds of weighting strategies: local weighting strategy and global weighting strategy. As a result, RMSS is defined by virtue of the final commuting matrix. Experimental evaluations show that the existing metrics are sensitive to different meta-paths or meta-structures and that the proposed RMSS outperforms the existing metrics in terms of ranking and clustering tasks.
Submission history
From: Jianbin Huang [view email][v1] Mon, 25 Dec 2017 05:07:01 UTC (344 KB)
[v2] Wed, 23 May 2018 03:11:06 UTC (417 KB)
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.