Computer Science > Databases
[Submitted on 18 Dec 2018 (this version), latest version 10 May 2020 (v2)]
Title:High-utility itemset mining for subadditive monotone utility functions
View PDFAbstract:High-utility Itemset Mining (HUIM) finds itemsets from a transaction database with utility no less than a user-defined threshold where the utility of an itemset is defined as the sum of the utilities of its items. In this paper, we introduce the notion of generalized utility functions that need not be the sum of individual utilities. In particular, we study subadditive monotone (SM) utility functions and prove that it generalizes the HUIM problem mentioned above. Moving on to HUIM algorithms, the existing algorithms use upper-bounds like `Transaction Weighted Utility' and `Exact-Utility, Remaining Utility' for efficient search-space exploration. We derive analogous and tighter upper-bounds for SM utility functions and explain how existing HUIM algorithms of different classes can be adapted using our upper bound. We experimentally compared adaptations of some of the latest algorithms and point out some caveats that should be kept in mind while handling general utility functions.
Submission history
From: Siddharth Dawar [view email][v1] Tue, 18 Dec 2018 07:26:15 UTC (765 KB)
[v2] Sun, 10 May 2020 06:05:54 UTC (938 KB)
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.