Mathematics > Statistics Theory
[Submitted on 3 Dec 2022 (v1), revised 3 Mar 2023 (this version, v2), latest version 18 Jul 2024 (v3)]
Title:A simple extension of Azadkia $\&$ Chatterjee's rank correlation to a vector of endogenous variables
View PDFAbstract:We propose a direct and natural extension of Azadkia & Chatterjee's rank correlation $T$ introduced in [4] to a set of $q \geq 1$ endogenous variables. The approach builds upon converting the original vector-valued problem into a univariate problem and then applying the rank correlation $T$ to it. The novel measure $T^q$ then quantifies the scale-invariant extent of functional dependence of an endogenous vector ${\bf Y} = (Y_1,\dots,Y_q)$ on a number of exogenous variables ${\bf X} = (X_1,\dots,X_p)$, $p\geq1$, characterizes independence of ${\bf X}$ and ${\bf Y}$ as well as perfect dependence of ${\bf Y}$ on ${\bf X}$ and hence fulfills all the desired characteristics of a measure of predictability. Aiming at maximum interpretability, we provide various general invariance and continuity conditions for $T^q$ as well as novel ordering results for conditional distributions, revealing new insights into the nature of $T$. Building upon the graph-based estimator for $T$ in [4], we present a non-parametric estimator for $T^q$ that is strongly consistent in full generality, i.e., without any distributional assumptions. Based on this estimator we develop a model-free and dependence-based feature ranking and forward feature selection of multiple-outcome data, and establish tools for identifying networks between random variables. Real case studies illustrate the main aspects of the developed methodology.
Submission history
From: Sebastian Fuchs [view email][v1] Sat, 3 Dec 2022 14:24:14 UTC (29 KB)
[v2] Fri, 3 Mar 2023 12:15:25 UTC (193 KB)
[v3] Thu, 18 Jul 2024 08:38:14 UTC (93 KB)
Current browse context:
math.ST
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
Connected Papers (What is Connected Papers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.