Statistics > Applications
[Submitted on 17 Mar 2016]
Title:Covariate Microaggregation for Logistic Regression: An Application for Analysis of Confidential Data
View PDFAbstract:In the recent past, electronic health records and distributed data networks emerged as a viable resource for medical and scientific research. As the use of confidential patient information from such sources become more common, maintaining privacy of patients is of utmost importance. For a binary disease outcome of interest, we show that the techniques of microaggregation (equivalent to specimen pooling) and \underline{Po}oled \underline{Lo}gistic \underline{R}egression (PoLoR) could be applied for analysis of large and/or distributed data while respecting patient privacy. PoLoR is exactly the same as standard logistic regression, but instead of using individual covariate level, the analysis uses microaggregated covariate level when microaggregation is conditional on the outcome status. Aggregate levels of covariates can be passed from the nodes of the network to the analysis center without revealing individual-level microdata and can be used very easily with standard softwares for estimation of disease odds ratio associated with a set of categorical or continuous covariates. Microaggregation of covariates allows for consistent estimation of the parameters of logistic regression model that can include confounders and transformation of exposure. Additionally, since the microdata can be accessed within nodes, effect modifiers can be accommodated and consistently estimated. For analysis of confidential health data, covariate microaggregation for logistic regression will provide a practical and straightforward alternative to more complicated existing options.
Submission history
From: Paramita Saha-Chauchuri [view email][v1] Thu, 17 Mar 2016 02:48:08 UTC (82 KB)
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.