Iterative Effect-Size Bias in Ridehailing: Measuring Social Bias in Dynamic Pricing of 100 Million Rides

Pandey, Akshat; Caliskan, Aylin

Computer Science > Computers and Society

arXiv:2006.04599v4 (cs)

[Submitted on 8 Jun 2020 (v1), revised 22 Jun 2020 (this version, v4), latest version 3 May 2021 (v6)]

Title:Iterative Effect-Size Bias in Ridehailing: Measuring Social Bias in Dynamic Pricing of 100 Million Rides

Authors:Akshat Pandey, Aylin Caliskan

View PDF

Abstract:Algorithmic bias is the systematic preferential or discriminatory treatment of a group of people by an artificial intelligence system. In this work we develop a random-effects based metric for the analysis of social bias in supervised machine learning prediction models where model outputs depend on U.S. locations. We define a methodology for using U.S. Census data to measure social bias on user attributes legally protected against discrimination, such as ethnicity, sex, and religion, also known as protected attributes. We evaluate our method on the Strategic Subject List (SSL) gun-violence prediction dataset, where we have access to both U.S. Census data as well as ground truth protected attributes for 224,235 individuals in Chicago being assessed for participation in future gun-violence incidents. Our results indicate that quantifying social bias using U.S. Census data provides a valid approach to auditing a supervised algorithmic decision-making system. Using our methodology, we then quantify the potential social biases of 100 million ridehailing samples in the city of Chicago.
This work is the first large-scale fairness analysis of the dynamic pricing algorithms used by ridehailing applications. An analysis of Chicago ridehailing samples in conjunction with American Community Survey data indicates possible disparate impact due to social bias based on age, house pricing, education, and ethnicity in the dynamic fare pricing models used by ridehailing applications, with effect-sizes of 0.74, 0.70, 0.34, and -0.31 (using Cohen's d) for each demographic respectively. Further, our methodology provides a principled approach to quantifying algorithmic bias on datasets where protected attributes are unavailable, given that U.S. geolocations and algorithmic decisions are provided.

Comments:	16 pages, 6 tables, 6 figures
Subjects:	Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2006.04599 [cs.CY]
	(or arXiv:2006.04599v4 [cs.CY] for this version)
	https://doi.org/10.48550/arXiv.2006.04599

Submission history

From: Akshat Pandey [view email]
[v1] Mon, 8 Jun 2020 13:51:03 UTC (824 KB)
[v2] Fri, 12 Jun 2020 21:06:33 UTC (2,487 KB)
[v3] Fri, 19 Jun 2020 00:45:59 UTC (4,133 KB)
[v4] Mon, 22 Jun 2020 19:43:31 UTC (9,759 KB)
[v5] Thu, 8 Apr 2021 14:42:09 UTC (8,340 KB)
[v6] Mon, 3 May 2021 20:35:37 UTC (8,342 KB)

Computer Science > Computers and Society

Title:Iterative Effect-Size Bias in Ridehailing: Measuring Social Bias in Dynamic Pricing of 100 Million Rides

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computers and Society

Title:Iterative Effect-Size Bias in Ridehailing: Measuring Social Bias in Dynamic Pricing of 100 Million Rides

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators