Position: Curvature Matrices Should Be Democratized via Linear Operators

Dangel, Felix; Eschenhagen, Runa; Ormaniec, Weronika; Fernandez, Andres; Tatzel, Lukas; Kristiadi, Agustinus

Computer Science > Machine Learning

arXiv:2501.19183 (cs)

[Submitted on 31 Jan 2025]

Title:Position: Curvature Matrices Should Be Democratized via Linear Operators

Authors:Felix Dangel, Runa Eschenhagen, Weronika Ormaniec, Andres Fernandez, Lukas Tatzel, Agustinus Kristiadi

View PDF HTML (experimental)

Abstract:Structured large matrices are prevalent in machine learning. A particularly important class is curvature matrices like the Hessian, which are central to understanding the loss landscape of neural nets (NNs), and enable second-order optimization, uncertainty quantification, model pruning, data attribution, and more. However, curvature computations can be challenging due to the complexity of automatic differentiation, and the variety and structural assumptions of curvature proxies, like sparsity and Kronecker factorization. In this position paper, we argue that linear operators -- an interface for performing matrix-vector products -- provide a general, scalable, and user-friendly abstraction to handle curvature matrices. To support this position, we developed $\textit{curvlinops}$, a library that provides curvature matrices through a unified linear operator interface. We demonstrate with $\textit{curvlinops}$ how this interface can hide complexity, simplify applications, be extensible and interoperable with other libraries, and scale to large NNs.

Comments:	8 pages, 2 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2501.19183 [cs.LG]
	(or arXiv:2501.19183v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2501.19183

Submission history

From: Felix Dangel [view email]
[v1] Fri, 31 Jan 2025 14:46:30 UTC (1,447 KB)

Computer Science > Machine Learning

Title:Position: Curvature Matrices Should Be Democratized via Linear Operators

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Position: Curvature Matrices Should Be Democratized via Linear Operators

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators