Linguistic Matrix Theory

Kartsaklis, Dimitrios; Ramgoolam, Sanjaye; Sadrzadeh, Mehrnoosh

Computer Science > Computation and Language

arXiv:1703.10252 (cs)

[Submitted on 28 Mar 2017]

Title:Linguistic Matrix Theory

Authors:Dimitrios Kartsaklis, Sanjaye Ramgoolam, Mehrnoosh Sadrzadeh

View PDF

Abstract:Recent research in computational linguistics has developed algorithms which associate matrices with adjectives and verbs, based on the distribution of words in a corpus of text. These matrices are linear operators on a vector space of context words. They are used to construct the meaning of composite expressions from that of the elementary constituents, forming part of a compositional distributional approach to semantics. We propose a Matrix Theory approach to this data, based on permutation symmetry along with Gaussian weights and their perturbations. A simple Gaussian model is tested against word matrices created from a large corpus of text. We characterize the cubic and quartic departures from the model, which we propose, alongside the Gaussian parameters, as signatures for comparison of linguistic corpora. We propose that perturbed Gaussian models with permutation symmetry provide a promising framework for characterizing the nature of universality in the statistical properties of word matrices. The matrix theory framework developed here exploits the view of statistics as zero dimensional perturbative quantum field theory. It perceives language as a physical system realizing a universality class of matrix statistics characterized by permutation symmetry.

Comments:	32 pages, 3 figures
Subjects:	Computation and Language (cs.CL); High Energy Physics - Theory (hep-th); Combinatorics (math.CO)
Report number:	QMUL-PH-17-03
Cite as:	arXiv:1703.10252 [cs.CL]
	(or arXiv:1703.10252v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1703.10252

Submission history

From: Dimitri Kartsaklis [view email]
[v1] Tue, 28 Mar 2017 15:20:52 UTC (118 KB)

Computer Science > Computation and Language

Title:Linguistic Matrix Theory

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Linguistic Matrix Theory

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators