Quantifying Loss of Information in Network-based Dimensionality Reduction Techniques

Zenil, Hector; Kiani, Narsis A.; Tegnér, Jesper

Quantitative Biology > Molecular Networks

arXiv:1504.06249 (q-bio)

[Submitted on 23 Apr 2015 (v1), last revised 27 Aug 2015 (this version, v4)]

Title:Quantifying Loss of Information in Network-based Dimensionality Reduction Techniques

Authors:Hector Zenil, Narsis A. Kiani, Jesper Tegnér

View PDF

Abstract:To cope with the complexity of large networks, a number of dimensionality reduction techniques for graphs have been developed. However, the extent to which information is lost or preserved when these techniques are employed has not yet been clear. Here we develop a framework, based on algorithmic information theory, to quantify the extent to which information is preserved when network motif analysis, graph spectra and spectral sparsification methods are applied to over twenty different biological and artificial networks. We find that the spectral sparsification is highly sensitive to high number of edge deletion, leading to significant inconsistencies, and that graph spectral methods are the most irregular, capturing algebraic information in a condensed fashion but largely losing most of the information content of the original networks. However, the approach shows that network motif analysis excels at preserving the relative algorithmic information content of a network, hence validating and generalizing the remarkable fact that despite their inherent combinatorial possibilities, local regularities preserve information to such an extent that essential properties are fully recoverable across different networks to determine their family group to which they belong to (eg genetic vs social network). Our algorithmic information methodology thus provides a rigorous framework enabling a fundamental assessment and comparison between different data dimensionality reduction methods thereby facilitating the identification and evaluation of the capabilities of old and new methods.

Comments:	29 pages, 6 figures
Subjects:	Molecular Networks (q-bio.MN); Information Theory (cs.IT); Quantitative Methods (q-bio.QM)
Cite as:	arXiv:1504.06249 [q-bio.MN]
	(or arXiv:1504.06249v4 [q-bio.MN] for this version)
	https://doi.org/10.48550/arXiv.1504.06249

Submission history

From: Hector Zenil [view email]
[v1] Thu, 23 Apr 2015 16:49:18 UTC (2,990 KB)
[v2] Sun, 3 May 2015 09:29:44 UTC (2,989 KB)
[v3] Sat, 13 Jun 2015 10:20:22 UTC (2,988 KB)
[v4] Thu, 27 Aug 2015 13:36:30 UTC (2,988 KB)

Quantitative Biology > Molecular Networks

Title:Quantifying Loss of Information in Network-based Dimensionality Reduction Techniques

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Quantitative Biology > Molecular Networks

Title:Quantifying Loss of Information in Network-based Dimensionality Reduction Techniques

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators