English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Association Plots: visualizing cluster-specific associations in high-dimensional correspondence analysis biplots

Gralinska, E., & Vingron, M. (2023). Association Plots: visualizing cluster-specific associations in high-dimensional correspondence analysis biplots. Journal of the Royal Statistical Society - Series C: Applied Statistics, 72(4), 1023-1040. doi:10.1093/jrsssc/qlad039.

Item is

Files

show Files
hide Files
:
Journal of the Royal Statistical Society Series C_Gralinska&Vingron_2023.pdf (Publisher version), 4MB
Name:
Journal of the Royal Statistical Society Series C_Gralinska&Vingron_2023.pdf
Description:
-
OA-Status:
Not specified
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
© The Author(s) 2023.

Locators

show

Creators

show
hide
 Creators:
Gralinska, Elzbieta1, Author                 
Vingron, Martin1, Author                 
Affiliations:
1Transcriptional Regulation (Martin Vingron), Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_1479639              

Content

show
hide
Free keywords: association, correspondence analysis, gene expression, marker genes
 Abstract: In molecular biology, just as in many other fields of science, data often come in the form of matrices or contingency tables with many observations (rows) for a set of variables (columns). While projection methods like principal component analysis or correspondence analysis (CA) can be applied for obtaining an overview of such data, in cases where the matrix is very large the associated loss of information upon projection into two or three dimensions may be dramatic. However, when the set of variables can be grouped into clusters, this opens up a new angle on the data. We focus on the question of which observations are associated to a cluster and distinguish it from other clusters. CA employs a geometry geared towards answering this question. We exploit this feature in order to introduce Association Plots for visualizing cluster-specific observations in complex data. Regardless of the data matrix dimensionality Association Plots are two-dimensional and depict the observations associated to a cluster of variables. We demonstrate our method on two small data sets and then use it to study a challenging genomic data set comprising >10,000 samples. We show that Association Plots can clearly highlight those observations which characterise a cluster of variables.

Details

show
hide
Language(s): eng - English
 Dates: 2023-04-142023-06-082023-08
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.1093/jrsssc/qlad039
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Journal of the Royal Statistical Society - Series C: Applied Statistics
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: Oxford : Oxford University Press
Pages: - Volume / Issue: 72 (4) Sequence Number: - Start / End Page: 1023 - 1040 Identifier: Other: ISSN
CoNE: https://pure.mpg.de/cone/journals/resource/1467-9876