Englander Institute for Precision Medicine

SGI: automatic clinical subgroup identification in omics datasets.

TitleSGI: automatic clinical subgroup identification in omics datasets.
Publication TypeJournal Article
Year of Publication2022
AuthorsBuyukozkan M, Suhre K, Krumsiek J
JournalBioinformatics
Volume38
Issue2
Pagination573-576
Date Published2022 Jan 03
ISSN1367-4811
KeywordsAlgorithms, Diabetes Mellitus, Type 2, DNA Copy Number Variations, Humans, Metabolomics, Software
Abstract

SUMMARY: The 'Subgroup Identification' (SGI) toolbox provides an algorithm to automatically detect clinical subgroups of samples in large-scale omics datasets. It is based on hierarchical clustering trees in combination with a specifically designed association testing and visualization framework that can process an arbitrary number of clinical parameters and outcomes in a systematic fashion. A multi-block extension allows for the simultaneous use of multiple omics datasets on the same samples. In this article, we first describe the functionality of the toolbox and then demonstrate its capabilities through application examples on a type 2 diabetes metabolomics study as well as two copy number variation datasets from The Cancer Genome Atlas.

AVAILABILITY AND IMPLEMENTATION: SGI is an open-source package implemented in R. Package source codes and hands-on tutorials are available at https://github.com/krumsieklab/sgi. The QMdiab metabolomics data is included in the package and can be downloaded from https://doi.org/10.6084/m9.figshare.5904022.

SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

DOI10.1093/bioinformatics/btab656
Alternate JournalBioinformatics
PubMed ID34529048
PubMed Central IDPMC8723155
Grant ListR01 AG069901 / AG / NIA NIH HHS / United States
U19 AG063744 / AG / NIA NIH HHS / United States
/ / Biomedical Research Program /
/ / Weill Cornell Medical College in Qatar /
/ / Qatar Foundation and multiple grants from the Qatar National Research Fund (QNRF) /

Weill Cornell Medicine Englander Institute for Precision Medicine 413 E 69th Street
Belfer Research Building
New York, NY 10021