Meta-Analytic Gene-Clustering Algorithm for Integrating Multi-Omics and Multi-Study Data. Bioengineering (Basel) 2024 Jun 08;11(6)
Date
06/27/2024Pubmed ID
38927823Pubmed Central ID
PMC11201102DOI
10.3390/bioengineering11060587Scopus ID
2-s2.0-85197927921 (requires institutional sign-in at Scopus site) 1 CitationAbstract
Gene pathways and gene-regulatory networks are used to describe the causal relationship between genes, based on biological experiments. However, many genes are still to be studied to define novel pathways. To address this, a gene-clustering algorithm has been used to group correlated genes together, based on the similarity of their gene expression level. The existing methods cluster genes based on only one type of omics data, which ignores the information from other types. A large sample size is required to achieve an accurate clustering structure for thousands of genes, which can be challenging due to the cost of multi-omics data. Meta-analysis has been used to aggregate the data from multiple studies and improve the analysis results. We propose a computationally efficient meta-analytic gene-clustering algorithm that combines multi-omics datasets from multiple studies, using the fixed effects linear models and a modified weighted correlation network analysis framework. The simulation study shows that the proposed method outperforms existing single omic-based clustering approaches when multi-omics data and/or multiple studies are available. A real data example demonstrates that our meta-analytic method outperforms single-study based methods.
Author List
Kemmo Tsafack U, Ahn KW, Kwitek AE, Lin CWAuthors
Kwang Woo Ahn PhD Director, Professor in the Data Science Institute department at Medical College of WisconsinAnne E. Kwitek PhD Professor in the Physiology department at Medical College of Wisconsin
Chien-Wei Lin PhD Associate Professor in the Data Science Institute department at Medical College of Wisconsin