Joint regression analysis of multiple traits based on genetic relationships

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Dokumenter

  • Fulltext

    Forlagets udgivne version, 1,43 MB, PDF-dokument

MOTIVATION: Polygenic scores (PGSs) are widely available and employed in genomic data analyses for predicting and understanding genetic architectures. Existing approaches either require information on SNP level, do not infer clusters of traits sharing genetic characteristic, or do not have any immediate predictive properties.

RESULTS: Here, we present geneJAM, which is a novel clustering and estimation method using PGSs for inferring a genetic relationship among multiple, simultaneously measured and potentially correlated traits in a multivariate GWAS.Using graphical lasso, we estimate a sparse covariance matrix of the PGSs and obtain clusters of traits sharing genetic characteristics. We use the clusters to specify the structure of the error covariance matrix of a generalized least squares (GLS) model and use the feasible GLS estimator for estimating a linear regression model with a certain unknown degree of correlation between the residuals.The method suits many biology studies well with traits embedded in some genetic functioning groups and facilitates development of the PGS research. We compare the method with fully parametric techniques on simulated data and illustrate the utility of the methods by examining a heterogeneous stock mouse data set from the Wellcome Trust Centre for Human Genetics. We demonstrate that the method successfully identifies clusters of traits and increases precision, power, and computational efficiency.

AVAILABILITY AND IMPLEMENTATION: GeneJAM is implemented in R and available at: https://github.com/abuchardt/geneJAM.

OriginalsprogEngelsk
Artikelnummervbad192
TidsskriftBioinformatics Advances
Vol/bind4
Udgave nummer1
Antal sider16
DOI
StatusUdgivet - 2024

Bibliografisk note

© The Author(s) 2024. Published by Oxford University Press.

ID: 380747241