Bayesian Variable Selection in Searching for Additive and Dominant Effects in Genome-Wide Data

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorPeltola, Tomi
dc.contributor.authorMarttinen, Pekka
dc.contributor.authorJula, Antti
dc.contributor.authorSalomaa, Veikko
dc.contributor.authorPerola, Markus
dc.contributor.authorVehtari, Aki
dc.contributor.departmentDepartment of Computer Scienceen
dc.date.accessioned2017-05-11T06:43:29Z
dc.date.available2017-05-11T06:43:29Z
dc.date.issued2012
dc.description.abstractAlthough complex diseases and traits are thought to have multifactorial genetic basis, the common methods in genome-wide association analyses test each variant for association independent of the others. This computational simplification may lead to reduced power to identify variants with small effect sizes and requires correcting for multiple hypothesis tests with complex relationships. However, advances in computational methods and increase in computational resources are enabling the computation of models that adhere more closely to the theory of multifactorial inheritance. Here, a Bayesian variable selection and model averaging approach is formulated for searching for additive and dominant genetic effects. The approach considers simultaneously all available variants for inclusion as predictors in a linear genotype-phenotype mapping and averages over the uncertainty in the variable selection. This leads to naturally interpretable summary quantities on the significances of the variants and their contribution to the genetic basis of the studied trait. We first characterize the behavior of the approach in simulations. The results indicate a gain in the causal variant identification performance when additive and dominant variation are simulated, with a negligible loss of power in purely additive case. An application to the analysis of high- and low-density lipoprotein cholesterol levels in a dataset of 3895 Finns is then presented, demonstrating the feasibility of the approach at the current scale of single-nucleotide polymorphism data. We describe a Markov chain Monte Carlo algorithm for the computation and give suggestions on the specification of prior parameters using commonly available prior information. An open-source software implementing the method is available at http://www.lce.hut.fi/research/mm/bmagwa/ and https://github.com/to-mi/.en
dc.description.versionPeer revieweden
dc.format.extent1-11
dc.format.mimetypeapplication/pdf
dc.identifier.citationPeltola , T , Marttinen , P , Jula , A , Salomaa , V , Perola , M & Vehtari , A 2012 , ' Bayesian Variable Selection in Searching for Additive and Dominant Effects in Genome-Wide Data ' , PloS one , vol. 7 , no. 1 , e29115 , pp. 1-11 . https://doi.org/10.1371/journal.pone.0029115en
dc.identifier.doi10.1371/journal.pone.0029115
dc.identifier.issn1932-6203
dc.identifier.otherPURE UUID: 079f721b-8684-4e77-b5bc-f3dd58f2a78e
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/079f721b-8684-4e77-b5bc-f3dd58f2a78e
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/12866920/journal.pone.0029115.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/25443
dc.identifier.urnURN:NBN:fi:aalto-201705113827
dc.language.isoenen
dc.relation.ispartofseriesPLOS ONEen
dc.relation.ispartofseriesVolume 7, issue 1en
dc.rightsopenAccessen
dc.titleBayesian Variable Selection in Searching for Additive and Dominant Effects in Genome-Wide Dataen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files