Browsing by Author "Marttinen, Pekka, Dr., Aalto University, Department of Information and Computer Science, Finland"
Now showing 1 - 1 of 1
Results Per Page
Sort Options
Item Sparse Bayesian Linear Models: Computational Advances and Applications in Epidemiology(Aalto University, 2014) Peltola, Tomi; Vehtari, Aki, Prof., Aalto University, Department of Biomedical Engineering and Computational Science, Finland; Marttinen, Pekka, Dr., Aalto University, Department of Information and Computer Science, Finland; Lääketieteellisen tekniikan ja laskennallisen tieteen laitos; Department of Biomedical Engineering and Computational Science; Perustieteiden korkeakoulu; School of Science; Lampinen, Jouko, Prof., Aalto University, Department of Biomedical Engineering and Computational Science, FinlandRecent advances in measurement technologies have transformed the landscape of studies in the genetic and metabolic determinants of diseases and other complex traits. DNA and blood samples can be cost- and time-efficiently interrogated for millions of genetic markers and hundreds of circulating metabolites. While the scale and unbiased nature of the characterization of the individual samples creates opportunities for new discoveries, they also pose a challenge for the statistical analysis of the data. One approach for tackling the issues, and a focus of much recent research in statistical methodology, is searching for linear relationships with a sparsity assumption, that is, the presence of only a limited number of practically relevant relationships among the vast number of possibilities. This thesis studies aspects of the statistical modelling and computation with the linearity and sparsity assumptions in the framework of Bayesian data analysis. First, a hierarchical extension of the spike and slab prior distribution for sparse linear regression modelling, to allow additive and dominant effects in genome-wide association analysis, is presented. The model is applied to search for genetic markers related to blood cholesterol levels. A tailored, finitely adaptive Markov chain Monte Carlo algorithm is studied for the computation. Second, an approach for constructing deterministic Gaussian approximations for Bayesian linear latent variable models using the expectation propagation method is described. The main advance is an efficient numerical solution to the moment integrals for bilinear probability factors. Third, a model for the prediction of the risk of adverse cardiovascular events in diabetic individuals using candidate biomarkers is presented. The model is extended hierarchically to include data from non-diabetic individuals. Shrinkage priors and projective covariate selection are applied to identify biomarkers with predictive value. The results of the studies demonstrate benefits from the hierarchical Bayesian modelling. Despite the advances here and generally in the literature, the computation in sparse models and large datasets remains challenging. On the other hand, given the fast pace in the development of deterministic approximation methods, assessing their role in predictive covariate selection would seem timely.