Survival Modeling Using Factor Analysis Data Integration.

 |  Login

Show simple item record

dc.contributor Aalto-yliopisto fi
dc.contributor Aalto University en
dc.contributor.advisor Marttinen, Pekka Ali, Mehreen 2015-11-19T11:30:39Z 2015-11-19T11:30:39Z 2015-11-05
dc.description.abstract Biology proves that complex diseases are a result of an interplay of genetics and environmental factors. This study aims to combine both by integrating `multi-omics' data with clinical data, thus helping biological and medical researchers in the process of disease diagnosis, patient stratification, disease mechanism analysis and effective treatment decisions. Multi-view biological data from a cohort from National Institute for Health and Welfare (THL), Finland, has been explored using factor models. Factor models reduce high-dimensional data into lower-dimensional factor space. Factor analysis (FA) is the simplest factor model that represents each data feature as a weighted sum of latent factors, separating noise. Bayesian multi-view group-sparse factor analysis (GFA) is another factor model that has been examined in this study. GFA is an extension of FA with sparsity added to the model. GFA is applied on high-dimensional data where features can be naturally divided into different groups (views). Unlike FA, GFA can record component (latent factor) activity for views (groups of related features), this makes GFA a well-suited model for multi-view data sets. Survival models have been utilized to make cardiovascular disease (CVD) risk predictions based on the dependencies between the multiple views as represented by factor models. Cox proportional hazard model is applied to analyze data until a CVD risk event occurs and the output variable is time. This study will provide a stepping stone for exploring GFA, in combination with Cox survival model, for a better latent factor representation of multi-view data sets. en
dc.format.extent 66 + 9
dc.language.iso en en
dc.title Survival Modeling Using Factor Analysis Data Integration. en
dc.type G2 Pro gradu, diplomityö en Perustieteiden korkeakoulu fi
dc.subject.keyword Bayesian data analysis en
dc.subject.keyword factor models en
dc.subject.keyword survival models en
dc.subject.keyword risk predictions en
dc.subject.keyword cardiovascular disease en
dc.identifier.urn URN:NBN:fi:aalto-201511205260
dc.programme.major Computational Systems Biology en
dc.programme.mcode IL3013
dc.type.ontasot Master's thesis en
dc.type.ontasot Diplomityö fi
dc.contributor.supervisor Kaski, Samuel
dc.programme Master's Degree Programme in Computational and Systems Biology (euSYSBIO) en

Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search archive

Advanced Search

article-iconSubmit a publication


My Account