Block HSIC Lasso: Model-free biomarker detection for ultra-high dimensional data
Loading...
Access rights
openAccess
URL
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
Date
2019-07-15
Department
Major/Subject
Mcode
Degree programme
Language
en
Pages
i427-i435
Series
Bioinformatics, Volume 35, issue 14
Abstract
Motivation: Finding non-linear relationships between biomolecules and a biological outcome is computationally expensive and statistically challenging. Existing methods have important drawbacks, including among others lack of parsimony, non-convexity and computational overhead. Here we propose block HSIC Lasso, a non-linear feature selector that does not present the previous drawbacks. Results: We compare block HSIC Lasso to other state-of-the-art feature selection techniques in both synthetic and real data, including experiments over three common types of genomic data: gene-expression microarrays, single-cell RNA sequencing and genome-wide association studies. In all cases, we observe that features selected by block HSIC Lasso retain more information about the underlying biology than those selected by other techniques. As a proof of concept, we applied block HSIC Lasso to a single-cell RNA sequencing experiment on mouse hippocampus. We discovered that many genes linked in the past to brain development and function are involved in the biological differences between the types of neurons.Description
| openaire: EC/H2020/666003/EU//IC-3i-PhD
Keywords
Other note
Citation
Climente-González, H, Azencott, C A, Kaski, S & Yamada, M 2019, ' Block HSIC Lasso : Model-free biomarker detection for ultra-high dimensional data ', Bioinformatics, vol. 35, no. 14, btz333, pp. i427-i435 . https://doi.org/10.1093/bioinformatics/btz333