Efficient Transfer Learning with Sequential and Multi-Modal Approaches for Electronic Health Records

Loading...
Thumbnail Image

URL

Journal Title

Journal ISSN

Volume Title

School of Science | Doctoral thesis (article-based) | Defence date: 2024-10-25

Date

Major/Subject

Mcode

Degree programme

Language

en

Pages

66 + app. 96

Series

Aalto University publication series DOCTORAL THESES, 197/2024

Abstract

The digital transformation in healthcare has dramatically increased data availability, yet the potential for data-driven insights is frequently constrained by the quality of data. Securing high-quality data is particularly challenging in fields like healthcare, where expert involvement is crucial for gathering, annotating, and ensuring data quality. This thesis applies deep learning to Electronic Health Records (EHR) to enhance predictive accuracy and operational efficiency. Deep learning models are particularly adept at capturing complex and non-linear relationships present in EHR data, but they require extensive training datasets to be effective. This study explores and develops ways to employ transfer learning to effectively mitigate these data constraints. The thesis tackles four key research questions aimed at improving healthcare outcomes using EHR data. It begins by enhancing prediction accuracy for healthcare utilization through an RNN model with multi-headed attention, which significantly outperforms traditional count-based models and shows robust time generalization. The study then introduces SANSformer, a custom-built, attention-free sequential model optimized for EHR specifics. This model excels in predicting healthcare demand, particularly in diverse patient subgroups, while managing limited data scenarios via transfer learning. Thirdly, the thesis explores the enhancement of neural network similarity metrics in assessing functional similarities, particularly in the context of transfer learning and model performance. It introduces a covariate adjustment to correct traditional metrics, which are often misled by input data structures, ensuring they reflect true functional similarities. Lastly, it explores the integration of expert annotations into the medical CLIP model, eCLIP, which utilizes radiologist eye-gaze heatmaps to substantially improve the quality of embeddings and sample efficiency in multi-modal medical imaging. The findings from this thesis highlight the significant potential of deep learning to enhance prediction of healthcare outcomes by addressing the unique challenges of EHR data. The research adapts sophisticated deep learning models to meet the complex demands of EHR data and introduces novel techniques like covariate adjustment for similarity metrics and integrating expert annotations to set a foundation for further advancements in healthcare analytics.

Description

Supervising professor

Marttinen, Pekka, Prof., Aalto University, Department of Computer Science, Finland

Thesis advisor

Marttinen, Pekka, Prof., Aalto University, Department of Computer Science, Finland

Other note

Parts

  • [Publication 1]: Yogesh Kumar, Henri Salo, Tuomo Nieminen, Kristian Vepsalainen, Sangita Kulathinal, Pekka Marttinen. Predicting utilization of healthcare services from individul disease trajectories using RNNs with multiheaded attention. Machine Learning for Health (ML4H) at NeurIPS 2019 (Proceedings of Machine Learning Research), 116:93-111, 2020. https://urn.fi/URN:NBN:fi-fe2019121948937
  • [Publication 2]: Yogesh Kumar, Alexander Ilin, Henri Salo, Sangita Kulathinal, Maarit K Leinonen, Pekka Marttinen. Self-Supervised Forecasting in Electronic Health Records with Attention-Free Models. IEEE Transactions on Artificial Intelligence, vol. 5, 08:3926-3938 Aug 2024.
    DOI: 10.1109/TAI.2024.3353164 View at publisher
  • [Publication 3]: Tianyu Cu, Yogesh Kumar, Pekka Marttinen, Samuel Kaski. Deconfounded Representation Similarity for Comparison of Neural Networks. Advances in Neural Information Processing Systems, 35:19138–19151, 2022. https://urn.fi/URN:NBN:fi:aalto-202306053612.
  • [Publication 4]: Yogesh Kumar, Pekka Marttinen. Improving Medical Multi-modal Contrastive Learning with Expert Annotations. Accepted for publication in 18th European Conference on Computer Vision (ECCV), 2024. https://arxiv.org/abs/2403.10153.
    DOI: 10.48550/arXiv.2403.10153 View at publisher

Citation