Efficient Transfer Learning with Sequential and Multi-Modal Approaches for Electronic Health Records

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorMarttinen, Pekka, Prof., Aalto University, Department of Computer Science, Finland
dc.contributor.authorKumar, Yogesh
dc.contributor.departmentTietotekniikan laitosfi
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.labMachine Learning for Health (ML4H)en
dc.contributor.schoolPerustieteiden korkeakoulufi
dc.contributor.schoolSchool of Scienceen
dc.contributor.supervisorMarttinen, Pekka, Prof., Aalto University, Department of Computer Science, Finland
dc.date.accessioned2024-10-15T09:00:36Z
dc.date.available2024-10-15T09:00:36Z
dc.date.defence2024-10-25
dc.date.issued2024
dc.description.abstractThe digital transformation in healthcare has dramatically increased data availability, yet the potential for data-driven insights is frequently constrained by the quality of data. Securing high-quality data is particularly challenging in fields like healthcare, where expert involvement is crucial for gathering, annotating, and ensuring data quality. This thesis applies deep learning to Electronic Health Records (EHR) to enhance predictive accuracy and operational efficiency. Deep learning models are particularly adept at capturing complex and non-linear relationships present in EHR data, but they require extensive training datasets to be effective. This study explores and develops ways to employ transfer learning to effectively mitigate these data constraints. The thesis tackles four key research questions aimed at improving healthcare outcomes using EHR data. It begins by enhancing prediction accuracy for healthcare utilization through an RNN model with multi-headed attention, which significantly outperforms traditional count-based models and shows robust time generalization. The study then introduces SANSformer, a custom-built, attention-free sequential model optimized for EHR specifics. This model excels in predicting healthcare demand, particularly in diverse patient subgroups, while managing limited data scenarios via transfer learning. Thirdly, the thesis explores the enhancement of neural network similarity metrics in assessing functional similarities, particularly in the context of transfer learning and model performance. It introduces a covariate adjustment to correct traditional metrics, which are often misled by input data structures, ensuring they reflect true functional similarities. Lastly, it explores the integration of expert annotations into the medical CLIP model, eCLIP, which utilizes radiologist eye-gaze heatmaps to substantially improve the quality of embeddings and sample efficiency in multi-modal medical imaging. The findings from this thesis highlight the significant potential of deep learning to enhance prediction of healthcare outcomes by addressing the unique challenges of EHR data. The research adapts sophisticated deep learning models to meet the complex demands of EHR data and introduces novel techniques like covariate adjustment for similarity metrics and integrating expert annotations to set a foundation for further advancements in healthcare analytics.en
dc.format.extent66 + app. 96
dc.format.mimetypeapplication/pdfen
dc.identifier.isbn978-952-64-2031-8 (electronic)
dc.identifier.isbn978-952-64-2030-1 (printed)
dc.identifier.issn1799-4942 (electronic)
dc.identifier.issn1799-4934 (printed)
dc.identifier.issn1799-4934 (ISSN-L)
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/131219
dc.identifier.urnURN:ISBN:978-952-64-2031-8
dc.language.isoenen
dc.opnClemmensen, Line, Dr., Technical University of Denmark (DTU)
dc.publisherAalto Universityen
dc.publisherAalto-yliopistofi
dc.relation.haspart[Publication 1]: Yogesh Kumar, Henri Salo, Tuomo Nieminen, Kristian Vepsalainen, Sangita Kulathinal, Pekka Marttinen. Predicting utilization of healthcare services from individul disease trajectories using RNNs with multiheaded attention. Machine Learning for Health (ML4H) at NeurIPS 2019 (Proceedings of Machine Learning Research), 116:93-111, 2020. https://urn.fi/URN:NBN:fi-fe2019121948937
dc.relation.haspart[Publication 2]: Yogesh Kumar, Alexander Ilin, Henri Salo, Sangita Kulathinal, Maarit K Leinonen, Pekka Marttinen. Self-Supervised Forecasting in Electronic Health Records with Attention-Free Models. IEEE Transactions on Artificial Intelligence, vol. 5, 08:3926-3938 Aug 2024. Full text in Acris/Aaltodoc: https://urn.fi/URN:NBN:fi:aalto-202402072346. DOI: 10.1109/TAI.2024.3353164
dc.relation.haspart[Publication 3]: Tianyu Cu, Yogesh Kumar, Pekka Marttinen, Samuel Kaski. Deconfounded Representation Similarity for Comparison of Neural Networks. Advances in Neural Information Processing Systems, 35:19138–19151, 2022. https://urn.fi/URN:NBN:fi:aalto-202306053612.
dc.relation.haspart[Publication 4]: Yogesh Kumar, Pekka Marttinen. Improving Medical Multi-modal Contrastive Learning with Expert Annotations. Accepted for publication in 18th European Conference on Computer Vision (ECCV), 2024. https://arxiv.org/abs/2403.10153. DOI: 10.48550/arXiv.2403.10153
dc.relation.ispartofseriesAalto University publication series DOCTORAL THESESen
dc.relation.ispartofseries197/2024
dc.revMotani, Mehul, Dr., National University of Singapore, Singapore
dc.revJutzeler, Catherine, Dr., ETH Zurich, Switzerland
dc.subject.keyworddeep learningen
dc.subject.keywordtransfer learningen
dc.subject.keywordhealthcareen
dc.subject.otherComputer scienceen
dc.titleEfficient Transfer Learning with Sequential and Multi-Modal Approaches for Electronic Health Recordsen
dc.typeG5 Artikkeliväitöskirjafi
dc.type.dcmitypetexten
dc.type.ontasotDoctoral dissertation (article-based)en
dc.type.ontasotVäitöskirja (artikkeli)fi
local.aalto.acrisexportstatuschecked 2024-10-25_1352
local.aalto.archiveyes
local.aalto.formfolder2024_10_15_klo_08_21
local.aalto.infraScience-IT

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
isbn9789526420318.pdf
Size:
2.97 MB
Format:
Adobe Portable Document Format