Contrastive learning with time-aware transformer on EHR data

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorRenkonen, Risto
dc.contributor.advisorKoskinen, Miika
dc.contributor.authorTran, Linh
dc.contributor.schoolPerustieteiden korkeakoulufi
dc.contributor.schoolSchool of Scienceen
dc.contributor.supervisorMarttinen, Pekka
dc.date.accessioned2025-12-17T18:04:50Z
dc.date.available2025-12-17T18:04:50Z
dc.date.issued2025-10-31
dc.description.abstractPatient representation learning has been a hot topic in the medical fields in recent years due to the fact that personal indexes of each person can be different from the population levels. Therefore, researchers have been trying contrastive learning using EHR data to achieve a personalized representation of each patient, resulting in more individual and specific recommendations. Earlier contrastive learning work has made use of scan images or clinical notes during hospitalization to analyze the survival status or predict the upcoming events of patients. However, the impact of both categorical and numerical data in a multimodal setting has not yet been focused on, even though hospitals and clinical sites have a considerable amount of such information collected over years. Therefore, in this paper, we concentrate on contrastive learning using multimodal data, also including special types with hierarchical structure such as ICD-10 and medication codes. We conduct exhaustive experiment with different types of data to see how each of them can be augmented for a contrastive learning framework. We also come up with a novel pretrained model schema for understanding the hierarchy of ICD-10 and medication codes, which is integrated into a unified multimodal model. For learning the temporal and semantic information of patients event sequences in a unified manner, we follow SimCLR framework with TAAT being the main encoder module and InfoNCE loss acting as the objective function. The obtained results show that our pretrained models can perfectly understand the inherent hierarchy within ICD-10 and medication codes, and our augmentation methods work very well for both normal data types and special data types with hierarchical structure. The preliminary findings from the multimodal model suggest that it has learned meaningful latent representations for patients and can form clusters of the ones with similar features, and these representations can be utilized for downstream tasks such as prediction and classification. We believe this would contribute a further step toward the goal of personalized healthcare with artificial intelligence.en
dc.format.extent71
dc.format.mimetypeapplication/pdfen
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/141281
dc.identifier.urnURN:NBN:fi:aalto-202512179390
dc.language.isoenen
dc.programmeMaster's Programme in Computer, Communication and Information Sciencesen
dc.programmeMaster's Programme in Computer, Communication and Information Sciencesfi
dc.programmeMaster's Programme in Computer, Communication and Information Sciencessv
dc.programme.majorMachine Learning, Data Science and Artificial Intelligenceen
dc.subject.keywordcontrastive learningen
dc.subject.keywordEHR dataen
dc.subject.keyworddata augmentationen
dc.subject.keywordtime-aware transformeren
dc.subject.keywordhierarchical structureen
dc.subject.keywordBERTen
dc.titleContrastive learning with time-aware transformer on EHR dataen
dc.typeG2 Pro gradu, diplomityöfi
dc.type.ontasotMaster's thesisen
dc.type.ontasotDiplomityöfi
local.aalto.electroniconlyyes
local.aalto.openaccessyes

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
master_Tran_Linh_2025.pdf
Size:
3.9 MB
Format:
Adobe Portable Document Format