Natural Language Processing for Healthcare: Text Representation, Multitask Learning, and Applications

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorMarttinen, Pekka, Prof., Aalto University, Department of Computer Science, Finland
dc.contributor.authorJi, Shaoxiong
dc.contributor.departmentTietotekniikan laitosfi
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.schoolPerustieteiden korkeakoulufi
dc.contributor.schoolSchool of Scienceen
dc.contributor.supervisorMarttinen, Pekka, Prof., Aalto University, Department of Computer Science, Finland
dc.date.accessioned2023-03-07T10:00:08Z
dc.date.available2023-03-07T10:00:08Z
dc.date.defence2023-03-22
dc.date.issued2023
dc.description.abstractThe emergence of deep learning algorithms in natural language processing has boosted the development of intelligent medical information systems. Firstly, this dissertation explores effective text encoding for clinical text. We propose a dilated convolutional attention network with dilated convolutions to capture complex medical patterns in long clinical notes by exponentially increasing the receptive field with the dilation size. Furthermore, we propose to utilize embedding injection and gated information propagation in the medical note encoding module for better representation learning of the lengthy clinical text. To capture the interaction between notes and codes, we explicitly model the underlying dependency between notes and codes and utilize textual descriptions of medical codes as external knowledge. We also adopt the contextualized graph embeddings to learn contextual information and causal relationships between text mentions such as drugs taken and adverse reactions. We also conduct an empirical analysis on the effectiveness of transfer learning with language model pretraining to clinical text encoding and medical code prediction. We develop a hierarchical encoding model to equip the pretrained language models with the capacity to encode long clinical notes. We further study the effect of pretraining in different domains and with different strategies. The comprehensive quantitative analysis shows that hierarchical encoding can capture interactions between distant words to some extent. Then, this dissertation investigates the multitask learning paradigm and its applications to healthcare. Multitask learning, motivated by human learning from previous tasks to help with a new task, makes full use of the information contained in each task and shares information between related tasks through common parameters. We adopt multitask learning for medical code prediction and demonstrate the benefits of leveraging multiple coding schemes. We design a recalibrated aggregation module to generate clinical document features with better quality and less noise in the shared modules of multitask networks. Finally, we consider the task context to improve multitask learning for healthcare. We propose to use a domain-adaptive pretrained model and hypernetwork-guided multitask heads to learn shared representation modules and task-specific predictors. Specifically, the domain-adaptive pretrained model is directly pretrained in the target domain of clinical applications. Task embeddings as task context are used to generate task-specific parameters with hypernetworks. Experiments show that the proposed hypernetwork-guided multitask learning method can achieve better predictive performance and semantic task information can improve the generalizability of the task-conditioned multitask model.en
dc.format.extent56 + app. 135
dc.format.mimetypeapplication/pdfen
dc.identifier.isbn978-952-64-1131-6 (electronic)
dc.identifier.isbn978-952-64-1130-9 (printed)
dc.identifier.issn1799-4942 (electronic)
dc.identifier.issn1799-4934 (printed)
dc.identifier.issn1799-4934 (ISSN-L)
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/119968
dc.identifier.urnURN:ISBN:978-952-64-1131-6
dc.language.isoenen
dc.opnGinter, Filip, Prof., University of Turku, Finland
dc.publisherAalto Universityen
dc.publisherAalto-yliopistofi
dc.relation.haspart[Publication 1]: Shaoxiong Ji, Erik Cambria, and Pekka Marttinen. Dilated Convolutional Attention Network for Medical Code Assignment from Clinical Text. In Proceedings of the 3rd Clinical Natural Language Processing Workshop, Virtual, pages 73-78, 2020
dc.relation.haspart[Publication 2]: Shaoxiong Ji, Shirui Pan, and Pekka Marttinen. Medical Code Assignment with Gated Convolution and Note-Code Interaction. In Findings of the Association for Computational Linguistics: ACLIJCNLP 2021, Virtual, pages 1034-1043, 2021. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-202202091828. DOI: 10.18653/v1/2021.findings-acl.89
dc.relation.haspart[Publication 3]: Shaoxiong Ji, Matti Hölttä, and Pekka Marttinen. Does the Magic of BERT Apply to Medical Code Assignment? A Quantitative Study. Computers in Biology and Medicine, Volume 139, 104998, 2021. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-2021110410005. DOI: 10.1016/j.compbiomed.2021.104998
dc.relation.haspart[Publication 4]: Wei Sun, Shaoxiong Ji, Erik Cambria, and Pekka Marttinen. Multitask Recalibrated Aggregation Network for Medical Code Prediction. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases (ECML-PKDD), vol 12978, Springer, Cham. 2021. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-2021111710276. DOI: 10.1007/978-3-030-86514-6_23
dc.relation.haspart[Publication 5]: Wei Sun, Shaoxiong Ji, Erik Cambria, and Pekka Marttinen. Multitask Balanced and Recalibrated Network for Medical Code Prediction. ACM Transactions on Intelligent Systems and Technology, Aug 2022. Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-202211306749. DOI: 10.1145/3563041
dc.relation.haspart[Publication 6]: Shaoxiong Ji and Pekka Marttinen. Patient Outcome and Zero-shot Diagnosis Prediction with Hypernetwork-guided Multitask Learning. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
dc.relation.haspart[Publication 7]: Shaoxiong Ji, Wei Sun, Hang Dong, Honghan Wu, and Pekka Marttinen. A Unified Review of Deep Learning for Automated Medical Coding. arXiv preprint arXiv:2201.02797, Jan 2022
dc.relation.haspart[Publication 8]: Ya Gao, Shaoxiong Ji, Tongxuan Zhang, Prayag Tiwari, and Pekka Marttinen. Contextualized Graph Embeddings for Adverse Drug Event Detection. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases (ECML-PKDD), Springer, Cham. 2022
dc.relation.ispartofseriesAalto University publication series DOCTORAL THESESen
dc.relation.ispartofseries11/2023
dc.revVerberne, Suzan, Assoc. Prof., Leiden University, Netherlands
dc.revDalianis, Hercules, Prof., Stockholm University, Sweden
dc.subject.keywordnatural language processingen
dc.subject.keywordhealthcare applicationsen
dc.subject.keywordtext representationen
dc.subject.keywordmultitask learningen
dc.subject.otherComputer scienceen
dc.titleNatural Language Processing for Healthcare: Text Representation, Multitask Learning, and Applicationsen
dc.typeG5 Artikkeliväitöskirjafi
dc.type.dcmitypetexten
dc.type.ontasotDoctoral dissertation (article-based)en
dc.type.ontasotVäitöskirja (artikkeli)fi
local.aalto.acrisexportstatuschecked 2023-03-28_1336
local.aalto.archiveyes
local.aalto.formfolder2023_03_06_klo_14_09
local.aalto.infraScience-IT

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
isbn9789526411316.pdf
Size:
368.51 KB
Format:
Adobe Portable Document Format