Patient Outcome and Zero-shot Diagnosis Prediction with Hypernetwork-guided Multitask Learning

Loading...
Thumbnail Image

Access rights

openAccess
publishedVersion

URL

Journal Title

Journal ISSN

Volume Title

A4 Artikkeli konferenssijulkaisussa

Date

2023

Major/Subject

Mcode

Degree programme

Language

en

Pages

Series

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, pp. 589–598

Abstract

Multitask deep learning has been applied to patient outcome prediction from text, taking clinical notes as input and training deep neural networks with a joint loss function of multiple tasks. However, the joint training scheme of multitask learning suffers from inter-task interference, and diagnosis prediction among the multiple tasks has the generalizability issue due to rare diseases or unseen diagnoses. To solve these challenges, we propose a hypernetwork-based approach that generates task-conditioned parameters and coefficients of multitask prediction heads to learn task-specific prediction and balance the multitask learning. We also incorporate semantic task information to improve the generalizability of our task-conditioned multitask model. Experiments on early and discharge notes extracted from the real-world MIMIC database show our method can achieve better performance on multitask patient outcome prediction than strong baselines in most cases. Besides, our method can effectively handle the scenario with limited information and improve zero-shot prediction on unseen diagnosis categories.

Description

| openaire: EC/H2020/101016775/EU//INTERVENE

Keywords

Other note

Citation

Ji, S & Marttinen, P 2023, Patient Outcome and Zero-shot Diagnosis Prediction with Hypernetwork-guided Multitask Learning . in A Vlachos & I Augenstein (eds), Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics . Association for Computational Linguistics, pp. 589–598, Conference of the European Chapter of the Association for Computational Linguistics, Dubrovnik, Croatia, 02/05/2023 . https://doi.org/10.18653/v1/2023.eacl-main.43